Skip to main content

Showing 1–26 of 26 results for author: Metaxas, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.04324  [pdf, other

    cs.CV eess.IV

    SF-V: Single Forward Video Generation Model

    Authors: Zhixing Zhang, Yanyu Li, Yushu Wu, Yanwu Xu, Anil Kag, Ivan Skorokhodov, Willi Menapace, Aliaksandr Siarohin, Junli Cao, Dimitris Metaxas, Sergey Tulyakov, Jian Ren

    Abstract: Diffusion-based video generation models have demonstrated remarkable success in obtaining high-fidelity videos through the iterative denoising process. However, these models require multiple denoising steps during sampling, resulting in high computational costs. In this work, we propose a novel approach to obtain single-step video generation models by leveraging adversarial training to fine-tune p… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project Page: https://snap-research.github.io/SF-V

  2. arXiv:2404.01082  [pdf, other

    eess.IV

    The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023

    Authors: Jun Lyu, Chen Qin, Shuo Wang, Fanwen Wang, Yan Li, Zi Wang, Kunyuan Guo, Cheng Ouyang, Michael Tänzer, Meng Liu, Longyu Sun, Mengting Sun, Qin Li, Zhang Shi, Sha Hua, Hao Li, Zhensen Chen, Zhenlin Zhang, Bingyu Xin, Dimitris N. Metaxas, George Yiasemis, Jonas Teuwen, Li** Zhang, Weitian Chen, Yidong Zhao , et al. (25 additional authors not shown)

    Abstract: Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation p… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 25 pages, 17 figures

  3. arXiv:2309.13839  [pdf, other

    eess.IV cs.CV

    Fill the K-Space and Refine the Image: Prompting for Dynamic and Multi-Contrast MRI Reconstruction

    Authors: Bingyu Xin, Meng Ye, Leon Axel, Dimitris N. Metaxas

    Abstract: The key to dynamic or multi-contrast magnetic resonance imaging (MRI) reconstruction lies in exploring inter-frame or inter-contrast information. Currently, the unrolled model, an approach combining iterative MRI reconstruction steps with learnable neural network layers, stands as the best-performing method for MRI reconstruction. However, there are two main limitations to overcome: firstly, the u… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: STACOM 2023; Code is available at https://github.com/hellopipu/PromptMR

  4. arXiv:2308.09223  [pdf, other

    eess.IV cs.CV cs.LG

    DMCVR: Morphology-Guided Diffusion Model for 3D Cardiac Volume Reconstruction

    Authors: Xiaoxiao He, Chaowei Tan, Ligong Han, Bo Liu, Leon Axel, Kang Li, Dimitris N. Metaxas

    Abstract: Accurate 3D cardiac reconstruction from cine magnetic resonance imaging (cMRI) is crucial for improved cardiovascular disease diagnosis and understanding of the heart's motion. However, current cardiac MRI-based reconstruction technology used in clinical settings is 2D with limited through-plane resolution, resulting in low-quality reconstructed cardiac volumes. To better reconstruct 3D cardiac vo… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted in MICCAI 2023

  5. arXiv:2308.04663  [pdf, other

    eess.IV cs.CV cs.LG

    Classification of lung cancer subtypes on CT images with synthetic pathological priors

    Authors: Wentao Zhu, Yuan **, Gege Ma, Geng Chen, Jan Egger, Shaoting Zhang, Dimitris N. Metaxas

    Abstract: The accurate diagnosis on pathological subtypes for lung cancer is of significant importance for the follow-up treatments and prognosis managements. In this paper, we propose self-generating hybrid feature network (SGHF-Net) for accurately classifying lung cancer subtypes on computed tomography (CT) images. Inspired by studies stating that cross-scale associations exist in the image patterns betwe… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 16 pages, 7 figures

    Journal ref: Medical Image Analysis 95, July 2024, 103199

  6. arXiv:2306.05705  [pdf, other

    eess.IV cs.CV

    On the Challenges and Perspectives of Foundation Models for Medical Image Analysis

    Authors: Shaoting Zhang, Dimitris Metaxas

    Abstract: This article discusses the opportunities, applications and future directions of large-scale pre-trained models, i.e., foundation models, for analyzing medical images. Medical foundation models have immense potential in solving a wide range of downstream tasks, as they can help to accelerate the development of accurate and robust models, reduce the large amounts of required labeled data, preserve t… ▽ More

    Submitted 21 November, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  7. arXiv:2303.14357  [pdf, other

    eess.IV cs.CV cs.LG

    Dealing With Heterogeneous 3D MR Knee Images: A Federated Few-Shot Learning Method With Dual Knowledge Distillation

    Authors: Xiaoxiao He, Chaowei Tan, Bo Liu, Li** Si, Weiwu Yao, Liang Zhao, Di Liu, Qilong Zhangli, Qi Chang, Kang Li, Dimitris N. Metaxas

    Abstract: Federated Learning has gained popularity among medical institutions since it enables collaborative training between clients (e.g., hospitals) without aggregating data. However, due to the high cost associated with creating annotations, especially for large 3D image datasets, clinical institutions do not have enough supervised data for training locally. Thus, the performance of the collaborative mo… ▽ More

    Submitted 17 April, 2023; v1 submitted 25 March, 2023; originally announced March 2023.

  8. arXiv:2206.07163  [pdf, other

    cs.CV cs.LG eess.IV

    DeepRecon: Joint 2D Cardiac Segmentation and 3D Volume Reconstruction via A Structure-Specific Generative Method

    Authors: Qi Chang, Zhennan Yan, Mu Zhou, Di Liu, Khalid Sawalha, Meng Ye, Qilong Zhangli, Mikael Kanski, Subhi Al Aref, Leon Axel, Dimitris Metaxas

    Abstract: Joint 2D cardiac segmentation and 3D volume reconstruction are fundamental to building statistical cardiac anatomy models and understanding functional mechanisms from motion patterns. However, due to the low through-plane resolution of cine MR and high inter-subject variance, accurately segmenting cardiac images and reconstructing the 3D volume are challenging. In this study, we propose an end-to-… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: MICCAI2022

  9. arXiv:2203.10726  [pdf, other

    eess.IV cs.CV

    TransFusion: Multi-view Divergent Fusion for Medical Image Segmentation with Transformers

    Authors: Di Liu, Yunhe Gao, Qilong Zhangli, Ligong Han, Xiaoxiao He, Zhaoyang Xia, Song Wen, Qi Chang, Zhennan Yan, Mu Zhou, Dimitris Metaxas

    Abstract: Combining information from multi-view images is crucial to improve the performance and robustness of automated methods for disease diagnosis. However, due to the non-alignment characteristics of multi-view images, building correlation and data fusion across views largely remain an open problem. In this study, we present TransFusion, a Transformer-based architecture to merge divergent multi-view im… ▽ More

    Submitted 5 September, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

  10. arXiv:2203.00131  [pdf, other

    eess.IV cs.CV

    A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and Benchmark

    Authors: Yunhe Gao, Mu Zhou, Di Liu, Zhennan Yan, Shaoting Zhang, Dimitris N. Metaxas

    Abstract: Transformers have demonstrated remarkable performance in natural language processing and computer vision. However, existing vision Transformers struggle to learn from limited medical data and are unable to generalize on diverse medical image tasks. To tackle these challenges, we present MedFormer, a data-scalable Transformer designed for generalizable 3D medical image segmentation. Our approach in… ▽ More

    Submitted 4 April, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

  11. arXiv:2202.08916  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Graph Convolutional Networks for Multi-modality Medical Imaging: Methods, Architectures, and Clinical Applications

    Authors: Kexin Ding, Mu Zhou, Zichen Wang, Qiao Liu, Corey W. Arnold, Shaoting Zhang, Dimitri N. Metaxas

    Abstract: Image-based characterization and disease understanding involve integrative analysis of morphological, spatial, and topological information across biological scales. The development of graph convolutional networks (GCNs) has created the opportunity to address this information complexity via graph-driven architectures, since GCNs can perform feature aggregation, interaction, and reasoning with remar… ▽ More

    Submitted 20 April, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

  12. Modality Bank: Learn multi-modality images across data centers without sharing medical data

    Authors: Qi Chang, Hui Qu, Zhennan Yan, Yunhe Gao, Lohendran Baskaran, Dimitris Metaxas

    Abstract: Multi-modality images have been widely used and provide comprehensive information for medical image analysis. However, acquiring all modalities among all institutes is costly and often impossible in clinical settings. To leverage more comprehensive multi-modality information, we propose a privacy secured decentralized multi-modality adaptive learning architecture named ModalityBank. Our method cou… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2012.08604

    Journal ref: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2022, pp. 4758-4763

  13. arXiv:2112.09760  [pdf, other

    eess.IV cs.CV

    Learned Half-Quadratic Splitting Network for MR Image Reconstruction

    Authors: Bingyu Xin, Timothy S. Phan, Leon Axel, Dimitris N. Metaxas

    Abstract: Magnetic Resonance (MR) image reconstruction from highly undersampled $k$-space data is critical in accelerated MR imaging (MRI) techniques. In recent years, deep learning-based methods have shown great potential in this task. This paper proposes a learned half-quadratic splitting algorithm for MR image reconstruction and implements the algorithm in an unrolled deep learning network architecture.… ▽ More

    Submitted 23 August, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: accepted for MIDL2022

  14. WORD: A large scale dataset, benchmark and clinical applicable study for abdominal organ segmentation from CT image

    Authors: Xiangde Luo, Wenjun Liao, Jianghong Xiao, Jieneng Chen, Tao Song, Xiaofan Zhang, Kang Li, Dimitris N. Metaxas, Guotai Wang, Shaoting Zhang

    Abstract: Whole abdominal organ segmentation is important in diagnosing abdomen lesions, radiotherapy, and follow-up. However, oncologists' delineating all abdominal organs from 3D volumes is time-consuming and very expensive. Deep learning-based medical image segmentation has shown the potential to reduce manual delineation efforts, but it still requires a large-scale fine annotated dataset for training, a… ▽ More

    Submitted 12 February, 2023; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: Accepted to Medical Image Analysis, dataset at: https://github.com/HiLab-git/WORD (we corrected the results or description in this version.)

  15. arXiv:2110.08718  [pdf, other

    cs.CV eess.IV

    AE-StyleGAN: Improved Training of Style-Based Auto-Encoders

    Authors: Ligong Han, Sri Harsha Musunuri, Martin Renqiang Min, Ruijiang Gao, Yu Tian, Dimitris Metaxas

    Abstract: StyleGANs have shown impressive results on data generation and manipulation in recent years, thanks to its disentangled style latent space. A lot of efforts have been made in inverting a pretrained generator, where an encoder is trained ad hoc after the generator is trained in a two-stage fashion. In this paper, we focus on style-based generators asking a scientific question: Does forcing such a g… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

    Comments: Accepted at WACV-22

  16. Semi-Supervised Segmentation of Radiation-Induced Pulmonary Fibrosis from Lung CT Scans with Multi-Scale Guided Dense Attention

    Authors: Guotai Wang, Shuwei Zhai, Giovanni Lasio, Baoshe Zhang, Byong Yi, Shifeng Chen, Thomas J. Macvittie, Dimitris Metaxas, **ghao Zhou, Shaoting Zhang

    Abstract: Computed Tomography (CT) plays an important role in monitoring radiation-induced Pulmonary Fibrosis (PF), where accurate segmentation of the PF lesions is highly desired for diagnosis and treatment follow-up. However, the task is challenged by ambiguous boundary, irregular shape, various position and size of the lesions, as well as the difficulty in acquiring a large set of annotated volumetric im… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: 12 pages, 9 figures. Submitted to IEEE TMI

  17. FocusNetv2: Imbalanced Large and Small Organ Segmentation with Adversarial Shape Constraint for Head and Neck CT Images

    Authors: Yunhe Gao, Rui Huang, Yiwei Yang, Jie Zhang, Kainan Shao, Changjuan Tao, Yuanyuan Chen, Dimitris N. Metaxas, Hongsheng Li, Ming Chen

    Abstract: Radiotherapy is a treatment where radiation is used to eliminate cancer cells. The delineation of organs-at-risk (OARs) is a vital step in radiotherapy treatment planning to avoid damage to healthy organs. For nasopharyngeal cancer, more than 20 OARs are needed to be precisely segmented in advance. The challenge of this task lies in complex anatomical structure, low-contrast organ contours, and th… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: Accepted by Medical Image Analysis

  18. arXiv:2103.16493  [pdf, other

    cs.CV eess.IV

    Enabling Data Diversity: Efficient Automatic Augmentation via Regularized Adversarial Training

    Authors: Yunhe Gao, Zhiqiang Tang, Mu Zhou, Dimitris Metaxas

    Abstract: Data augmentation has proved extremely useful by increasing training data variance to alleviate overfitting and improve deep neural networks' generalization performance. In medical image analysis, a well-designed augmentation policy usually requires much expert knowledge and is difficult to generalize to multiple tasks due to the vast discrepancies among pixel intensities, image appearances, and o… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted by IPMI 2021

  19. arXiv:2103.03761  [pdf, other

    eess.IV cs.CV

    Liver Fibrosis and NAS scoring from CT images using self-supervised learning and texture encoding

    Authors: Ananya Jana, Hui Qu, Carlos D. Minacapelli, Carolyn Catalano, Vinod Rustgi, Dimitris Metaxas

    Abstract: Non-alcoholic fatty liver disease (NAFLD) is one of the most common causes of chronic liver diseases (CLD) which can progress to liver cancer. The severity and treatment of NAFLD is determined by NAFLD Activity Scores (NAS)and liver fibrosis stage, which are usually obtained from liver biopsy. However, biopsy is invasive in nature and involves risk of procedural complications. Current methods to p… ▽ More

    Submitted 15 March, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: 5 pages, 2 figures, accepted at ISBI 2021, code at this URL: https://github.com/ananyajana/fibrosis_code

  20. arXiv:2009.10687  [pdf, other

    eess.IV cs.CV

    Deep Learning based NAS Score and Fibrosis Stage Prediction from CT and Pathology Data

    Authors: Ananya Jana, Hui Qu, Puru Rattan, Carlos D. Minacapelli, Vinod Rustgi, Dimitris Metaxas

    Abstract: Non-Alcoholic Fatty Liver Disease (NAFLD) is becoming increasingly prevalent in the world population. Without diagnosis at the right time, NAFLD can lead to non-alcoholic steatohepatitis (NASH) and subsequent liver damage. The diagnosis and treatment of NAFLD depend on the NAFLD activity score (NAS) and the liver fibrosis stage, which are usually evaluated from liver biopsies by pathologists. In t… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: 6 pages, 3 figures. Accepted in IEEE BIBE 2020

  21. arXiv:2008.11109  [pdf, other

    eess.IV cs.CV cs.LG

    Measure Anatomical Thickness from Cardiac MRI with Deep Neural Networks

    Authors: Qiaoying Huang, Eric Z. Chen, Hanchao Yu, Yimo Guo, Terrence Chen, Dimitris Metaxas, Shanhui Sun

    Abstract: Accurate estimation of shape thickness from medical images is crucial in clinical applications. For example, the thickness of myocardium is one of the key to cardiac disease diagnosis. While mathematical models are available to obtain accurate dense thickness estimation, they suffer from heavy computational overhead due to iterative solvers. To this end, we propose novel methods for dense thicknes… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: Accepted by STACOM 2020

  22. arXiv:2008.08248  [pdf, other

    eess.IV cs.CV cs.LG

    Enhanced MRI Reconstruction Network using Neural Architecture Search

    Authors: Qiaoying Huang, Dong Yang, Yikun Xian, Pengxiang Wu, **gru Yi, Hui Qu, Dimitris Metaxas

    Abstract: The accurate reconstruction of under-sampled magnetic resonance imaging (MRI) data using modern deep learning technology, requires significant effort to design the necessary complex neural network architectures. The cascaded network architecture for MRI reconstruction has been widely used, while it suffers from the "vanishing gradient" problem when the network becomes deep. In addition, homogeneou… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: 10 pages. Code will be released soon

  23. arXiv:2007.09221  [pdf, other

    cs.CV eess.IV

    Learn distributed GAN with Temporary Discriminators

    Authors: Hui Qu, Yikai Zhang, Qi Chang, Zhennan Yan, Chao Chen, Dimitris Metaxas

    Abstract: In this work, we propose a method for training distributed GAN with sequential temporary discriminators. Our proposed method tackles the challenge of training GAN in the federated learning manner: How to update the generator with a flow of temporary discriminators? We apply our proposed method to learn a self-adaptive generator with a series of local discriminators from multiple data centers. We s… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: Accepted by ECCV2020. Code: https://github.com/huiqu18/TDGAN-PyTorch

  24. arXiv:2006.00080  [pdf, other

    eess.IV cs.CV cs.LG

    Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data

    Authors: Qi Chang, Hui Qu, Yikai Zhang, Mert Sabuncu, Chao Chen, Tong Zhang, Dimitris Metaxas

    Abstract: In this paper, we propose a data privacy-preserving and communication efficient distributed GAN learning framework named Distributed Asynchronized Discriminator GAN (AsynDGAN). Our proposed framework aims to train a central generator learns from distributed discriminator, and use the generated synthetic image solely to train the segmentation model.We validate the proposed framework on the applicat… ▽ More

    Submitted 14 June, 2020; v1 submitted 29 May, 2020; originally announced June 2020.

    Journal ref: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 13856-13866

  25. arXiv:2001.03187  [pdf, other

    eess.IV cs.CV

    Vertebra-Focused Landmark Detection for Scoliosis Assessment

    Authors: **gru Yi, Pengxiang Wu, Qiaoying Huang, Hui Qu, Dimitris N. Metaxas

    Abstract: Adolescent idiopathic scoliosis (AIS) is a lifetime disease that arises in children. Accurate estimation of Cobb angles of the scoliosis is essential for clinicians to make diagnosis and treatment decisions. The Cobb angles are measured according to the vertebrae landmarks. Existing regression-based methods for the vertebra landmark detection typically suffer from large dense map** parameters an… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: Accepted to ISBI2020

  26. arXiv:1908.04469  [pdf, other

    eess.IV cs.CV cs.LG cs.MA

    Collaborative Multi-agent Learning for MR Knee Articular Cartilage Segmentation

    Authors: Chaowei Tan, Zhennan Yan, Shaoting Zhang, Kang Li, Dimitris N. Metaxas

    Abstract: The 3D morphology and quantitative assessment of knee articular cartilages (i.e., femoral, tibial, and patellar cartilage) in magnetic resonance (MR) imaging is of great importance for knee radiographic osteoarthritis (OA) diagnostic decision making. However, effective and efficient delineation of all the knee articular cartilages in large-sized and high-resolution 3D MR knee data is still an open… ▽ More

    Submitted 12 August, 2019; originally announced August 2019.