Search | arXiv e-print repository

doi 10.1117/12.2652947

A slice classification neural network for automated classification of axial PET/CT slices from a multi-centric lymphoma dataset

Authors: Shadab Ahamed, Yixi Xu, Ingrid Bloise, Joo H. O, Carlos F. Uribe, Rahul Dodhia, Juan L. Ferres, Arman Rahmim

Abstract: Automated slice classification is clinically relevant since it can be incorporated into medical image segmentation workflows as a preprocessing step that would flag slices with a higher probability of containing tumors, thereby directing physicians attention to the important slices. In this work, we train a ResNet-18 network to classify axial slices of lymphoma PET/CT images (collected from two in… ▽ More Automated slice classification is clinically relevant since it can be incorporated into medical image segmentation workflows as a preprocessing step that would flag slices with a higher probability of containing tumors, thereby directing physicians attention to the important slices. In this work, we train a ResNet-18 network to classify axial slices of lymphoma PET/CT images (collected from two institutions) depending on whether the slice intercepted a tumor (positive slice) in the 3D image or if the slice did not (negative slice). Various instances of the network were trained on 2D axial datasets created in different ways: (i) slice-level split and (ii) patient-level split; inputs of different types were used: (i) only PET slices and (ii) concatenated PET and CT slices; and different training strategies were employed: (i) center-aware (CAW) and (ii) center-agnostic (CAG). Model performances were compared using the area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC), and various binary classification metrics. We observe and describe a performance overestimation in the case of slice-level split as compared to the patient-level split training. The model trained using patient-level split data with the network input containing only PET slices in the CAG training regime was the best performing/generalizing model on a majority of metrics. Our models were additionally more closely compared using the sensitivity metric on the positive slices from their respective test sets. △ Less

Submitted 11 March, 2024; originally announced March 2024.

Comments: 10 pages, 6 figures, 2 tables

Journal ref: Proc. SPIE 12464, Medical Imaging 2023: Image Processing, 124641Q (3 April 2023)

arXiv:2403.07092 [pdf, other]

doi 10.1117/12.2612684

A cascaded deep network for automated tumor detection and segmentation in clinical PET imaging of diffuse large B-cell lymphoma

Authors: Shadab Ahamed, Natalia Dubljevic, Ingrid Bloise, Claire Gowdy, Patrick Martineau, Don Wilson, Carlos F. Uribe, Arman Rahmim, Fereshteh Yousefirizi

Abstract: Accurate detection and segmentation of diffuse large B-cell lymphoma (DLBCL) from PET images has important implications for estimation of total metabolic tumor volume, radiomics analysis, surgical intervention and radiotherapy. Manual segmentation of tumors in whole-body PET images is time-consuming, labor-intensive and operator-dependent. In this work, we develop and validate a fast and efficient… ▽ More Accurate detection and segmentation of diffuse large B-cell lymphoma (DLBCL) from PET images has important implications for estimation of total metabolic tumor volume, radiomics analysis, surgical intervention and radiotherapy. Manual segmentation of tumors in whole-body PET images is time-consuming, labor-intensive and operator-dependent. In this work, we develop and validate a fast and efficient three-step cascaded deep learning model for automated detection and segmentation of DLBCL tumors from PET images. As compared to a single end-to-end network for segmentation of tumors in whole-body PET images, our three-step model is more effective (improves 3D Dice score from 58.9% to 78.1%) since each of its specialized modules, namely the slice classifier, the tumor detector and the tumor segmentor, can be trained independently to a high degree of skill to carry out a specific task, rather than a single network with suboptimal performance on overall segmentation. △ Less

Submitted 11 March, 2024; originally announced March 2024.

Comments: 8 pages, 3 figures, 3 tables

Journal ref: Proc. SPIE 12032, Medical Imaging 2022: Image Processing, 120323M (4 April 2022)

arXiv:2309.01977 [pdf, other]

PyTomography: A Python Library for Quantitative Medical Image Reconstruction

Authors: Lucas Polson, Roberto Fedrigo, Chenguang Li, Maziar Sabouri, Obed Dzikunu, Shadab Ahamed, Nikolaos Karakatsanis, Arman Rahmim, Carlos Uribe

Abstract: There is a need for open-source libraries in emission tomography that (i) use modern and popular backend code to encourage community contributions and (ii) offer support for the multitude of reconstruction algorithms available in recent literature, such as those that employ artificial intelligence. The purpose of this research was to create and evaluate a GPU-accelerated, open-source, and user-fri… ▽ More There is a need for open-source libraries in emission tomography that (i) use modern and popular backend code to encourage community contributions and (ii) offer support for the multitude of reconstruction algorithms available in recent literature, such as those that employ artificial intelligence. The purpose of this research was to create and evaluate a GPU-accelerated, open-source, and user-friendly image reconstruction library, designed to serve as a central platform for the development, validation, and deployment of various tomographic reconstruction algorithms. PyTomography was developed using Python and inherits the GPU-accelerated functionality of PyTorch and parallelproj for fast computations. Its flexible and modular design decouples system matrices, likelihoods, and reconstruction algorithms, simplifying the process of integrating new imaging modalities using various python tools. Example use cases demonstrate the software capabilities in parallel hole SPECT and listmode PET imaging. Overall, we have developed and publicly share PyTomography, a highly optimized and user-friendly software for medical image reconstruction, with a class hierarchy that fosters the development of novel imaging applications. △ Less

Submitted 7 July, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

Comments: 27 pages, 7 figures

arXiv:2305.09627 [pdf, other]

Addressing computational challenges in physical system simulations with machine learning

Authors: Sabber Ahamed, Md Mesbah Uddin

Abstract: In this paper, we present a machine learning-based data generator framework tailored to aid researchers who utilize simulations to examine various physical systems or processes. High computational costs and the resulting limited data often pose significant challenges to gaining insights into these systems or processes. Our approach involves a two-step process: initially, we train a supervised pred… ▽ More In this paper, we present a machine learning-based data generator framework tailored to aid researchers who utilize simulations to examine various physical systems or processes. High computational costs and the resulting limited data often pose significant challenges to gaining insights into these systems or processes. Our approach involves a two-step process: initially, we train a supervised predictive model using a limited simulated dataset to predict simulation outcomes. Subsequently, a reinforcement learning agent is trained to generate accurate, simulation-like data by leveraging the supervised model. With this framework, researchers can generate more accurate data and know the outcomes without running high computational simulations, which enables them to explore the parameter space more efficiently and gain deeper insights into physical systems or processes. We demonstrate the effectiveness of the proposed framework by applying it to two case studies, one focusing on earthquake rupture physics and the other on new material development. △ Less

Submitted 16 May, 2023; originally announced May 2023.

arXiv:2208.00274 [pdf]

Convolutional neural network with a hybrid loss function for fully automated segmentation of lymphoma lesions in FDG PET images

Authors: Fereshteh Yousefirizi, Natalia Dubljevic, Shadab Ahamed, Ingrid Bloise, Claire Gowdy, Joo Hyun O, Youssef Farag, Rodrigue de Schaetzen, Patrick Martineau, Don Wilson, Carlos F. Uribe, Arman Rahmim

Abstract: Segmentation of lymphoma lesions is challenging due to their varied sizes and locations in whole-body PET scans. This work presents a fully-automated segmentation technique using a multi-center dataset of diffuse large B-cell lymphoma (DLBCL) with heterogeneous characteristics. We utilized a dataset of [18F]FDG-PET scans (n=194) from two different imaging centers, including cases with primary medi… ▽ More Segmentation of lymphoma lesions is challenging due to their varied sizes and locations in whole-body PET scans. This work presents a fully-automated segmentation technique using a multi-center dataset of diffuse large B-cell lymphoma (DLBCL) with heterogeneous characteristics. We utilized a dataset of [18F]FDG-PET scans (n=194) from two different imaging centers, including cases with primary mediastinal large B-cell lymphoma (PMBCL) (n=104). Automated brain and bladder removal approaches were utilized as preprocessing steps to tackle false positives caused by normal hypermetabolic uptake in these organs. Our segmentation model is a convolutional neural network (CNN) based on a 3D U-Net architecture that includes squeeze and excitation (SE) modules. Hybrid distribution, region, and boundary-based losses (Unified Focal and Mumford-Shah (MS)) were utilized that showed the best performance compared to other combinations (p<0.05). Cross-validation between different centers, DLBCL and PMBCL cases, and three random splits were applied on train/validation data. The ensemble of these six models achieved a Dice similarity coefficient (DSC) of 0.77 +- 0.08 and Hausdorff distance (HD) of 16.5 +-12.5. Our 3D U-net model with SE modules for segmentation with hybrid loss performed significantly better (p<0.05) as compared to the 3D U-Net (without SE modules) using the same loss function (Unified Focal and MS loss) (DSC= 0.64 +-0.21 and HD= 26.3 +- 18.7). Our model can facilitate a fully automated quantification pipeline in a multi-center context that opens the possibility for routine reporting of total metabolic tumor volume (TMTV) and other metrics shown useful for the management of lymphoma. △ Less

Submitted 10 August, 2022; v1 submitted 30 July, 2022; originally announced August 2022.

arXiv:2004.02734 [pdf, other]

Exploring Basement Surface relationship of north-west Bengal Basin using satellite images and tectonic modeling

Authors: Sabber Ahamed, Delwar Hossain, Jahangir Alam

Abstract: The Bengal basin is one of the thickest sedimentary basins and is being constantly affected by the collision of the Indian plate with the Burma and Tibetan plates. The northwest part of the basin, our study area, is one of the least explored areas where the shallowest faulted basement is present. Controversies exist about the origin of the basement and its role to the formation of surface landform… ▽ More The Bengal basin is one of the thickest sedimentary basins and is being constantly affected by the collision of the Indian plate with the Burma and Tibetan plates. The northwest part of the basin, our study area, is one of the least explored areas where the shallowest faulted basement is present. Controversies exist about the origin of the basement and its role to the formation of surface landforms. We analyze satellite images, Bouguer anomaly data, and develop a geodynamic model to explore the relationship between the faulted basement and surface landforms. Satellite images and gravity anomalies show a spatial correlation between the surface topography and basement fault structures. The elevated tracts and the low-lying flood plains are located on top of the gravity highs (horsts) and lows (grabens). The geodynamic model suggests that conjugate thrust faults may exist beneath the horsts that push the horst block upward. Our observations suggest the regional compression and basement faults have a more considerable influence on the development of surface landforms such as the uplifted tracts and the low-lying flood plains. △ Less

Submitted 6 April, 2020; originally announced April 2020.

arXiv:1911.09660 [pdf, other]

Estimating uncertainty of earthquake rupture using Bayesian neural network

Authors: Sabber Ahamed, Md Mesbah Uddin

Abstract: Bayesian neural networks (BNN) are the probabilistic model that combines the strengths of both neural network (NN) and stochastic processes. As a result, BNN can combat overfitting and perform well in applications where data is limited. Earthquake rupture study is such a problem where data is insufficient, and scientists have to rely on many trial and error numerical or physical models. Lack of re… ▽ More Bayesian neural networks (BNN) are the probabilistic model that combines the strengths of both neural network (NN) and stochastic processes. As a result, BNN can combat overfitting and perform well in applications where data is limited. Earthquake rupture study is such a problem where data is insufficient, and scientists have to rely on many trial and error numerical or physical models. Lack of resources and computational expenses, often, it becomes hard to determine the reasons behind the earthquake rupture. In this work, a BNN has been used (1) to combat the small data problem and (2) to find out the parameter combinations responsible for earthquake rupture and (3) to estimate the uncertainty associated with earthquake rupture. Two thousand rupture simulations are used to train and test the model. A simple 2D rupture geometry is considered where the fault has a Gaussian geometric heterogeneity at the center, and eight parameters vary in each simulation. The test F1-score of BNN (0.8334), which is 2.34% higher than plain NN score. Results show that the parameters of rupture propagation have higher uncertainty than the rupture arrest. Normal stresses play a vital role in determining rupture propagation and are also the highest source of uncertainty, followed by the dynamic friction coefficient. Shear stress has a moderate role, whereas the geometric features such as the width and height of the fault are least significant and uncertain. △ Less

Submitted 11 April, 2023; v1 submitted 21 November, 2019; originally announced November 2019.

arXiv:1906.06250 [pdf, other]

Machine Learning Approach to Earthquake Rupture Dynamics

Authors: Sabber Ahamed, Eric G. Daub

Abstract: Simulating dynamic rupture propagation is challenging due to the uncertainties involved in the underlying physics of fault slip, stress conditions, and frictional properties of the fault. A trial and error approach is often used to determine the unknown parameters describing rupture, but running many simulations usually requires human review to determine how to adjust parameter values and is thus… ▽ More Simulating dynamic rupture propagation is challenging due to the uncertainties involved in the underlying physics of fault slip, stress conditions, and frictional properties of the fault. A trial and error approach is often used to determine the unknown parameters describing rupture, but running many simulations usually requires human review to determine how to adjust parameter values and is thus not very efficient. To reduce the computational cost and improve our ability to determine reasonable stress and friction parameters, we take advantage of the machine learning approach. We develop two models for earthquake rupture propagation using the artificial neural network (ANN) and the random forest (RF) algorithms to predict if a rupture can break a geometric heterogeneity on a fault. We train the models using a database of 1600 dynamic rupture simulations computed numerically. Fault geometry, stress conditions, and friction parameters vary in each simulation. We cross-validate and test the predictive power of the models using an additional 400 simulated ruptures, respectively. Both RF and ANN models predict rupture propagation with more than 81% accuracy, and model parameters can be used to infer the underlying factors most important for rupture propagation. Both of the models are computationally efficient such that the 400 testings require a fraction of a second, leading to potential applications of dynamic rupture that have previously not been possible due to the computational demands of physics-based rupture simulations. △ Less

Submitted 14 June, 2019; originally announced June 2019.

arXiv:1812.09849 [pdf, ps, other]

Incorporating Deformation Energetics in Long-Term Tectonic Modeling

Authors: Sabber Ahamed, Eunseo Choi

Abstract: The deformation-related energy budget is usually considered in the simplest form or even completely omitted from the energy balance equation. We derive a full energy balance equation that accounts not only for heat energy but also for mechanical (elastic, plastic and viscous) work. The derived equation is implemented in DES3D, an unstructured finite element solver for long-term tectonic deformatio… ▽ More The deformation-related energy budget is usually considered in the simplest form or even completely omitted from the energy balance equation. We derive a full energy balance equation that accounts not only for heat energy but also for mechanical (elastic, plastic and viscous) work. The derived equation is implemented in DES3D, an unstructured finite element solver for long-term tectonic deformation. We verify the implementation by comparing numerical solutions to the corresponding semi-analytic solutions in three benchmarks extended from the classical oedometer test. Two of the benchmarks are designed to evaluate the temperature change in a Mohr-Coulomb elasto-plastic square governed by a simplified equation involving plastic power only and by the full temperature evolution equation, respectively. The third benchmark differs in that it computes thermal stresses associated with a prescribed uniform temperature increase. All the solutions from DES3D show relative error less than 0.1%. We also investigate the long-term effects of deformation energetics on the evolution of large offset normal faults. We find that the models considering the full energy balance equation tend to produce more secondary faults and an elongated core complex. Our results for the normal fault system confirm that persistent inelastic deformation has a significant impact on the long-term evolution of faults, motivating further exploration of the role of the full energy balance equation in other geodynamic systems. △ Less

Submitted 24 December, 2018; originally announced December 2018.

Showing 1–9 of 9 results for author: Ahamed, S