-
DALSA: Domain Adaptation for Supervised Learning From Sparsely Annotated MR Images
Authors:
Michael Götz,
Christian Weber,
Franciszek Binczyk,
Joanna Polanska,
Rafal Tarnawski,
Barbara Bobek-Billewicz,
Ullrich Köthe,
Jens Kleesiek,
Bram Stieltjes,
Klaus H. Maier-Hein
Abstract:
We propose a new method that employs transfer learning techniques to effectively correct sampling selection errors introduced by sparse annotations during supervised learning for automated tumor segmentation. The practicality of current learning-based automated tissue classification approaches is severely impeded by their dependency on manually segmented training databases that need to be recreate…
▽ More
We propose a new method that employs transfer learning techniques to effectively correct sampling selection errors introduced by sparse annotations during supervised learning for automated tumor segmentation. The practicality of current learning-based automated tissue classification approaches is severely impeded by their dependency on manually segmented training databases that need to be recreated for each scenario of application, site, or acquisition setup. The comprehensive annotation of reference datasets can be highly labor-intensive, complex, and error-prone. The proposed method derives high-quality classifiers for the different tissue classes from sparse and unambiguous annotations and employs domain adaptation techniques for effectively correcting sampling selection errors introduced by the sparse sampling. The new approach is validated on labeled, multi-modal MR images of 19 patients with malignant gliomas and by comparative analysis on the BraTS 2013 challenge data sets. Compared to training on fully labeled data, we reduced the time for labeling and training by a factor greater than 70 and 180 respectively without sacrificing accuracy. This dramatically eases the establishment and constant extension of large annotated databases in various scenarios and imaging setups and thus represents an important step towards practical applicability of learning-based approaches in tissue classification.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Combining low-dose CT-based radiomics and metabolomics for early lung cancer screening support
Authors:
Joanna Zyla,
Michal Marczyk,
Wojciech Prazuch,
Marek Socha,
Aleksandra Suwalska,
Agata Durawa,
Malgorzata Jelitto-Gorska,
Katarzyna Dziadziuszko,
Edyta Szurowska,
Witold Rzyman,
Piotr Widlak,
Joanna Polanska
Abstract:
Due to its predominantly asymptomatic or mildly symptomatic progression, lung cancer is often diagnosed in advanced stages, resulting in poorer survival rates for patients. As with other cancers, early detection significantly improves the chances of successful treatment. Early diagnosis can be facilitated through screening programs designed to detect lung tissue tumors when they are still small, t…
▽ More
Due to its predominantly asymptomatic or mildly symptomatic progression, lung cancer is often diagnosed in advanced stages, resulting in poorer survival rates for patients. As with other cancers, early detection significantly improves the chances of successful treatment. Early diagnosis can be facilitated through screening programs designed to detect lung tissue tumors when they are still small, typically around 3mm in size. However, the analysis of extensive screening program data is hampered by limited access to medical experts. In this study, we developed a procedure for identifying potential malignant neoplastic lesions within lung parenchyma. The system leverages machine learning (ML) techniques applied to two types of measurements: low-dose Computed Tomography-based radiomics and metabolomics. Using data from two Polish screening programs, two ML algorithms were tested, along with various integration methods, to create a final model that combines both modalities to support lung cancer screening.
△ Less
Submitted 20 September, 2023;
originally announced November 2023.
-
BRONCO: Automated modelling of the bronchovascular bundle using the Computed Tomography Images
Authors:
Wojciech Prażuch,
Marek Socha,
Anna Mrukwa,
Aleksandra Suwalska,
Agata Durawa,
Malgorzata Jelitto-Górska,
Katarzyna Dziadziuszko,
Edyta Szurowska,
Pawel Bożek,
Michal Marczyk,
Witold Rzyman,
Joanna Polanska
Abstract:
Segmentation of the bronchovascular bundle within the lung parenchyma is a key step for the proper analysis and planning of many pulmonary diseases. It might also be considered the preprocessing step when the goal is to segment the nodules from the lung parenchyma. We propose a segmentation pipeline for the bronchovascular bundle based on the Computed Tomography images, returning either binary or…
▽ More
Segmentation of the bronchovascular bundle within the lung parenchyma is a key step for the proper analysis and planning of many pulmonary diseases. It might also be considered the preprocessing step when the goal is to segment the nodules from the lung parenchyma. We propose a segmentation pipeline for the bronchovascular bundle based on the Computed Tomography images, returning either binary or labelled masks of vessels and bronchi situated in the lung parenchyma. The method consists of two modules, modeling of the bronchial tree and vessels. The core revolves around a similar pipeline, the determination of the initial perimeter by the GMM method, skeletonization, and hierarchical analysis of the created graph. We tested our method on both low-dose CT and standard-dose CT, with various pathologies, reconstructed with various slice thicknesses, and acquired from various machines. We conclude that the method is invariant with respect to the origin and parameters of the CT series. Our pipeline is best suited for studies with healthy patients, patients with lung nodules, and patients with emphysema.
△ Less
Submitted 17 September, 2023;
originally announced September 2023.
-
Isotopic envelope identification by analysis of the spatial distribution of components in MALDI-MSI data
Authors:
Anna Glodek,
Joanna Polańska,
Marta Gawin
Abstract:
One of the significant steps in the process leading to the identification of proteins is mass spectrometry, which allows for obtaining information about the structure of proteins. Removing isotope peaks from the mass spectrum is vital and it is done in a process called deisoto**. There are different algorithms for deisoto**, but they have their limitations, they are dedicated to different meth…
▽ More
One of the significant steps in the process leading to the identification of proteins is mass spectrometry, which allows for obtaining information about the structure of proteins. Removing isotope peaks from the mass spectrum is vital and it is done in a process called deisoto**. There are different algorithms for deisoto**, but they have their limitations, they are dedicated to different methods of mass spectrometry. Data from experiments performed with the MALDI-ToF technique are characterized by high dimensionality. This paper presents a method for identifying isotope envelopes in MALDI-ToF molecular imaging data based on the Mamdani-Assilan fuzzy system and spatial maps of the molecular distribution of peaks included in the isotopic envelope. Several image texture measures were used to evaluate spatial molecular distribution maps. The algorithm was tested on eight datasets obtained from the MALDI-ToF experiment on samples from the National Institute of Oncology in Gliwice from patients with cancer of the head and neck region. The data were subjected to pre-processing and feature extraction. The results were collected and compared with three existing deisoto** algorithms. The analysis of the obtained results showed that the method for identifying isotopic envelopes proposed in this paper enables the detection of overlap** envelopes by using the approach oriented to study peak pairs. Moreover, the proposed algorithm enables the analysis of large data sets.
△ Less
Submitted 14 February, 2023; v1 submitted 12 February, 2023;
originally announced February 2023.
-
POLCOVID: a multicenter multiclass chest X-ray database (Poland, 2020-2021)
Authors:
Aleksandra Suwalska,
Joanna Tobiasz,
Wojciech Prazuch,
Marek Socha,
Pawel Foszner,
Damian Piotrowski,
Katarzyna Gruszczynska,
Magdalena Sliwinska,
Jerzy Walecki,
Tadeusz Popiela,
Grzegorz Przybylski,
Mateusz Nowak,
Piotr Fiedor,
Malgorzata Pawlowska,
Robert Flisiak,
Krzysztof Simon,
Gabriela Zapolska,
Barbara Gizycka,
Edyta Szurowska,
POLCOVID Study Group,
Michal Marczyk,
Andrzej Cieszanowski,
Joanna Polanska
Abstract:
The outbreak of the SARS-CoV-2 pandemic has put healthcare systems worldwide to their limits, resulting in increased waiting time for diagnosis and required medical assistance. With chest radiographs (CXR) being one of the most common COVID-19 diagnosis methods, many artificial intelligence tools for image-based COVID-19 detection have been developed, often trained on a small number of images from…
▽ More
The outbreak of the SARS-CoV-2 pandemic has put healthcare systems worldwide to their limits, resulting in increased waiting time for diagnosis and required medical assistance. With chest radiographs (CXR) being one of the most common COVID-19 diagnosis methods, many artificial intelligence tools for image-based COVID-19 detection have been developed, often trained on a small number of images from COVID-19-positive patients. Thus, the need for high-quality and well-annotated CXR image databases increased. This paper introduces POLCOVID dataset, containing chest X-ray (CXR) images of patients with COVID-19 or other-type pneumonia, and healthy individuals gathered from 15 Polish hospitals. The original radiographs are accompanied by the preprocessed images limited to the lung area and the corresponding lung masks obtained with the segmentation model. Moreover, the manually created lung masks are provided for a part of POLCOVID dataset and the other four publicly available CXR image collections. POLCOVID dataset can help in pneumonia or COVID-19 diagnosis, while the set of matched images and lung masks may serve for the development of lung segmentation solutions.
△ Less
Submitted 15 December, 2022; v1 submitted 29 November, 2022;
originally announced November 2022.
-
CIRCA: comprehensible online system in support of chest X-rays-based COVID-19 diagnosis
Authors:
Wojciech Prazuch,
Aleksandra Suwalska,
Marek Socha,
Joanna Tobiasz,
Pawel Foszner,
Jerzy Jaroszewicz,
Katarzyna Gruszczynska,
Magdalena Sliwinska,
Jerzy Walecki,
Tadeusz Popiela,
Grzegorz Przybylski,
Andrzej Cieszanowski,
Mateusz Nowak,
Malgorzata Pawlowska,
Robert Flisiak,
Krzysztof Simon,
Gabriela Zapolska,
Barbara Gizycka,
Edyta Szurowska,
POLCOVID Study Group,
Michal Marczyk,
Joanna Polanska
Abstract:
Due to the large accumulation of patients requiring hospitalization, the COVID-19 pandemic disease caused a high overload of health systems, even in developed countries. Deep learning techniques based on medical imaging data can help in the faster detection of COVID-19 cases and monitoring of disease progression. Regardless of the numerous proposed solutions for lung X-rays, none of them is a prod…
▽ More
Due to the large accumulation of patients requiring hospitalization, the COVID-19 pandemic disease caused a high overload of health systems, even in developed countries. Deep learning techniques based on medical imaging data can help in the faster detection of COVID-19 cases and monitoring of disease progression. Regardless of the numerous proposed solutions for lung X-rays, none of them is a product that can be used in the clinic. Five different datasets (POLCOVID, AIforCOVID, COVIDx, NIH, and artificially generated data) were used to construct a representative dataset of 23 799 CXRs for model training; 1 050 images were used as a hold-out test set, and 44 247 as independent test set (BIMCV database). A U-Net-based model was developed to identify a clinically relevant region of the CXR. Each image class (normal, pneumonia, and COVID-19) was divided into 3 subtypes using a 2D Gaussian mixture model. A decision tree was used to aggregate predictions from the InceptionV3 network based on processed CXRs and a dense neural network on radiomic features. The lung segmentation model gave the Sorensen-Dice coefficient of 94.86% in the validation dataset, and 93.36% in the testing dataset. In 5-fold cross-validation, the accuracy for all classes ranged from 91% to 93%, kee** slightly higher specificity than sensitivity and NPV than PPV. In the hold-out test set, the balanced accuracy ranged between 68% and 100%. The highest performance was obtained for the subtypes N1, P1, and C1. A similar performance was obtained on the independent dataset for normal and COVID-19 class subtypes. Seventy-six percent of COVID-19 patients wrongly classified as normal cases were annotated by radiologists as with no signs of disease. Finally, we developed the online service (https://circa.aei.polsl.pl) to provide access to fast diagnosis support tools.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Classification supporting COVID-19 diagnostics based on patient survey data
Authors:
Joanna Henzel,
Joanna Tobiasz,
Michał Kozielski,
Małgorzata Bach,
Paweł Foszner,
Aleksandra Gruca,
Mateusz Kania,
Justyna Mika,
Anna Papiez,
Aleksandra Werner,
Joanna Zyla,
Jerzy Jaroszewicz,
Joanna Polanska,
Marek Sikora
Abstract:
Distinguishing COVID-19 from other flu-like illnesses can be difficult due to ambiguous symptoms and still an initial experience of doctors. Whereas, it is crucial to filter out those sick patients who do not need to be tested for SARS-CoV-2 infection, especially in the event of the overwhelming increase in disease. As a part of the presented research, logistic regression and XGBoost classifiers,…
▽ More
Distinguishing COVID-19 from other flu-like illnesses can be difficult due to ambiguous symptoms and still an initial experience of doctors. Whereas, it is crucial to filter out those sick patients who do not need to be tested for SARS-CoV-2 infection, especially in the event of the overwhelming increase in disease. As a part of the presented research, logistic regression and XGBoost classifiers, that allow for effective screening of patients for COVID-19, were generated. Each of the methods was tuned to achieve an assumed acceptable threshold of negative predictive values during classification. Additionally, an explanation of the obtained classification models was presented. The explanation enables the users to understand what was the basis of the decision made by the model. The obtained classification models provided the basis for the DECODE service (decode.polsl.pl), which can serve as support in screening patients with COVID-19 disease. Moreover, the data set constituting the basis for the analyses performed is made available to the research community. This data set consisting of more than 3,000 examples is based on questionnaires collected at a hospital in Poland.
△ Less
Submitted 24 November, 2020;
originally announced November 2020.