-
Identification of pneumonia on chest x-ray images through machine learning
Authors:
Eduardo Augusto Roeder
Abstract:
Pneumonia is the leading infectious cause of infant death in the world. When identified early, it is possible to alter the prognosis of the patient, one could use imaging exams to help in the diagnostic confirmation. Performing and interpreting the exams as soon as possible is vital for a good treatment, with the most common exam for this pathology being chest X-ray. The objective of this study wa…
▽ More
Pneumonia is the leading infectious cause of infant death in the world. When identified early, it is possible to alter the prognosis of the patient, one could use imaging exams to help in the diagnostic confirmation. Performing and interpreting the exams as soon as possible is vital for a good treatment, with the most common exam for this pathology being chest X-ray. The objective of this study was to develop a software that identify the presence or absence of pneumonia in chest radiographs. The software was developed as a computational model based on machine learning using transfer learning technique. For the training process, images were collected from a database available online with children's chest X-rays images taken at a hospital in China. After training, the model was then exposed to new images, achieving relevant results on identifying such pathology, reaching 98% sensitivity and 97.3% specificity for the sample used for testing. It can be concluded that it is possible to develop a software that identifies pneumonia in chest X-ray images.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Biclustering random matrix partitions with an application to classification of forensic body fluids
Authors:
Chieh-Hsi Wu,
Amy D. Roeder,
Geoff K. Nicholls
Abstract:
Classification of unlabeled data is usually achieved by supervised learning from labeled samples. Although there exist many sophisticated supervised machine learning methods that can predict the missing labels with a high level of accuracy, they often lack the required transparency in situations where it is important to provide interpretable results and meaningful measures of confidence. Body flui…
▽ More
Classification of unlabeled data is usually achieved by supervised learning from labeled samples. Although there exist many sophisticated supervised machine learning methods that can predict the missing labels with a high level of accuracy, they often lack the required transparency in situations where it is important to provide interpretable results and meaningful measures of confidence. Body fluid classification of forensic casework data is the case in point. We develop a new Biclustering Dirichlet Process for Class-assignment with Random Matrices (BDP-CaRMa), with a three-level hierarchy of clustering, and a model-based approach to classification that adapts to block structure in the data matrix. As the class labels of some observations are missing, the number of rows in the data matrix for each class is unknown. BDP-CaRMa handles this and extends existing biclustering methods by simultaneously biclustering multiple matrices each having a randomly variable number of rows. We demonstrate our method by applying it to the motivating problem, which is the classification of body fluids based on mRNA profiles taken from crime scenes. The analyses of casework-like data show that our method is interpretable and produces well-calibrated posterior probabilities. Our model can be more generally applied to other types of data with a similar structure to the forensic data.
△ Less
Submitted 14 October, 2023; v1 submitted 27 June, 2023;
originally announced June 2023.
-
Accurate Virus Identification with Interpretable Raman Signatures by Machine Learning
Authors:
Jiarong Ye,
Yin-Ting Yeh,
Yuan Xue,
Ziyang Wang,
Na Zhang,
He Liu,
Kunyan Zhang,
RyeAnne Ricker,
Zhuohang Yu,
Allison Roder,
Nestor Perea Lopez,
Lindsey Organtini,
Wallace Greene,
Susan Hafenstein,
Huaguang Lu,
Elodie Ghedin,
Mauricio Terrones,
Shengxi Huang,
Sharon Xiaolei Huang
Abstract:
Rapid identification of newly emerging or circulating viruses is an important first step toward managing the public health response to potential outbreaks. A portable virus capture device coupled with label-free Raman Spectroscopy holds the promise of fast detection by rapidly obtaining the Raman signature of a virus followed by a machine learning approach applied to recognize the virus based on i…
▽ More
Rapid identification of newly emerging or circulating viruses is an important first step toward managing the public health response to potential outbreaks. A portable virus capture device coupled with label-free Raman Spectroscopy holds the promise of fast detection by rapidly obtaining the Raman signature of a virus followed by a machine learning approach applied to recognize the virus based on its Raman spectrum, which is used as a fingerprint. We present such a machine learning approach for analyzing Raman spectra of human and avian viruses. A Convolutional Neural Network (CNN) classifier specifically designed for spectral data achieves very high accuracy for a variety of virus type or subtype identification tasks. In particular, it achieves 99% accuracy for classifying influenza virus type A vs. type B, 96% accuracy for classifying four subtypes of influenza A, 95% accuracy for differentiating enveloped and non-enveloped viruses, and 99% accuracy for differentiating avian coronavirus (infectious bronchitis virus, IBV) from other avian viruses. Furthermore, interpretation of neural net responses in the trained CNN model using a full-gradient algorithm highlights Raman spectral ranges that are most important to virus identification. By correlating ML-selected salient Raman ranges with the signature ranges of known biomolecules and chemical functional groups (for example, amide, amino acid, carboxylic acid), we verify that our ML model effectively recognizes the Raman signatures of proteins, lipids and other vital functional groups present in different viruses and uses a weighted combination of these signatures to identify viruses.
△ Less
Submitted 5 June, 2022;
originally announced June 2022.
-
Unmanned Aerial Vehicle Forensic Investigation Process: Dji Phantom 3 Drone As A Case Study
Authors:
Alan Roder,
Kim-Kwang Raymon Choo,
Nhien-An Le-Khac
Abstract:
Drones (also known as Unmanned Aerial Vehicles, UAVs) is a potential source of evidence in a digital investigation, partly due to their increasing popularity in our society. However, existing UAV/drone forensics generally rely on conventional digital forensic investigation guidelines such as those of ACPO and NIST, which may not be entirely fit_for_purpose. In this paper, we identify the challenge…
▽ More
Drones (also known as Unmanned Aerial Vehicles, UAVs) is a potential source of evidence in a digital investigation, partly due to their increasing popularity in our society. However, existing UAV/drone forensics generally rely on conventional digital forensic investigation guidelines such as those of ACPO and NIST, which may not be entirely fit_for_purpose. In this paper, we identify the challenges associated with UAV/drone forensics. We then explore and evaluate existing forensic guidelines, in terms of their effectiveness for UAV/drone forensic investigations. Next, we present our set of guidelines for UAV/drone investigations. Finally, we demonstrate how the proposed guidelines can be used to guide a drone forensic investigation using the DJI Phantom 3 drone as a case study.
△ Less
Submitted 23 April, 2018;
originally announced April 2018.
-
Dark Matter Search Results from the PICO-60 C$_3$F$_8$ Bubble Chamber
Authors:
C. Amole,
M. Ardid,
I. J. Arnquist,
D. M. Asner,
D. Baxter,
E. Behnke,
P. Bhattacharjee,
H. Borsodi,
M. Bou-Cabo,
P. Campion,
G. Cao,
C. J. Chen,
U. Chowdhury,
K. Clark,
J. I. Collar,
P. S. Cooper,
M. Crisler,
G. Crowder,
C. E. Dahl,
M. Das,
S. Fallows,
J. Farine,
I. Felis,
R. Filgas,
F. Girard
, et al. (37 additional authors not shown)
Abstract:
New results are reported from the operation of the PICO-60 dark matter detector, a bubble chamber filled with 52 kg of C$_3$F$_8$ located in the SNOLAB underground laboratory. As in previous PICO bubble chambers, PICO-60 C$_3$F$_8$ exhibits excellent electron recoil and alpha decay rejection, and the observed multiple-scattering neutron rate indicates a single-scatter neutron background of less th…
▽ More
New results are reported from the operation of the PICO-60 dark matter detector, a bubble chamber filled with 52 kg of C$_3$F$_8$ located in the SNOLAB underground laboratory. As in previous PICO bubble chambers, PICO-60 C$_3$F$_8$ exhibits excellent electron recoil and alpha decay rejection, and the observed multiple-scattering neutron rate indicates a single-scatter neutron background of less than 1 event per month. A blind analysis of an efficiency-corrected 1167-kg-day exposure at a 3.3-keV thermodynamic threshold reveals no single-scattering nuclear recoil candidates, consistent with the predicted background. These results set the most stringent direct-detection constraint to date on the WIMP-proton spin-dependent cross section at 3.4 $\times$ 10$^{-41}$ cm$^2$ for a 30-GeV$\thinspace$c$^{-2}$ WIMP, more than one order of magnitude improvement from previous PICO results.
△ Less
Submitted 2 August, 2017; v1 submitted 24 February, 2017;
originally announced February 2017.
-
Tessellations and Pattern Formation in Plant Growth and Development
Authors:
Bruce E Shapiro,
Henrik Jonsson,
Patrick Sahlin,
Marcus Heisler,
Adrienne Roeder,
Michael Burl,
Elliot M Meyerowitz,
Eric D Mjolsness
Abstract:
The shoot apical meristem (SAM) is a dome-shaped collection of cells at the apex of growing plants from which all above-ground tissue ultimately derives. In Arabidopsis thaliana (thale cress), a small flowering weed of the Brassicaceae family (related to mustard and cabbage), the SAM typically contains some three to five hundred cells that range from five to ten microns in diameter. These cells ar…
▽ More
The shoot apical meristem (SAM) is a dome-shaped collection of cells at the apex of growing plants from which all above-ground tissue ultimately derives. In Arabidopsis thaliana (thale cress), a small flowering weed of the Brassicaceae family (related to mustard and cabbage), the SAM typically contains some three to five hundred cells that range from five to ten microns in diameter. These cells are organized into several distinct zones that maintain their topological and functional relationships throughout the life of the plant. As the plant grows, organs (primordia) form on its surface flanks in a phyllotactic pattern that develop into new shoots, leaves, and flowers. Cross-sections through the meristem reveal a pattern of polygonal tessellation that is suggestive of Voronoi diagrams derived from the centroids of cellular nuclei. In this chapter we explore some of the properties of these patterns within the meristem and explore the applicability of simple, standard mathematical models of their geometry.
△ Less
Submitted 13 September, 2012;
originally announced September 2012.
-
Structure and Dynamics of amorphous Silica Surfaces
Authors:
Alexandra Roder,
Walter Kob,
Kurt Binder
Abstract:
We use molecular dynamics computer simulations to study the equilibrium properties of the surface of amorphous silica. Two types of geometries are investigated: i) clusters with different diameters (13.5Å, 19Å, and 26.5Å) and ii) a thin film with thickness 29Å. We find that the shape of the clusters is independent of temperature and that it becomes more spherical with increasing size. The surfac…
▽ More
We use molecular dynamics computer simulations to study the equilibrium properties of the surface of amorphous silica. Two types of geometries are investigated: i) clusters with different diameters (13.5Å, 19Å, and 26.5Å) and ii) a thin film with thickness 29Å. We find that the shape of the clusters is independent of temperature and that it becomes more spherical with increasing size. The surface energy is in qualitative agreement with the experimental value for the surface tension. The density distribution function shows a small peak just below the surface, the origin of which is traced back to a local chemical ordering at the surface. Close to the surface the partial radial distribution functions as well as the distributions of the bond-bond angles show features which are not observed in the interior of the systems. By calculating the distribution of the length of the Si-O rings we can show that these additional features are related to the presence of two-membered rings at the surface. The surface density of these structures is around 0.6/nm^2 in good agreement with experimental estimates. From the behavior of the mean-squared displacement at low temperatures we conclude that at the surface the cage of the particles is larger than the one in the bulk. Close to the surface the diffusion constant is somewhat larger than the one in the bulk and with decreasing temperature the relative difference grows. The total vibrational density of states at the surface is similar to the one in the bulk. However, if only the one for the silicon atoms is considered, significant differences are found.
△ Less
Submitted 20 November, 2000;
originally announced November 2000.
-
High-Temperature Series Analysis of the Free Energy and Susceptibility of the 2D Random-Bond Ising Model
Authors:
Alexandra Roder,
Joan Adler,
Wolfhard Janke
Abstract:
We derive high-temperature series expansions for the free energy and susceptibility of the two-dimensional random-bond Ising model with a symmetric bimodal distribution of two positive coupling strengths J_1 and J_2 and study the influence of the quenched, random bond-disorder on the critical behavior of the model. By analysing the series expansions over a wide range of coupling ratios J_2/J_1,…
▽ More
We derive high-temperature series expansions for the free energy and susceptibility of the two-dimensional random-bond Ising model with a symmetric bimodal distribution of two positive coupling strengths J_1 and J_2 and study the influence of the quenched, random bond-disorder on the critical behavior of the model. By analysing the series expansions over a wide range of coupling ratios J_2/J_1, covering the crossover from weak to strong disorder, we obtain for the susceptibility with two different methods compelling evidence for a singularity of the form $χ\sim t^{-7/4} |\ln t|^{7/8}$, as predicted theoretically by Shalaev, Shankar, and Ludwig. For the specific heat our results are less convincing, but still compatible with the theoretically predicted log-log singularity.
△ Less
Submitted 18 May, 1999;
originally announced May 1999.