Search | arXiv e-print repository

Heterogeneous Image-based Classification Using Distributional Data Analysis

Authors: Alec Reinhardt, Newsha Nikzad, Raven J. Hollis, Galia Jacobson, Millicent A. Roach, Mohamed Badawy, Peter Chul Park, Laura Beretta, Prasun K Jalal, David T. Fuentes, Eugene J. Koay, Suprateek Kundu

Abstract: Diagnostic imaging has gained prominence as potential biomarkers for early detection and diagnosis in a diverse array of disorders including cancer. However, existing methods routinely face challenges arising from various factors such as image heterogeneity. We develop a novel imaging-based distributional data analysis (DDA) approach that incorporates the probability (quantile) distribution of the… ▽ More Diagnostic imaging has gained prominence as potential biomarkers for early detection and diagnosis in a diverse array of disorders including cancer. However, existing methods routinely face challenges arising from various factors such as image heterogeneity. We develop a novel imaging-based distributional data analysis (DDA) approach that incorporates the probability (quantile) distribution of the pixel-level features as covariates. The proposed approach uses a smoothed quantile distribution (via a suitable basis representation) as functional predictors in a scalar-on-functional quantile regression model. Some distinctive features of the proposed approach include the ability to: (i) account for heterogeneity within the image; (ii) incorporate granular information spanning the entire distribution; and (iii) tackle variability in image sizes for unregistered images in cancer applications. Our primary goal is risk prediction in Hepatocellular carcinoma that is achieved via predicting the change in tumor grades at post-diagnostic visits using pre-diagnostic enhancement pattern map** (EPM) images of the liver. Along the way, the proposed DDA approach is also used for case versus control diagnosis and risk stratification objectives. Our analysis reveals that when coupled with global structural radiomics features derived from the corresponding T1-MRI scans, the proposed smoothed quantile distributions derived from EPM images showed considerable improvements in sensitivity and comparable specificity in contrast to classification based on routinely used summary measures that do not account for image heterogeneity. Given that there are limited predictive modeling approaches based on heterogeneous images in cancer, the proposed method is expected to provide considerable advantages in image-based early detection and risk prediction. △ Less

Submitted 11 March, 2024; originally announced March 2024.

Comments: 16, 2 figures, 3 tables

arXiv:2210.03694 [pdf, other]

Cluster Amplitudes and Their Interplay with Self-Consistency in Density Functional Methods

Authors: Greta Jacobson, Juan M. Marmolejo-Tejada, Martín A. Mosquera

Abstract: Density functional theory (DFT) provides convenient electronic structure methods for the study of molecular systems and materials. Regular Kohn-Sham DFT calculations rely on unitary transformations to determine the ground-state electronic density, ground state energy, and related properties. However, for dissociation of molecular systems into open-shell fragments, due to the self-interaction error… ▽ More Density functional theory (DFT) provides convenient electronic structure methods for the study of molecular systems and materials. Regular Kohn-Sham DFT calculations rely on unitary transformations to determine the ground-state electronic density, ground state energy, and related properties. However, for dissociation of molecular systems into open-shell fragments, due to the self-interaction error present in a large number of density functional approximations, the self-consistent procedure based on the this type of transformation gives rise to the well-known charge delocalization problem. To avoid this issue, we showed previously that the cluster operator of coupled-cluster theory can be utilized within the context of DFT to solve in an alternative and approximate fashion the ground-state self-consistent problem. This work further examines the application of the singles cluster operator to molecular ground state calculations. Two approximations are derived and explored: i), A linearized scheme of the quadratic equation used to determine the cluster amplitudes, and, ii), the effect of carrying the calculations in a non-self-consistent field fashion. These approaches are found to be capable of improving the energy and density of the system and are quite stable in either case. The theoretical framework discussed in this work could be used to describe, with an added flexibility, quantum systems that display challenging features and require expanded theoretical methods. △ Less

Submitted 7 October, 2022; originally announced October 2022.

Comments: 17 pages, 5 figures, 1 table, submitted to ChemPhysChem

arXiv:2106.08946 [pdf, other]

FGLP: A Federated Fine-Grained Location Prediction System for Mobile Users

Authors: Xiaopeng Jiang, Shuai Zhao, Guy Jacobson, Rittwik Jana, Wen-Ling Hsu, Manoop Talasila, Syed Anwar Aftab, Yi Chen, Cristian Borcea

Abstract: Fine-grained location prediction on smart phones can be used to improve app/system performance. Application scenarios include video quality adaptation as a function of the 5G network quality at predicted user locations, and augmented reality apps that speed up content rendering based on predicted user locations. Such use cases require prediction error in the same range as the GPS error, and no exi… ▽ More Fine-grained location prediction on smart phones can be used to improve app/system performance. Application scenarios include video quality adaptation as a function of the 5G network quality at predicted user locations, and augmented reality apps that speed up content rendering based on predicted user locations. Such use cases require prediction error in the same range as the GPS error, and no existing works on location prediction can achieve this level of accuracy. We present a system for fine-grained location prediction (FGLP) of mobile users, based on GPS traces collected on the phones. FGLP has two components: a federated learning framework and a prediction model. The framework runs on the phones of the users and also on a server that coordinates learning from all users in the system. FGLP represents the user location data as relative points in an abstract 2D space, which enables learning across different physical spaces. The model merges Bidirectional Long Short-Term Memory (BiLSTM) and Convolutional Neural Networks (CNN), where BiLSTM learns the speed and direction of the mobile users, and CNN learns information such as user movement preferences. FGLP uses federated learning to protect user privacy and reduce bandwidth consumption. Our experimental results, using a dataset with over 600,000 users, demonstrate that FGLP outperforms baseline models in terms of prediction accuracy. We also demonstrate that FGLP works well in conjunction with transfer learning, which enables model reusability. Finally, benchmark results on several types of Android phones demonstrate FGLP's feasibility in real life. △ Less

Submitted 12 June, 2021; originally announced June 2021.

arXiv:1810.07159 [pdf]

Packaging and Sharing Machine Learning Models via the Acumos AI Open Platform

Authors: Shuai Zhao, Manoop Talasila, Guy Jacobson, Cristian Borcea, Syed Anwar Aftab, John F Murray

Abstract: Applying Machine Learning (ML) to business applications for automation usually faces difficulties when integrating diverse ML dependencies and services, mainly because of the lack of a common ML framework. In most cases, the ML models are developed for applications which are targeted for specific business domain use cases, leading to duplicated effort, and making reuse impossible. This paper prese… ▽ More Applying Machine Learning (ML) to business applications for automation usually faces difficulties when integrating diverse ML dependencies and services, mainly because of the lack of a common ML framework. In most cases, the ML models are developed for applications which are targeted for specific business domain use cases, leading to duplicated effort, and making reuse impossible. This paper presents Acumos, an open platform capable of packaging ML models into portable containerized microservices which can be easily shared via the platform's catalog, and can be integrated into various business applications. We present a case study of packaging sentiment analysis and classification ML models via the Acumos platform, permitting easy sharing with others. We demonstrate that the Acumos platform reduces the technical burden on application developers when applying machine learning models to their business applications. Furthermore, the platform allows the reuse of readily available ML microservices in various business domains. △ Less

Submitted 16 October, 2018; originally announced October 2018.

Comments: ICMLA 2018: International Conference on Machine Learning and Applications

Showing 1–4 of 4 results for author: Jacobson, G