-
CohortFinder: an open-source tool for data-driven partitioning of biomedical image cohorts to yield robust machine learning models
Authors:
Fan Fan,
Georgia Martinez,
Thomas Desilvio,
John Shin,
Yijiang Chen,
Bangchen Wang,
Takaya Ozeki,
Maxime W. Lafarge,
Viktor H. Koelzer,
Laura Barisoni,
Anant Madabhushi,
Satish E. Viswanath,
Andrew Janowczyk
Abstract:
Batch effects (BEs) refer to systematic technical differences in data collection unrelated to biological variations whose noise is shown to negatively impact machine learning (ML) model generalizability. Here we release CohortFinder, an open-source tool aimed at mitigating BEs via data-driven cohort partitioning. We demonstrate CohortFinder improves ML model performance in downstream medical image…
▽ More
Batch effects (BEs) refer to systematic technical differences in data collection unrelated to biological variations whose noise is shown to negatively impact machine learning (ML) model generalizability. Here we release CohortFinder, an open-source tool aimed at mitigating BEs via data-driven cohort partitioning. We demonstrate CohortFinder improves ML model performance in downstream medical image processing tasks. CohortFinder is freely available for download at cohortfinder.com.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Geometric Scattering on Measure Spaces
Authors:
Joyce Chew,
Matthew Hirn,
Smita Krishnaswamy,
Deanna Needell,
Michael Perlmutter,
Holly Steach,
Siddharth Viswanath,
Hau-Tieng Wu
Abstract:
The scattering transform is a multilayered, wavelet-based transform initially introduced as a model of convolutional neural networks (CNNs) that has played a foundational role in our understanding of these networks' stability and invariance properties. Subsequently, there has been widespread interest in extending the success of CNNs to data sets with non-Euclidean structure, such as graphs and man…
▽ More
The scattering transform is a multilayered, wavelet-based transform initially introduced as a model of convolutional neural networks (CNNs) that has played a foundational role in our understanding of these networks' stability and invariance properties. Subsequently, there has been widespread interest in extending the success of CNNs to data sets with non-Euclidean structure, such as graphs and manifolds, leading to the emerging field of geometric deep learning. In order to improve our understanding of the architectures used in this new field, several papers have proposed generalizations of the scattering transform for non-Euclidean data structures such as undirected graphs and compact Riemannian manifolds without boundary.
In this paper, we introduce a general, unified model for geometric scattering on measure spaces. Our proposed framework includes previous work on geometric scattering as special cases but also applies to more general settings such as directed graphs, signed graphs, and manifolds with boundary. We propose a new criterion that identifies to which groups a useful representation should be invariant and show that this criterion is sufficient to guarantee that the scattering transform has desirable stability and invariance properties. Additionally, we consider finite measure spaces that are obtained from randomly sampling an unknown manifold. We propose two methods for constructing a data-driven graph on which the associated graph scattering transform approximates the scattering transform on the underlying manifold. Moreover, we use a diffusion-maps based approach to prove quantitative estimates on the rate of convergence of one of these approximations as the number of sample points tends to infinity. Lastly, we showcase the utility of our method on spherical images, directed graphs, and on high-dimensional single-cell data.
△ Less
Submitted 13 October, 2022; v1 submitted 17 August, 2022;
originally announced August 2022.
-
The Manifold Scattering Transform for High-Dimensional Point Cloud Data
Authors:
Joyce Chew,
Holly R. Steach,
Siddharth Viswanath,
Hau-Tieng Wu,
Matthew Hirn,
Deanna Needell,
Smita Krishnaswamy,
Michael Perlmutter
Abstract:
The manifold scattering transform is a deep feature extractor for data defined on a Riemannian manifold. It is one of the first examples of extending convolutional neural network-like operators to general manifolds. The initial work on this model focused primarily on its theoretical stability and invariance properties but did not provide methods for its numerical implementation except in the case…
▽ More
The manifold scattering transform is a deep feature extractor for data defined on a Riemannian manifold. It is one of the first examples of extending convolutional neural network-like operators to general manifolds. The initial work on this model focused primarily on its theoretical stability and invariance properties but did not provide methods for its numerical implementation except in the case of two-dimensional surfaces with predefined meshes. In this work, we present practical schemes, based on the theory of diffusion maps, for implementing the manifold scattering transform to datasets arising in naturalistic systems, such as single cell genetics, where the data is a high-dimensional point cloud modeled as lying on a low-dimensional manifold. We show that our methods are effective for signal classification and manifold classification tasks.
△ Less
Submitted 21 January, 2024; v1 submitted 20 June, 2022;
originally announced June 2022.
-
Correlation between image quality metrics of magnetic resonance images and the neural network segmentation accuracy
Authors:
Rajarajeswari Muthusivarajan,
Adrian Celaya,
Joshua P. Yung,
Satish Viswanath,
Daniel S. Marcus,
Caroline Chung,
David Fuentes
Abstract:
Deep neural networks with multilevel connections process input data in complex ways to learn the information.A networks learning efficiency depends not only on the complex neural network architecture but also on the input training images.Medical image segmentation with deep neural networks for skull strip** or tumor segmentation from magnetic resonance images enables learning both global and loc…
▽ More
Deep neural networks with multilevel connections process input data in complex ways to learn the information.A networks learning efficiency depends not only on the complex neural network architecture but also on the input training images.Medical image segmentation with deep neural networks for skull strip** or tumor segmentation from magnetic resonance images enables learning both global and local features of the images.Though medical images are collected in a controlled environment,there may be artifacts or equipment based variance that cause inherent bias in the input set.In this study, we investigated the correlation between the image quality metrics of MR images with the neural network segmentation accuracy.For that we have used the 3D DenseNet architecture and let the network trained on the same input but applying different methodologies to select the training data set based on the IQM values.The difference in the segmentation accuracy between models based on the random training inputs with IQM based training inputs shed light on the role of image quality metrics on segmentation accuracy.By running the image quality metrics to choose the training inputs,further we may tune the learning efficiency of the network and the segmentation accuracy.
△ Less
Submitted 1 November, 2021;
originally announced November 2021.
-
MRQy: An Open-Source Tool for Quality Control of MR Imaging Data
Authors:
Amir Reza Sadri,
Andrew Janowczyk,
Ren Zou,
Ruchika Verma,
Niha Beig,
Jacob Antunes,
Anant Madabhushi,
Pallavi Tiwari,
Satish E. Viswanath
Abstract:
We sought to develop a quantitative tool to quickly determine relative differences in MRI volumes both within and between large MR imaging cohorts (such as available in The Cancer Imaging Archive (TCIA)), in order to help determine the generalizability of radiomics and machine learning schemes to unseen datasets. The tool is intended to help quantify presence of (a) site- or scanner-specific varia…
▽ More
We sought to develop a quantitative tool to quickly determine relative differences in MRI volumes both within and between large MR imaging cohorts (such as available in The Cancer Imaging Archive (TCIA)), in order to help determine the generalizability of radiomics and machine learning schemes to unseen datasets. The tool is intended to help quantify presence of (a) site- or scanner-specific variations in image resolution, field-of-view, or image contrast, or (b) imaging artifacts such as noise, motion, inhomogeneity, ringing, or aliasing; which can adversely affect relative image quality between data cohorts. We present MRQy, a new open-source quality control tool to (a) interrogate MRI cohorts for site- or equipment-based differences, and (b) quantify the impact of MRI artifacts on relative image quality; to help determine how to correct for these variations prior to model development. MRQy extracts a series of quality measures (e.g. noise ratios, variation metrics, entropy and energy criteria) and MR image metadata (e.g. voxel resolution, image dimensions) for subsequent interrogation via a specialized HTML5 based front-end designed for real-time filtering and trend visualization. MRQy was used to evaluate (a) n=133 brain MRIs from TCIA (7 sites), and (b) n=104 rectal MRIs (3 local sites). MRQy measures revealed significant site-specific variations in both cohorts, indicating potential batch effects. Marked differences in specific MRQy measures were also able to identify outlier MRI datasets that needed to be corrected for common MR imaging artifacts. MRQy is designed to be a standalone, unsupervised tool that can be efficiently run on a standard desktop computer. It has been made freely accessible at \url{http://github.com/ccipd/MRQy} for wider community use and feedback.
△ Less
Submitted 17 August, 2020; v1 submitted 9 April, 2020;
originally announced April 2020.
-
Identifying Indoor Points of Interest via Mobile Crowdsensing: An Experimental Study
Authors:
Sumudu Hasala Marakkalage,
Ran Liu,
Sanjana Kadaba Viswanath,
Chau Yuen
Abstract:
This paper presents a mobile crowdsensing approach to identify the indoor points of interest (POI) by exploiting Wi-Fi similarity measurements. Since indoor environments are lacking the GPS positioning accuracy when compared to outdoors, we rely on widely available Wi-Fi access points (AP) in contemporary urban indoor environments, to accurately identify user POI. We propose a smartphone applicati…
▽ More
This paper presents a mobile crowdsensing approach to identify the indoor points of interest (POI) by exploiting Wi-Fi similarity measurements. Since indoor environments are lacking the GPS positioning accuracy when compared to outdoors, we rely on widely available Wi-Fi access points (AP) in contemporary urban indoor environments, to accurately identify user POI. We propose a smartphone application based system architecture to scan the surrounding Wi-Fi AP and measure the cosine similarity of received signal strengths (RSS), and demonstrate through the experimental results that it is possible to identify the distinct POI of users, and the common POI among users of a given indoor environment.
△ Less
Submitted 20 August, 2019;
originally announced August 2019.
-
Towards Comfortable Cycling: A Practical Approach to Monitor the Conditions in Cycling Paths
Authors:
Nipun Wijerathne,
Sanjana Kadaba Viswanath,
Marakkalage Sumudu Hasala,
Victoria Beltran,
Chau Yuen,
Hock Beng Lim
Abstract:
This is a no brainer. Using bicycles to commute is the most sustainable form of transport, is the least expensive to use and are pollution-free. Towns and cities have to be made bicycle-friendly to encourage their wide usage. Therefore, cycling paths should be more convenient, comfortable, and safe to ride. This paper investigates a smartphone application, which passively monitors the road conditi…
▽ More
This is a no brainer. Using bicycles to commute is the most sustainable form of transport, is the least expensive to use and are pollution-free. Towns and cities have to be made bicycle-friendly to encourage their wide usage. Therefore, cycling paths should be more convenient, comfortable, and safe to ride. This paper investigates a smartphone application, which passively monitors the road conditions during cyclists ride. To overcome the problems of monitoring roads, we present novel algorithms that sense the rough cycling paths and locate road bumps. Each event is detected in real time to improve the user friendliness of the application. Cyclists may keep their smartphones at any random orientation and placement. Moreover, different smartphones sense the same incident dissimilarly and hence report discrepant sensor values. We further address the aforementioned difficulties that limit such crowd-sourcing application. We evaluate our sensing application on cycling paths in Singapore, and show that it can successfully detect such bad road conditions.
△ Less
Submitted 15 December, 2017;
originally announced December 2017.
-
System Design of Internet-of-Things for Residential Smart Grid
Authors:
Sanjana Kadaba Viswanath,
Chau Yuen,
Wayes Tushar,
Wen-Tai Li,
Chao-Kai Wen,
Kun Hu,
Cheng Chen,
Xiang Liu
Abstract:
Internet-of-Things (IoTs) envisions to integrate, coordinate, communicate, and collaborate real-world objects in order to perform daily tasks in a more intelligent and efficient manner. To comprehend this vision, this paper studies the design of a large scale IoT system for smart grid application, which constitutes a large number of home users and has the requirement of fast response time. In part…
▽ More
Internet-of-Things (IoTs) envisions to integrate, coordinate, communicate, and collaborate real-world objects in order to perform daily tasks in a more intelligent and efficient manner. To comprehend this vision, this paper studies the design of a large scale IoT system for smart grid application, which constitutes a large number of home users and has the requirement of fast response time. In particular, we focus on the messaging protocol of a universal IoT home gateway, where our cloud enabled system consists of a backend server, unified home gateway (UHG) at the end users, and user interface for mobile devices. We discuss the features of such IoT system to support a large scale deployment with a UHG and real-time residential smart grid applications. Based on the requirements, we design an IoT system using the XMPP protocol, and implemented in a testbed for energy management applications. To show the effectiveness of the designed testbed, we present some results using the proposed IoT architecture.
△ Less
Submitted 13 April, 2016;
originally announced April 2016.