-
Cycle-consistent Generative Adversarial Network Synthetic CT for MR-only Adaptive Radiation Therapy on MR-Linac
Authors:
Gabriel L. Asher,
Bassem I. Zaki,
Gregory A. Russo,
Gobind S. Gill,
Charles R. Thomas,
Temiloluwa O. Prioleau,
Rongxiao Zhang,
Brady Hunt
Abstract:
Purpose: This study assesses the effectiveness of Deep Learning (DL) for creating synthetic CT (sCT) images in MR-guided adaptive radiation therapy (MRgART).
Methods: A Cycle-GAN model was trained with MRI and CT scan slices from MR-LINAC treatments, generating sCT volumes. The analysis involved retrospective treatment plan data from patients with various tumors. sCT images were compared with st…
▽ More
Purpose: This study assesses the effectiveness of Deep Learning (DL) for creating synthetic CT (sCT) images in MR-guided adaptive radiation therapy (MRgART).
Methods: A Cycle-GAN model was trained with MRI and CT scan slices from MR-LINAC treatments, generating sCT volumes. The analysis involved retrospective treatment plan data from patients with various tumors. sCT images were compared with standard CT scans using mean absolute error in Hounsfield Units (HU) and image similarity metrics (SSIM, PSNR, NCC). sCT volumes were integrated into a clinical treatment system for dosimetric re-evaluation.
Results: The model, trained on 8405 frames from 57 patients and tested on 357 sCT frames from 17 patients, showed sCTs comparable to dCTs in electron density and structural similarity with MRI scans. The MAE between sCT and dCT was 49.2 +/- 13.2 HU, with sCT NCC exceeding dCT by 0.06, and SSIM and PSNR at 0.97 +/- 0.01 and 19.9 +/- 1.6 respectively. Dosimetric evaluations indicated minimal differences between sCTs and dCTs, with sCTs showing better air-bubble reconstruction.
Conclusions: DL-based sCT generation on MR-Linacs is accurate for dose calculation and optimization in MRgART. This could facilitate MR-only treatment planning, enhancing simulation and adaptive planning efficiency on MR-Linacs.
△ Less
Submitted 2 December, 2023;
originally announced December 2023.
-
Robust language-based mental health assessments in time and space through social media
Authors:
Siddharth Mangalik,
Johannes C. Eichstaedt,
Salvatore Giorgi,
Jihu Mun,
Farhan Ahmed,
Gilvir Gill,
Adithya V. Ganesan,
Shashanka Subrahmanya,
Nikita Soni,
Sean A. P. Clouston,
H. Andrew Schwartz
Abstract:
Compared to physical health, population mental health measurement in the U.S. is very coarse-grained. Currently, in the largest population surveys, such as those carried out by the Centers for Disease Control or Gallup, mental health is only broadly captured through "mentally unhealthy days" or "sadness", and limited to relatively infrequent state or metropolitan estimates. Through the large scale…
▽ More
Compared to physical health, population mental health measurement in the U.S. is very coarse-grained. Currently, in the largest population surveys, such as those carried out by the Centers for Disease Control or Gallup, mental health is only broadly captured through "mentally unhealthy days" or "sadness", and limited to relatively infrequent state or metropolitan estimates. Through the large scale analysis of social media data, robust estimation of population mental health is feasible at much higher resolutions, up to weekly estimates for counties. In the present work, we validate a pipeline that uses a sample of 1.2 billion Tweets from 2 million geo-located users to estimate mental health changes for the two leading mental health conditions, depression and anxiety. We find moderate to large associations between the language-based mental health assessments and survey scores from Gallup for multiple levels of granularity, down to the county-week (fixed effects $β= .25$ to $1.58$; $p<.001$). Language-based assessment allows for the cost-effective and scalable monitoring of population mental health at weekly time scales. Such spatially fine-grained time series are well suited to monitor effects of societal events and policies as well as enable quasi-experimental study designs in population health and other disciplines. Beyond mental health in the U.S., this method generalizes to a broad set of psychological outcomes and allows for community measurement in under-resourced settings where no traditional survey measures - but social media data - are available.
△ Less
Submitted 24 February, 2023;
originally announced February 2023.
-
Radio-frequency manipulation of state populations in an entangled fluorine-muon-fluorine system
Authors:
David Billington,
Edward Riordan,
Majdi Salman,
Daniel Margineda,
George J. W. Gill,
Stephen P. Cottrell,
Iain McKenzie,
Tom Lancaster,
Michael J. Graf,
Sean R. Giblin
Abstract:
Entangled spin states are created by implanting muons into single crystal LiY0.95Ho0.05F4 to form a cluster of correlated, dipole-coupled local magnetic moments. The resulting states have well-defined energy levels allowing experimental manipulation of the state populations by electromagnetic excitation. Experimental control of the evolution of the muon spin polarization is demonstrated through ap…
▽ More
Entangled spin states are created by implanting muons into single crystal LiY0.95Ho0.05F4 to form a cluster of correlated, dipole-coupled local magnetic moments. The resulting states have well-defined energy levels allowing experimental manipulation of the state populations by electromagnetic excitation. Experimental control of the evolution of the muon spin polarization is demonstrated through application of continuous, radio-frequency magnetic excitation fields. A semiclassical model of quantum, dipole-coupled spins interacting with a classical, oscillating magnetic field accounts for the muon spin evolution. On application of the excitation field, this model shows how changes in the state populations lead to the experimentally observed effects, thus enabling a spectroscopic probe of entangled spin states with muons.
△ Less
Submitted 11 April, 2022;
originally announced April 2022.
-
Measuring Geometric Similarity Across Possible Plans for Automated Redistricting
Authors:
Gilvir Gill
Abstract:
Algorithmic and statistical approaches to congressional redistricting are becoming increasingly valuable tools in courts and redistricting commissions for quantifying gerrymandering in the United States. While there is existing literature covering how various Markov chain Monte Carlo distributions differ in terms of projected electoral outcomes and geometric quantifiers of compactness, there is st…
▽ More
Algorithmic and statistical approaches to congressional redistricting are becoming increasingly valuable tools in courts and redistricting commissions for quantifying gerrymandering in the United States. While there is existing literature covering how various Markov chain Monte Carlo distributions differ in terms of projected electoral outcomes and geometric quantifiers of compactness, there is still work to be done on measuring similarities between different congressional redistricting plans. This paper briefly introduces an intuitive and interpretive measure of similarity, and a corresponding assignment matrix, that corresponds to the percentage of a state's area or population that stays in the same congressional district between two plans. We then show how to calculate this measure in polynomial time and briefly demonstrate some potential use-cases.
△ Less
Submitted 16 November, 2021;
originally announced November 2021.
-
Optimizing Graph Transformer Networks with Graph-based Techniques
Authors:
Loc Hoang,
Udit Agarwal,
Gurbinder Gill,
Roshan Dathathri,
Abhik Seal,
Brian Martin,
Keshav **ali
Abstract:
Graph transformer networks (GTN) are a variant of graph convolutional networks (GCN) that are targeted to heterogeneous graphs in which nodes and edges have associated type information that can be exploited to improve inference accuracy. GTNs learn important metapaths in the graph, create weighted edges for these metapaths, and use the resulting graph in a GCN. Currently, the only available implem…
▽ More
Graph transformer networks (GTN) are a variant of graph convolutional networks (GCN) that are targeted to heterogeneous graphs in which nodes and edges have associated type information that can be exploited to improve inference accuracy. GTNs learn important metapaths in the graph, create weighted edges for these metapaths, and use the resulting graph in a GCN. Currently, the only available implementation of GTNs uses dense matrix multiplication to find metapaths. Unfortunately, the space overhead of this approach can be large, so in practice it is used only for small graphs. In addition, the matrix-based implementation is not fine-grained enough to use random-walk based methods to optimize metapath finding. In this paper, we present a graph-based formulation and implementation of the GTN metapath finding problem. This graph-based formulation has two advantages over the matrix-based approach. First, it is more space efficient than the original GTN implementation and more compute-efficient for metapath sizes of practical interest. Second, it permits us to implement a sampling method that reduces the number of metapaths that must be enumerated, allowing the implementation to be used for larger graphs and larger metapath sizes. Experimental results show that our implementation is $6.5\times$ faster than the original GTN implementation on average for a metapath length of 4, and our sampling implementation is $155\times$ faster on average than this implementation without compromising on the accuracy of the GTN.
△ Less
Submitted 15 June, 2021;
originally announced June 2021.
-
Sandslash: A Two-Level Framework for Efficient Graph Pattern Mining
Authors:
Xuhao Chen,
Roshan Dathathri,
Gurbinder Gill,
Loc Hoang,
Keshav **ali
Abstract:
Graph pattern mining (GPM) is used in diverse application areas including social network analysis, bioinformatics, and chemical engineering. Existing GPM frameworks either provide high-level interfaces for productivity at the cost of expressiveness or provide low-level interfaces that can express a wide variety of GPM algorithms at the cost of increased programming complexity. Moreover, existing s…
▽ More
Graph pattern mining (GPM) is used in diverse application areas including social network analysis, bioinformatics, and chemical engineering. Existing GPM frameworks either provide high-level interfaces for productivity at the cost of expressiveness or provide low-level interfaces that can express a wide variety of GPM algorithms at the cost of increased programming complexity. Moreover, existing systems lack the flexibility to explore combinations of optimizations to achieve performance competitive with hand-optimized applications.
We present Sandslash, an in-memory Graph Pattern Mining (GPM) framework that uses a novel programming interface to support productive, expressive, and efficient GPM on large graphs. Sandslash provides a high-level API that needs only a specification of the GPM problem, and it implements fast subgraph enumeration, provides efficient data structures, and applies high-level optimizations automatically. To achieve performance competitive with expert-optimized implementations, Sandslash also provides a low-level API that allows users to express algorithm-specific optimizations. This enables Sandslash to support both high-productivity and high-efficiency without losing expressiveness. We evaluate Sandslash on shared-memory machines using five GPM applications and a wide range of large real-world graphs. Experimental results demonstrate that applications written using Sandslash high-level or low-level API outperforms state-of-the-art GPM systems AutoMine, Pangolin, and Peregrine on average by 13.8x, 7.9x, and 5.4x, respectively. We also show that these Sandslash applications outperform expert-optimized GPM implementations by 2.3x on average with less programming effort.
△ Less
Submitted 5 November, 2020;
originally announced November 2020.
-
An Adaptive Load Balancer For Graph Analytical Applications on GPUs
Authors:
Vishwesh Jatala,
Loc Hoang,
Roshan Dathathri,
Gurbinder Gill,
V Krishna Nandivada,
Keshav **ali
Abstract:
Load-balancing among the threads of a GPU for graph analytics workloads is difficult because of the irregular nature of graph applications and the high variability in vertex degrees, particularly in power-law graphs. We describe a novel load balancing scheme to address this problem. Our scheme is implemented in the IrGL compiler to allow users to generate efficient load balanced code for a GPU fro…
▽ More
Load-balancing among the threads of a GPU for graph analytics workloads is difficult because of the irregular nature of graph applications and the high variability in vertex degrees, particularly in power-law graphs. We describe a novel load balancing scheme to address this problem. Our scheme is implemented in the IrGL compiler to allow users to generate efficient load balanced code for a GPU from high-level sequential programs. We evaluated several graph analytics applications on up to 16 distributed GPUs using IrGL to compile the code and the Gluon substrate for inter-GPU communication. Our experiments show that this scheme can achieve an average speed-up of 2.2x on inputs that suffer from severe load imbalance problems when previous state-of-the-art load-balancing schemes are used.
△ Less
Submitted 27 February, 2020; v1 submitted 20 November, 2019;
originally announced November 2019.
-
Pangolin: An Efficient and Flexible Graph Pattern Mining System on CPU and GPU
Authors:
Xuhao Chen,
Roshan Dathathri,
Gurbinder Gill,
Keshav **ali
Abstract:
There is growing interest in graph pattern mining (GPM) problems such as motif counting. GPM systems have been developed to provide unified interfaces for programming algorithms for these problems and for running them on parallel systems. However, existing systems may take hours to mine even simple patterns in moderate-sized graphs, which significantly limits their real-world usability.
We prese…
▽ More
There is growing interest in graph pattern mining (GPM) problems such as motif counting. GPM systems have been developed to provide unified interfaces for programming algorithms for these problems and for running them on parallel systems. However, existing systems may take hours to mine even simple patterns in moderate-sized graphs, which significantly limits their real-world usability.
We present Pangolin, a high-performance and flexible in-memory GPM framework targeting shared-memory CPUs and GPUs. Pangolin is the first GPM system that provides high-level abstractions for GPU processing. It provides a simple programming interface based on the extend-reduce-filter model, which enables users to specify application-specific knowledge for search space pruning and isomorphism test elimination. We describe novel optimizations that exploit locality, reduce memory consumption, and mitigate the overheads of dynamic memory allocation and synchronization.
Evaluation on a 28-core CPU demonstrates that Pangolin outperforms existing GPM frameworks Arabesque, RStream, and Fractal by 49x, 88x, and 80x on average, respectively. Acceleration on a V100 GPU further improves performance of Pangolin by 15x on average. Compared to state-of-the-art hand-optimized GPM applications, Pangolin provides competitive performance with less programming effort.
△ Less
Submitted 17 January, 2020; v1 submitted 16 November, 2019;
originally announced November 2019.
-
Distributed Training of Embeddings using Graph Analytics
Authors:
Gurbinder Gill,
Roshan Dathathri,
Saeed Maleki,
Madan Musuvathi,
Todd Mytkowicz,
Olli Saarikivi
Abstract:
Many applications today, such as NLP, network analysis, and code analysis, rely on semantically embedding objects into low-dimensional fixed-length vectors. Such embeddings naturally provide a way to perform useful downstream tasks, such as identifying relations among objects or predicting objects for a given context, etc. Unfortunately, the training necessary for accurate embeddings is usually co…
▽ More
Many applications today, such as NLP, network analysis, and code analysis, rely on semantically embedding objects into low-dimensional fixed-length vectors. Such embeddings naturally provide a way to perform useful downstream tasks, such as identifying relations among objects or predicting objects for a given context, etc. Unfortunately, the training necessary for accurate embeddings is usually computationally intensive and requires processing large amounts of data. Furthermore, distributing this training is challenging. Most embedding training uses stochastic gradient descent (SGD), an "inherently" sequential algorithm. Prior approaches to parallelizing SGD do not honor these dependencies and thus potentially suffer poor convergence.
This paper presents a distributed training framework for a class of applications that use Skip-gram-like models to generate embeddings. We call this class Any2Vec and it includes Word2Vec, DeepWalk, and Node2Vec among others. We first formulate Any2Vec training algorithm as a graph application and leverage the state-of-the-art distributed graph analytics framework, D-Galois. We adapt D-Galois to support dynamic graph generation and repartitioning, and incorporate novel communication optimizations. Finally, we introduce a novel way to combine gradients during distributed training to prevent accuracy loss. We show that our framework, called GraphAny2Vec, matches on a cluster of 32 hosts the accuracy of the state-of-the-art shared-memory implementations of Word2Vec and Vertex2Vec on 1 host, and gives a geo-mean speedup of 12x and 5x respectively. Furthermore, GraphAny2Vec is on average 2x faster than the state-of-the-art distributed Word2Vec implementation, DMTK, on 32 hosts. We also show the superiority of our Gradient Combiner independent of GraphAny2Vec by incorporating it in DMTK, which raises its accuracy by > 30%.
△ Less
Submitted 23 February, 2020; v1 submitted 7 September, 2019;
originally announced September 2019.
-
Single Machine Graph Analytics on Massive Datasets Using Intel Optane DC Persistent Memory
Authors:
Gurbinder Gill,
Roshan Dathathri,
Loc Hoang,
Ramesh Peri,
Keshav **ali
Abstract:
Intel Optane DC Persistent Memory (Optane PMM) is a new kind of byte-addressable memory with higher density and lower cost than DRAM. This enables the design of affordable systems that support up to 6TB of randomly accessible memory. In this paper, we present key runtime and algorithmic principles to consider when performing graph analytics on extreme-scale graphs on large-memory platforms of this…
▽ More
Intel Optane DC Persistent Memory (Optane PMM) is a new kind of byte-addressable memory with higher density and lower cost than DRAM. This enables the design of affordable systems that support up to 6TB of randomly accessible memory. In this paper, we present key runtime and algorithmic principles to consider when performing graph analytics on extreme-scale graphs on large-memory platforms of this sort.
To demonstrate the importance of these principles, we evaluate four existing shared-memory graph frameworks on large real-world web-crawls, using a machine with 6TB of Optane PMM. Our results show that frameworks based on the runtime and algorithmic principles advocated in this paper (i) perform significantly better than the others, and (ii) are competitive with graph analytics frameworks running on large production clusters.
△ Less
Submitted 23 February, 2020; v1 submitted 15 April, 2019;
originally announced April 2019.
-
A Universal Algorithm for Continuous Time Random Walks Limit Distributions
Authors:
Gurtek Gill,
Peter Straka
Abstract:
In this article, we generalize the recent Discrete Time Random Walk (DTRW) algorithm, which was introduced for the computation of probability densities of fractional diffusion. Although it has the same computational complexity and shares the same desirable features (consistency, conservation of mass, strictly non-negative solutions), it applies to virtually every conceivable Continuous Time Random…
▽ More
In this article, we generalize the recent Discrete Time Random Walk (DTRW) algorithm, which was introduced for the computation of probability densities of fractional diffusion. Although it has the same computational complexity and shares the same desirable features (consistency, conservation of mass, strictly non-negative solutions), it applies to virtually every conceivable Continuous Time Random Walk (CTRW) limit process, which we define broadly as the limit of a sequence of jump processes with renewals at every jump. Our only restrictive assumption is the boundedness and continuity of coefficients of the underlying Langevin proceesses.
We highlight three main novel use-cases: i) CTRWs with spatially varying waiting times, e.g. for interface problems between two differently anomalous media; ii) (varying) temporal drift, which limits the short-time speed of subdiffusive processes; and iii) the computation of probability densities for generalized inverse subordinators.
△ Less
Submitted 16 August, 2018;
originally announced August 2018.
-
A Semi-Markov Algorithm for Continuous Time Random Walk Limit Distributions
Authors:
G. Gill,
P. Straka
Abstract:
The Semi-Markov property of Continuous Time Random Walks (CTRWs) and their limit processes is utilized, and the probability distributions of the bivariate Markov process $(X(t),V(t))$ are calculated: $X(t)$ is a CTRW limit and $V(t)$ a process tracking the age, i.e. the time since the last jump. For a given CTRW limit process $X(t)$, a sequence of discrete CTRWs in discrete time is given which con…
▽ More
The Semi-Markov property of Continuous Time Random Walks (CTRWs) and their limit processes is utilized, and the probability distributions of the bivariate Markov process $(X(t),V(t))$ are calculated: $X(t)$ is a CTRW limit and $V(t)$ a process tracking the age, i.e. the time since the last jump. For a given CTRW limit process $X(t)$, a sequence of discrete CTRWs in discrete time is given which converges to $X(t)$ (weakly in the Skorokhod topology). Master equations for the discrete CTRWs are implemented numerically, thus approximating the distribution of $X(t)$. A consequence of the derived algorithm is that any distribution of initial age can be assumed as an initial condition for the CTRW limit dynamics. Four examples with different temporal scaling are discussed: subdiffusion, tempered subdiffusion, the fractal mobile/immobile model and the tempered fractal mobile/immobile model.
△ Less
Submitted 19 July, 2016; v1 submitted 10 March, 2016;
originally announced March 2016.
-
CheepSync: A Time Synchronization Service for Resource Constrained Bluetooth Low Energy Advertisers
Authors:
Sabarish Sridhar,
Prasant Misra,
Gurinder Singh Gill,
Jay Warrior
Abstract:
Clock synchronization is highly desirable in distributed systems, including many applications in the Internet of Things and Humans (IoTH). It improves the efficiency, modularity and scalability of the system, and optimizes use of event triggers. For IoTH, Bluetooth Low Energy (BLE) - a subset of the recent Bluetooth v4.0 stack - provides a low-power and loosely coupled mechanism for sensor data co…
▽ More
Clock synchronization is highly desirable in distributed systems, including many applications in the Internet of Things and Humans (IoTH). It improves the efficiency, modularity and scalability of the system, and optimizes use of event triggers. For IoTH, Bluetooth Low Energy (BLE) - a subset of the recent Bluetooth v4.0 stack - provides a low-power and loosely coupled mechanism for sensor data collection with ubiquitous units (e.g., smartphones and tablets) carried by humans. This fundamental design paradigm of BLE is enabled by a range of broadcast advertising modes. While its operational benefits are numerous, the lack of a common time reference in the broadcast mode of BLE has been a fundamental limitation. This paper presents and describes CheepSync: a time synchronization service for BLE advertisers, especially tailored for applications requiring high time precision on resource constrained BLE platforms. Designed on top of the existing Bluetooth v4.0 standard, the CheepSync framework utilizes low-level timestam** and comprehensive error compensation mechanisms for overcoming uncertainties in message transmission, clock drift and other system specific constraints. CheepSync was implemented on custom designed nRF24Cheep beacon platforms (as broadcasters) and commercial off-the-shelf Android ported smartphones (as passive listeners). We demonstrate the efficacy of CheepSync by numerous empirical evaluations in a variety of experimental setups, and show that its average (single-hop) time synchronization accuracy is in the 10us range.
△ Less
Submitted 8 April, 2016; v1 submitted 26 January, 2015;
originally announced January 2015.
-
Magnetic Small World Nanomaterials: Physical Small World Networks
Authors:
M. A. Novotny,
X. Zhang,
J. Yancey,
T. Dubreus,
M. L. Cook,
S. G. Gill,
I. T. Norwood,
A. M. Novotny
Abstract:
The question addressed is whether magnetic materials based on physical small world networks are possible. Physical constraints, such as uniform bond length and embedding in three dimensions, are the new features added to make small world networks physical. Results are presented to further determine if physical small world networks can exist, and the effect of the small world connections on the c…
▽ More
The question addressed is whether magnetic materials based on physical small world networks are possible. Physical constraints, such as uniform bond length and embedding in three dimensions, are the new features added to make small world networks physical. Results are presented to further determine if physical small world networks can exist, and the effect of the small world connections on the critical phenomena of Ising models on such networks. Spectra of the Laplacian on randomly-collapsed bead-chain networks are studied. The scaling function for the order parameter of an Ising model with physical small world connections is presented.
△ Less
Submitted 22 October, 2004;
originally announced October 2004.