-
Efficient Reconstruction of Free Breathing Under-Sampled Cardiac Cine MRI
Authors:
Abdul Haseeb Ahmed,
Ijaz M. Qureshi,
Jawad Ali Shah,
Hammad Omer
Abstract:
Respiratory motion can cause strong blurring artifacts in the reconstructed image during MR acquisition. These artifacts become more prominent when use in the presence of undersampled data. Recently, compressed sensing (CS) is developed as an MR reconstruction technique, to recover good quality images from the compressive k-space samples. To maximize the benefits of CS in free breathing data, it i…
▽ More
Respiratory motion can cause strong blurring artifacts in the reconstructed image during MR acquisition. These artifacts become more prominent when use in the presence of undersampled data. Recently, compressed sensing (CS) is developed as an MR reconstruction technique, to recover good quality images from the compressive k-space samples. To maximize the benefits of CS in free breathing data, it is understandable to use CS with the motion corrected images. In this paper, we have developed a new CS based motion corrected image reconstruction technique. In this two-stage technique, we use similarity measure to sort the motion corrupted data into different respiratory states. Then, we use a new reconstruction algorithm, which iteratively performs reconstruction and motion correction. The performance of the proposed method is qualitatively and quantitively evaluated using simulated data and clinical data. Results depict that this method performs the better reconstruction of respiratory motion corrected cardiac cine images as compared to the CS based reconstruction method.
△ Less
Submitted 9 April, 2019;
originally announced April 2019.
-
Predicting ConceptNet Path Quality Using Crowdsourced Assessments of Naturalness
Authors:
Yilun Zhou,
Steven Schockaert,
Julie A. Shah
Abstract:
In many applications, it is important to characterize the way in which two concepts are semantically related. Knowledge graphs such as ConceptNet provide a rich source of information for such characterizations by encoding relations between concepts as edges in a graph. When two concepts are not directly connected by an edge, their relationship can still be described in terms of the paths that conn…
▽ More
In many applications, it is important to characterize the way in which two concepts are semantically related. Knowledge graphs such as ConceptNet provide a rich source of information for such characterizations by encoding relations between concepts as edges in a graph. When two concepts are not directly connected by an edge, their relationship can still be described in terms of the paths that connect them. Unfortunately, many of these paths are uninformative and noisy, which means that the success of applications that use such path features crucially relies on their ability to select high-quality paths. In existing applications, this path selection process is based on relatively simple heuristics. In this paper we instead propose to learn to predict path quality from crowdsourced human assessments. Since we are interested in a generic task-independent notion of quality, we simply ask human participants to rank paths according to their subjective assessment of the paths' naturalness, without attempting to define naturalness or steering the participants towards particular indicators of quality. We show that a neural network model trained on these assessments is able to predict human judgments on unseen paths with near optimal performance. Most notably, we find that the resulting path selection method is substantially better than the current heuristic approaches at identifying meaningful paths.
△ Less
Submitted 20 February, 2019;
originally announced February 2019.
-
Experimental evidence of monolayer arsenene: An exotic two-dimensional semiconducting material
Authors:
J. Shah,
W. Wang,
H. M. Sohail,
R. I. G. Uhrberg
Abstract:
Group V element analogues of graphene have attracted a lot attention recently due to their semiconducting band structures, which make them promising for next generation electronic and optoelectronic devices based on two-dimensional materials. Theoretical investigations predict high electron mobility, large band gaps, band gap tuning by strain, formation of topological phases, quantum spin Hall eff…
▽ More
Group V element analogues of graphene have attracted a lot attention recently due to their semiconducting band structures, which make them promising for next generation electronic and optoelectronic devices based on two-dimensional materials. Theoretical investigations predict high electron mobility, large band gaps, band gap tuning by strain, formation of topological phases, quantum spin Hall effect at room temperature, and superconductivity amongst others. Here, we report a successful formation of freestanding like monolayer arsenene on Ag(111). This was concluded from our experimental atomic and electronic structure data by comparing to results of our theoretical calculations. Arsenene forms a buckled honeycomb layer on Ag(111) with a lattice constant of 3.6 Å showing an indirect band gap of about 1.4 eV as deduced from the position of the Fermi level pinning.
△ Less
Submitted 14 December, 2018;
originally announced December 2018.
-
Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge
Authors:
Spyridon Bakas,
Mauricio Reyes,
Andras Jakab,
Stefan Bauer,
Markus Rempfler,
Alessandro Crimi,
Russell Takeshi Shinohara,
Christoph Berger,
Sung Min Ha,
Martin Rozycki,
Marcel Prastawa,
Esther Alberts,
Jana Lipkova,
John Freymann,
Justin Kirby,
Michel Bilello,
Hassan Fathallah-Shaykh,
Roland Wiest,
Jan Kirschke,
Benedikt Wiestler,
Rivka Colen,
Aikaterini Kotrotsou,
Pamela Lamontagne,
Daniel Marcus,
Mikhail Milchenko
, et al. (402 additional authors not shown)
Abstract:
Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem…
▽ More
Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles disseminated across multi-parametric magnetic resonance imaging (mpMRI) scans, reflecting varying biological properties. Their heterogeneous shape, extent, and location are some of the factors that make these tumors difficult to resect, and in some cases inoperable. The amount of resected tumor is a factor also considered in longitudinal scans, when evaluating the apparent tumor for potential diagnosis of progression. Furthermore, there is mounting evidence that accurate segmentation of the various tumor sub-regions can offer the basis for quantitative image analysis towards prediction of patient overall survival. This study assesses the state-of-the-art machine learning (ML) methods used for brain tumor image analysis in mpMRI scans, during the last seven instances of the International Brain Tumor Segmentation (BraTS) challenge, i.e., 2012-2018. Specifically, we focus on i) evaluating segmentations of the various glioma sub-regions in pre-operative mpMRI scans, ii) assessing potential tumor progression by virtue of longitudinal growth of tumor sub-regions, beyond use of the RECIST/RANO criteria, and iii) predicting the overall survival from pre-operative mpMRI scans of patients that underwent gross total resection. Finally, we investigate the challenge of identifying the best ML algorithms for each of these tasks, considering that apart from being diverse on each instance of the challenge, the multi-institutional mpMRI BraTS dataset has also been a continuously evolving/growing dataset.
△ Less
Submitted 23 April, 2019; v1 submitted 5 November, 2018;
originally announced November 2018.
-
Pose consensus based on dual quaternion algebra with application to decentralized formation control of mobile manipulators
Authors:
Heitor J. Savino,
Luciano C. A. Pimenta,
Julie A. Shah,
Bruno V. Adorno
Abstract:
This paper presents a solution based on dual quaternion algebra to the general problem of pose (i.e., position and orientation) consensus for systems composed of multiple rigid-bodies. The dual quaternion algebra is used to model the agents' poses and also in the distributed control laws, making the proposed technique easily applicable to time-varying formation control of general robotic systems.…
▽ More
This paper presents a solution based on dual quaternion algebra to the general problem of pose (i.e., position and orientation) consensus for systems composed of multiple rigid-bodies. The dual quaternion algebra is used to model the agents' poses and also in the distributed control laws, making the proposed technique easily applicable to time-varying formation control of general robotic systems. The proposed pose consensus protocol has guaranteed convergence when the interaction among the agents is represented by directed graphs with directed spanning trees, which is a more general result when compared to the literature on formation control. In order to illustrate the proposed pose consensus protocol and its extension to the problem of formation control, we present a numerical simulation with a large number of free-flying agents and also an application of cooperative manipulation by using real mobile manipulators.
△ Less
Submitted 14 June, 2019; v1 submitted 20 October, 2018;
originally announced October 2018.
-
Parametrized higher category theory
Authors:
Jay Shah
Abstract:
We develop foundations for the category theory of $\infty$-categories parametrized by a base $\infty$-category. Our main contribution is a theory of indexed homotopy limits and colimits, which specializes to a theory of $G$-colimits for $G$ a finite group when the base is chosen to be the orbit category of $G$. We apply this theory to show that the $G$-$\infty$-category of $G$-spaces is freely gen…
▽ More
We develop foundations for the category theory of $\infty$-categories parametrized by a base $\infty$-category. Our main contribution is a theory of indexed homotopy limits and colimits, which specializes to a theory of $G$-colimits for $G$ a finite group when the base is chosen to be the orbit category of $G$. We apply this theory to show that the $G$-$\infty$-category of $G$-spaces is freely generated under $G$-colimits by the contractible $G$-space, thereby affirming a conjecture of Mike Hill.
△ Less
Submitted 6 January, 2022; v1 submitted 16 September, 2018;
originally announced September 2018.
-
Multi-Source Domain Adaptation with Mixture of Experts
Authors:
Jiang Guo,
Darsh J Shah,
Regina Barzilay
Abstract:
We propose a mixture-of-experts approach for unsupervised domain adaptation from multiple sources. The key idea is to explicitly capture the relationship between a target example and different source domains. This relationship, expressed by a point-to-set metric, determines how to combine predictors trained on various domains. The metric is learned in an unsupervised fashion using meta-training. E…
▽ More
We propose a mixture-of-experts approach for unsupervised domain adaptation from multiple sources. The key idea is to explicitly capture the relationship between a target example and different source domains. This relationship, expressed by a point-to-set metric, determines how to combine predictors trained on various domains. The metric is learned in an unsupervised fashion using meta-training. Experimental results on sentiment analysis and part-of-speech tagging demonstrate that our approach consistently outperforms multiple baselines and can robustly handle negative transfer.
△ Less
Submitted 16 October, 2018; v1 submitted 6 September, 2018;
originally announced September 2018.
-
Adversarial Domain Adaptation for Duplicate Question Detection
Authors:
Darsh J Shah,
Tao Lei,
Alessandro Moschitti,
Salvatore Romeo,
Preslav Nakov
Abstract:
We address the problem of detecting duplicate questions in forums, which is an important step towards automating the process of answering new questions. As finding and annotating such potential duplicates manually is very tedious and costly, automatic methods based on machine learning are a viable alternative. However, many forums do not have annotated data, i.e., questions labeled by experts as d…
▽ More
We address the problem of detecting duplicate questions in forums, which is an important step towards automating the process of answering new questions. As finding and annotating such potential duplicates manually is very tedious and costly, automatic methods based on machine learning are a viable alternative. However, many forums do not have annotated data, i.e., questions labeled by experts as duplicates, and thus a promising solution is to use domain adaptation from another forum that has such annotations. Here we focus on adversarial domain adaptation, deriving important findings about when it performs well and what properties of the domains are important in this regard. Our experiments with StackExchange data show an average improvement of 5.6% over the best baseline across multiple pairs of domains.
△ Less
Submitted 6 September, 2018;
originally announced September 2018.
-
Combining Many-objective Radiomics and 3-dimensional Convolutional Neural Network through Evidential Reasoning to Predict Lymph Node Metastasis in Head and Neck Cancer
Authors:
Liyuan Chen,
Zhiguo Zhou,
David Sher,
Qiongwen Zhang,
Jennifer Shah,
Nhat-Long Pham,
Steve Jiang,
**g Wang
Abstract:
Lymph node metastasis (LNM) is a significant prognostic factor in patients with head and neck cancer, and the ability to predict it accurately is essential for treatment optimization. PET and CT imaging are routinely used for LNM identification. However, uncertainties of LNM always exist especially for small size or reactive nodes. Radiomics and deep learning are the two preferred imaging-based st…
▽ More
Lymph node metastasis (LNM) is a significant prognostic factor in patients with head and neck cancer, and the ability to predict it accurately is essential for treatment optimization. PET and CT imaging are routinely used for LNM identification. However, uncertainties of LNM always exist especially for small size or reactive nodes. Radiomics and deep learning are the two preferred imaging-based strategies for node malignancy prediction. Radiomics models are built based on handcrafted features, and deep learning can learn the features automatically. We proposed a hybrid predictive model that combines many-objective radiomics (MO-radiomics) and 3-dimensional convolutional neural network (3D-CNN) through evidential reasoning (ER) approach. To build a more reliable model, we proposed a new many-objective radiomics model. Meanwhile, we designed a 3D-CNN that fully utilizes spatial contextual information. Finally, the outputs were fused through the ER approach. To study the predictability of the two modalities, three models were built for PET, CT, and PET&CT. The results showed that the model performed best when the two modalities were combined. Moreover, we showed that the quantitative results obtained from the hybrid model were better than those obtained from MO-radiomics and 3D-CNN.
△ Less
Submitted 5 September, 2018;
originally announced September 2018.
-
Real Time Elbow Angle Estimation Using Single RGB Camera
Authors:
Muhammad Yahya,
Jawad Ali Shah,
Arif Warsi,
Kushsairy Kadir,
Sheroz Khan,
M Izani
Abstract:
The use of motion capture has increased from last decade in a varied spectrum of applications like film special effects, controlling games and robots, rehabilitation system, animations etc. The current human motion capture techniques use markers, structured environment, and high resolution cameras in a dedicated environment. Because of rapid movement, elbow angle estimation is observed as the most…
▽ More
The use of motion capture has increased from last decade in a varied spectrum of applications like film special effects, controlling games and robots, rehabilitation system, animations etc. The current human motion capture techniques use markers, structured environment, and high resolution cameras in a dedicated environment. Because of rapid movement, elbow angle estimation is observed as the most difficult problem in human motion capture system. In this paper, we take elbow angle estimation as our research subject and propose a novel, markerless and cost-effective solution that uses RGB camera for estimating elbow angle in real time using part affinity field. We have recruited five (5) participants to perform cup to mouth movement and at the same time measured the angle by both RGB camera and Microsoft Kinect. The experimental results illustrate that markerless and cost-effective RGB camera has a median RMS errors of 3.06° and 0.95° in sagittal and coronal plane respectively as compared to Microsoft Kinect.
△ Less
Submitted 21 August, 2018;
originally announced August 2018.
-
Algorithmic canonical stratifications of simplicial complexes
Authors:
Ryo Asai,
Jay Shah
Abstract:
We introduce a new algorithm for the structural analysis of finite abstract simplicial complexes based on local homology. Through an iterative and top-down procedure, our algorithm computes a stratification $π$ of the poset $P$ of simplices of a simplicial complex $K$, such that for each strata $P_{π=i} \subset P$, $P_{π=i}$ is maximal among all open subposets $U \subset \overline{P_{π=i}}$ in its…
▽ More
We introduce a new algorithm for the structural analysis of finite abstract simplicial complexes based on local homology. Through an iterative and top-down procedure, our algorithm computes a stratification $π$ of the poset $P$ of simplices of a simplicial complex $K$, such that for each strata $P_{π=i} \subset P$, $P_{π=i}$ is maximal among all open subposets $U \subset \overline{P_{π=i}}$ in its closure such that the restriction of the local $\mathbb{Z}$-homology sheaf of $\overline{P_{π=i}}$ to $U$ is locally constant. Passage to the localization of $P$ dictated by $π$ then attaches a canonical stratified homotopy type to $K$.
Using $\infty$-categorical methods, we first prove that the proposed algorithm correctly computes the canonical stratification of a simplicial complex; along the way, we prove a few general results about sheaves on posets and the homotopy types of links that may be of independent interest. We then present a pseudocode implementation of the algorithm, with special focus given to the case of dimension $\leq 3$, and show that it runs in polynomial time. In particular, an $n$-dimensional simplicial complex with size $s$ and $n\leq3$ can be processed in O($s^2$) time or O($s$) given one further assumption on the structure. Processing Delaunay triangulations of $2$-spheres and $3$-balls provides experimental confirmation of this linear running time.
△ Less
Submitted 18 January, 2022; v1 submitted 20 August, 2018;
originally announced August 2018.
-
MoCoNet: Motion Correction in 3D MPRAGE images using a Convolutional Neural Network approach
Authors:
Kamlesh Pawar,
Zhaolin Chen,
N. Jon Shah,
Gary F. Egan
Abstract:
Purpose: The suppression of motion artefacts from MR images is a challenging task. The purpose of this paper is to develop a standalone novel technique to suppress motion artefacts from MR images using a data-driven deep learning approach. Methods: A deep learning convolutional neural network (CNN) was developed to remove motion artefacts in brain MR images. A CNN was trained on simulated motion c…
▽ More
Purpose: The suppression of motion artefacts from MR images is a challenging task. The purpose of this paper is to develop a standalone novel technique to suppress motion artefacts from MR images using a data-driven deep learning approach. Methods: A deep learning convolutional neural network (CNN) was developed to remove motion artefacts in brain MR images. A CNN was trained on simulated motion corrupted images to identify and suppress artefacts due to the motion. The network was an encoder-decoder CNN architecture where the encoder decomposed the motion corrupted images into a set of feature maps. The feature maps were then combined by the decoder network to generate a motion-corrected image. The network was tested on an unseen simulated dataset and an experimental, motion corrupted in vivo brain dataset. Results: The trained network was able to suppress the motion artefacts in the simulated motion corrupted images, and the mean percentage error in the motion corrected images was 2.69 % with a standard deviation of 0.95 %. The network was able to effectively suppress the motion artefacts from the experimental dataset, demonstrating the generalisation capability of the trained network. Conclusion: A novel and generic motion correction technique has been developed that can suppress motion artefacts from motion corrupted MR images. The proposed technique is a standalone post-processing method that does not interfere with data acquisition or reconstruction parameters, thus making it suitable for a multitude of MR sequences.
△ Less
Submitted 29 July, 2018;
originally announced July 2018.
-
Multifactorial cancer treatment outcome prediction through multifaceted radiomics
Authors:
Zhiguo Zhou,
David Sher,
Qiongwen Zhang,
**kun Yan,
Jennifer Shah,
Nhat-Long Pham,
Michael Folkert,
Steve Jiang,
**g Wang
Abstract:
Accurately predicting the treatment outcome plays a greatly important role in tailoring and adapting a treatment planning in cancer therapy. Although the development of different modalities and personalized medicine can greatly improve the accuracy of outcome prediction, they also bring the three mainly simultaneous challenges including multi-modality, multi-classifier and multi-criteria, which ar…
▽ More
Accurately predicting the treatment outcome plays a greatly important role in tailoring and adapting a treatment planning in cancer therapy. Although the development of different modalities and personalized medicine can greatly improve the accuracy of outcome prediction, they also bring the three mainly simultaneous challenges including multi-modality, multi-classifier and multi-criteria, which are summarized as multifactorial outcome prediction (MFOP) in this paper. Compared with traditional outcome prediction, MFOP is a more generalized problem. To handle this novel problem, based on the recent proposed radiomics, we propose a new unified framework termed as multifaceted radiomics (M-radiomics). M-radiomics trains multiple modality-specific classifiers first and then optimally combines the output from the outputs of different classifiers which are trained according to multiple different criteria such as sensitivity and specificity. It considers multi-modality, multi-classifier and multi-criteria into a unified framework, which makes the prediction more accurate. Furthermore, to obtain the more reliable predictive performance which is to maximize the similarity between predicted output and labelled vector, a new validation set based reliable fusion (VRF) strategy and reliable optimization models as well as a new recursive two stage hybrid optimization algorithm (RTSH) were also developed. Two clinical problems for predicting distant metastasis and locoregional recurrence in head & neck cancer were investigated to validate the performance and reliability of the proposed M-radiomics. By using the proposed RF strategy and RTSH optimization algorithm, the experimental results demonstrated that M-radiomics performed better than current radiomic models that rely on a single objective, modality or classifier.
△ Less
Submitted 24 July, 2018;
originally announced July 2018.
-
Efficient Graph Compression Using Huffman Coding Based Techniques
Authors:
Rushabh Jitendrakumar Shah
Abstract:
Graphs have been extensively used to represent data from various domains. In the era of Big Data, information is being generated at a fast pace, and analyzing the same is a challenge. Various methods have been proposed to speed up the analysis of the data and also mining it for information. All of this often involves using a massive array of compute nodes, and transmitting the data over the networ…
▽ More
Graphs have been extensively used to represent data from various domains. In the era of Big Data, information is being generated at a fast pace, and analyzing the same is a challenge. Various methods have been proposed to speed up the analysis of the data and also mining it for information. All of this often involves using a massive array of compute nodes, and transmitting the data over the network. Of course, with the huge quantity of data, this poses a major issue to the task of gathering intelligence from data. Therefore, in order to address such issues with Big Data, using data compression techniques is a viable option. Since graphs represent most real world data, methods to compress graphs have been in the forefront of such endeavors. In this paper we propose techniques to compress graphs by finding specific patterns and replacing those with identifiers that are of variable length, an idea inspired by Huffman Coding. Specifically, given a graph G = (V, E), where V is the set of vertices and E is the set of edges, and |V| = n, we propose methods to reduce the space requirements of the graph by compressing the adjacency representation of the same. The proposed methods show up to 80% reduction is the space required to store the graphs as compared to using the adjacency matrix. The methods can also be applied to other representations as well. The proposed techniques help solve the issues related to computing on the graphs on resources limited compute nodes, as well as reduce the latency for transfer of data over the network in case of distributed computing.
△ Less
Submitted 15 June, 2018;
originally announced June 2018.
-
Graph Compression Using Pattern Matching Techniques
Authors:
Rushabh Jitendrakumar Shah
Abstract:
Graphs can be used to represent a wide variety of data belonging to different domains. Graphs can capture the relationship among data in an efficient way, and have been widely used. In recent times, with the advent of Big Data, there has been a need to store and compute on large data sets efficiently. However, considering the size of the data sets in question, finding optimal methods to store and…
▽ More
Graphs can be used to represent a wide variety of data belonging to different domains. Graphs can capture the relationship among data in an efficient way, and have been widely used. In recent times, with the advent of Big Data, there has been a need to store and compute on large data sets efficiently. However, considering the size of the data sets in question, finding optimal methods to store and process the data has been a challenge. Therefore, in this paper, we study different graph compression techniques and propose novel algorithms to do the same. Specifically, given a graph G = (V, E), where V is the set of vertices and E is the set of edges, and |V| = n, we propose techniques to compress the adjacency matrix representation of the graph. Our algorithms are based on finding patterns within the adjacency matrix data, and replacing the common patterns with specific markers. All the techniques proposed here are lossless compression of graphs. Based on the experimental results, it is observed that our proposed techniques achieve almost 70% compression as compared to adjacency matrix representation. The results show that large graphs can be efficiently stored in smaller memory and exploit the parallel processing power of compute nodes as well as efficiently transfer data between resources.
△ Less
Submitted 5 June, 2018;
originally announced June 2018.
-
Detection of Paroxysmal Atrial Fibrillation using Attention-based Bidirectional Recurrent Neural Networks
Authors:
Supreeth P. Shashikumar,
Amit J. Shah,
Gari D. Clifford,
Shamim Nemati
Abstract:
Detection of atrial fibrillation (AF), a type of cardiac arrhythmia, is difficult since many cases of AF are usually clinically silent and undiagnosed. In particular paroxysmal AF is a form of AF that occurs occasionally, and has a higher probability of being undetected. In this work, we present an attention based deep learning framework for detection of paroxysmal AF episodes from a sequence of w…
▽ More
Detection of atrial fibrillation (AF), a type of cardiac arrhythmia, is difficult since many cases of AF are usually clinically silent and undiagnosed. In particular paroxysmal AF is a form of AF that occurs occasionally, and has a higher probability of being undetected. In this work, we present an attention based deep learning framework for detection of paroxysmal AF episodes from a sequence of windows. Time-frequency representation of 30 seconds recording windows, over a 10 minute data segment, are fed sequentially into a deep convolutional neural network for image-based feature extraction, which are then presented to a bidirectional recurrent neural network with an attention layer for AF detection. To demonstrate the effectiveness of the proposed framework for transient AF detection, we use a database of 24 hour Holter Electrocardiogram (ECG) recordings acquired from 2850 patients at the University of Virginia heart station. The algorithm achieves an AUC of 0.94 on the testing set, which exceeds the performance of baseline models. We also demonstrate the cross-domain generalizablity of the approach by adapting the learned model parameters from one recording modality (ECG) to another (photoplethysmogram) with improved AF detection performance. The proposed high accuracy, low false alarm algorithm for detecting paroxysmal AF has potential applications in long-term monitoring using wearable sensors.
△ Less
Submitted 7 May, 2018;
originally announced May 2018.
-
Discovering Blind Spots in Reinforcement Learning
Authors:
Ramya Ramakrishnan,
Ece Kamar,
Debadeepta Dey,
Julie Shah,
Eric Horvitz
Abstract:
Agents trained in simulation may make errors in the real world due to mismatches between training and execution environments. These mistakes can be dangerous and difficult to discover because the agent cannot predict them a priori. We propose using oracle feedback to learn a predictive model of these blind spots to reduce costly errors in real-world applications. We focus on blind spots in reinfor…
▽ More
Agents trained in simulation may make errors in the real world due to mismatches between training and execution environments. These mistakes can be dangerous and difficult to discover because the agent cannot predict them a priori. We propose using oracle feedback to learn a predictive model of these blind spots to reduce costly errors in real-world applications. We focus on blind spots in reinforcement learning (RL) that occur due to incomplete state representation: The agent does not have the appropriate features to represent the true state of the world and thus cannot distinguish among numerous states. We formalize the problem of discovering blind spots in RL as a noisy supervised learning problem with class imbalance. We learn models to predict blind spots in unseen regions of the state space by combining techniques for label aggregation, calibration, and supervised learning. The models take into consideration noise emerging from different forms of oracle feedback, including demonstrations and corrections. We evaluate our approach on two domains and show that it achieves higher predictive performance than baseline methods, and that the learned model can be used to selectively query an oracle at execution time to prevent errors. We also empirically analyze the biases of various feedback types and how they influence the discovery of blind spots.
△ Less
Submitted 23 May, 2018;
originally announced May 2018.
-
Predicting Lymph Node Metastasis in Head and Neck Cancer by Combining Many-objective Radiomics and 3-dimensioal Convolutional Neural Network through Evidential Reasoning
Authors:
Zhiguo Zhou,
Liyuan Chen,
David Sher,
Qiongwen Zhang,
Jennifer Shah,
Nhat-Long Pham,
Steve Jiang,
**g Wang
Abstract:
Lymph node metastasis (LNM) is a significant prognostic factor in patients with head and neck cancer, and the ability to predict it accurately is essential for treatment optimization. PET and CT imaging are routinely used for LNM identification. However, uncertainties of LNM always exist especially for small size or reactive nodes. Radiomics and deep learning are the two preferred imaging-based st…
▽ More
Lymph node metastasis (LNM) is a significant prognostic factor in patients with head and neck cancer, and the ability to predict it accurately is essential for treatment optimization. PET and CT imaging are routinely used for LNM identification. However, uncertainties of LNM always exist especially for small size or reactive nodes. Radiomics and deep learning are the two preferred imaging-based strategies for node malignancy prediction. Radiomics models are built based on handcrafted features, and deep learning can learn the features automatically. We proposed a hybrid predictive model that combines many-objective radiomics (MO-radiomics) and 3-dimensional convolutional neural network (3D-CNN) through evidential reasoning (ER) approach. To build a more reliable model, we proposed a new many-objective radiomics model. Meanwhile, we designed a 3D-CNN that fully utilizes spatial contextual information. Finally, the outputs were fused through the ER approach. To study the predictability of the two modalities, three models were built for PET, CT, and PET&CT. The results showed that the model performed best when the two modalities were combined. Moreover, we showed that the quantitative results obtained from the hybrid model were better than those obtained from MO-radiomics and 3D-CNN.
△ Less
Submitted 17 May, 2018;
originally announced May 2018.
-
Integrating Hypertension Phenotype and Genotype with Hybrid Non-negative Matrix Factorization
Authors:
Yuan Luo,
Chengsheng Mao,
Yiben Yang,
Fei Wang,
Faraz S. Ahmad,
Donna Arnett,
Marguerite R. Irvin,
Sanjiv J. Shah
Abstract:
Hypertension is a heterogeneous syndrome in need of improved subty** using phenotypic and genetic measurements so that patients in different subtypes share similar pathophysiologic mechanisms and respond more uniformly to targeted treatments. Existing machine learning approaches often face challenges in integrating phenotype and genotype information and presenting to clinicians an interpretable…
▽ More
Hypertension is a heterogeneous syndrome in need of improved subty** using phenotypic and genetic measurements so that patients in different subtypes share similar pathophysiologic mechanisms and respond more uniformly to targeted treatments. Existing machine learning approaches often face challenges in integrating phenotype and genotype information and presenting to clinicians an interpretable model. We aim to provide informed patient stratification by introducing Hybrid Non-negative Matrix Factorization (HNMF) on phenotype and genotype matrices. HNMF simultaneously approximates the phenotypic and genetic matrices using different appropriate loss functions, and generates patient subtypes, phenotypic groups and genetic groups. Unlike previous methods, HNMF approximates phenotypic matrix under Frobenius loss, and genetic matrix under Kullback-Leibler (KL) loss. We propose an alternating projected gradient method to solve the approximation problem. Simulation shows HNMF converges fast and accurately to the true factor matrices. On real-world clinical dataset, we used the patient factor matrix as features to predict main cardiac mechanistic outcomes. We compared HNMF with six different models using phenotype or genotype features alone, with or without NMF, or using joint NMF with only one type of loss. HNMF significantly outperforms all comparison models. HNMF also reveals intuitive phenotype-genotype interactions that characterize cardiac abnormalities.
△ Less
Submitted 18 May, 2018; v1 submitted 14 May, 2018;
originally announced May 2018.
-
Human-Machine Collaborative Optimization via Apprenticeship Scheduling
Authors:
Matthew Gombolay,
Reed Jensen,
Jessica Stigile,
Toni Golen,
Neel Shah,
Sung-Hyun Son,
Julie Shah
Abstract:
Coordinating agents to complete a set of tasks with intercoupled temporal and resource constraints is computationally challenging, yet human domain experts can solve these difficult scheduling problems using paradigms learned through years of apprenticeship. A process for manually codifying this domain knowledge within a computational framework is necessary to scale beyond the ``single-expert, sin…
▽ More
Coordinating agents to complete a set of tasks with intercoupled temporal and resource constraints is computationally challenging, yet human domain experts can solve these difficult scheduling problems using paradigms learned through years of apprenticeship. A process for manually codifying this domain knowledge within a computational framework is necessary to scale beyond the ``single-expert, single-trainee" apprenticeship model. However, human domain experts often have difficulty describing their decision-making processes, causing the codification of this knowledge to become laborious. We propose a new approach for capturing domain-expert heuristics through a pairwise ranking formulation. Our approach is model-free and does not require enumerating or iterating through a large state space. We empirically demonstrate that this approach accurately learns multifaceted heuristics on a synthetic data set incorporating job-shop scheduling and vehicle routing problems, as well as on two real-world data sets consisting of demonstrations of experts solving a weapon-to-target assignment problem and a hospital resource allocation problem. We also demonstrate that policies learned from human scheduling demonstration via apprenticeship learning can substantially improve the efficiency of a branch-and-bound search for an optimal schedule. We employ this human-machine collaborative optimization technique on a variant of the weapon-to-target assignment problem. We demonstrate that this technique generates solutions substantially superior to those produced by human domain experts at a rate up to 9.5 times faster than an optimization approach and can be applied to optimally solve problems twice as complex as those solved by a human demonstrator.
△ Less
Submitted 10 May, 2018;
originally announced May 2018.
-
Bridgeout: stochastic bridge regularization for deep neural networks
Authors:
Najeeb Khan,
Jawad Shah,
Ian Stavness
Abstract:
A major challenge in training deep neural networks is overfitting, i.e. inferior performance on unseen test examples compared to performance on training examples. To reduce overfitting, stochastic regularization methods have shown superior performance compared to deterministic weight penalties on a number of image recognition tasks. Stochastic methods such as Dropout and Shakeout, in expectation,…
▽ More
A major challenge in training deep neural networks is overfitting, i.e. inferior performance on unseen test examples compared to performance on training examples. To reduce overfitting, stochastic regularization methods have shown superior performance compared to deterministic weight penalties on a number of image recognition tasks. Stochastic methods such as Dropout and Shakeout, in expectation, are equivalent to imposing a ridge and elastic-net penalty on the model parameters, respectively. However, the choice of the norm of weight penalty is problem dependent and is not restricted to $\{L_1,L_2\}$. Therefore, in this paper we propose the Bridgeout stochastic regularization technique and prove that it is equivalent to an $L_q$ penalty on the weights, where the norm $q$ can be learned as a hyperparameter from data. Experimental results show that Bridgeout results in sparse model weights, improved gradients and superior classification performance compared to Dropout and Shakeout on synthetic and real datasets.
△ Less
Submitted 21 April, 2018;
originally announced April 2018.
-
A Novel Approach for Image Segmentation based on Histograms computed from Hue-data
Authors:
Viraj Mavani,
Ayesha Gurnani,
Jhanvi Shah
Abstract:
Computer Vision is growing day by day in terms of user specific applications. The first step of any such application is segmenting an image. In this paper, we propose a novel and grass-root level image segmentation algorithm for cases in which the background has uniform color distribution. This algorithm can be used for images of flowers, birds, insects and many more where such background conditio…
▽ More
Computer Vision is growing day by day in terms of user specific applications. The first step of any such application is segmenting an image. In this paper, we propose a novel and grass-root level image segmentation algorithm for cases in which the background has uniform color distribution. This algorithm can be used for images of flowers, birds, insects and many more where such background conditions occur. By image segmentation, the visualization of a computer increases manifolds and it can even attain near-human accuracy during classification.
△ Less
Submitted 30 July, 2017;
originally announced July 2017.
-
Mathematical Models of Adaptation in Human-Robot Collaboration
Authors:
Stefanos Nikolaidis,
Jodi Forlizzi,
David Hsu,
Julie Shah,
Siddhartha Srinivasa
Abstract:
A robot operating in isolation needs to reason over the uncertainty in its model of the world and adapt its own actions to account for this uncertainty. Similarly, a robot interacting with people needs to reason over its uncertainty over the human internal state, as well as over how this state may change, as humans adapt to the robot. This paper summarizes our own work in this area, which depicts…
▽ More
A robot operating in isolation needs to reason over the uncertainty in its model of the world and adapt its own actions to account for this uncertainty. Similarly, a robot interacting with people needs to reason over its uncertainty over the human internal state, as well as over how this state may change, as humans adapt to the robot. This paper summarizes our own work in this area, which depicts the different ways that probabilistic planning and game-theoretic algorithms can enable such reasoning in robotic systems that collaborate with people. We start with a general formulation of the problem as a two-player game with incomplete information. We then articulate the different assumptions within this general formulation, and we explain how these lead to exciting and diverse robot behaviors in real-time interactions with actual human subjects, in a variety of manufacturing, personal robotics and assistive care settings.
△ Less
Submitted 4 August, 2017; v1 submitted 9 July, 2017;
originally announced July 2017.
-
Design Activism for Minimum Wage Crowd Work
Authors:
Akash Mankar,
Riddhi J. Shah,
Matthew Lease
Abstract:
Entry-level crowd work is often reported to pay less than minimum wage. While this may be appropriate or even necessary, due to various legal, economic, and pragmatic factors, some Requesters and workers continue to question this status quo. To promote further discussion on the issue, we survey Requesters and workers whether they would support restricting tasks to require minimum wage pay. As a fo…
▽ More
Entry-level crowd work is often reported to pay less than minimum wage. While this may be appropriate or even necessary, due to various legal, economic, and pragmatic factors, some Requesters and workers continue to question this status quo. To promote further discussion on the issue, we survey Requesters and workers whether they would support restricting tasks to require minimum wage pay. As a form of design activism, we confronted workers with this dilemma directly by posting a dummy Mechanical Turk task which told them that they could not work on it because it paid less than their local minimum wage, and we invited their feedback. Strikingly, for those workers expressing an opinion, two-thirds of Indians favored the policy while two-thirds of Americans opposed it. Though a majority of Requesters supported minimum wage pay, only 20\% would enforce it. To further empower Requesters, and to ensure that effort or ignorance are not barriers to change, we provide a simple public API to make it easy to find a worker's local minimum wage by his/her IP address.
△ Less
Submitted 27 August, 2017; v1 submitted 30 June, 2017;
originally announced June 2017.
-
A Computer Vision Pipeline for Automated Determination of Cardiac Structure and Function and Detection of Disease by Two-Dimensional Echocardiography
Authors:
Jeffrey Zhang,
Sravani Gajjala,
Pulkit Agrawal,
Geoffrey H. Tison,
Laura A. Hallock,
Lauren Beussink-Nelson,
Eugene Fan,
Mandar A. Aras,
ChaRandle Jordan,
Kirsten E. Fleischmann,
Michelle Melisko,
Atif Qasim,
Alexei Efros,
Sanjiv J. Shah,
Ruzena Bajcsy,
Rahul C. Deo
Abstract:
Automated cardiac image interpretation has the potential to transform clinical practice in multiple ways including enabling low-cost serial assessment of cardiac function in the primary care and rural setting. We hypothesized that advances in computer vision could enable building a fully automated, scalable analysis pipeline for echocardiogram (echo) interpretation. Our approach entailed: 1) prepr…
▽ More
Automated cardiac image interpretation has the potential to transform clinical practice in multiple ways including enabling low-cost serial assessment of cardiac function in the primary care and rural setting. We hypothesized that advances in computer vision could enable building a fully automated, scalable analysis pipeline for echocardiogram (echo) interpretation. Our approach entailed: 1) preprocessing; 2) convolutional neural networks (CNN) for view identification, image segmentation, and phasing of the cardiac cycle; 3) quantification of chamber volumes and left ventricular mass; 4) particle tracking to compute longitudinal strain; and 5) targeted disease detection. CNNs accurately identified views (e.g. 99% for apical 4-chamber) and segmented individual cardiac chambers. Cardiac structure measurements agreed with study report values (e.g. mean absolute deviations (MAD) of 7.7 mL/kg/m2 for left ventricular diastolic volume index, 2918 studies). We computed automated ejection fraction and longitudinal strain measurements (within 2 cohorts), which agreed with commercial software-derived values [for ejection fraction, MAD=5.3%, N=3101 studies; for strain, MAD=1.5% (n=197) and 1.6% (n=110)], and demonstrated applicability to serial monitoring of breast cancer patients for trastuzumab cardiotoxicity. Overall, we found that, compared to manual measurements, automated measurements had superior performance across seven internal consistency metrics with an average increase in the Spearman correlation coefficient of 0.05 (p=0.02). Finally, we developed disease detection algorithms for hypertrophic cardiomyopathy and cardiac amyloidosis, with C-statistics of 0.93 and 0.84, respectively. Our pipeline lays the groundwork for using automated interpretation to support point-of-care handheld cardiac ultrasound and large-scale analysis of the millions of echos archived within healthcare systems.
△ Less
Submitted 12 January, 2018; v1 submitted 22 June, 2017;
originally announced June 2017.
-
Categorifying rationalization
Authors:
Clark Barwick,
Saul Glasman,
Marc Hoyois,
Denis Nardin,
Jay Shah
Abstract:
We solve a problem proposed by Khovanov by constructing, for any set of primes $S$, a triangulated category (in fact a stable $\infty$-category) whose Grothendieck group is $S^{-1}\mathbf{Z}$. More generally, for any exact $\infty$-category $E$, we construct an exact $\infty$-category $S^{-1}E$ of equivariant sheaves on the Cantor space with respect to an action of a dense subgroup of the circle.…
▽ More
We solve a problem proposed by Khovanov by constructing, for any set of primes $S$, a triangulated category (in fact a stable $\infty$-category) whose Grothendieck group is $S^{-1}\mathbf{Z}$. More generally, for any exact $\infty$-category $E$, we construct an exact $\infty$-category $S^{-1}E$ of equivariant sheaves on the Cantor space with respect to an action of a dense subgroup of the circle. We show that this $\infty$-category is precisely the result of categorifying division by the primes in $S$. In particular, $K_n(S^{-1}E)\cong S^{-1}K_n(E)$.
△ Less
Submitted 23 October, 2016;
originally announced October 2016.
-
Toward a Science of Autonomy for Physical Systems: Paths
Authors:
Pieter Abbeel,
Ken Goldberg,
Gregory Hager,
Julie Shah
Abstract:
An Autonomous Physical System (APS) will be expected to reliably and independently evaluate, execute, and achieve goals while respecting surrounding rules, laws, or conventions. In doing so, an APS must rely on a broad spectrum of dynamic, complex, and often imprecise information about its surroundings, the task it is to perform, and its own sensors and actuators. For example, cleaning in a home o…
▽ More
An Autonomous Physical System (APS) will be expected to reliably and independently evaluate, execute, and achieve goals while respecting surrounding rules, laws, or conventions. In doing so, an APS must rely on a broad spectrum of dynamic, complex, and often imprecise information about its surroundings, the task it is to perform, and its own sensors and actuators. For example, cleaning in a home or commercial setting requires the ability to perceive, grasp, and manipulate many physical objects, the ability to reliably perform a variety of subtasks such as washing, folding, and stacking, and knowledge about local conventions such as how objects are classified and where they should be stored. The information required for reliable autonomous operation may come from external sources and from the robot's own sensor observations or in the form of direct instruction by a trainer. Similar considerations apply across many domains - construction, manufacturing, in-home assistance, and healthcare. For example, surgeons spend many years learning about physiology and anatomy before they touch a patient. They then perform roughly 1000 surgeries under the tutelage of an expert surgeon, and they practice basic maneuvers such as suture tying thousands of times outside the operating room. All of these elements come together to achieve expertise at this task. Endowing a system with robust autonomy by traditional programming methods has thus far had limited success. Several promising new paths to acquiring and processing such data are emerging. This white paper outlines three promising research directions for enabling an APS to learn the physical and information skills necessary to perform tasks with independence and flexibility: Deep Reinforcement Learning, Human-Robot Interaction, and Cloud Robotics.
△ Less
Submitted 19 September, 2016;
originally announced September 2016.
-
An Information Extraction Approach to Prescreen Heart Failure Patients for Clinical Trials
Authors:
Abhishek Kalyan Adupa,
Ravi Prakash Garg,
Jessica Corona-Cox,
Sanjiv. J. Shah,
Siddhartha R. Jonnalagadda
Abstract:
To reduce the large amount of time spent screening, identifying, and recruiting patients into clinical trials, we need prescreening systems that are able to automate the data extraction and decision-making tasks that are typically relegated to clinical research study coordinators. However, a major obstacle is the vast amount of patient data available as unstructured free-form text in electronic he…
▽ More
To reduce the large amount of time spent screening, identifying, and recruiting patients into clinical trials, we need prescreening systems that are able to automate the data extraction and decision-making tasks that are typically relegated to clinical research study coordinators. However, a major obstacle is the vast amount of patient data available as unstructured free-form text in electronic health records. Here we propose an information extraction-based approach that first automatically converts unstructured text into a structured form. The structured data are then compared against a list of eligibility criteria using a rule-based system to determine which patients qualify for enrollment in a heart failure clinical trial. We show that we can achieve highly accurate results, with recall and precision values of 0.95 and 0.86, respectively. Our system allowed us to significantly reduce the time needed for prescreening patients from a few weeks to a few minutes. Our open-source information extraction modules are available for researchers and could be tested and validated in other cardiovascular trials. An approach such as the one we demonstrate here may decrease costs and expedite clinical trials, and could enhance the reproducibility of trials across institutions and populations.
△ Less
Submitted 6 September, 2016;
originally announced September 2016.
-
Parametrized higher category theory and higher algebra: Exposé I -- Elements of parametrized higher category theory
Authors:
Clark Barwick,
Emanuele Dotto,
Saul Glasman,
Denis Nardin,
Jay Shah
Abstract:
We introduce the basic elements of the theory of parametrized $\infty$-categories and functors between them. These notions are defined as suitable fibrations of $\infty$-categories and functors between them. We give as many examples as we are able at this stage. Simple operations, such as the formation of opposites and the formation of functor $\infty$-categories, become slightly more involved in…
▽ More
We introduce the basic elements of the theory of parametrized $\infty$-categories and functors between them. These notions are defined as suitable fibrations of $\infty$-categories and functors between them. We give as many examples as we are able at this stage. Simple operations, such as the formation of opposites and the formation of functor $\infty$-categories, become slightly more involved in the parametrized setting, but we explain precisely how to perform these constructions. All of these constructions can be performed explicitly, without resorting to such acts of desperation as straightening. The key results of this Exposé are: (1) a universal characterization of the $T$-$\infty$-category of $T$-objects in any $\infty$-category, (2) the existence of an internal Hom for $T$-$\infty$-categories, and (3) a parametrized Yoneda lemma.
△ Less
Submitted 11 August, 2016;
originally announced August 2016.
-
Parametrized higher category theory and higher algebra: A general introduction
Authors:
Clark Barwick,
Emanuele Dotto,
Saul Glasman,
Denis Nardin,
Jay Shah
Abstract:
We introduce the study of parametrized higher category theory and parametrized higher algebra, and we describe the main theorems of the series of Exposés that make up the monograph.
We introduce the study of parametrized higher category theory and parametrized higher algebra, and we describe the main theorems of the series of Exposés that make up the monograph.
△ Less
Submitted 11 August, 2016;
originally announced August 2016.
-
Fibrations in $\infty$-category theory
Authors:
Clark Barwick,
Jay Shah
Abstract:
In this short expository note, we discuss, with plenty of examples, the bestiary of fibrations in quasicategory theory. We underscore the simplicity and clarity of the constructions these fibrations make available to end-users of higher category theory.
In this short expository note, we discuss, with plenty of examples, the bestiary of fibrations in quasicategory theory. We underscore the simplicity and clarity of the constructions these fibrations make available to end-users of higher category theory.
△ Less
Submitted 11 August, 2016; v1 submitted 14 July, 2016;
originally announced July 2016.
-
Quality assessment of voice converted speech using articulatory features
Authors:
Avni Rajpal,
Nirmesh J. Shah,
Mohammadi Zaki,
Hemant A. Patil
Abstract:
We propose a novel application based on acoustic-to-articulatory inversion towards quality assessment of voice converted speech. The ability of humans to speak effortlessly requires coordinated movements of various articulators, muscles, etc. This effortless movement contributes towards naturalness, intelligibility and speakers identity which is partially present in voice converted speech. Hence,…
▽ More
We propose a novel application based on acoustic-to-articulatory inversion towards quality assessment of voice converted speech. The ability of humans to speak effortlessly requires coordinated movements of various articulators, muscles, etc. This effortless movement contributes towards naturalness, intelligibility and speakers identity which is partially present in voice converted speech. Hence, during voice conversion, the information related to speech production is lost. In this paper, this loss is quantified for male voice, by showing increase in RMSE error for voice converted speech followed by showing decrease in mutual information. Similar results are obtained in case of female voice. This observation is extended by showing that articulatory features can be used as an objective measure. The effectiveness of proposed measure over MCD is illustrated by comparing their correlation with Mean Opinion Score.
△ Less
Submitted 23 November, 2015; v1 submitted 16 November, 2015;
originally announced November 2015.
-
Spectral Mackey functors and equivariant algebraic K-theory (II)
Authors:
C. Barwick,
S. Glasman,
J. Shah
Abstract:
We study the "higher algebra" of spectral Mackey functors, which the first named author introduced in Part I of this paper. In particular, armed with our new theory of symmetric promonoidal $\infty$-categories and a suitable generalization of the second named author's Day convolution, we endow the $\infty$-category of Mackey functors with a well-behaved symmetric monoidal structure. This makes it…
▽ More
We study the "higher algebra" of spectral Mackey functors, which the first named author introduced in Part I of this paper. In particular, armed with our new theory of symmetric promonoidal $\infty$-categories and a suitable generalization of the second named author's Day convolution, we endow the $\infty$-category of Mackey functors with a well-behaved symmetric monoidal structure. This makes it possible to speak of spectral Green functors for any operad $O$. We also answer a question of A. Mathew, proving that the algebraic $K$-theory of group actions is lax symmetric monoidal. We also show that the algebraic $K$-theory of derived stacks provides an example. Finally, we give a very short, new proof of the equivariant Barratt-Priddy-Quillen theorem, which states that the algebraic $K$-theory of the category of finite $G$-sets is simply the $G$-equivariant sphere spectrum.
△ Less
Submitted 28 March, 2016; v1 submitted 12 May, 2015;
originally announced May 2015.
-
The Bayesian Case Model: A Generative Approach for Case-Based Reasoning and Prototype Classification
Authors:
Been Kim,
Cynthia Rudin,
Julie Shah
Abstract:
We present the Bayesian Case Model (BCM), a general framework for Bayesian case-based reasoning (CBR) and prototype classification and clustering. BCM brings the intuitive power of CBR to a Bayesian generative framework. The BCM learns prototypes, the "quintessential" observations that best represent clusters in a dataset, by performing joint inference on cluster labels, prototypes and important f…
▽ More
We present the Bayesian Case Model (BCM), a general framework for Bayesian case-based reasoning (CBR) and prototype classification and clustering. BCM brings the intuitive power of CBR to a Bayesian generative framework. The BCM learns prototypes, the "quintessential" observations that best represent clusters in a dataset, by performing joint inference on cluster labels, prototypes and important features. Simultaneously, BCM pursues sparsity by learning subspaces, the sets of features that play important roles in the characterization of the prototypes. The prototype and subspace representation provides quantitative benefits in interpretability while preserving classification accuracy. Human subject experiments verify statistically significant improvements to participants' understanding when using explanations produced by BCM, compared to those given by prior art.
△ Less
Submitted 3 March, 2015;
originally announced March 2015.
-
Robust diffusion imaging framework for clinical studies
Authors:
Ivan I. Maximov,
Farida Grinberg,
Irene Neuner,
N. Jon Shah
Abstract:
Clinical diffusion imaging requires short acquisition times and good image quality to permit its use in various medical applications. In turn, these demands require the development of a robust and efficient post-processing framework in order to guarantee useful and reliable results. However, multiple artefacts abound in in vivo measurements; from either subject such as cardiac pulsation, bulk head…
▽ More
Clinical diffusion imaging requires short acquisition times and good image quality to permit its use in various medical applications. In turn, these demands require the development of a robust and efficient post-processing framework in order to guarantee useful and reliable results. However, multiple artefacts abound in in vivo measurements; from either subject such as cardiac pulsation, bulk head motion, respiratory motion and involuntary tics and tremor, or imaging hardware related problems, such as table vibrations, etc. These artefacts can severely degrade the resulting images and render diffusion analysis difficult or impossible. In order to overcome these problems, we developed a robust and efficient framework enabling the use of initially corrupted images from a clinical study. At the heart of this framework is an improved least trimmed squares diffusion tensor estimation algorithm that works well with severely degraded datasets with low signal-to-noise ratio. This approach has been compared with other diffusion imaging post-processing algorithms using simulations and in vivo experiments. Exploiting track-based spatial statistics analysis, we demonstrate that corrupted datasets can be restored and reused in further clinical studies rather than being discarded due to poor quality. The developed robust framework is shown to exhibit a high efficiency and accuracy and can, in principle, be exploited in other MR studies where artefact/outlier suppression is needed.
△ Less
Submitted 17 February, 2015;
originally announced February 2015.
-
Efficient Model Learning for Human-Robot Collaborative Tasks
Authors:
Stefanos Nikolaidis,
Keren Gu,
Ramya Ramakrishnan,
Julie Shah
Abstract:
We present a framework for learning human user models from joint-action demonstrations that enables the robot to compute a robust policy for a collaborative task with a human. The learning takes place completely automatically, without any human intervention. First, we describe the clustering of demonstrated action sequences into different human types using an unsupervised learning algorithm. These…
▽ More
We present a framework for learning human user models from joint-action demonstrations that enables the robot to compute a robust policy for a collaborative task with a human. The learning takes place completely automatically, without any human intervention. First, we describe the clustering of demonstrated action sequences into different human types using an unsupervised learning algorithm. These demonstrated sequences are also used by the robot to learn a reward function that is representative for each type, through the employment of an inverse reinforcement learning algorithm. The learned model is then used as part of a Mixed Observability Markov Decision Process formulation, wherein the human type is a partially observable variable. With this framework, we can infer, either offline or online, the human type of a new user that was not included in the training set, and can compute a policy for the robot that will be aligned to the preference of this new user and will be robust to deviations of the human actions from prior demonstrations. Finally we validate the approach using data collected in human subject experiments, and conduct proof-of-concept demonstrations in which a person performs a collaborative task with a small industrial robot.
△ Less
Submitted 24 May, 2014;
originally announced May 2014.
-
Inferring Robot Task Plans from Human Team Meetings: A Generative Modeling Approach with Logic-Based Prior
Authors:
Been Kim,
Caleb M. Chacha,
Julie Shah
Abstract:
We aim to reduce the burden of programming and deploying autonomous systems to work in concert with people in time-critical domains, such as military field operations and disaster response. Deployment plans for these operations are frequently negotiated on-the-fly by teams of human planners. A human operator then translates the agreed upon plan into machine instructions for the robots. We present…
▽ More
We aim to reduce the burden of programming and deploying autonomous systems to work in concert with people in time-critical domains, such as military field operations and disaster response. Deployment plans for these operations are frequently negotiated on-the-fly by teams of human planners. A human operator then translates the agreed upon plan into machine instructions for the robots. We present an algorithm that reduces this translation burden by inferring the final plan from a processed form of the human team's planning conversation. Our approach combines probabilistic generative modeling with logical plan validation used to compute a highly structured prior over possible plans. This hybrid approach enables us to overcome the challenge of performing inference over the large solution space with only a small amount of noisy data from the team planning session. We validate the algorithm through human subject experimentation and show we are able to infer a human team's final plan with 83% accuracy on average. We also describe a robot demonstration in which two people plan and execute a first-response collaborative task with a PR2 robot. To the best of our knowledge, this is the first work that integrates a logical planning technique within a generative model to perform plan inference.
△ Less
Submitted 4 June, 2013;
originally announced June 2013.
-
Discrete Dynamical Modeling and Analysis of the R-S Flip-Flop Circuit
Authors:
Denis Blackmore,
Aminur Rahman,
Jigar Shah
Abstract:
A simple discrete planar dynamical model for the ideal (logical) R-S flip-flop circuit is developed with an eye toward mimicking the dynamical behavior observed for actual physical realizations of this circuit. It is shown that the model exhibits most of the qualitative features ascribed to the R-S flip-flop circuit, such as an intrinsic instability associated with unit set and reset inputs, manif…
▽ More
A simple discrete planar dynamical model for the ideal (logical) R-S flip-flop circuit is developed with an eye toward mimicking the dynamical behavior observed for actual physical realizations of this circuit. It is shown that the model exhibits most of the qualitative features ascribed to the R-S flip-flop circuit, such as an intrinsic instability associated with unit set and reset inputs, manifested in a chaotic sequence of output states that tend to oscillate among all possible output states, and the existence of periodic orbits of arbitrarily high period that depend on the various intrinsic system parameters. The investigation involves a combination of analytical methods from the modern theory of discrete dynamical systems, and numerical simulations that illustrate the dazzling array of dynamics that can be generated by the model. Validation of the discrete model is accomplished by comparison with certain Poincaré map like representations of the dynamics corresponding to three-dimensional differential equation models of electrical circuits that produce R-S flip-flop behavior.
△ Less
Submitted 3 June, 2013;
originally announced June 2013.
-
A New Guess-and-Determine Attack on the A5/1 Stream Cipher
Authors:
Jay Shah,
Ayan Mahalanobis
Abstract:
In Europe and North America, the most widely used stream cipher to ensure privacy and confidentiality of conversations in GSM mobile phones is the A5/1. In this paper, we present a new attack on the A5/1 stream cipher with an average time complexity of 2^(48.5), which is much less than the brute-force attack with a complexity of 2^(64). The attack has a 100% success rate and requires about 5.65GB…
▽ More
In Europe and North America, the most widely used stream cipher to ensure privacy and confidentiality of conversations in GSM mobile phones is the A5/1. In this paper, we present a new attack on the A5/1 stream cipher with an average time complexity of 2^(48.5), which is much less than the brute-force attack with a complexity of 2^(64). The attack has a 100% success rate and requires about 5.65GB storage. We provide a detailed description of our new attack along with its implementation and results.
△ Less
Submitted 3 May, 2012; v1 submitted 20 April, 2012;
originally announced April 2012.
-
Performance Study on Image Encryption Schemes
Authors:
Jolly Shah,
Vikas Saxena
Abstract:
Image applications have been increasing in recent years.Encryption is used to provide the security needed for image applications. In this paper, we classify various image encryption schemes and analyze them with respect to various parameters like tunability, visual degradation, compression friendliness,format compliance, encryption ratio, speed, and cryptographic security.
Image applications have been increasing in recent years.Encryption is used to provide the security needed for image applications. In this paper, we classify various image encryption schemes and analyze them with respect to various parameters like tunability, visual degradation, compression friendliness,format compliance, encryption ratio, speed, and cryptographic security.
△ Less
Submitted 5 December, 2011;
originally announced December 2011.
-
Video Encryption: A Survey
Authors:
Jolly Shah,
Dr. Vikas Saxena
Abstract:
Multimedia data security is becoming important with the continuous increase of digital communications on internet. The encryption algorithms developed to secure text data are not suitable for multimedia application because of the large data size and real time constraint. In this paper, classification and description of various video encryption algorithms are presented. Analysis and Comparison of t…
▽ More
Multimedia data security is becoming important with the continuous increase of digital communications on internet. The encryption algorithms developed to secure text data are not suitable for multimedia application because of the large data size and real time constraint. In this paper, classification and description of various video encryption algorithms are presented. Analysis and Comparison of these algorithms with respect to various parameters like visual degradation, encryption ratio, speed, compression friendliness, format compliance and cryptographic security is presented.
△ Less
Submitted 5 April, 2011;
originally announced April 2011.
-
High-Q/V air-mode photonic crystal cavities at microwave frequencies
Authors:
Yinan Zhang,
Irfan Bulu,
Wai-Ming Tam,
Ben Levitt,
Jagdish Shah,
Tancredi Botto,
Marko Loncar
Abstract:
We present results for a photonic microwave resonator designed and fabricated at 17.4GHz with a record high Quality factor (Q = 26,400) at room temperature over a mode volume smaller than one cubic wavelength. The cavity is uniquely designed to have its electric field concentrated in air, which allows for efficient coupling to free space and facilitates interactions with gaseous atomic systems and…
▽ More
We present results for a photonic microwave resonator designed and fabricated at 17.4GHz with a record high Quality factor (Q = 26,400) at room temperature over a mode volume smaller than one cubic wavelength. The cavity is uniquely designed to have its electric field concentrated in air, which allows for efficient coupling to free space and facilitates interactions with gaseous atomic systems and fluids.
△ Less
Submitted 28 January, 2011;
originally announced January 2011.
-
Water coordination structures and the excess free energy of the liquid
Authors:
Safir Merchant,
**dal K. Shah,
D. Asthagiri
Abstract:
For a distinguished water molecule, the solute water, we assess the contribution of each coordination state to its excess chemical potential, using a molecular aufbau approach. In this approach, we define a coordination sphere, the inner-shell, and separate the excess chemical potential into packing, outer-shell, and local chemical contributions; the coordination state is defined by the number of…
▽ More
For a distinguished water molecule, the solute water, we assess the contribution of each coordination state to its excess chemical potential, using a molecular aufbau approach. In this approach, we define a coordination sphere, the inner-shell, and separate the excess chemical potential into packing, outer-shell, and local chemical contributions; the coordination state is defined by the number of solvent water molecules within the coordination sphere. The packing term accounts for the free energy of creating a solute-free coordination sphere in the liquid. The outer-shell term accounts for the interaction of the solute with the fluid outside the coordination sphere and it is accurately described by a Gaussian model of hydration for coordination radii greater than the minimum of the oxygen-oxygen pair correlation function. Consistent with the conventional radial cut-off used for defining hydrogen-bonds in liquid water, theory helps identify a chemically meaningful coordination radius. The local chemical contribution is recast as a sum over coordination states. The n-th term in this sum is given by the probability of observing n water molecules inside the coordination sphere in the absence of the solute water times a factor accounting for the interaction of the solute with the inner-shell solvent water molecules. Using this molecular aufbau expansion, we monitor the change in the chemical contribution due to the incremental increase in n. We find that though four water molecules are needed to fully account for the chemical term, the first added water accounts for nearly half the chemical term. Our results emphasize the need to acknowledge the intrinsic occupancy of a solute-free coordination sphere together with solute-solvent interactions in rationalizing the tetrahedral coordination of the solute water.
△ Less
Submitted 5 January, 2011;
originally announced January 2011.
-
Comparative study of the Pros and Cons of Programming languages Java, Scala, C++, Haskell, VB .NET, AspectJ, Perl, Ruby, PHP & Scheme - a Team 11 COMP6411-S10 Term Report
Authors:
Venkatreddy Dwarampudi,
Shahbaz Singh Dhillon,
Jivitesh Shah,
Nikhil Joseph Sebastian,
Nitin Kanigicharla
Abstract:
With the advent of numerous languages it is difficult to realize the edge of one language in a particular scope over another one. We are making an effort, realizing these few issues and comparing some main stream languages like Java, Scala, C++, Haskell, VB .NET, AspectJ, Perl, Ruby, PHP and Scheme kee** in mind some core issues in program development.
With the advent of numerous languages it is difficult to realize the edge of one language in a particular scope over another one. We are making an effort, realizing these few issues and comparing some main stream languages like Java, Scala, C++, Haskell, VB .NET, AspectJ, Perl, Ruby, PHP and Scheme kee** in mind some core issues in program development.
△ Less
Submitted 19 August, 2010;
originally announced August 2010.
-
C Implementation & comparison of companding & silence audio compression techniques
Authors:
Kruti Dangarwala,
Jigar Shah
Abstract:
Just about all the newest living room audio-video electronics and PC multimedia products being designed today will incorporate some form of compressed digitized-audio processing capability. Audio compression reduces the bit rate required to represent an analog audio signal while maintaining the perceived audio quality. Discarding inaudible data reduces the storage, transmission and compute require…
▽ More
Just about all the newest living room audio-video electronics and PC multimedia products being designed today will incorporate some form of compressed digitized-audio processing capability. Audio compression reduces the bit rate required to represent an analog audio signal while maintaining the perceived audio quality. Discarding inaudible data reduces the storage, transmission and compute requirements of handling high-quality audio files. This paper covers wave audio file format & algorithm of silence compression method and companding method to compress and decompress wave audio file. Then it compares the result of these two methods.
△ Less
Submitted 19 April, 2010;
originally announced April 2010.
-
Reply to Wernsdorfer's post: "Correspondence on: Quantum interference of tunnel trajectories between states of different spin lenght in a dimeric molecular nanomagnet"
Authors:
Christopher M. Ramsey,
Enrique del Barco,
Stephen Hill,
Sonali J. Shah,
Christopher C. Beedle,
David N. Hendrickson
Abstract:
We present here an exact version of our response (dated April 27) to Wernsdorfer's correspondence submitted to Nature Physics on March 31, 2008. After consultation with a referee, Nature Physics chose not publish any part of this exchange. We would therefore like to point out that our original study has now been considered favorably by four separate referees chosen by Nature Physics. Unfortunate…
▽ More
We present here an exact version of our response (dated April 27) to Wernsdorfer's correspondence submitted to Nature Physics on March 31, 2008. After consultation with a referee, Nature Physics chose not publish any part of this exchange. We would therefore like to point out that our original study has now been considered favorably by four separate referees chosen by Nature Physics. Unfortunately, Wernsdorfer subsequently posted two further variations of his correspondence on this archive (arXiv:0804.1246v1 and arXiv:0804.1246v2). We note that aspects of the most recent posting (dated after submission of our response) contradict the version submitted to Nature Physics. However, none of the revisions add weight to Wernsdorfer's original correspondence.
△ Less
Submitted 11 June, 2008;
originally announced June 2008.
-
Balancing Local Order and Long-Ranged Interactions in the Molecular Theory of Liquid Water
Authors:
J. K. Shah,
D. Asthagiri,
L. R. Pratt,
M. E. Paulaitis
Abstract:
A molecular theory of liquid water is identified and studied on the basis of computer simulation of the TIP3P model of liquid water. This theory would be exact for models of liquid water in which the intermolecular interactions vanish outside a finite spatial range, and therefore provides a precise analysis tool for investigating the effects of longer-ranged intermolecular interactions. We show…
▽ More
A molecular theory of liquid water is identified and studied on the basis of computer simulation of the TIP3P model of liquid water. This theory would be exact for models of liquid water in which the intermolecular interactions vanish outside a finite spatial range, and therefore provides a precise analysis tool for investigating the effects of longer-ranged intermolecular interactions. We show how local order can be introduced through quasi-chemical theory. Long-ranged interactions are characterized generally by a conditional distribution of binding energies, and this formulation is interpreted as a regularization of the primitive statistical thermodynamic problem. These binding-energy distributions for liquid water are observed to be unimodal. The gaussian approximation proposed is remarkably successful in predicting the Gibbs free energy and the molar entropy of liquid water, as judged by comparison with numerically exact results. The remaining discrepancies are subtle quantitative problems that do have significant consequences for the thermodynamic properties that distinguish water from many other liquids. The basic subtlety of liquid water is found then in the competition of several effects which must be quantitatively balanced for realistic results.
△ Less
Submitted 7 July, 2007;
originally announced July 2007.
-
A Metric on Shape Space with Explicit Geodesics
Authors:
Peter W. Michor,
David Mumford,
Jayant Shah,
Laurent Younes
Abstract:
This paper studies a specific metric on plane curves that has the property of being isometric to classical manifold (sphere, complex projective, Stiefel, Grassmann) modulo change of parametrization, each of these classical manifolds being associated to specific qualifications of the space of curves (closed-open, modulo rotation etc...) Using these isometries, we are able to explicitely describe…
▽ More
This paper studies a specific metric on plane curves that has the property of being isometric to classical manifold (sphere, complex projective, Stiefel, Grassmann) modulo change of parametrization, each of these classical manifolds being associated to specific qualifications of the space of curves (closed-open, modulo rotation etc...) Using these isometries, we are able to explicitely describe the geodesics, first in the parametric case, then by modding out the paremetrization and considering horizontal vectors. We also compute the sectional curvature for these spaces, and show, in particular, that the space of closed curves modulo rotation and change of parameter has positive curvature. Experimental results that explicitly compute minimizing geodesics between two closed curves are finally provided
△ Less
Submitted 5 May, 2008; v1 submitted 28 June, 2007;
originally announced June 2007.
-
The Fe-Line Feature In The X-Ray Spectrum of Solar Flares: First Results From The SOXS Mission
Authors:
Rajmal Jain,
Anil K. Pradhan,
Vishal Joshi,
K. J. Shah,
Jayshree J. Trivedi,
S. L. Kayasth,
Vishal M. Shah,
M. R. Deshpande
Abstract:
We present the first results from the "Low Energy Detector" payload of the "Solar X-ray Spectrometer (SOXS)" mission, which was launched onboard the GSAT-2 Indian spacecraft on 08 May 2003 by the GSLV-D2 rocket to study solar flares. The SOXS Low Energy Detector (SLD) payload was designed, developed, and fabricated by the Physical Research Laboratory (PRL) in collaboration with the Space Applica…
▽ More
We present the first results from the "Low Energy Detector" payload of the "Solar X-ray Spectrometer (SOXS)" mission, which was launched onboard the GSAT-2 Indian spacecraft on 08 May 2003 by the GSLV-D2 rocket to study solar flares. The SOXS Low Energy Detector (SLD) payload was designed, developed, and fabricated by the Physical Research Laboratory (PRL) in collaboration with the Space Application Centre (SAC), Ahmedabad and the ISRO Satellite Centre (ISAC), Bangalore of Indian Space Research Organization (ISRO). The energy ranges of the Si PIN and CZT detectors are 4 - 25 keV and 4 - 56 keV respectively. The Si PIN provides sub-keV energy resolution while the CZT reveals ~1.7 keV energy resolution throughout the energy range. The high sensitivity and sub-keV energy resolution of the Si PIN detector allows measuring the intensity, peak energy, and the equivalent width of the Fe-line complex at approximately 6.7 keV as a function of time in all ten M-class flares studied in this investigation. The peak energy (Ep) of the Fe-line feature varies between 6.4 and 6.7 keV with increasing in temperature from 9 to 58 MK. We found that the equivalent width (w) of the Fe-line feature increases exponentially with temperature up to 30 MK and then increases very slowly up to 40 Mk. It remains between 3.5 and 4 keV in the temperature range of 30 - 45 MK. We compare our measurements of w with calculations made earlier by various investigators and propose that these measurements may improve theoretical models. We interpret the variation of both Ep and w with temperature as due to the changes in the ionization and recombination conditions in the plasma during the flare interval and as a consequence the contribution from different ionic emission lines also varies.
△ Less
Submitted 13 December, 2006;
originally announced December 2006.
-
Gaussian Models for the Statistical Thermodynamics of Liquid Water
Authors:
J. K. Shah,
D. Asthagiri,
L. R. Pratt,
M. E. Paulaitis
Abstract:
A gaussian distribution of binding energies, but conditioned to exploit generally available information on packing in liquids, provides a statistical-thermodynamic theory of liquid water that is structurally non-committal, molecularly realistic, and surprisingly accurate. Neglect of fluctuation contributions to this gaussian model yields a mean-field theory that produces useless results. A refin…
▽ More
A gaussian distribution of binding energies, but conditioned to exploit generally available information on packing in liquids, provides a statistical-thermodynamic theory of liquid water that is structurally non-committal, molecularly realistic, and surprisingly accurate. Neglect of fluctuation contributions to this gaussian model yields a mean-field theory that produces useless results. A refinement that accounts for sharper-than-gaussian behavior at high binding energies recognizes contributions from a discrete number of water molecules and permits a natural matching of numerically exact results. These gaussian models, which can be understood as vigorous simplifications of quasi-chemical theories, are applicable to aqueous environments where the utility of structural models based on geometrical considerations of water hydrogen bonding have not been established.
△ Less
Submitted 21 August, 2006;
originally announced August 2006.