-
Maximum Likelihood Estimation of Flexible Survival Densities with Importance Sampling
Authors:
Mert Ketenci,
Shreyas Bhave,
Noémie Elhadad,
Adler Perotte
Abstract:
Survival analysis is a widely-used technique for analyzing time-to-event data in the presence of censoring. In recent years, numerous survival analysis methods have emerged which scale to large datasets and relax traditional assumptions such as proportional hazards. These models, while being performant, are very sensitive to model hyperparameters including: (1) number of bins and bin size for disc…
▽ More
Survival analysis is a widely-used technique for analyzing time-to-event data in the presence of censoring. In recent years, numerous survival analysis methods have emerged which scale to large datasets and relax traditional assumptions such as proportional hazards. These models, while being performant, are very sensitive to model hyperparameters including: (1) number of bins and bin size for discrete models and (2) number of cluster assignments for mixture-based models. Each of these choices requires extensive tuning by practitioners to achieve optimal performance. In addition, we demonstrate in empirical studies that: (1) optimal bin size may drastically differ based on the metric of interest (e.g., concordance vs brier score), and (2) mixture models may suffer from mode collapse and numerical instability. We propose a survival analysis approach which eliminates the need to tune hyperparameters such as mixture assignments and bin sizes, reducing the burden on practitioners. We show that the proposed approach matches or outperforms baselines on several real-world datasets.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Adapting model-based deep learning to multiple acquisition conditions: Ada-MoDL
Authors:
Aniket Pramanik,
Sampada Bhave,
Saurav Sajib,
Samir D. Sharma,
Mathews Jacob
Abstract:
Purpose: The aim of this work is to introduce a single model-based deep network that can provide high-quality reconstructions from undersampled parallel MRI data acquired with multiple sequences, acquisition settings and field strengths.
Methods: A single unrolled architecture, which offers good reconstructions for multiple acquisition settings, is introduced. The proposed scheme adapts the mode…
▽ More
Purpose: The aim of this work is to introduce a single model-based deep network that can provide high-quality reconstructions from undersampled parallel MRI data acquired with multiple sequences, acquisition settings and field strengths.
Methods: A single unrolled architecture, which offers good reconstructions for multiple acquisition settings, is introduced. The proposed scheme adapts the model to each setting by scaling the CNN features and the regularization parameter with appropriate weights. The scaling weights and regularization parameter are derived using a multi-layer perceptron model from conditional vectors, which represents the specific acquisition setting. The perceptron parameters and the CNN weights are jointly trained using data from multiple acquisition settings, including differences in field strengths, acceleration, and contrasts. The conditional network is validated using datasets acquired with different acquisition settings.
Results: The comparison of the adaptive framework, which trains a single model using the data from all the settings, shows that it can offer consistently improved performance for each acquisition condition. The comparison of the proposed scheme with networks that are trained independently for each acquisition setting shows that it requires less training data per acquisition setting to offer good performance.
Conclusion: The Ada-MoDL framework enables the use of a single model-based unrolled network for multiple acquisition settings. In addition to eliminating the need to train and store multiple networks for different acquisition settings, this approach reduces the training data needed for each acquisition setting.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Assessing Phenotype Definitions for Algorithmic Fairness
Authors:
Tony Y. Sun,
Shreyas Bhave,
Jaan Altosaar,
Noémie Elhadad
Abstract:
Disease identification is a core, routine activity in observational health research. Cohorts impact downstream analyses, such as how a condition is characterized, how patient risk is defined, and what treatments are studied. It is thus critical to ensure that selected cohorts are representative of all patients, independently of their demographics or social determinants of health. While there are m…
▽ More
Disease identification is a core, routine activity in observational health research. Cohorts impact downstream analyses, such as how a condition is characterized, how patient risk is defined, and what treatments are studied. It is thus critical to ensure that selected cohorts are representative of all patients, independently of their demographics or social determinants of health. While there are multiple potential sources of bias when constructing phenotype definitions which may affect their fairness, it is not standard in the field of phenoty** to consider the impact of different definitions across subgroups of patients. In this paper, we propose a set of best practices to assess the fairness of phenotype definitions. We leverage established fairness metrics commonly used in predictive models and relate them to commonly used epidemiological cohort description metrics. We describe an empirical study for Crohn's disease and diabetes type 2, each with multiple phenotype definitions taken from the literature across two sets of patient subgroups (gender and race). We show that the different phenotype definitions exhibit widely varying and disparate performance according to the different fairness metrics and subgroups. We hope that the proposed best practices can help in constructing fair and inclusive phenotype definitions.
△ Less
Submitted 27 August, 2022; v1 submitted 10 March, 2022;
originally announced March 2022.
-
Zero-Shot Clinical Acronym Expansion via Latent Meaning Cells
Authors:
Griffin Adams,
Mert Ketenci,
Shreyas Bhave,
Adler Perotte,
Noémie Elhadad
Abstract:
We introduce Latent Meaning Cells, a deep latent variable model which learns contextualized representations of words by combining local lexical context and metadata. Metadata can refer to granular context, such as section type, or to more global context, such as unique document ids. Reliance on metadata for contextualized representation learning is apropos in the clinical domain where text is semi…
▽ More
We introduce Latent Meaning Cells, a deep latent variable model which learns contextualized representations of words by combining local lexical context and metadata. Metadata can refer to granular context, such as section type, or to more global context, such as unique document ids. Reliance on metadata for contextualized representation learning is apropos in the clinical domain where text is semi-structured and expresses high variation in topics. We evaluate the LMC model on the task of zero-shot clinical acronym expansion across three datasets. The LMC significantly outperforms a diverse set of baselines at a fraction of the pre-training cost and learns clinically coherent representations. We demonstrate that not only is metadata itself very helpful for the task, but that the LMC inference algorithm provides an additional large benefit.
△ Less
Submitted 12 November, 2020; v1 submitted 28 September, 2020;
originally announced October 2020.
-
Energy-efficient Hybrid CMOS-NEMS LIF Neuron Circuit in 28 nm CMOS Process
Authors:
Saber Moradi,
Sunil A. Bhave,
Rajit Manohar
Abstract:
Designing analog sub-threshold neuromorphic circuits in deep sub-micron technologies e.g. 28 nm can be a daunting task due to the problem of excessive leakage current. We propose novel energy-efficient hybrid CMOS-nano electro-mechanical switches (NEMS) Leaky Integrate and Fire (LIF) neuron and synapse circuits and investigate the impact of NEM switches on the leakage power and overall energy cons…
▽ More
Designing analog sub-threshold neuromorphic circuits in deep sub-micron technologies e.g. 28 nm can be a daunting task due to the problem of excessive leakage current. We propose novel energy-efficient hybrid CMOS-nano electro-mechanical switches (NEMS) Leaky Integrate and Fire (LIF) neuron and synapse circuits and investigate the impact of NEM switches on the leakage power and overall energy consumption. We analyze the performance of biologically-inspired neuron circuit in terms of leakage power consumption and present new energy-efficient neural circuits that operate with biologically plausible firing rates. Our results show the proposed CMOS-NEMS neuron circuit is, on average, 35% more energy-efficient than its CMOS counterpart with same complexity in 28 nm process. Moreover, we discuss how NEM switches can be utilized to further improve the scalability of mixed-signal neuromorphic circuits.
△ Less
Submitted 19 December, 2017;
originally announced December 2017.
-
Experimental Demonstration of Efficient Spin-Orbit Torque Switching of an MTJ with sub-100 ns Pulses
Authors:
Tanay A. Gosavi,
Sasikanth Manipatruni,
Sriharsha V. Aradhya,
Graham E. Rowlands,
Dmitri Nikonov,
Ian A. Young,
Sunil A. Bhave
Abstract:
Efficient generation of spin currents from charge currents is of high importance for memory and logic applications of spintronics. In particular, generation of spin currents from charge currents in high spin-orbit coupling metals has the potential to provide a scalable solution for embedded memory. We demonstrate a net reduction in critical charge current for spin torque driven magnetization rever…
▽ More
Efficient generation of spin currents from charge currents is of high importance for memory and logic applications of spintronics. In particular, generation of spin currents from charge currents in high spin-orbit coupling metals has the potential to provide a scalable solution for embedded memory. We demonstrate a net reduction in critical charge current for spin torque driven magnetization reversal via using spin-orbit mediated spin current generation. We scaled the dimensions of the spin-orbit electrode to 400 nm and the nanomagnet to 270 nm x 68 nm in a three terminal spin-orbit torque, magnetic tunnel junction (SOT-MTJ) geometry. Our estimated effective spin Hall angle is 0.15-0.20 using the ratio of zero temperature critical current from spin Hall switching and estimated spin current density for switching the magnet. We show bidirectional transient switching using spin-orbit generated spin torque at 100 ns switching pulses reliably followed by transient read operations. We finally compare the static and dynamic response of the SOT-MTJ with transient spin circuit modeling showing the performance of scaled SOT-MTJs to enable nanosecond class non-volatile MTJs.
△ Less
Submitted 9 August, 2016; v1 submitted 30 June, 2015;
originally announced June 2015.
-
Association Rule Based Flexible Machine Learning Module for Embedded System Platforms like Android
Authors:
Amiraj Dhawan,
Shruti Bhave,
Amrita Aurora,
Vishwanathan Iyer
Abstract:
The past few years have seen a tremendous growth in the popularity of smartphones. As newer features continue to be added to smartphones to increase their utility, their significance will only increase in future. Combining machine learning with mobile computing can enable smartphones to become 'intelligent' devices, a feature which is hitherto unseen in them. Also, the combination of machine learn…
▽ More
The past few years have seen a tremendous growth in the popularity of smartphones. As newer features continue to be added to smartphones to increase their utility, their significance will only increase in future. Combining machine learning with mobile computing can enable smartphones to become 'intelligent' devices, a feature which is hitherto unseen in them. Also, the combination of machine learning and context aware computing can enable smartphones to gauge user's requirements proactively, depending upon their environment and context. Accordingly, necessary services can be provided to users.
In this paper, we have explored the methods and applications of integrating machine learning and context aware computing on the Android platform, to provide higher utility to the users. To achieve this, we define a Machine Learning (ML) module which is incorporated in the basic Android architecture. Firstly, we have outlined two major functionalities that the ML module should provide. Then, we have presented three architectures, each of which incorporates the ML module at a different level in the Android architecture. The advantages and shortcomings of each of these architectures have been evaluated. Lastly, we have explained a few applications in which our proposed system can be incorporated such that their functionality is improved.
△ Less
Submitted 14 November, 2014;
originally announced November 2014.