Search | arXiv e-print repository

doi 10.1109/isbi53787.2023.10230816

Contrastive Self-Supervised Learning for Spatio-Temporal Analysis of Lung Ultrasound Videos

Authors: Li Chen, Jonathan Rubin, Jiahong Ouyang, Naveen Balaraju, Shubham Patil, Courosh Mehanian, Sourabh Kulhare, Rachel Millin, Kenton W Gregory, Cynthia R Gregory, Meihua Zhu, David O Kessler, Laurie Malia, Almaz Dessie, Joni Rabiner, Di Coneybeare, Bo Shopsin, Andrew Hersh, Cristian Madar, Jeffrey Shupp, Laura S Johnson, Jacob Avila, Kristin Dwyer, Peter Weimersheimer, Balasundar Raju , et al. (2 additional authors not shown)

Abstract: Self-supervised learning (SSL) methods have shown promise for medical imaging applications by learning meaningful visual representations, even when the amount of labeled data is limited. Here, we extend state-of-the-art contrastive learning SSL methods to 2D+time medical ultrasound video data by introducing a modified encoder and augmentation method capable of learning meaningful spatio-temporal r… ▽ More Self-supervised learning (SSL) methods have shown promise for medical imaging applications by learning meaningful visual representations, even when the amount of labeled data is limited. Here, we extend state-of-the-art contrastive learning SSL methods to 2D+time medical ultrasound video data by introducing a modified encoder and augmentation method capable of learning meaningful spatio-temporal representations, without requiring constraints on the input data. We evaluate our method on the challenging clinical task of identifying lung consolidations (an important pathological feature) in ultrasound videos. Using a multi-center dataset of over 27k lung ultrasound videos acquired from over 500 patients, we show that our method can significantly improve performance on downstream localization and classification of lung consolidation. Comparisons against baseline models trained without SSL show that the proposed methods are particularly advantageous when the size of labeled training data is limited (e.g., as little as 5% of the training set). △ Less

Submitted 14 October, 2023; originally announced October 2023.

Comments: ISBI 2023, 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI)

arXiv:2308.04463 [pdf, other]

Weakly Semi-Supervised Detection in Lung Ultrasound Videos

Authors: Jiahong Ouyang, Li Chen, Gary Y. Li, Naveen Balaraju, Shubham Patil, Courosh Mehanian, Sourabh Kulhare, Rachel Millin, Kenton W. Gregory, Cynthia R. Gregory, Meihua Zhu, David O. Kessler, Laurie Malia, Almaz Dessie, Joni Rabiner, Di Coneybeare, Bo Shopsin, Andrew Hersh, Cristian Madar, Jeffrey Shupp, Laura S. Johnson, Jacob Avila, Kristin Dwyer, Peter Weimersheimer, Balasundar Raju , et al. (2 additional authors not shown)

Abstract: Frame-by-frame annotation of bounding boxes by clinical experts is often required to train fully supervised object detection models on medical video data. We propose a method for improving object detection in medical videos through weak supervision from video-level labels. More concretely, we aggregate individual detection predictions into video-level predictions and extend a teacher-student train… ▽ More Frame-by-frame annotation of bounding boxes by clinical experts is often required to train fully supervised object detection models on medical video data. We propose a method for improving object detection in medical videos through weak supervision from video-level labels. More concretely, we aggregate individual detection predictions into video-level predictions and extend a teacher-student training strategy to provide additional supervision via a video-level loss. We also introduce improvements to the underlying teacher-student framework, including methods to improve the quality of pseudo-labels based on weak supervision and adaptive schemes to optimize knowledge transfer between the student and teacher networks. We apply this approach to the clinically important task of detecting lung consolidations (seen in respiratory infections such as COVID-19 pneumonia) in medical ultrasound videos. Experiments reveal that our framework improves detection accuracy and robustness compared to baseline semi-supervised models, and improves efficiency in data and annotation usage. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: IPMI 2023

arXiv:2204.08000 [pdf, other]

doi 10.1007/978-3-031-18523-6_18

LRH-Net: A Multi-Level Knowledge Distillation Approach for Low-Resource Heart Network

Authors: Ekansh Chauhan, Swathi Guptha, Likith Reddy, Bapi Raju

Abstract: An electrocardiogram (ECG) monitors the electrical activity generated by the heart and is used to detect fatal cardiovascular diseases (CVDs). Conventionally, to capture the precise electrical activity, clinical experts use multiple-lead ECGs (typically 12 leads). But in recent times, large-size deep learning models have been used to detect these diseases. However, such models require heavy comput… ▽ More An electrocardiogram (ECG) monitors the electrical activity generated by the heart and is used to detect fatal cardiovascular diseases (CVDs). Conventionally, to capture the precise electrical activity, clinical experts use multiple-lead ECGs (typically 12 leads). But in recent times, large-size deep learning models have been used to detect these diseases. However, such models require heavy compute resources like huge memory and long inference time. To alleviate these shortcomings, we propose a low-parameter model, named Low Resource Heart-Network (LRH-Net), which uses fewer leads to detect ECG anomalies in a resource-constrained environment. A multi-level knowledge distillation process is used on top of that to get better generalization performance on our proposed model. The multi-level knowledge distillation process distills the knowledge to LRH-Net trained on a reduced number of leads from higher parameter (teacher) models trained on multiple leads to reduce the performance gap. The proposed model is evaluated on the PhysioNet-2020 challenge dataset with constrained input. The parameters of the LRH-Net are 106x less than our teacher model for detecting CVDs. The performance of the LRH-Net was scaled up to 3.2% and the inference time scaled down by 75% compared to the teacher model. In contrast to the compute- and parameter-intensive deep learning techniques, the proposed methodology uses a subset of ECG leads using the low resource LRH-Net, making it eminently suitable for deployment on edge devices. △ Less

Submitted 12 August, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

Report number: 978-3-031-18523-6

Journal ref: DeCaF FAIR 2022, MICCAI 2022

arXiv:2204.03272 [pdf, other]

mulEEG: A Multi-View Representation Learning on EEG Signals

Authors: Vamsi Kumar, Likith Reddy, Shivam Kumar Sharma, Kamalakar Dadi, Chiranjeevi Yarra, Bapi S. Raju, Srijithesh Rajendran

Abstract: Modeling effective representations using multiple views that positively influence each other is challenging, and the existing methods perform poorly on Electroencephalogram (EEG) signals for sleep-staging tasks. In this paper, we propose a novel multi-view self-supervised method (mulEEG) for unsupervised EEG representation learning. Our method attempts to effectively utilize the complementary info… ▽ More Modeling effective representations using multiple views that positively influence each other is challenging, and the existing methods perform poorly on Electroencephalogram (EEG) signals for sleep-staging tasks. In this paper, we propose a novel multi-view self-supervised method (mulEEG) for unsupervised EEG representation learning. Our method attempts to effectively utilize the complementary information available in multiple views to learn better representations. We introduce diverse loss that further encourages complementary information across multiple views. Our method with no access to labels beats the supervised training while outperforming multi-view baseline methods on transfer learning experiments carried out on sleep-staging tasks. We posit that our method was able to learn better representations by using complementary multi-views. △ Less

Submitted 7 April, 2022; originally announced April 2022.

Comments: Preprint version

arXiv:1912.05453 [pdf, other]

Value-of-Information based Arbitration between Model-based and Model-free Control

Authors: Krishn Bera, Yash Mandilwar, Bapi Raju

Abstract: There have been numerous attempts in explaining the general learning behaviours using model-based and model-free methods. While the model-based control is flexible yet computationally expensive in planning, the model-free control is quick but inflexible. The model-based control is therefore immune from reward devaluation and contingency degradation. Multiple arbitration schemes have been suggested… ▽ More There have been numerous attempts in explaining the general learning behaviours using model-based and model-free methods. While the model-based control is flexible yet computationally expensive in planning, the model-free control is quick but inflexible. The model-based control is therefore immune from reward devaluation and contingency degradation. Multiple arbitration schemes have been suggested to achieve the data efficiency and computational efficiency of model-based and model-free control respectively. In this context, we propose a quantitative 'value of information' based arbitration between both the controllers in order to establish a general computational framework for skill learning. The interacting model-based and model-free reinforcement learning processes are arbitrated using an uncertainty-based value of information. We further show that our algorithm performs better than Q-learning as well as Q-learning with experience replay. △ Less

Submitted 8 December, 2019; originally announced December 2019.

arXiv:1901.01856 [pdf, other]

A Computational Framework for Motor Skill Acquisition

Authors: Krishn Bera, Tejas Savalia, Bapi Raju

Abstract: There have been numerous attempts in explaining the general learning behaviours by various cognitive models. Multiple hypotheses have been put further to qualitatively argue the best-fit model for motor skill acquisition task and its variations. In this context, for a discrete sequence production (DSP) task, one of the most insightful models is Verwey's Dual Processor Model (DPM). It largely expla… ▽ More There have been numerous attempts in explaining the general learning behaviours by various cognitive models. Multiple hypotheses have been put further to qualitatively argue the best-fit model for motor skill acquisition task and its variations. In this context, for a discrete sequence production (DSP) task, one of the most insightful models is Verwey's Dual Processor Model (DPM). It largely explains the learning and behavioural phenomenon of skilled discrete key-press sequences without providing any concrete computational basis of reinforcement. Therefore, we propose a quantitative explanation for Verwey's DPM hypothesis by experimentally establishing a general computational framework for motor skill learning. We attempt combining the qualitative and quantitative theories based on a best-fit model of the experimental simulations of variations of dual processor models. The fundamental premise of sequential decision making for skill learning is based on interacting model-based (MB) and model-free (MF) reinforcement learning (RL) processes. Our unifying framework shows the proposed idea agrees well to Verwey's DPM and Fitts' three phases of skill learning. The accuracy of our model can further be validated by its statistical fit with the human-generated data on simple environment tasks like the grid-world. △ Less

Submitted 3 January, 2019; originally announced January 2019.

arXiv:1805.00967 [pdf]

Use Cases of Computational Reproducibility for Scientific Workflows at Exascale

Authors: Line Pouchard, Sterling Baldwin, Todd Elsethagen, Carlos Gamboa, Shantenu Jha, Bibi Raju, Eric Stephan, Li Tang, Kerstin Kleese Van Dam

Abstract: We propose an approach for improved reproducibility that includes capturing and relating provenance characteristics and performance metrics, in a hybrid queriable system, the ProvEn server. The system capabilities are illustrated on two use cases: scientific reproducibility of results in the ACME climate simulations and performance reproducibility in molecular dynamics workflows on HPC computing p… ▽ More We propose an approach for improved reproducibility that includes capturing and relating provenance characteristics and performance metrics, in a hybrid queriable system, the ProvEn server. The system capabilities are illustrated on two use cases: scientific reproducibility of results in the ACME climate simulations and performance reproducibility in molecular dynamics workflows on HPC computing platforms. △ Less

Submitted 20 April, 2018; originally announced May 2018.

Comments: Presented at SC17, Denver, CO. Full version submitted to IJHPCA March 2018

arXiv:1703.00548 [pdf, other]

Evolving Deep Neural Networks

Authors: Risto Miikkulainen, Jason Liang, Elliot Meyerson, Aditya Rawal, Dan Fink, Olivier Francon, Bala Raju, Hormoz Shahrzad, Arshak Navruzyan, Nigel Duffy, Babak Hodjat

Abstract: The success of deep learning depends on finding an architecture to fit the task. As deep learning has scaled up to more challenging tasks, the architectures have become difficult to design by hand. This paper proposes an automated method, CoDeepNEAT, for optimizing deep learning architectures through evolution. By extending existing neuroevolution methods to topology, components, and hyperparamete… ▽ More The success of deep learning depends on finding an architecture to fit the task. As deep learning has scaled up to more challenging tasks, the architectures have become difficult to design by hand. This paper proposes an automated method, CoDeepNEAT, for optimizing deep learning architectures through evolution. By extending existing neuroevolution methods to topology, components, and hyperparameters, this method achieves results comparable to best human designs in standard benchmarks in object recognition and language modeling. It also supports building a real-world application of automated image captioning on a magazine website. Given the anticipated increases in available computing power, evolution of deep networks is promising approach to constructing deep learning applications in the future. △ Less

Submitted 4 March, 2017; v1 submitted 1 March, 2017; originally announced March 2017.

arXiv:0910.2946 [pdf, ps, other]

12 GHz Radio-Holographic surface measurement of the RRI 10.4 m telescope

Authors: Ramesh Balasubramanyam, Suresh Venkatesh, Sharath B. Raju

Abstract: A modern Q-band low noise amplifier (LNA) front-end is being fitted to the 10.4 m millimeter-wave telescope at the Raman Research Institute (RRI) to support observations in the 40-50 GHz frequency range. To assess the suitability of the surface for this purpose, we measured the deviations of the primary surface from an ideal paraboloid using radio holography. We used the 11.6996 GHz beacon signa… ▽ More A modern Q-band low noise amplifier (LNA) front-end is being fitted to the 10.4 m millimeter-wave telescope at the Raman Research Institute (RRI) to support observations in the 40-50 GHz frequency range. To assess the suitability of the surface for this purpose, we measured the deviations of the primary surface from an ideal paraboloid using radio holography. We used the 11.6996 GHz beacon signal from the GSAT3 satellite, a 1.2 m reference antenna, commercial Ku-band Low Noise Block Convereters (LNBC) as the receiver front-ends and a Stanford Research Systems (SRS) lock-in amplifier as the backend. The LNBCs had independent free-running first local oscillators (LO). Yet, we recovered the correlation by using a radiatively injected common tone that served as the second local oscillator. With this setup, we mapped the surface deviations on a 64 x 64 grid and measured an rms surface deviation of ~350 um with a measurement accuracy of ~50 um. △ Less

Submitted 15 October, 2009; originally announced October 2009.

Comments: 4 pages, 5 figures, ASP Conference Series, Vol. LFRU, 2009

Report number: 2009ASPC..407..434B

Showing 1–9 of 9 results for author: Raju, B