-
Multi-Modal Contrastive Learning for Online Clinical Time-Series Applications
Authors:
Fabian Baldenweg,
Manuel Burger,
Gunnar Rätsch,
Rita Kuznetsova
Abstract:
Electronic Health Record (EHR) datasets from Intensive Care Units (ICU) contain a diverse set of data modalities. While prior works have successfully leveraged multiple modalities in supervised settings, we apply advanced self-supervised multi-modal contrastive learning techniques to ICU data, specifically focusing on clinical notes and time-series for clinically relevant online prediction tasks.…
▽ More
Electronic Health Record (EHR) datasets from Intensive Care Units (ICU) contain a diverse set of data modalities. While prior works have successfully leveraged multiple modalities in supervised settings, we apply advanced self-supervised multi-modal contrastive learning techniques to ICU data, specifically focusing on clinical notes and time-series for clinically relevant online prediction tasks. We introduce a loss function Multi-Modal Neighborhood Contrastive Loss (MM-NCL), a soft neighborhood function, and showcase the excellent linear probe and zero-shot performance of our approach.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
On Supercapacitors Time-Domain Spectroscopy. C/R Characteristic Slope
Authors:
Dmitry Valentinovich Agafonov,
Arina Romanovna Kuznetsova,
Mikhail Evgenievich Kompan,
Vladislav Gennadievich Malyshkin
Abstract:
A novel time-domain technique for supercapacitor characterization is developed, modeled numerically, and experimentally tested on a number of commercial supercapacitors. The method involves momentarily shorting a supercapacitor for a brief duration, denoted as $τ$, and measuring first $\int Idt$ and second $\int I^2dt$ moments of current along with the potential before and after shorting. The effe…
▽ More
A novel time-domain technique for supercapacitor characterization is developed, modeled numerically, and experimentally tested on a number of commercial supercapacitors. The method involves momentarily shorting a supercapacitor for a brief duration, denoted as $τ$, and measuring first $\int Idt$ and second $\int I^2dt$ moments of current along with the potential before and after shorting. The effective $C(τ)$ and $R(τ)$ are then obtained from charge preservation and energy dissipation invariants. A linear behavior in $[R(τ),C(τ)]$ parametric plot is observed by several orders of $τ$. This gives a $C/R$ characteristic slope: how much $ΔC$ we can ``gain'' if we are ready to ``lose'' $ΔR$ in internal resistance. The $C/R$ characteristic slope characterizes possible energy and power properties of the device in terms of materials and technology used, this is a measure of supercapacitor perfection. The technique has been proven with experimental measurements and then validated through computer modeling, analytic analysis, and impedance spectroscopy on a number of circuit types: transmission line, binary tree, etc., a new n-tree element (nTE) is introduced. The approach offers an alternative to low-frequency impedance spectroscopy and methods outlined in the IEC 62391 standard. It provides valuable insights into the performance and characteristics of supercapacitors.
△ Less
Submitted 13 February, 2024; v1 submitted 12 January, 2024;
originally announced January 2024.
-
On the Importance of Step-wise Embeddings for Heterogeneous Clinical Time-Series
Authors:
Rita Kuznetsova,
Alizée Pace,
Manuel Burger,
Hugo Yèche,
Gunnar Rätsch
Abstract:
Recent advances in deep learning architectures for sequence modeling have not fully transferred to tasks handling time-series from electronic health records. In particular, in problems related to the Intensive Care Unit (ICU), the state-of-the-art remains to tackle sequence classification in a tabular manner with tree-based methods. Recent findings in deep learning for tabular data are now surpass…
▽ More
Recent advances in deep learning architectures for sequence modeling have not fully transferred to tasks handling time-series from electronic health records. In particular, in problems related to the Intensive Care Unit (ICU), the state-of-the-art remains to tackle sequence classification in a tabular manner with tree-based methods. Recent findings in deep learning for tabular data are now surpassing these classical methods by better handling the severe heterogeneity of data input features. Given the similar level of feature heterogeneity exhibited by ICU time-series and motivated by these findings, we explore these novel methods' impact on clinical sequence modeling tasks. By jointly using such advances in deep learning for tabular data, our primary objective is to underscore the importance of step-wise embeddings in time-series modeling, which remain unexplored in machine learning methods for clinical data. On a variety of clinically relevant tasks from two large-scale ICU datasets, MIMIC-III and HiRID, our work provides an exhaustive analysis of state-of-the-art methods for tabular time-series as time-step embedding models, showing overall performance improvement. In particular, we evidence the importance of feature grou** in clinical time-series, with significant performance gains when considering features within predefined semantic groups in the step-wise embedding module.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Knowledge Graph Representations to enhance Intensive Care Time-Series Predictions
Authors:
Samyak Jain,
Manuel Burger,
Gunnar Rätsch,
Rita Kuznetsova
Abstract:
Intensive Care Units (ICU) require comprehensive patient data integration for enhanced clinical outcome predictions, crucial for assessing patient conditions. Recent deep learning advances have utilized patient time series data, and fusion models have incorporated unstructured clinical reports, improving predictive performance. However, integrating established medical knowledge into these models h…
▽ More
Intensive Care Units (ICU) require comprehensive patient data integration for enhanced clinical outcome predictions, crucial for assessing patient conditions. Recent deep learning advances have utilized patient time series data, and fusion models have incorporated unstructured clinical reports, improving predictive performance. However, integrating established medical knowledge into these models has not yet been explored. The medical domain's data, rich in structural relationships, can be harnessed through knowledge graphs derived from clinical ontologies like the Unified Medical Language System (UMLS) for better predictions. Our proposed methodology integrates this knowledge with ICU data, improving clinical decision modeling. It combines graph representations with vital signs and clinical reports, enhancing performance, especially when data is missing. Additionally, our model includes an interpretability component to understand how knowledge graph nodes affect predictions.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Language Model Training Paradigms for Clinical Feature Embeddings
Authors:
Yurong Hu,
Manuel Burger,
Gunnar Rätsch,
Rita Kuznetsova
Abstract:
In research areas with scarce data, representation learning plays a significant role. This work aims to enhance representation learning for clinical time series by deriving universal embeddings for clinical features, such as heart rate and blood pressure. We use self-supervised training paradigms for language models to learn high-quality clinical feature embeddings, achieving a finer granularity t…
▽ More
In research areas with scarce data, representation learning plays a significant role. This work aims to enhance representation learning for clinical time series by deriving universal embeddings for clinical features, such as heart rate and blood pressure. We use self-supervised training paradigms for language models to learn high-quality clinical feature embeddings, achieving a finer granularity than existing time-step and patient-level representation learning. We visualize the learnt embeddings via unsupervised dimension reduction techniques and observe a high degree of consistency with prior clinical knowledge. We also evaluate the model performance on the MIMIC-III benchmark and demonstrate the effectiveness of using clinical feature embeddings. We publish our code online for replication.
△ Less
Submitted 6 February, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Multi-modal Graph Learning over UMLS Knowledge Graphs
Authors:
Manuel Burger,
Gunnar Rätsch,
Rita Kuznetsova
Abstract:
Clinicians are increasingly looking towards machine learning to gain insights about patient evolutions. We propose a novel approach named Multi-Modal UMLS Graph Learning (MMUGL) for learning meaningful representations of medical concepts using graph neural networks over knowledge graphs based on the unified medical language system. These representations are aggregated to represent entire patient v…
▽ More
Clinicians are increasingly looking towards machine learning to gain insights about patient evolutions. We propose a novel approach named Multi-Modal UMLS Graph Learning (MMUGL) for learning meaningful representations of medical concepts using graph neural networks over knowledge graphs based on the unified medical language system. These representations are aggregated to represent entire patient visits and then fed into a sequence model to perform predictions at the granularity of multiple hospital visits of a patient. We improve performance by incorporating prior medical knowledge and considering multiple modalities. We compare our method to existing architectures proposed to learn representations at different granularities on the MIMIC-III dataset and show that our approach outperforms these methods. The results demonstrate the significance of multi-modal medical concept representations based on prior medical knowledge.
△ Less
Submitted 9 November, 2023; v1 submitted 10 July, 2023;
originally announced July 2023.
-
On the Importance of Clinical Notes in Multi-modal Learning for EHR Data
Authors:
Severin Husmann,
Hugo Yèche,
Gunnar Rätsch,
Rita Kuznetsova
Abstract:
Understanding deep learning model behavior is critical to accepting machine learning-based decision support systems in the medical community. Previous research has shown that jointly using clinical notes with electronic health record (EHR) data improved predictive performance for patient monitoring in the intensive care unit (ICU). In this work, we explore the underlying reasons for these improvem…
▽ More
Understanding deep learning model behavior is critical to accepting machine learning-based decision support systems in the medical community. Previous research has shown that jointly using clinical notes with electronic health record (EHR) data improved predictive performance for patient monitoring in the intensive care unit (ICU). In this work, we explore the underlying reasons for these improvements. While relying on a basic attention-based model to allow for interpretability, we first confirm that performance significantly improves over state-of-the-art EHR data models when combining EHR data and clinical notes. We then provide an analysis showing improvements arise almost exclusively from a subset of notes containing broader context on patient state rather than clinician notes. We believe such findings highlight deep learning models for EHR data to be more limited by partially-descriptive data than by modeling choice, motivating a more data-centric approach in the field.
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
Temporal Label Smoothing for Early Event Prediction
Authors:
Hugo Yèche,
Alizée Pace,
Gunnar Rätsch,
Rita Kuznetsova
Abstract:
Models that can predict the occurrence of events ahead of time with low false-alarm rates are critical to the acceptance of decision support systems in the medical community. This challenging task is typically treated as a simple binary classification, ignoring temporal dependencies between samples, whereas we propose to exploit this structure. We first introduce a common theoretical framework uni…
▽ More
Models that can predict the occurrence of events ahead of time with low false-alarm rates are critical to the acceptance of decision support systems in the medical community. This challenging task is typically treated as a simple binary classification, ignoring temporal dependencies between samples, whereas we propose to exploit this structure. We first introduce a common theoretical framework unifying dynamic survival analysis and early event prediction. Following an analysis of objectives from both fields, we propose Temporal Label Smoothing (TLS), a simpler, yet best-performing method that preserves prediction monotonicity over time. By focusing the objective on areas with a stronger predictive signal, TLS improves performance over all baselines on two large-scale benchmark tasks. Gains are particularly notable along clinically relevant measures, such as event recall at low false-alarm rates. TLS reduces the number of missed events by up to a factor of two over previously used approaches in early event prediction.
△ Less
Submitted 30 January, 2023; v1 submitted 29 August, 2022;
originally announced August 2022.
-
HiRID-ICU-Benchmark -- A Comprehensive Machine Learning Benchmark on High-resolution ICU Data
Authors:
Hugo Yèche,
Rita Kuznetsova,
Marc Zimmermann,
Matthias Hüser,
Xinrui Lyu,
Martin Faltys,
Gunnar Rätsch
Abstract:
The recent success of machine learning methods applied to time series collected from Intensive Care Units (ICU) exposes the lack of standardized machine learning benchmarks for develo** and comparing such methods. While raw datasets, such as MIMIC-IV or eICU, can be freely accessed on Physionet, the choice of tasks and pre-processing is often chosen ad-hoc for each publication, limiting comparab…
▽ More
The recent success of machine learning methods applied to time series collected from Intensive Care Units (ICU) exposes the lack of standardized machine learning benchmarks for develo** and comparing such methods. While raw datasets, such as MIMIC-IV or eICU, can be freely accessed on Physionet, the choice of tasks and pre-processing is often chosen ad-hoc for each publication, limiting comparability across publications. In this work, we aim to improve this situation by providing a benchmark covering a large spectrum of ICU-related tasks. Using the HiRID dataset, we define multiple clinically relevant tasks in collaboration with clinicians. In addition, we provide a reproducible end-to-end pipeline to construct both data and labels. Finally, we provide an in-depth analysis of current state-of-the-art sequence modeling methods, highlighting some limitations of deep learning approaches for this type of data. With this benchmark, we hope to give the research community the possibility of a fair comparison of their work.
△ Less
Submitted 17 January, 2022; v1 submitted 16 November, 2021;
originally announced November 2021.
-
Variational learning across domains with triplet information
Authors:
Rita Kuznetsova,
Oleg Bakhteev,
Alexandr Ogaltsov
Abstract:
The work investigates deep generative models, which allow us to use training data from one domain to build a model for another domain. We propose the Variational Bi-domain Triplet Autoencoder (VBTA) that learns a joint distribution of objects from different domains. We extend the VBTAs objective function by the relative constraints or triplets that sampled from the shared latent space across domai…
▽ More
The work investigates deep generative models, which allow us to use training data from one domain to build a model for another domain. We propose the Variational Bi-domain Triplet Autoencoder (VBTA) that learns a joint distribution of objects from different domains. We extend the VBTAs objective function by the relative constraints or triplets that sampled from the shared latent space across domains. In other words, we combine the deep generative models with a metric learning ideas in order to improve the final objective with the triplets information. The performance of the VBTA model is demonstrated on different tasks: image-to-image translation, bi-directional image generation and cross-lingual document classification.
△ Less
Submitted 19 November, 2018; v1 submitted 22 June, 2018;
originally announced June 2018.