Search | arXiv e-print repository

HiRID-ICU-Benchmark -- A Comprehensive Machine Learning Benchmark on High-resolution ICU Data

Authors: Hugo Yèche, Rita Kuznetsova, Marc Zimmermann, Matthias Hüser, Xinrui Lyu, Martin Faltys, Gunnar Rätsch

Abstract: The recent success of machine learning methods applied to time series collected from Intensive Care Units (ICU) exposes the lack of standardized machine learning benchmarks for develo** and comparing such methods. While raw datasets, such as MIMIC-IV or eICU, can be freely accessed on Physionet, the choice of tasks and pre-processing is often chosen ad-hoc for each publication, limiting comparab… ▽ More The recent success of machine learning methods applied to time series collected from Intensive Care Units (ICU) exposes the lack of standardized machine learning benchmarks for develo** and comparing such methods. While raw datasets, such as MIMIC-IV or eICU, can be freely accessed on Physionet, the choice of tasks and pre-processing is often chosen ad-hoc for each publication, limiting comparability across publications. In this work, we aim to improve this situation by providing a benchmark covering a large spectrum of ICU-related tasks. Using the HiRID dataset, we define multiple clinically relevant tasks in collaboration with clinicians. In addition, we provide a reproducible end-to-end pipeline to construct both data and labels. Finally, we provide an in-depth analysis of current state-of-the-art sequence modeling methods, highlighting some limitations of deep learning approaches for this type of data. With this benchmark, we hope to give the research community the possibility of a fair comparison of their work. △ Less

Submitted 17 January, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

Comments: NeurIPS 2021 (Datasets and Benchmarks)

arXiv:2106.05142 [pdf, other]

Neighborhood Contrastive Learning Applied to Online Patient Monitoring

Authors: Hugo Yèche, Gideon Dresdner, Francesco Locatello, Matthias Hüser, Gunnar Rätsch

Abstract: Intensive care units (ICU) are increasingly looking towards machine learning for methods to provide online monitoring of critically ill patients. In machine learning, online monitoring is often formulated as a supervised learning problem. Recently, contrastive learning approaches have demonstrated promising improvements over competitive supervised benchmarks. These methods rely on well-understood… ▽ More Intensive care units (ICU) are increasingly looking towards machine learning for methods to provide online monitoring of critically ill patients. In machine learning, online monitoring is often formulated as a supervised learning problem. Recently, contrastive learning approaches have demonstrated promising improvements over competitive supervised benchmarks. These methods rely on well-understood data augmentation techniques developed for image data which do not apply to online monitoring. In this work, we overcome this limitation by supplementing time-series data augmentation techniques with a novel contrastive learning objective which we call neighborhood contrastive learning (NCL). Our objective explicitly groups together contiguous time segments from each patient while maintaining state-specific information. Our experiments demonstrate a marked improvement over existing work applying contrastive methods to medical time-series. △ Less

Submitted 9 June, 2021; originally announced June 2021.

Comments: ICML 2021

Journal ref: Proceedings of 38th International Conference on Machine Learning, 139, (2021) 11964--11974

arXiv:2105.05728 [pdf, other]

Early prediction of respiratory failure in the intensive care unit

Authors: Matthias Hüser, Martin Faltys, Xinrui Lyu, Chris Barber, Stephanie L. Hyland, Tobias M. Merz, Gunnar Rätsch

Abstract: The development of respiratory failure is common among patients in intensive care units (ICU). Large data quantities from ICU patient monitoring systems make timely and comprehensive analysis by clinicians difficult but are ideal for automatic processing by machine learning algorithms. Early prediction of respiratory system failure could alert clinicians to patients at risk of respiratory failure… ▽ More The development of respiratory failure is common among patients in intensive care units (ICU). Large data quantities from ICU patient monitoring systems make timely and comprehensive analysis by clinicians difficult but are ideal for automatic processing by machine learning algorithms. Early prediction of respiratory system failure could alert clinicians to patients at risk of respiratory failure and allow for early patient reassessment and treatment adjustment. We propose an early warning system that predicts moderate/severe respiratory failure up to 8 hours in advance. Our system was trained on HiRID-II, a data-set containing more than 60,000 admissions to a tertiary care ICU. An alarm is typically triggered several hours before the beginning of respiratory failure. Our system outperforms a clinical baseline mimicking traditional clinical decision-making based on pulse-oximetric oxygen saturation and the fraction of inspired oxygen. To provide model introspection and diagnostics, we developed an easy-to-use web browser-based system to explore model input data and predictions visually. △ Less

Submitted 12 May, 2021; originally announced May 2021.

Comments: 14 pages, 5 figures

arXiv:2011.05048 [pdf]

X-ray imaging detector for radiological applications in the harsh environments of low-income countries

Authors: Mario A. Chavarria, Matthias Huser, Sebastien Blanc, Pascal Monnin, Jérôme Schmid, Christophe Chênes, Lazhari Assassi, Hubert Blanchard, Romain Sahli, Jean-Philippe Thiran, René P. Salathé, Klaus Schönenberger

Abstract: This paper describes the development of a novel medical Xray imaging system adapted to the needs and constraints of low and middle income countries. The developed system is based on an indirect conversion chain: a scintillator plate produces visible light when excited by the Xrays, then a calibrated multi camera architecture converts the visible light from the scintillator into a set of digital im… ▽ More This paper describes the development of a novel medical Xray imaging system adapted to the needs and constraints of low and middle income countries. The developed system is based on an indirect conversion chain: a scintillator plate produces visible light when excited by the Xrays, then a calibrated multi camera architecture converts the visible light from the scintillator into a set of digital images. The partial images are then unwarped, enhanced and stitched through parallel processing units and a specialized software. All the detector components were carefully selected focusing on optimizing the system s image quality, robustness, cost, effectiveness and capability to work in harsh tropical environments. With this aim, different customized and commercial components were characterized. The resulting detector can generate high quality medical diagnostic images with DQE levels up to 60 percent, at 2.34 micro Gray, even under harsh environments i.e. 60 degrees Celsius and 98 percent humidity. △ Less

Submitted 10 November, 2020; originally announced November 2020.

arXiv:2011.00865 [pdf, other]

WRSE -- a non-parametric weighted-resolution ensemble for predicting individual survival distributions in the ICU

Authors: Jonathan Heitz, Joanna Ficek, Martin Faltys, Tobias M. Merz, Gunnar Rätsch, Matthias Hüser

Abstract: Dynamic assessment of mortality risk in the intensive care unit (ICU) can be used to stratify patients, inform about treatment effectiveness or serve as part of an early-warning system. Static risk scoring systems, such as APACHE or SAPS, have recently been supplemented with data-driven approaches that track the dynamic mortality risk over time. Recent works have focused on enhancing the informati… ▽ More Dynamic assessment of mortality risk in the intensive care unit (ICU) can be used to stratify patients, inform about treatment effectiveness or serve as part of an early-warning system. Static risk scoring systems, such as APACHE or SAPS, have recently been supplemented with data-driven approaches that track the dynamic mortality risk over time. Recent works have focused on enhancing the information delivered to clinicians even further by producing full survival distributions instead of point predictions or fixed horizon risks. In this work, we propose a non-parametric ensemble model, Weighted Resolution Survival Ensemble (WRSE), tailored to estimate such dynamic individual survival distributions. Inspired by the simplicity and robustness of ensemble methods, the proposed approach combines a set of binary classifiers spaced according to a decay function reflecting the relevance of short-term mortality predictions. Models and baselines are evaluated under weighted calibration and discrimination metrics for individual survival distributions which closely reflect the utility of a model in ICU practice. We show competitive results with state-of-the-art probabilistic models, while greatly reducing training time by factors of 2-9x. △ Less

Submitted 2 November, 2020; originally announced November 2020.

Comments: 9 pages, 6 figures

arXiv:1910.01590 [pdf, other]

DPSOM: Deep Probabilistic Clustering with Self-Organizing Maps

Authors: Laura Manduchi, Matthias Hüser, Julia Vogt, Gunnar Rätsch, Vincent Fortuin

Abstract: Generating interpretable visualizations from complex data is a common problem in many applications. Two key ingredients for tackling this issue are clustering and representation learning. However, current methods do not yet successfully combine the strengths of these two approaches. Existing representation learning models which rely on latent topological structure such as self-organising maps, exh… ▽ More Generating interpretable visualizations from complex data is a common problem in many applications. Two key ingredients for tackling this issue are clustering and representation learning. However, current methods do not yet successfully combine the strengths of these two approaches. Existing representation learning models which rely on latent topological structure such as self-organising maps, exhibit markedly lower clustering performance compared to recent deep clustering methods. To close this performance gap, we (a) present a novel way to fit self-organizing maps with probabilistic cluster assignments (PSOM), (b) propose a new deep architecture for probabilistic clustering (DPSOM) using a VAE, and (c) extend our architecture for time-series clustering (T-DPSOM), which also allows forecasting in the latent space using LSTMs. We show that DPSOM achieves superior clustering performance compared to current deep clustering methods on MNIST/Fashion-MNIST, while maintaining the favourable visualization properties of SOMs. On medical time series, we show that T-DPSOM outperforms baseline methods in time series clustering and time series forecasting, while providing interpretable visualizations of patient state trajectories and uncertainty estimation. △ Less

Submitted 9 June, 2020; v1 submitted 3 October, 2019; originally announced October 2019.

arXiv:1904.07990 [pdf]

Machine learning for early prediction of circulatory failure in the intensive care unit

Authors: Stephanie L. Hyland, Martin Faltys, Matthias Hüser, Xinrui Lyu, Thomas Gumbsch, Cristóbal Esteban, Christian Bock, Max Horn, Michael Moor, Bastian Rieck, Marc Zimmermann, Dean Bodenham, Karsten Borgwardt, Gunnar Rätsch, Tobias M. Merz

Abstract: Intensive care clinicians are presented with large quantities of patient information and measurements from a multitude of monitoring systems. The limited ability of humans to process such complex information hinders physicians to readily recognize and act on early signs of patient deterioration. We used machine learning to develop an early warning system for circulatory failure based on a high-res… ▽ More Intensive care clinicians are presented with large quantities of patient information and measurements from a multitude of monitoring systems. The limited ability of humans to process such complex information hinders physicians to readily recognize and act on early signs of patient deterioration. We used machine learning to develop an early warning system for circulatory failure based on a high-resolution ICU database with 240 patient years of data. This automatic system predicts 90.0% of circulatory failure events (prevalence 3.1%), with 81.8% identified more than two hours in advance, resulting in an area under the receiver operating characteristic curve of 94.0% and area under the precision-recall curve of 63.0%. The model was externally validated in a large independent patient cohort. △ Less

Submitted 19 April, 2019; v1 submitted 16 April, 2019; originally announced April 2019.

Comments: 5 main figures, 1 main table, 13 supplementary figures, 5 supplementary tables; 250ppi images

arXiv:1902.09499 [pdf, other]

doi 10.1088/1361-6579/ab6360

Forecasting intracranial hypertension using multi-scale waveform metrics

Authors: Matthias Hüser, Adrian Kündig, Walter Karlen, Valeria De Luca, Martin Jaggi

Abstract: Objective: Acute intracranial hypertension is an important risk factor of secondary brain damage after traumatic brain injury. Hypertensive episodes are often diagnosed reactively, leading to late detection and lost time for intervention planning. A pro-active approach that predicts critical events several hours ahead of time could assist in directing attention to patients at risk. Approach: We de… ▽ More Objective: Acute intracranial hypertension is an important risk factor of secondary brain damage after traumatic brain injury. Hypertensive episodes are often diagnosed reactively, leading to late detection and lost time for intervention planning. A pro-active approach that predicts critical events several hours ahead of time could assist in directing attention to patients at risk. Approach: We developed a prediction framework that forecasts onsets of acute intracranial hypertension in the next 8 hours. It jointly uses cerebral auto-regulation indices, spectral energies and morphological pulse metrics to describe the neurological state of the patient. One-minute base windows were compressed by computing signal metrics, and then stored in a multi-scale history, from which physiological features were derived. Main results: Our model predicted events up to 8 hours in advance with alarm recall rates of 90% at a precision of 30.3% in the MIMIC-III waveform database, improving upon two baselines from the literature. We found that features derived from high-frequency waveforms substantially improved the prediction performance over simple statistical summaries of low-frequency time series, and each of the three feature classes contributed to the performance gain. The inclusion of long-term history up to 8 hours was especially important. Significance: Our results highlight the importance of information contained in high-frequency waveforms in the neurological intensive care unit. They could motivate future studies on pre-hypertensive patterns and the design of new alarm algorithms for critical events in the injured brain. △ Less

Submitted 4 December, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

Comments: 11 pages, 5 figures

arXiv:1812.00490 [pdf, other]

Improving Clinical Predictions through Unsupervised Time Series Representation Learning

Authors: Xinrui Lyu, Matthias Hueser, Stephanie L. Hyland, George Zerveas, Gunnar Raetsch

Abstract: In this work, we investigate unsupervised representation learning on medical time series, which bears the promise of leveraging copious amounts of existing unlabeled data in order to eventually assist clinical decision making. By evaluating on the prediction of clinically relevant outcomes, we show that in a practical setting, unsupervised representation learning can offer clear performance benefi… ▽ More In this work, we investigate unsupervised representation learning on medical time series, which bears the promise of leveraging copious amounts of existing unlabeled data in order to eventually assist clinical decision making. By evaluating on the prediction of clinically relevant outcomes, we show that in a practical setting, unsupervised representation learning can offer clear performance benefits over end-to-end supervised architectures. We experiment with using sequence-to-sequence (Seq2Seq) models in two different ways, as an autoencoder and as a forecaster, and show that the best performance is achieved by a forecasting Seq2Seq model with an integrated attention mechanism, proposed here for the first time in the setting of unsupervised learning for medical time series. △ Less

Submitted 2 December, 2018; originally announced December 2018.

Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

Report number: ML4H/2018/171

arXiv:1806.02199 [pdf, other]

SOM-VAE: Interpretable Discrete Representation Learning on Time Series

Authors: Vincent Fortuin, Matthias Hüser, Francesco Locatello, Heiko Strathmann, Gunnar Rätsch

Abstract: High-dimensional time series are common in many domains. Since human cognition is not optimized to work well in high-dimensional spaces, these areas could benefit from interpretable low-dimensional representations. However, most representation learning algorithms for time series data are difficult to interpret. This is due to non-intuitive map**s from data features to salient properties of the r… ▽ More High-dimensional time series are common in many domains. Since human cognition is not optimized to work well in high-dimensional spaces, these areas could benefit from interpretable low-dimensional representations. However, most representation learning algorithms for time series data are difficult to interpret. This is due to non-intuitive map**s from data features to salient properties of the representation and non-smoothness over time. To address this problem, we propose a new representation learning framework building on ideas from interpretable discrete dimensionality reduction and deep generative modeling. This framework allows us to learn discrete representations of time series, which give rise to smooth and interpretable embeddings with superior clustering performance. We introduce a new way to overcome the non-differentiability in discrete representation learning and present a gradient-based version of the traditional self-organizing map algorithm that is more performant than the original. Furthermore, to allow for a probabilistic interpretation of our method, we integrate a Markov model in the representation space. This model uncovers the temporal transition structure, improves clustering performance even further and provides additional explanatory insights as well as a natural representation of uncertainty. We evaluate our model in terms of clustering performance and interpretability on static (Fashion-)MNIST data, a time series of linearly interpolated (Fashion-)MNIST images, a chaotic Lorenz attractor system with two macro states, as well as on a challenging real world medical time series application on the eICU data set. Our learned representations compare favorably with competitor methods and facilitate downstream tasks on the real world data. △ Less

Submitted 4 January, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

Comments: Accepted for publication at the Seventh International Conference on Learning Representations (ICLR 2019)

arXiv:astro-ph/9907140 [pdf, ps, other]

Dark Matter in the Dwarf Galaxy NGC 247

Authors: M. Straessle, M. Huser, Ph. Jetzer, F. DePaolis

Abstract: Dwarf galaxies are dominated by dark matter even in the innermost regions and, therefore, provide excellent probes for the investigation of dark halos. To that purpose, we analyse ROSAT PSPC-data of the dwarf galaxy NGC 247. We focus in particular on the diffuse X-ray emission in the $1/4 $keV band. Assuming an isothermal density profile, we find that the mass of the hot emitting gas is about… ▽ More Dwarf galaxies are dominated by dark matter even in the innermost regions and, therefore, provide excellent probes for the investigation of dark halos. To that purpose, we analyse ROSAT PSPC-data of the dwarf galaxy NGC 247. We focus in particular on the diffuse X-ray emission in the $1/4 $keV band. Assuming an isothermal density profile, we find that the mass of the hot emitting gas is about $10^8 {\rm M_{\odot}}$, corresponding to $\lesssim 0.5%$ of the total dynamical mass of the galaxy. The total mass of NGC 247, as derived from the X-ray data agrees quite well with the value obtained from the measured rotation curve (Burlak). The X-ray profile in the $3/4 $keV and $1.5 $keV band shows an excess at a radial distance of about $15 $ arcmin from the center. Such a ``hump'' in the radial X-ray profile can be explained by the presence of a cluster of young low mass stars or brown dwarfs. Therefore, NGC 247 offers the possibility to observe the formation of a halo of MACHOs. △ Less

Submitted 12 July, 1999; originally announced July 1999.

Comments: 6 pages, accepted for publication in A & A

Report number: ZU-TH 20/99

Journal ref: Astron.Astrophys. 349 (1999) 1

Showing 1–11 of 11 results for author: Hueser, M