Skip to main content

Showing 1–50 of 53 results for author: Rätsch, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18450  [pdf, other

    cs.LG cs.AI

    Preference Elicitation for Offline Reinforcement Learning

    Authors: Alizée Pace, Bernhard Schölkopf, Gunnar Rätsch, Giorgia Ramponi

    Abstract: Applying reinforcement learning (RL) to real-world problems is often made challenging by the inability to interact with the environment and the difficulty of designing reward functions. Offline RL addresses the first challenge by considering access to an offline dataset of environment interactions labeled by the reward function. In contrast, Preference-based RL does not assume access to the reward… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2403.18316  [pdf, other

    cs.LG

    Multi-Modal Contrastive Learning for Online Clinical Time-Series Applications

    Authors: Fabian Baldenweg, Manuel Burger, Gunnar Rätsch, Rita Kuznetsova

    Abstract: Electronic Health Record (EHR) datasets from Intensive Care Units (ICU) contain a diverse set of data modalities. While prior works have successfully leveraged multiple modalities in supervised settings, we apply advanced self-supervised multi-modal contrastive learning techniques to ICU data, specifically focusing on clinical notes and time-series for clinically relevant online prediction tasks.… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted as a Workshop Paper at TS4H@ICLR2024

  3. arXiv:2403.12818  [pdf, other

    cs.LG

    Dynamic Survival Analysis for Early Event Prediction

    Authors: Hugo Yèche, Manuel Burger, Dinara Veshchezerova, Gunnar Rätsch

    Abstract: This study advances Early Event Prediction (EEP) in healthcare through Dynamic Survival Analysis (DSA), offering a novel approach by integrating risk localization into alarm policies to enhance clinical event metrics. By adapting and evaluating DSA models against traditional EEP benchmarks, our research demonstrates their ability to match EEP models on a time-step level and significantly improve e… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  4. arXiv:2312.03865  [pdf, other

    cs.LG q-bio.GN

    Learning Genomic Sequence Representations using Graph Neural Networks over De Bruijn Graphs

    Authors: Kacper Kapuśniak, Manuel Burger, Gunnar Rätsch, Amir Joudaki

    Abstract: The rapid expansion of genomic sequence data calls for new methods to achieve robust sequence representations. Existing techniques often neglect intricate structural details, emphasizing mainly contextual information. To address this, we developed k-mer embeddings that merge contextual and structural string information by enhancing De Bruijn graphs with structural similarity connections. Subsequen… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Poster at "NeurIPS 2023 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2023)"

  5. arXiv:2311.08902  [pdf, other

    cs.LG

    On the Importance of Step-wise Embeddings for Heterogeneous Clinical Time-Series

    Authors: Rita Kuznetsova, Alizée Pace, Manuel Burger, Hugo Yèche, Gunnar Rätsch

    Abstract: Recent advances in deep learning architectures for sequence modeling have not fully transferred to tasks handling time-series from electronic health records. In particular, in problems related to the Intensive Care Unit (ICU), the state-of-the-art remains to tackle sequence classification in a tabular manner with tree-based methods. Recent findings in deep learning for tabular data are now surpass… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Machine Learning for Health (ML4H) 2023 in Proceedings of Machine Learning Research 225

    Journal ref: PMLR 225:268-291, 2023

  6. arXiv:2311.07180  [pdf, other

    cs.LG cs.AI

    Knowledge Graph Representations to enhance Intensive Care Time-Series Predictions

    Authors: Samyak Jain, Manuel Burger, Gunnar Rätsch, Rita Kuznetsova

    Abstract: Intensive Care Units (ICU) require comprehensive patient data integration for enhanced clinical outcome predictions, crucial for assessing patient conditions. Recent deep learning advances have utilized patient time series data, and fusion models have incorporated unstructured clinical reports, improving predictive performance. However, integrating established medical knowledge into these models h… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 11 pages

  7. arXiv:2311.00768  [pdf, other

    cs.LG cs.CL

    Language Model Training Paradigms for Clinical Feature Embeddings

    Authors: Yurong Hu, Manuel Burger, Gunnar Rätsch, Rita Kuznetsova

    Abstract: In research areas with scarce data, representation learning plays a significant role. This work aims to enhance representation learning for clinical time series by deriving universal embeddings for clinical features, such as heart rate and blood pressure. We use self-supervised training paradigms for language models to learn high-quality clinical feature embeddings, achieving a finer granularity t… ▽ More

    Submitted 6 February, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: Poster at "NeurIPS 2023 Workshop: Self-Supervised Learning - Theory and Practice"

  8. arXiv:2310.02012  [pdf, other

    cs.LG cs.AI

    Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion

    Authors: Alexandru Meterez, Amir Joudaki, Francesco Orabona, Alexander Immer, Gunnar Rätsch, Hadi Daneshmand

    Abstract: Normalization layers are one of the key building blocks for deep neural networks. Several theoretical studies have shown that batch normalization improves the signal propagation, by avoiding the representations from becoming collinear across the layers. However, results on mean-field theory of batch normalization also conclude that this benefit comes at the expense of exploding gradients in depth.… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  9. arXiv:2307.04461  [pdf, other

    cs.LG

    Multi-modal Graph Learning over UMLS Knowledge Graphs

    Authors: Manuel Burger, Gunnar Rätsch, Rita Kuznetsova

    Abstract: Clinicians are increasingly looking towards machine learning to gain insights about patient evolutions. We propose a novel approach named Multi-Modal UMLS Graph Learning (MMUGL) for learning meaningful representations of medical concepts using graph neural networks over knowledge graphs based on the unified medical language system. These representations are aggregated to represent entire patient v… ▽ More

    Submitted 9 November, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: Machine Learning for Health (ML4H) 2023 in Proceedings of Machine Learning Research 225

    Journal ref: PMLR 225:52-81, 2023

  10. arXiv:2306.03968  [pdf, other

    stat.ML cs.LG

    Stochastic Marginal Likelihood Gradients using Neural Tangent Kernels

    Authors: Alexander Immer, Tycho F. A. van der Ouderaa, Mark van der Wilk, Gunnar Rätsch, Bernhard Schölkopf

    Abstract: Selecting hyperparameters in deep learning greatly impacts its effectiveness but requires manual effort and expertise. Recent works show that Bayesian model selection with Laplace approximations can allow to optimize such hyperparameters just like standard neural network parameters using gradients and on the training data. However, estimating a single hyperparameter gradient requires a pass throug… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  11. arXiv:2306.01157  [pdf, other

    cs.LG cs.AI

    Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding

    Authors: Alizée Pace, Hugo Yèche, Bernhard Schölkopf, Gunnar Rätsch, Guy Tennenholtz

    Abstract: A prominent challenge of offline reinforcement learning (RL) is the issue of hidden confounding: unobserved variables may influence both the actions taken by the agent and the observed outcomes. Hidden confounding can compromise the validity of any causal conclusion drawn from data and presents a major obstacle to effective offline RL. In the present paper, we tackle the problem of hidden confound… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  12. arXiv:2305.16905  [pdf, other

    stat.ML cs.LG

    Improving Neural Additive Models with Bayesian Principles

    Authors: Kouroche Bouchiat, Alexander Immer, Hugo Yèche, Gunnar Rätsch, Vincent Fortuin

    Abstract: Neural additive models (NAMs) enhance the transparency of deep neural networks by handling input features in separate additive sub-networks. However, they lack inherent mechanisms that provide calibrated uncertainties and enable selection of relevant features and interactions. Approaching NAMs from a Bayesian perspective, we augment them in three primary ways, namely by a) providing credible inter… ▽ More

    Submitted 29 May, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 41st International Conference on Machine Learning (ICML 2024)

  13. arXiv:2212.03044  [pdf, other

    cs.LG

    On the Importance of Clinical Notes in Multi-modal Learning for EHR Data

    Authors: Severin Husmann, Hugo Yèche, Gunnar Rätsch, Rita Kuznetsova

    Abstract: Understanding deep learning model behavior is critical to accepting machine learning-based decision support systems in the medical community. Previous research has shown that jointly using clinical notes with electronic health record (EHR) data improved predictive performance for patient monitoring in the intensive care unit (ICU). In this work, we explore the underlying reasons for these improvem… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Workshop on Learning from Time Series for Health, 36th Conference on Neural Information Processing Systems (NeurIPS 2022) 15 pages (including appendices)

  14. arXiv:2208.13764  [pdf, other

    cs.LG

    Temporal Label Smoothing for Early Event Prediction

    Authors: Hugo Yèche, Alizée Pace, Gunnar Rätsch, Rita Kuznetsova

    Abstract: Models that can predict the occurrence of events ahead of time with low false-alarm rates are critical to the acceptance of decision support systems in the medical community. This challenging task is typically treated as a simple binary classification, ignoring temporal dependencies between samples, whereas we propose to exploit this structure. We first introduce a common theoretical framework uni… ▽ More

    Submitted 30 January, 2023; v1 submitted 29 August, 2022; originally announced August 2022.

  15. arXiv:2202.13212  [pdf, other

    cs.LG math.OC

    Faster One-Sample Stochastic Conditional Gradient Method for Composite Convex Minimization

    Authors: Gideon Dresdner, Maria-Luiza Vladarean, Gunnar Rätsch, Francesco Locatello, Volkan Cevher, Alp Yurtsever

    Abstract: We propose a stochastic conditional gradient method (CGM) for minimizing convex finite-sum objectives formed as a sum of smooth and non-smooth terms. Existing CGM variants for this template either suffer from slow convergence rates, or require carefully increasing the batch size over the course of the algorithm's execution, which leads to computing full gradients. In contrast, the proposed method,… ▽ More

    Submitted 17 April, 2022; v1 submitted 26 February, 2022; originally announced February 2022.

    Comments: Artificial Intelligence and Statistics (AISTATS) 2022

  16. arXiv:2202.10638  [pdf, other

    stat.ML cs.LG

    Invariance Learning in Deep Neural Networks with Differentiable Laplace Approximations

    Authors: Alexander Immer, Tycho F. A. van der Ouderaa, Gunnar Rätsch, Vincent Fortuin, Mark van der Wilk

    Abstract: Data augmentation is commonly applied to improve performance of deep learning by enforcing the knowledge that certain transformations on the input preserve the output. Currently, the data augmentation parameters are chosen by human effort and costly cross-validation, which makes it cumbersome to apply to new datasets. We develop a convenient gradient-based method for selecting the data augmentatio… ▽ More

    Submitted 13 October, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022

  17. arXiv:2111.08536  [pdf, other

    cs.LG

    HiRID-ICU-Benchmark -- A Comprehensive Machine Learning Benchmark on High-resolution ICU Data

    Authors: Hugo Yèche, Rita Kuznetsova, Marc Zimmermann, Matthias Hüser, Xinrui Lyu, Martin Faltys, Gunnar Rätsch

    Abstract: The recent success of machine learning methods applied to time series collected from Intensive Care Units (ICU) exposes the lack of standardized machine learning benchmarks for develo** and comparing such methods. While raw datasets, such as MIMIC-IV or eICU, can be freely accessed on Physionet, the choice of tasks and pre-processing is often chosen ad-hoc for each publication, limiting comparab… ▽ More

    Submitted 17 January, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021 (Datasets and Benchmarks)

  18. arXiv:2106.05142  [pdf, other

    cs.LG

    Neighborhood Contrastive Learning Applied to Online Patient Monitoring

    Authors: Hugo Yèche, Gideon Dresdner, Francesco Locatello, Matthias Hüser, Gunnar Rätsch

    Abstract: Intensive care units (ICU) are increasingly looking towards machine learning for methods to provide online monitoring of critically ill patients. In machine learning, online monitoring is often formulated as a supervised learning problem. Recently, contrastive learning approaches have demonstrated promising improvements over competitive supervised benchmarks. These methods rely on well-understood… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: ICML 2021

    Journal ref: Proceedings of 38th International Conference on Machine Learning, 139, (2021) 11964--11974

  19. arXiv:2105.09240  [pdf, other

    cs.LG stat.ML

    Boosting Variational Inference With Locally Adaptive Step-Sizes

    Authors: Gideon Dresdner, Saurav Shekhar, Fabian Pedregosa, Francesco Locatello, Gunnar Rätsch

    Abstract: Variational Inference makes a trade-off between the capacity of the variational family and the tractability of finding an approximate posterior distribution. Instead, Boosting Variational Inference allows practitioners to obtain increasingly good posterior approximations by spending more compute. The main obstacle to widespread adoption of Boosting Variational Inference is the amount of resources… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

  20. arXiv:2105.05728  [pdf, other

    cs.LG stat.ML

    Early prediction of respiratory failure in the intensive care unit

    Authors: Matthias Hüser, Martin Faltys, Xinrui Lyu, Chris Barber, Stephanie L. Hyland, Tobias M. Merz, Gunnar Rätsch

    Abstract: The development of respiratory failure is common among patients in intensive care units (ICU). Large data quantities from ICU patient monitoring systems make timely and comprehensive analysis by clinicians difficult but are ideal for automatic processing by machine learning algorithms. Early prediction of respiratory system failure could alert clinicians to patients at risk of respiratory failure… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 14 pages, 5 figures

  21. arXiv:2104.04975  [pdf, other

    stat.ML cs.LG

    Scalable Marginal Likelihood Estimation for Model Selection in Deep Learning

    Authors: Alexander Immer, Matthias Bauer, Vincent Fortuin, Gunnar Rätsch, Mohammad Emtiyaz Khan

    Abstract: Marginal-likelihood based model-selection, even though promising, is rarely used in deep learning due to estimation difficulties. Instead, most approaches rely on validation data, which may not be readily available. In this work, we present a scalable marginal-likelihood estimation method to select both hyperparameters and network architectures, based on the training data alone. Some hyperparamete… ▽ More

    Submitted 15 June, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: ICML 2021

  22. arXiv:2102.06571  [pdf, other

    stat.ML cs.LG

    Bayesian Neural Network Priors Revisited

    Authors: Vincent Fortuin, Adrià Garriga-Alonso, Sebastian W. Ober, Florian Wenzel, Gunnar Rätsch, Richard E. Turner, Mark van der Wilk, Laurence Aitchison

    Abstract: Isotropic Gaussian priors are the de facto standard for modern Bayesian neural network inference. However, it is unclear whether these priors accurately reflect our true beliefs about the weight distributions or give optimal performance. To find better priors, we study summary statistics of neural network weights in networks trained using stochastic gradient descent (SGD). We find that convolution… ▽ More

    Submitted 16 March, 2022; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: Accepted at ICLR 2022

  23. arXiv:2102.05507  [pdf, other

    stat.ML cs.LG

    On Disentanglement in Gaussian Process Variational Autoencoders

    Authors: Simon Bing, Vincent Fortuin, Gunnar Rätsch

    Abstract: Complex multivariate time series arise in many fields, ranging from computer vision to robotics or medicine. Often we are interested in the independent underlying factors that give rise to the high-dimensional data we are observing. While many models have been introduced to learn such disentangled representations, only few attempt to explicitly exploit the structure of sequential data. We investig… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

  24. arXiv:2011.00865  [pdf, other

    stat.ML cs.LG

    WRSE -- a non-parametric weighted-resolution ensemble for predicting individual survival distributions in the ICU

    Authors: Jonathan Heitz, Joanna Ficek, Martin Faltys, Tobias M. Merz, Gunnar Rätsch, Matthias Hüser

    Abstract: Dynamic assessment of mortality risk in the intensive care unit (ICU) can be used to stratify patients, inform about treatment effectiveness or serve as part of an early-warning system. Static risk scoring systems, such as APACHE or SAPS, have recently been supplemented with data-driven approaches that track the dynamic mortality risk over time. Recent works have focused on enhancing the informati… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: 9 pages, 6 figures

  25. arXiv:2010.14766  [pdf, other

    cs.LG stat.ML

    A Sober Look at the Unsupervised Learning of Disentangled Representations and their Evaluation

    Authors: Francesco Locatello, Stefan Bauer, Mario Lucic, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf, Olivier Bachem

    Abstract: The idea behind the \emph{unsupervised} learning of \emph{disentangled} representations is that real-world data is generated by a few explanatory factors of variation which can be recovered by unsupervised learning algorithms. In this paper, we provide a sober look at recent progress in the field and challenge some common assumptions. We first theoretically show that the unsupervised learning of d… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1811.12359

    Journal ref: Journal of Machine Learning Research 2020, Volume 21, Number 209

  26. arXiv:2010.13472  [pdf, other

    stat.ML cs.LG

    Scalable Gaussian Process Variational Autoencoders

    Authors: Metod Jazbec, Matthew Ashman, Vincent Fortuin, Michael Pearce, Stephan Mandt, Gunnar Rätsch

    Abstract: Conventional variational autoencoders fail in modeling correlations between data points due to their use of factorized priors. Amortized Gaussian process inference through GP-VAEs has led to significant improvements in this regard, but is still inhibited by the intrinsic complexity of exact GP inference. We improve the scalability of these methods through principled sparse inference approaches. We… ▽ More

    Submitted 24 February, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: Published at AISTATS 2021

  27. arXiv:2007.14184  [pdf, other

    cs.LG cs.AI stat.ML

    A Commentary on the Unsupervised Learning of Disentangled Representations

    Authors: Francesco Locatello, Stefan Bauer, Mario Lucic, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf, Olivier Bachem

    Abstract: The goal of the unsupervised learning of disentangled representations is to separate the independent explanatory factors of variation in the data without access to supervision. In this paper, we summarize the results of Locatello et al., 2019, and focus on their implications for practitioners. We discuss the theoretical result showing that the unsupervised learning of disentangled representations… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Journal ref: The Thirty-Fourth AAAI Conference on Artificial Intelligence 2020 (AAAI-20)

  28. arXiv:2002.02886  [pdf, other

    cs.LG stat.ML

    Weakly-Supervised Disentanglement Without Compromises

    Authors: Francesco Locatello, Ben Poole, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem, Michael Tschannen

    Abstract: Intelligent agents should be able to learn useful representations by observing changes in their environment. We model such observations as pairs of non-i.i.d. images sharing at least one of the underlying factors of variation. First, we theoretically show that only knowing how many factors have changed, but not which ones, is sufficient to learn disentangled representations. Second, we provide pra… ▽ More

    Submitted 20 October, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

    Comments: We updated the description of the generation of the dataset compared to the ICML version

    Journal ref: ICML 2020

  29. arXiv:1911.04200  [pdf, other

    cs.CE cs.DC cs.PF q-bio.GN

    Communication-Efficient Jaccard Similarity for High-Performance Distributed Genome Comparisons

    Authors: Maciej Besta, Raghavendra Kanakagiri, Harun Mustafa, Mikhail Karasikov, Gunnar Rätsch, Torsten Hoefler, Edgar Solomonik

    Abstract: The Jaccard similarity index is an important measure of the overlap of two sets, widely used in machine learning, computational genomics, information retrieval, and many other areas. We design and implement SimilarityAtScale, the first communication-efficient distributed algorithm for computing the Jaccard similarity among pairs of large datasets. Our algorithm provides an efficient encoding of th… ▽ More

    Submitted 11 November, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Journal ref: Proceedings of the 34st IEEE International Parallel and Distributed Processing Symposium (IPDPS'20), 2020

  30. arXiv:1910.01590  [pdf, other

    cs.LG stat.ML

    DPSOM: Deep Probabilistic Clustering with Self-Organizing Maps

    Authors: Laura Manduchi, Matthias Hüser, Julia Vogt, Gunnar Rätsch, Vincent Fortuin

    Abstract: Generating interpretable visualizations from complex data is a common problem in many applications. Two key ingredients for tackling this issue are clustering and representation learning. However, current methods do not yet successfully combine the strengths of these two approaches. Existing representation learning models which rely on latent topological structure such as self-organising maps, exh… ▽ More

    Submitted 9 June, 2020; v1 submitted 3 October, 2019; originally announced October 2019.

  31. arXiv:1909.13146  [pdf, other

    q-bio.GN cs.LG stat.ML

    META$^\mathbf{2}$: Memory-efficient taxonomic classification and abundance estimation for metagenomics with deep learning

    Authors: Andreas Georgiou, Vincent Fortuin, Harun Mustafa, Gunnar Rätsch

    Abstract: Metagenomic studies have increasingly utilized sequencing technologies in order to analyze DNA fragments found in environmental samples.One important step in this analysis is the taxonomic classification of the DNA fragments. Conventional read classification methods require large databases and vast amounts of memory to run, with recent deep learning methods suffering from very large model sizes. W… ▽ More

    Submitted 10 February, 2020; v1 submitted 28 September, 2019; originally announced September 2019.

  32. arXiv:1907.04155  [pdf, other

    stat.ML cs.LG

    GP-VAE: Deep Probabilistic Time Series Imputation

    Authors: Vincent Fortuin, Dmitry Baranchuk, Gunnar Rätsch, Stephan Mandt

    Abstract: Multivariate time series with missing values are common in areas such as healthcare and finance, and have grown in number and complexity over the years. This raises the question whether deep learning methodologies can outperform classical data imputation methods in this domain. However, naive applications of deep learning fall short in giving reliable confidence estimates and lack interpretability… ▽ More

    Submitted 20 February, 2020; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: Accepted for publication at the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020)

  33. arXiv:1905.01258  [pdf, other

    cs.LG cs.AI stat.ML

    Disentangling Factors of Variation Using Few Labels

    Authors: Francesco Locatello, Michael Tschannen, Stefan Bauer, Gunnar Rätsch, Bernhard Schölkopf, Olivier Bachem

    Abstract: Learning disentangled representations is considered a cornerstone problem in representation learning. Recently, Locatello et al. (2019) demonstrated that unsupervised disentanglement learning without inductive biases is theoretically impossible and that existing inductive biases and unsupervised methods do not allow to consistently learn disentangled representations. However, in many practical set… ▽ More

    Submitted 14 February, 2020; v1 submitted 3 May, 2019; originally announced May 2019.

    Journal ref: Eighth International Conference on Learning Representations - ICLR 2020

  34. arXiv:1904.12973  [pdf

    cs.LG cs.CL stat.AP stat.ML

    Unsupervised Extraction of Phenotypes from Cancer Clinical Notes for Association Studies

    Authors: Stefan G. Stark, Stephanie L. Hyland, Melanie F. Pradier, Kjong Lehmann, Andreas Wicki, Fernando Perez Cruz, Julia E. Vogt, Gunnar Rätsch

    Abstract: The recent adoption of Electronic Health Records (EHRs) by health care providers has introduced an important source of data that provides detailed and highly specific insights into patient phenotypes over large cohorts. These datasets, in combination with machine learning and statistical approaches, generate new opportunities for research and clinical care. However, many methods require the patien… ▽ More

    Submitted 3 May, 2019; v1 submitted 29 April, 2019; originally announced April 2019.

  35. arXiv:1904.07990  [pdf

    cs.LG stat.AP stat.ML

    Machine learning for early prediction of circulatory failure in the intensive care unit

    Authors: Stephanie L. Hyland, Martin Faltys, Matthias Hüser, Xinrui Lyu, Thomas Gumbsch, Cristóbal Esteban, Christian Bock, Max Horn, Michael Moor, Bastian Rieck, Marc Zimmermann, Dean Bodenham, Karsten Borgwardt, Gunnar Rätsch, Tobias M. Merz

    Abstract: Intensive care clinicians are presented with large quantities of patient information and measurements from a multitude of monitoring systems. The limited ability of humans to process such complex information hinders physicians to readily recognize and act on early signs of patient deterioration. We used machine learning to develop an early warning system for circulatory failure based on a high-res… ▽ More

    Submitted 19 April, 2019; v1 submitted 16 April, 2019; originally announced April 2019.

    Comments: 5 main figures, 1 main table, 13 supplementary figures, 5 supplementary tables; 250ppi images

  36. arXiv:1901.08098  [pdf, other

    stat.ML cs.LG

    Meta-Learning Mean Functions for Gaussian Processes

    Authors: Vincent Fortuin, Heiko Strathmann, Gunnar Rätsch

    Abstract: When fitting Bayesian machine learning models on scarce data, the main challenge is to obtain suitable prior knowledge and encode it into the model. Recent advances in meta-learning offer powerful methods for extracting such prior knowledge from data acquired in related tasks. When it comes to meta-learning in Gaussian process models, approaches in this setting have mostly focused on learning the… ▽ More

    Submitted 14 February, 2020; v1 submitted 23 January, 2019; originally announced January 2019.

  37. arXiv:1812.00490  [pdf, other

    cs.LG stat.ML

    Improving Clinical Predictions through Unsupervised Time Series Representation Learning

    Authors: Xinrui Lyu, Matthias Hueser, Stephanie L. Hyland, George Zerveas, Gunnar Raetsch

    Abstract: In this work, we investigate unsupervised representation learning on medical time series, which bears the promise of leveraging copious amounts of existing unlabeled data in order to eventually assist clinical decision making. By evaluating on the prediction of clinically relevant outcomes, we show that in a practical setting, unsupervised representation learning can offer clear performance benefi… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/171

  38. arXiv:1811.12359  [pdf, other

    cs.LG cs.AI stat.ML

    Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations

    Authors: Francesco Locatello, Stefan Bauer, Mario Lucic, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf, Olivier Bachem

    Abstract: The key idea behind the unsupervised learning of disentangled representations is that real-world data is generated by a few explanatory factors of variation which can be recovered by unsupervised learning algorithms. In this paper, we provide a sober look at recent progress in the field and challenge some common assumptions. We first theoretically show that the unsupervised learning of disentangle… ▽ More

    Submitted 18 June, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Journal ref: Proceedings of the 36th International Conference on Machine Learning (ICML 2019)

  39. arXiv:1810.10368  [pdf, other

    stat.ML cs.AI cs.LG

    Scalable Gaussian Processes on Discrete Domains

    Authors: Vincent Fortuin, Gideon Dresdner, Heiko Strathmann, Gunnar Rätsch

    Abstract: Kernel methods on discrete domains have shown great promise for many challenging data types, for instance, biological sequence data and molecular structure data. Scalable kernel methods like Support Vector Machines may offer good predictive performances but do not intrinsically provide uncertainty estimates. In contrast, probabilistic kernel methods like Gaussian Processes offer uncertainty estima… ▽ More

    Submitted 26 May, 2021; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: Published at IEEE Access

  40. arXiv:1806.02199  [pdf, other

    cs.LG stat.ML

    SOM-VAE: Interpretable Discrete Representation Learning on Time Series

    Authors: Vincent Fortuin, Matthias Hüser, Francesco Locatello, Heiko Strathmann, Gunnar Rätsch

    Abstract: High-dimensional time series are common in many domains. Since human cognition is not optimized to work well in high-dimensional spaces, these areas could benefit from interpretable low-dimensional representations. However, most representation learning algorithms for time series data are difficult to interpret. This is due to non-intuitive map**s from data features to salient properties of the r… ▽ More

    Submitted 4 January, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

    Comments: Accepted for publication at the Seventh International Conference on Learning Representations (ICLR 2019)

  41. arXiv:1806.02185  [pdf, other

    stat.ML cs.LG

    Boosting Black Box Variational Inference

    Authors: Francesco Locatello, Gideon Dresdner, Rajiv Khanna, Isabel Valera, Gunnar Rätsch

    Abstract: Approximating a probability density in a tractable manner is a central task in Bayesian statistics. Variational Inference (VI) is a popular technique that achieves tractability by choosing a relatively simple variational family. Borrowing ideas from the classic boosting framework, recent approaches attempt to \emph{boost} VI by replacing the selection of a single density with a greedily constructe… ▽ More

    Submitted 28 November, 2018; v1 submitted 6 June, 2018; originally announced June 2018.

  42. arXiv:1804.11130  [pdf, other

    cs.LG cs.AI stat.ML

    Competitive Training of Mixtures of Independent Deep Generative Models

    Authors: Francesco Locatello, Damien Vincent, Ilya Tolstikhin, Gunnar Rätsch, Sylvain Gelly, Bernhard Schölkopf

    Abstract: A common assumption in causal modeling posits that the data is generated by a set of independent mechanisms, and algorithms should aim to recover this structure. Standard unsupervised learning, however, is often concerned with training a single model to capture the overall distribution or aspects thereof. Inspired by clustering approaches, we consider mixtures of implicit generative models that ``… ▽ More

    Submitted 3 March, 2019; v1 submitted 30 April, 2018; originally announced April 2018.

  43. arXiv:1803.09539  [pdf, other

    stat.ML cs.LG math.OC

    On Matching Pursuit and Coordinate Descent

    Authors: Francesco Locatello, Anant Raj, Sai Praneeth Karimireddy, Gunnar Rätsch, Bernhard Schölkopf, Sebastian U. Stich, Martin Jaggi

    Abstract: Two popular examples of first-order optimization methods over linear spaces are coordinate descent and matching pursuit algorithms, with their randomized variants. While the former targets the optimization by moving along coordinates, the latter considers a generalized notion of directions. Exploiting the connection between the two algorithms, we present a unified analysis of both, providing affin… ▽ More

    Submitted 31 May, 2019; v1 submitted 26 March, 2018; originally announced March 2018.

    Journal ref: ICML 2018 - Proceedings of the 35th International Conference on Machine Learning

  44. arXiv:1708.01733  [pdf, other

    cs.LG cs.AI stat.ML

    Boosting Variational Inference: an Optimization Perspective

    Authors: Francesco Locatello, Rajiv Khanna, Joydeep Ghosh, Gunnar Rätsch

    Abstract: Variational inference is a popular technique to approximate a possibly intractable Bayesian posterior with a more tractable one. Recently, boosting variational inference has been proposed as a new paradigm to approximate the posterior by a mixture of densities by greedily adding components to the mixture. However, as is the case with many other variational inference algorithms, its theoretical pro… ▽ More

    Submitted 7 March, 2018; v1 submitted 5 August, 2017; originally announced August 2017.

    Journal ref: AISTATS 2018

  45. arXiv:1706.02633  [pdf, other

    stat.ML cs.LG

    Real-valued (Medical) Time Series Generation with Recurrent Conditional GANs

    Authors: Cristóbal Esteban, Stephanie L. Hyland, Gunnar Rätsch

    Abstract: Generative Adversarial Networks (GANs) have shown remarkable success as a framework for training models to produce realistic-looking data. In this work, we propose a Recurrent GAN (RGAN) and Recurrent Conditional GAN (RCGAN) to produce realistic real-valued multi-dimensional time series, with an emphasis on their application to medical data. RGANs make use of recurrent neural networks in the gener… ▽ More

    Submitted 3 December, 2017; v1 submitted 8 June, 2017; originally announced June 2017.

    Comments: 13 pages, 4 figures, 3 tables (update with differential privacy)

  46. arXiv:1705.11041  [pdf, other

    cs.LG stat.ML

    Greedy Algorithms for Cone Constrained Optimization with Convergence Guarantees

    Authors: Francesco Locatello, Michael Tschannen, Gunnar Rätsch, Martin Jaggi

    Abstract: Greedy optimization methods such as Matching Pursuit (MP) and Frank-Wolfe (FW) algorithms regained popularity in recent years due to their simplicity, effectiveness and theoretical guarantees. MP and FW address optimization over the linear span and the convex hull of a set of atoms, respectively. In this paper, we consider the intermediate case of optimization over the convex cone, parametrized as… ▽ More

    Submitted 19 November, 2017; v1 submitted 31 May, 2017; originally announced May 2017.

    Comments: NIPS 2017

  47. arXiv:1607.04903  [pdf, other

    stat.ML cs.LG

    Learning Unitary Operators with Help From u(n)

    Authors: Stephanie L. Hyland, Gunnar Rätsch

    Abstract: A major challenge in the training of recurrent neural networks is the so-called vanishing or exploding gradient problem. The use of a norm-preserving transition operator can address this issue, but parametrization is challenging. In this work we focus on unitary operators and describe a parametrization using the Lie algebra $\mathfrak{u}(n)$ associated with the Lie group $U(n)$ of $n \times n$ uni… ▽ More

    Submitted 10 January, 2017; v1 submitted 17 July, 2016; originally announced July 2016.

    Comments: 9 pages, 3 figures, 5 figures inc. subfigures, to appear at AAAI-17

  48. arXiv:1602.03551  [pdf, other

    cs.CL stat.AP

    Knowledge Transfer with Medical Language Embeddings

    Authors: Stephanie L. Hyland, Theofanis Karaletsos, Gunnar Rätsch

    Abstract: Identifying relationships between concepts is a key aspect of scientific knowledge synthesis. Finding these links often requires a researcher to laboriously search through scien- tific papers and databases, as the size of these resources grows ever larger. In this paper we describe how distributional semantics can be used to unify structured knowledge graphs with unstructured text to predict new r… ▽ More

    Submitted 10 February, 2016; originally announced February 2016.

    Comments: 6 pages, 2 figures, to appear at SDM-DMMH 2016

  49. arXiv:1510.00259  [pdf, other

    cs.CL cs.LG stat.ML

    A Generative Model of Words and Relationships from Multiple Sources

    Authors: Stephanie L. Hyland, Theofanis Karaletsos, Gunnar Rätsch

    Abstract: Neural language models are a powerful tool to embed words into semantic vector spaces. However, learning such models generally relies on the availability of abundant and diverse training examples. In highly specialised domains this requirement may not be met due to difficulties in obtaining a large corpus, or the limited range of expression in average use. Such domains may encode prior knowledge a… ▽ More

    Submitted 3 December, 2015; v1 submitted 1 October, 2015; originally announced October 2015.

    Comments: 8 pages, 5 figures; incorporated feedback from reviewers; to appear in Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence 2016

  50. arXiv:1506.09153  [pdf, other

    stat.ML cs.CE cs.LG

    Framework for Multi-task Multiple Kernel Learning and Applications in Genome Analysis

    Authors: Christian Widmer, Marius Kloft, Vipin T Sreedharan, Gunnar Rätsch

    Abstract: We present a general regularization-based framework for Multi-task learning (MTL), in which the similarity between tasks can be learned or refined using $\ell_p$-norm Multiple Kernel learning (MKL). Based on this very general formulation (including a general loss function), we derive the corresponding dual formulation using Fenchel duality applied to Hermitian matrices. We show that numerous estab… ▽ More

    Submitted 30 June, 2015; originally announced June 2015.