Skip to main content

Showing 1–11 of 11 results for author: Ilse, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04449  [pdf, other

    cs.CL cs.CV

    MAIRA-2: Grounded Radiology Report Generation

    Authors: Shruthi Bannur, Kenza Bouzid, Daniel C. Castro, Anton Schwaighofer, Sam Bond-Taylor, Maximilian Ilse, Fernando Pérez-García, Valentina Salvatelli, Harshita Sharma, Felix Meissen, Mercy Ranjit, Shaury Srivastav, Julia Gong, Fabian Falck, Ozan Oktay, Anja Thieme, Matthew P. Lungren, Maria Teodora Wetscherek, Javier Alvarez-Valle, Stephanie L. Hyland

    Abstract: Radiology reporting is a complex task that requires detailed image understanding, integration of multiple inputs, including comparison with prior imaging, and precise language generation. This makes it ideal for the development and use of generative multimodal models. Here, we extend report generation to include the localisation of individual findings on the image - a task we call grounded report… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 44 pages, 20 figures

  2. arXiv:2401.10815  [pdf, other

    cs.CV

    RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision

    Authors: Fernando Pérez-García, Harshita Sharma, Sam Bond-Taylor, Kenza Bouzid, Valentina Salvatelli, Maximilian Ilse, Shruthi Bannur, Daniel C. Castro, Anton Schwaighofer, Matthew P. Lungren, Maria Wetscherek, Noel Codella, Stephanie L. Hyland, Javier Alvarez-Valle, Ozan Oktay

    Abstract: Language-supervised pre-training has proven to be a valuable method for extracting semantically meaningful features from images, serving as a foundational element in multimodal systems within the computer vision and medical imaging domains. However, resulting features are limited by the information contained within the text. This is particularly problematic in medical imaging, where radiologists'… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  3. arXiv:2312.12865  [pdf, other

    cs.CV cs.AI

    RadEdit: stress-testing biomedical vision models via diffusion image editing

    Authors: Fernando Pérez-García, Sam Bond-Taylor, Pedro P. Sanchez, Boris van Breugel, Daniel C. Castro, Harshita Sharma, Valentina Salvatelli, Maria T. A. Wetscherek, Hannah Richardson, Matthew P. Lungren, Aditya Nori, Javier Alvarez-Valle, Ozan Oktay, Maximilian Ilse

    Abstract: Biomedical imaging datasets are often small and biased, meaning that real-world performance of predictive models can be substantially lower than expected from internal testing. This work proposes using generative image editing to simulate dataset shifts and diagnose failure modes of biomedical vision models; this can be used in advance of deployment to assess readiness, potentially reducing cost a… ▽ More

    Submitted 3 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  4. arXiv:2304.13737  [pdf, other

    q-bio.QM cs.LG

    AIRIVA: A Deep Generative Model of Adaptive Immune Repertoires

    Authors: Melanie F. Pradier, Niranjani Prasad, Paidamoyo Chapfuwa, Sahra Ghalebikesabi, Max Ilse, Steven Woodhouse, Rebecca Elyanow, Javier Zazo, Javier Gonzalez, Julia Greissl, Edward Meeds

    Abstract: Recent advances in immunomics have shown that T-cell receptor (TCR) signatures can accurately predict active or recent infection by leveraging the high specificity of TCR binding to disease antigens. However, the extreme diversity of the adaptive immune repertoire presents challenges in reliably identifying disease-specific TCRs. Population genetics and sequencing depth can also have strong system… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  5. arXiv:2301.04558  [pdf, other

    cs.CV cs.CL

    Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

    Authors: Shruthi Bannur, Stephanie Hyland, Qianchu Liu, Fernando Pérez-García, Maximilian Ilse, Daniel C. Castro, Benedikt Boecking, Harshita Sharma, Kenza Bouzid, Anja Thieme, Anton Schwaighofer, Maria Wetscherek, Matthew P. Lungren, Aditya Nori, Javier Alvarez-Valle, Ozan Oktay

    Abstract: Self-supervised learning in vision-language processing exploits semantic alignment between imaging and text modalities. Prior work in biomedical VLP has mostly relied on the alignment of single image and report pairs even though clinical notes commonly refer to prior images. This does not only introduce poor alignment between the modalities but also a missed opportunity to exploit rich self-superv… ▽ More

    Submitted 16 March, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: To appear in CVPR 2023

  6. arXiv:2103.04786  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Combining Interventional and Observational Data Using Causal Reductions

    Authors: Maximilian Ilse, Patrick Forré, Max Welling, Joris M. Mooij

    Abstract: Unobserved confounding is one of the main challenges when estimating causal effects. We propose a causal reduction method that, given a causal model, replaces an arbitrary number of possibly high-dimensional latent confounders with a single latent confounder that takes values in the same space as the treatment variable, without changing the observational and interventional distributions the causal… ▽ More

    Submitted 22 February, 2023; v1 submitted 8 March, 2021; originally announced March 2021.

  7. arXiv:2011.01761  [pdf, other

    cs.LG cs.SD eess.AS

    Problems using deep generative models for probabilistic audio source separation

    Authors: Maurice Frank, Maximilian Ilse

    Abstract: Recent advancements in deep generative modeling make it possible to learn prior distributions from complex data that subsequently can be used for Bayesian inference. However, we find that distributions learned by deep generative models for audio signals do not exhibit the right properties that are necessary for tasks like audio source separation using a probabilistic approach. We observe that the… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Journal ref: 1st I Can't Believe It's Not Better Workshop (ICBINB @ NeurIPS 2020), Vancouver, Canada

  8. arXiv:2005.01856  [pdf, other

    stat.ML cs.CV cs.LG

    Selecting Data Augmentation for Simulating Interventions

    Authors: Maximilian Ilse, Jakub M. Tomczak, Patrick Forré

    Abstract: Machine learning models trained with purely observational data and the principle of empirical risk minimization \citep{vapnik_principles_1992} can fail to generalize to unseen domains. In this paper, we focus on the case where the problem arises through spurious correlation between the observed domains and the actual task labels. We find that many domain generalization methods do not explicitly ta… ▽ More

    Submitted 26 October, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

  9. arXiv:1905.10427  [pdf, other

    stat.ML cs.LG

    DIVA: Domain Invariant Variational Autoencoders

    Authors: Maximilian Ilse, Jakub M. Tomczak, Christos Louizos, Max Welling

    Abstract: We consider the problem of domain generalization, namely, how to learn representations given data from a set of domains that generalize to data from a previously unseen domain. We propose the Domain Invariant Variational Autoencoder (DIVA), a generative model that tackles this problem by learning three independent latent subspaces, one for the domain, one for the class, and one for any residual va… ▽ More

    Submitted 7 October, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: Code available at https://github.com/AMLab-Amsterdam/DIVA

  10. arXiv:1802.04712  [pdf, other

    cs.LG stat.ML

    Attention-based Deep Multiple Instance Learning

    Authors: Maximilian Ilse, Jakub M. Tomczak, Max Welling

    Abstract: Multiple instance learning (MIL) is a variation of supervised learning where a single class label is assigned to a bag of instances. In this paper, we state the MIL problem as learning the Bernoulli distribution of the bag label where the bag label probability is fully parameterized by neural networks. Furthermore, we propose a neural network-based permutation-invariant aggregation operator that c… ▽ More

    Submitted 28 June, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: ICML 2018 paper, code source: https://github.com/AMLab-Amsterdam/AttentionDeepMIL

  11. arXiv:1712.00310  [pdf, other

    cs.LG stat.ML

    Deep Learning with Permutation-invariant Operator for Multi-instance Histopathology Classification

    Authors: Jakub M. Tomczak, Maximilian Ilse, Max Welling

    Abstract: The computer-aided analysis of medical scans is a longstanding goal in the medical imaging field. Currently, deep learning has became a dominant methodology for supporting pathologists and radiologist. Deep learning algorithms have been successfully applied to digital pathology and radiology, nevertheless, there are still practical issues that prevent these tools to be widely used in practice. The… ▽ More

    Submitted 5 December, 2017; v1 submitted 1 December, 2017; originally announced December 2017.

    Comments: Workshop on "Medical Imaging meets NIPS" at NIPS 2017