Skip to main content

Showing 1–12 of 12 results for author: Busch, F

.
  1. arXiv:2402.12298  [pdf, other

    cs.CL cs.AI

    Is Open-Source There Yet? A Comparative Study on Commercial and Open-Source LLMs in Their Ability to Label Chest X-Ray Reports

    Authors: Felix J. Dorfner, Liv Jürgensen, Leonhard Donle, Fares Al Mohamad, Tobias R. Bodenmann, Mason C. Cleveland, Felix Busch, Lisa C. Adams, James Sato, Thomas Schultz, Albert E. Kim, Jameson Merkow, Keno K. Bressem, Christopher P. Bridge

    Abstract: Introduction: With the rapid advances in large language models (LLMs), there have been numerous new open source as well as commercial models. While recent publications have explored GPT-4 in its application to extracting information of interest from radiology reports, there has not been a real-world comparison of GPT-4 to different leading open-source models. Materials and Methods: Two different… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  2. arXiv:2402.06434  [pdf, other

    cs.LG stat.ML

    Where is the Truth? The Risk of Getting Confounded in a Continual World

    Authors: Florian Peter Busch, Roshni Kamath, Rupert Mitchell, Wolfgang Stammer, Kristian Kersting, Martin Mundt

    Abstract: A dataset is confounded if it is most easily solved via a spurious correlation, which fails to generalize to new data. In this work, we show that, in a continual learning setting where confounders may vary in time across tasks, the challenge of mitigating the effect of confounders far exceeds the standard forgetting problem normally considered. In particular, we provide a formal description of suc… ▽ More

    Submitted 15 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  3. arXiv:2401.14490  [pdf, other

    cs.CL

    LongHealth: A Question Answering Benchmark with Long Clinical Documents

    Authors: Lisa Adams, Felix Busch, Tianyu Han, Jean-Baptiste Excoffier, Matthieu Ortala, Alexander Löser, Hugo JWL. Aerts, Jakob Nikolas Kather, Daniel Truhn, Keno Bressem

    Abstract: Background: Recent advancements in large language models (LLMs) offer potential benefits in healthcare, particularly in processing extensive patient records. However, existing benchmarks do not fully assess LLMs' capability in handling real-world, lengthy clinical data. Methods: We present the LongHealth benchmark, comprising 20 detailed fictional patient cases across various diseases, with each… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 11 pages, 3 figures, 5 tables

  4. arXiv:2311.14777  [pdf, other

    eess.IV cs.CV

    From Text to Image: Exploring GPT-4Vision's Potential in Advanced Radiological Analysis across Subspecialties

    Authors: Felix Busch, Tianyu Han, Marcus Makowski, Daniel Truhn, Keno Bressem, Lisa Adams

    Abstract: The study evaluates and compares GPT-4 and GPT-4Vision for radiological tasks, suggesting GPT-4Vision may recognize radiological features from images, thereby enhancing its diagnostic potential over text-based descriptions.

    Submitted 24 November, 2023; originally announced November 2023.

    Journal ref: J Med Internet Res 2024;26:e54948

  5. MEDBERT.de: A Comprehensive German BERT Model for the Medical Domain

    Authors: Keno K. Bressem, Jens-Michalis Papaioannou, Paul Grundmann, Florian Borchert, Lisa C. Adams, Leonhard Liu, Felix Busch, Lina Xu, Jan P. Loyen, Stefan M. Niehues, Moritz Augustin, Lennart Grosser, Marcus R. Makowski, Hugo JWL. Aerts, Alexander Löser

    Abstract: This paper presents medBERTde, a pre-trained German BERT model specifically designed for the German medical domain. The model has been trained on a large corpus of 4.7 Million German medical documents and has been shown to achieve new state-of-the-art performance on eight different medical benchmarks covering a wide range of disciplines and medical document types. In addition to evaluating the ove… ▽ More

    Submitted 24 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Keno K. Bressem and Jens-Michalis Papaioannou and Paul Grundmann contributed equally

    Journal ref: Expert Systems with Applications 2024;237(21):121598

  6. arXiv:2212.12575  [pdf, other

    cs.AI

    Continual Causal Abstractions

    Authors: Matej Zečević, Moritz Willig, Jonas Seng, Florian Peter Busch

    Abstract: This short paper discusses continually updated causal abstractions as a potential direction of future research. The key idea is to revise the existing level of causal abstraction to a different level of detail that is both consistent with the history of observed data and more effective in solving a given task.

    Submitted 6 January, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

    Comments: Main paper: 3 pages, 1 figure. References: 1 page

  7. arXiv:2209.13696  [pdf, other

    cs.CV cs.AI eess.IV

    What Does DALL-E 2 Know About Radiology?

    Authors: Lisa C. Adams, Felix Busch, Daniel Truhn, Marcus R. Makowski, Hugo JWL. Aerts, Keno K. Bressem

    Abstract: Generative models such as DALL-E 2 could represent a promising future tool for image generation, augmentation, and manipulation for artificial intelligence research in radiology provided that these models have sufficient medical domain knowledge. Here we show that DALL-E 2 has learned relevant representations of X-ray images with promising capabilities in terms of zero-shot text-to-image generatio… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: 4 Figures

    Journal ref: J Med Internet Res 2023;25:e43110

  8. Combining Predictions under Uncertainty: The Case of Random Decision Trees

    Authors: Florian Busch, Moritz Kulessa, Eneldo Loza Mencía, Hendrik Blockeel

    Abstract: A common approach to aggregate classification estimates in an ensemble of decision trees is to either use voting or to average the probabilities for each class. The latter takes uncertainty into account, but not the reliability of the uncertainty estimates (so to say, the "uncertainty about the uncertainty"). More generally, much remains unknown about how to best combine probabilistic estimates fr… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: Preprint version. Appeared in: Discovery Science. 24th International Conference, DS 2021, Halifax, NS, Canada, October 11 to 13, 2021, Proceedings. See https://ds2021.cs.dal.ca/ for further information

    Journal ref: In International Conference on Discovery Science (pp. 78-93). Springer, Cham (2021)

  9. arXiv:2206.07203  [pdf, other

    cs.LG

    Attributions Beyond Neural Networks: The Linear Program Case

    Authors: Florian Peter Busch, Matej Zečević, Kristian Kersting, Devendra Singh Dhami

    Abstract: Linear Programs (LPs) have been one of the building blocks in machine learning and have championed recent strides in differentiable optimizers for learning systems. While there exist solvers for even high-dimensional LPs, understanding said high-dimensional solutions poses an orthogonal and unresolved problem. We introduce an approach where we consider neural encodings for LPs that justify the app… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Main paper: 9.5 pages, References: 2 pages, Supplement: 2.5 pages. Main paper: 5 figures, 2 tables, Supplement: 1 figure

  10. arXiv:2205.04718  [pdf, other

    cs.MA eess.SY

    Integrating Parcel Deliveries into a Ride-Pooling Service -- An Agent-Based Simulation Study

    Authors: Fabian Fehn, Roman Engelhardt, Florian Dandl, Klaus Bogenberger, Fritz Busch

    Abstract: This paper examines the integration of freight delivery into the passenger transport of an on-demand ride-pooling service. The goal of this research is to use existing passenger trips for logistics services and thus reduce additional vehicle kilometers for freight delivery and the total number of vehicles on the road network. This is achieved by merging the need for two separate fleets into a sing… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

  11. arXiv:2204.12533  [pdf, other

    cs.RO eess.SY

    A Gaussian Process Model for Opponent Prediction in Autonomous Racing

    Authors: Edward L. Zhu, Finn Lukas Busch, Jake Johnson, Francesco Borrelli

    Abstract: In head-to-head racing, an accurate model of interactive behavior of the opposing target vehicle (TV) is required to perform tightly constrained, but highly rewarding maneuvers such as overtaking. However, such information is not typically made available in competitive scenarios, we therefore propose to construct a prediction and uncertainty model given data of the TV from previous races. In parti… ▽ More

    Submitted 1 March, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: Submitted to IROS 2023

  12. arXiv:2203.15274  [pdf, other

    cs.AI

    Finding Structure and Causality in Linear Programs

    Authors: Matej Zečević, Florian Peter Busch, Devendra Singh Dhami, Kristian Kersting

    Abstract: Linear Programs (LP) are celebrated widely, particularly so in machine learning where they have allowed for effectively solving probabilistic inference tasks or imposing structure on end-to-end learning systems. Their potential might seem depleted but we propose a foundational, causal perspective that reveals intriguing intra- and inter-structure relations for LP components. We conduct a systemati… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Main paper: 5 pages, References: 2 pages, Appendix: 1 page. Figures: 8 main, 1 appendix. Tables: 1 appendix