Skip to main content

Showing 1–4 of 4 results for author: Fanconi, C

.
  1. arXiv:2406.08414  [pdf, other

    cs.LG

    Discovering Preference Optimization Algorithms with and for Large Language Models

    Authors: Chris Lu, Samuel Holt, Claudio Fanconi, Alex J. Chan, Jakob Foerster, Mihaela van der Schaar, Robert Tjarko Lange

    Abstract: Offline preference optimization is a key method for enhancing and controlling the quality of Large Language Model (LLM) outputs. Typically, preference optimization is approached as an offline supervised learning task using manually-crafted convex loss functions. While these methods are based on theoretical insights, they are inherently constrained by human creativity, so the large search space of… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. This Reads Like That: Deep Learning for Interpretable Natural Language Processing

    Authors: Claudio Fanconi, Moritz Vandenhirtz, Severin Husmann, Julia E. Vogt

    Abstract: Prototype learning, a popular machine learning method designed for inherently interpretable decisions, leverages similarities to learned prototypes for classifying new data. While it is mainly applied in computer vision, in this work, we build upon prior research and further explore the extension of prototypical networks to natural language processing. We introduce a learned weighted similarity me… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 10 pages, 1 figure, 5 tables

    Journal ref: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

  3. arXiv:2209.13860  [pdf, other

    cs.CL cs.LG

    Natural Language Processing Methods to Identify Oncology Patients at High Risk for Acute Care with Clinical Notes

    Authors: Claudio Fanconi, Marieke van Buchem, Tina Hernandez-Boussard

    Abstract: Clinical notes are an essential component of a health record. This paper evaluates how natural language processing (NLP) can be used to identify the risk of acute care use (ACU) in oncology patients, once chemotherapy starts. Risk prediction using structured health data (SHD) is now standard, but predictions using free-text formats are complex. This paper explores the use of free-text notes for th… ▽ More

    Submitted 16 March, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: 11 pages, 6 figures, 2 tables

    Journal ref: AMIA Informatics Summit 2023

  4. arXiv:2105.02968  [pdf, other

    cs.CV cs.AI cs.LG

    This Looks Like That... Does it? Shortcomings of Latent Space Prototype Interpretability in Deep Networks

    Authors: Adrian Hoffmann, Claudio Fanconi, Rahul Rade, Jonas Kohler

    Abstract: Deep neural networks that yield human interpretable decisions by architectural design have lately become an increasingly popular alternative to post hoc interpretation of traditional black-box models. Among these networks, the arguably most widespread approach is so-called prototype learning, where similarities to learned latent prototypes serve as the basis of classifying an unseen data point. In… ▽ More

    Submitted 23 June, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

    Journal ref: ICML 2021 Workshop on Theoretic Foundation, Criticism, and Application Trend of Explainable AI