Skip to main content

Showing 1–3 of 3 results for author: Thede, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09384  [pdf, other

    cs.LG cs.CV

    Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models

    Authors: Lukas Thede, Karsten Roth, Olivier J. Hénaff, Matthias Bethge, Zeynep Akata

    Abstract: With the advent and recent ubiquity of foundation models, continual learning (CL) has recently shifted from continual training from scratch to the continual adaptation of pretrained models, seeing particular success on rehearsal-free CL benchmarks (RFCL). To achieve this, most proposed methods adapt and restructure parameter-efficient finetuning techniques (PEFT) to suit the continual nature of th… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 3rd Conference on Lifelong Learning Agents (CoLLAs) 2024

  2. arXiv:2310.17653  [pdf, other

    cs.LG cs.CV

    Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model

    Authors: Karsten Roth, Lukas Thede, Almut Sophia Koepke, Oriol Vinyals, Olivier Hénaff, Zeynep Akata

    Abstract: Training deep networks requires various design decisions regarding for instance their architecture, data augmentation, or optimization. In this work, we find these training variations to result in networks learning unique feature sets from the data. Using public model libraries comprising thousands of models trained on canonical datasets like ImageNet, we observe that for arbitrary pairings of pre… ▽ More

    Submitted 26 February, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 (spotlight)

  3. arXiv:2304.07306  [pdf, other

    cs.LG cs.AI cs.HC

    Learning to Defer with Limited Expert Predictions

    Authors: Patrick Hemmer, Lukas Thede, Michael Vössing, Johannes Jakubik, Niklas Kühl

    Abstract: Recent research suggests that combining AI models with a human expert can exceed the performance of either alone. The combination of their capabilities is often realized by learning to defer algorithms that enable the AI to learn to decide whether to make a prediction for a particular instance or defer it to the human expert. However, to accurately learn which instances should be deferred to the h… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: 37th AAAI Conference on Artificial Intelligence (AAAI-23)