Skip to main content

Showing 1–8 of 8 results for author: Kayser, M

.
  1. arXiv:2303.04958  [pdf, other

    cs.CV cs.AI

    NIFF: Alleviating Forgetting in Generalized Few-Shot Object Detection via Neural Instance Feature Forging

    Authors: Karim Guirguis, Johannes Meier, George Eskandar, Matthias Kayser, Bin Yang, Juergen Beyerer

    Abstract: Privacy and memory are two recurring themes in a broad conversation about the societal impact of AI. These concerns arise from the need for huge amounts of data to train deep neural networks. A promise of Generalized Few-shot Object Detection (G-FSOD), a learning paradigm in AI, is to alleviate the need for collecting abundant training samples of novel classes we wish to detect by leveraging prior… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  2. arXiv:2210.05783  [pdf, other

    cs.CV cs.AI

    Towards Discriminative and Transferable One-Stage Few-Shot Object Detectors

    Authors: Karim Guirguis, Mohamed Abdelsamad, George Eskandar, Ahmed Hendawy, Matthias Kayser, Bin Yang, Juergen Beyerer

    Abstract: Recent object detection models require large amounts of annotated data for training a new classes of objects. Few-shot object detection (FSOD) aims to address this problem by learning novel classes given only a few samples. While competitive results have been achieved using two-stage FSOD detectors, typically one-stage FSODs underperform compared to them. We make the observation that the large gap… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  3. arXiv:2207.04343  [pdf, other

    cs.CV cs.AI cs.CL

    Explaining Chest X-ray Pathologies in Natural Language

    Authors: Maxime Kayser, Cornelius Emde, Oana-Maria Camburu, Guy Parsons, Bartlomiej Papiez, Thomas Lukasiewicz

    Abstract: Most deep learning algorithms lack explanations for their predictions, which limits their deployment in clinical practice. Approaches to improve explainability, especially in medical imaging, have often been shown to convey limited information, be overly reassuring, or lack robustness. In this work, we introduce the task of generating natural language explanations (NLEs) to justify predictions mad… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

    Journal ref: MICCAI 2022

  4. arXiv:2204.05220  [pdf, other

    cs.CV

    CFA: Constraint-based Finetuning Approach for Generalized Few-Shot Object Detection

    Authors: Karim Guirguis, Ahmed Hendawy, George Eskandar, Mohamed Abdelsamad, Matthias Kayser, Juergen Beyerer

    Abstract: Few-shot object detection (FSOD) seeks to detect novel categories with limited data by leveraging prior knowledge from abundant base data. Generalized few-shot object detection (G-FSOD) aims to tackle FSOD without forgetting previously seen base classes and, thus, accounts for a more realistic scenario, where both classes are encountered during test time. While current FSOD methods suffer from cat… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

  5. arXiv:2204.05072  [pdf, other

    cs.CV

    Few-Shot Object Detection in Unseen Domains

    Authors: Karim Guirguis, George Eskandar, Matthias Kayser, Bin Yang, Juergen Beyerer

    Abstract: Few-shot object detection (FSOD) has thrived in recent years to learn novel object classes with limited data by transferring knowledge gained on abundant base classes. FSOD approaches commonly assume that both the scarcely provided examples of novel classes and test-time data belong to the same domain. However, this assumption does not hold in various industrial and robotics applications, where a… ▽ More

    Submitted 19 September, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

  6. arXiv:2105.03761  [pdf, other

    cs.CV cs.CL cs.LG

    e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks

    Authors: Maxime Kayser, Oana-Maria Camburu, Leonard Salewski, Cornelius Emde, Virginie Do, Zeynep Akata, Thomas Lukasiewicz

    Abstract: Recently, there has been an increasing number of efforts to introduce models capable of generating natural language explanations (NLEs) for their predictions on vision-language (VL) tasks. Such models are appealing, because they can provide human-friendly and comprehensive explanations. However, there is a lack of comparison between existing methods, which is due to a lack of re-usable evaluation… ▽ More

    Submitted 18 August, 2021; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted at ICCV 2021 (camera-ready version)

  7. arXiv:2010.05002  [pdf, other

    cs.CL

    Compressing Transformer-Based Semantic Parsing Models using Compositional Code Embeddings

    Authors: Prafull Prakash, Saurabh Kumar Shashidhar, Wenlong Zhao, Subendhu Rongali, Haidar Khan, Michael Kayser

    Abstract: The current state-of-the-art task-oriented semantic parsing models use BERT or RoBERTa as pretrained encoders; these models have huge memory footprints. This poses a challenge to their deployment for voice assistants such as Amazon Alexa and Google Assistant on edge devices with limited memory budgets. We propose to learn compositional code embeddings to greatly reduce the sizes of BERT-base and R… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: Accepted at EMNLP 2020 (Findings); 7 Pages

  8. arXiv:2002.02883  [pdf, other

    cs.LG eess.IV stat.ML

    Understanding the effects of artifacts on automated polyp detection and incorporating that knowledge via learning without forgetting

    Authors: Maxime Kayser, Roger D. Soberanis-Mukul, Anna-Maria Zvereva, Peter Klare, Nassir Navab, Shadi Albarqouni

    Abstract: Survival rates for colorectal cancer are higher when polyps are detected at an early stage and can be removed before they develop into malignant tumors. Automated polyp detection, which is dominated by deep learning based methods, seeks to improve early detection of polyps. However, current efforts rely heavily on the size and quality of the training datasets. The quality of these datasets often s… ▽ More

    Submitted 22 August, 2020; v1 submitted 7 February, 2020; originally announced February 2020.