Skip to main content

Showing 1–2 of 2 results for author: Moenck, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09637  [pdf, other

    cs.CV

    Industrial Language-Image Dataset (ILID): Adapting Vision Foundation Models for Industrial Settings

    Authors: Keno Moenck, Duc Trung Thieu, Julian Koch, Thorsten Schüppstuhl

    Abstract: In recent years, the upstream of Large Language Models (LLM) has also encouraged the computer vision community to work on substantial multimodal datasets and train models on a scale in a self-/semi-supervised manner, resulting in Vision Foundation Models (VFM), as, e.g., Contrastive Language-Image Pre-training (CLIP). The models generalize well and perform outstandingly on everyday objects or scen… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Dataset at https://github.com/kenomo/ilid training- and evaluation-related code at https://github.com/kenomo/industrial-clip

  2. arXiv:2307.12674  [pdf, other

    cs.CV

    Industrial Segment Anything -- a Case Study in Aircraft Manufacturing, Intralogistics, Maintenance, Repair, and Overhaul

    Authors: Keno Moenck, Arne Wendt, Philipp Prünte, Julian Koch, Arne Sahrhage, Johann Gierecker, Ole Schmedemann, Falko Kähler, Dirk Holst, Martin Gomse, Thorsten Schüppstuhl, Daniel Schoepflin

    Abstract: Deploying deep learning-based applications in specialized domains like the aircraft production industry typically suffers from the training data availability problem. Only a few datasets represent non-everyday objects, situations, and tasks. Recent advantages in research around Vision Foundation Models (VFM) opened a new area of tasks and models with high generalization capabilities in non-semanti… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.