Skip to main content

Showing 1–7 of 7 results for author: Springstein, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2207.02976  [pdf, other

    cs.CV cs.IR

    Semi-supervised Human Pose Estimation in Art-historical Images

    Authors: Matthias Springstein, Stefanie Schneider, Christian Althaus, Ralph Ewerth

    Abstract: Gesture as language of non-verbal communication has been theoretically established since the 17th century. However, its relevance for the visual arts has been expressed only sporadically. This may be primarily due to the sheer overwhelming amount of data that traditionally had to be processed by hand. With the steady progress of digitization, though, a growing number of historical artifacts have b… ▽ More

    Submitted 15 August, 2022; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted at ACM MM 2022 as a conference paper

  2. iART: A Search Engine for Art-Historical Images to Support Research in the Humanities

    Authors: Matthias Springstein, Stefanie Schneider, Javad Rahnama, Eyke Hüllermeier, Hubertus Kohle, Ralph Ewerth

    Abstract: In this paper, we introduce iART: an open Web platform for art-historical research that facilitates the process of comparative vision. The system integrates various machine learning techniques for keyword- and content-based image retrieval as well as category formation via clustering. An intuitive GUI supports users to define queries and explore results. By using a state-of-the-art cross-modal dee… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Journal ref: ACM Multimedia Conference 2021

  3. arXiv:2106.09432  [pdf, other

    cs.CV cs.LG

    Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention

    Authors: Matthias Springstein, Eric Müller-Budack, Ralph Ewerth

    Abstract: The recognition of handwritten mathematical expressions in images and video frames is a difficult and unsolved problem yet. Deep convectional neural networks are basically a promising approach, but typically require a large amount of labeled training data. However, such a large training dataset does not exist for the task of handwritten formula recognition. In this paper, we introduce a system tha… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted for publication in: ACM International Conference on Multimedia Retrieval (ICMR) Workshop 2021

  4. arXiv:2104.13748  [pdf, other

    cs.IR cs.MM

    QuTI! Quantifying Text-Image Consistency in Multimodal Documents

    Authors: Matthias Springstein, Eric Müller-Budack, Ralph Ewerth

    Abstract: The World Wide Web and social media platforms have become popular sources for news and information. Typically, multimodal information, e.g., image and text is used to convey information more effectively and to attract attention. While in most cases image content is decorative or depicts additional information, it has also been leveraged to spread misinformation and rumors in recent years. In this… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in: International ACM SIGIR Conference on Research and Development in Information Retrieval 2021

  5. arXiv:2011.04714  [pdf, other

    cs.CV

    Ontology-driven Event Type Classification in Images

    Authors: Eric Müller-Budack, Matthias Springstein, Sherzod Hakimov, Kevin Mrutzek, Ralph Ewerth

    Abstract: Event classification can add valuable information for semantic search and the increasingly important topic of fact validation in news. So far, only few approaches address image classification for newsworthy event types such as natural disasters, sports events, or elections. Previous work distinguishes only between a limited number of event types and relies on rather small datasets for training. In… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted for publication in: IEEE Winter Conference on Applications of Computer Vision (WACV) 2021

  6. Understanding, Categorizing and Predicting Semantic Image-Text Relations

    Authors: Christian Otto, Matthias Springstein, Avishek Anand, Ralph Ewerth

    Abstract: Two modalities are often used to convey information in a complementary and beneficial manner, e.g., in online news, videos, educational resources, or scientific publications. The automatic understanding of semantic correlations between text and associated images as well as their interplay has a great potential for enhanced multimodal web search and recommender systems. However, automatic understan… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: 8 pages, 8 Figures, 5 tables

    Journal ref: In Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR '19). ACM, New York, NY, USA, 168-176

  7. arXiv:1806.06796  [pdf, other

    cs.DL

    TIB-arXiv: An Alternative Search Portal for the arXiv Pre-print Server

    Authors: Matthias Springstein, Huu Hung Nguyen, Anett Hoppe, Ralph Ewerth

    Abstract: arXiv is a popular pre-print server focusing on natural science disciplines (e.g. physics, computer science, quantitative biology). As a platform with focus on easy publishing services it does not provide enhanced search functionality -- but offers programming interfaces which allow external parties to add these services. This paper presents extensions of the open source framework arXiv Sanity Pre… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.