Skip to main content

Showing 1–8 of 8 results for author: Rottensteiner, F

.
  1. Multimodal Metadata Assignment for Cultural Heritage Artifacts

    Authors: Luis Rei, Dunja Mladenić, Mareike Dorozynski, Franz Rottensteiner, Thomas Schleider, Raphaël Troncy, Jorge Sebastián Lozano, Mar Gaitán Salvatella

    Abstract: We develop a multimodal classifier for the cultural heritage domain using a late fusion approach and introduce a novel dataset. The three modalities are Image, Text, and Tabular data. We based the image classifier on a ResNet convolutional neural network architecture and the text classifier on a multilingual transformer architecture (XML-Roberta). Both are trained as multitask classifiers and use… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Journal ref: Multimedia Systems 29 (2023) 847-869

  2. arXiv:2405.10947  [pdf, other

    cs.CV

    Depth-aware Panoptic Segmentation

    Authors: Tuan Nguyen, Max Mehltretter, Franz Rottensteiner

    Abstract: Panoptic segmentation unifies semantic and instance segmentation and thus delivers a semantic class label and, for so-called thing classes, also an instance label per pixel. The differentiation of distinct objects of the same class with a similar appearance is particularly challenging and frequently causes such objects to be incorrectly assigned to a single instance. In the present work, we demons… ▽ More

    Submitted 21 March, 2024; originally announced May 2024.

  3. arXiv:2108.07779  [pdf, other

    cs.CV

    Appearance Based Deep Domain Adaptation for the Classification of Aerial Images

    Authors: Dennis Wittich, Franz Rottensteiner

    Abstract: This paper addresses domain adaptation for the pixel-wise classification of remotely sensed data using deep neural networks (DNN) as a strategy to reduce the requirements of DNN with respect to the availability of training data. We focus on the setting in which labelled data are only available in a source domain DS, but not in a target domain DT. Our method is based on adversarial training of an a… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

  4. Pose Estimation and 3D Reconstruction of Vehicles from Stereo-Images Using a Subcategory-Aware Shape Prior

    Authors: Max Coenen, Franz Rottensteiner

    Abstract: The 3D reconstruction of objects is a prerequisite for many highly relevant applications of computer vision such as mobile robotics or autonomous driving. To deal with the inverse problem of reconstructing 3D objects from their 2D projections, a common strategy is to incorporate prior object knowledge into the reconstruction approach by establishing a 3D model and aligning it to the 2D image plane… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

  5. arXiv:2104.06991  [pdf

    cs.CV

    A hierarchical deep learning framework for the consistent classification of land use objects in geospatial databases

    Authors: Chun Yang, Franz Rottensteiner, Christian Heipke

    Abstract: Land use as contained in geospatial databases constitutes an essential input for different applica-tions such as urban management, regional planning and environmental monitoring. In this paper, a hierarchical deep learning framework is proposed to verify the land use information. For this purpose, a two-step strategy is applied. First, given high-resolution aerial images, the land cover informatio… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

  6. Probabilistic Vehicle Reconstruction Using a Multi-Task CNN

    Authors: Max Coenen, Franz Rottensteiner

    Abstract: The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or i… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

    Comments: 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)

  7. The Hessigheim 3D (H3D) Benchmark on Semantic Segmentation of High-Resolution 3D Point Clouds and Textured Meshes from UAV LiDAR and Multi-View-Stereo

    Authors: Michael Kölle, Dominik Laupheimer, Stefan Schmohl, Norbert Haala, Franz Rottensteiner, Jan Dirk Wegner, Hugo Ledoux

    Abstract: Automated semantic segmentation and object detection are of great importance in geospatial data analysis. However, supervised machine learning systems such as convolutional neural networks require large corpora of annotated training data. Especially in the geospatial domain, such datasets are quite scarce. Within this paper, we aim to alleviate this issue by introducing a new annotated 3D dataset… ▽ More

    Submitted 25 February, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: H3D can be retrieved from https://ifpwww.ifp.uni-stuttgart.de/benchmark/hessigheim/default.aspx

  8. arXiv:1307.3043  [pdf, other

    cs.CV

    A two-layer Conditional Random Field for the classification of partially occluded objects

    Authors: Sergey Kosov, Pushmeet Kohli, Franz Rottensteiner, Christian Heipke

    Abstract: Conditional Random Fields (CRF) are among the most popular techniques for image labelling because of their flexibility in modelling dependencies between the labels and the image features. This paper proposes a novel CRF-framework for image labeling problems which is capable to classify partially occluded objects. Our approach is evaluated on aerial near-vertical images as well as on urban street-v… ▽ More

    Submitted 13 September, 2013; v1 submitted 11 July, 2013; originally announced July 2013.

    Comments: Conference Submission