Skip to main content

Showing 1–3 of 3 results for author: Harb, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.08199  [pdf, other

    eess.IV cs.CV cs.LG

    Diffusion-based generation of Histopathological Whole Slide Images at a Gigapixel scale

    Authors: Robert Harb, Thomas Pock, Heimo Müller

    Abstract: We present a novel diffusion-based approach to generate synthetic histopathological Whole Slide Images (WSIs) at an unprecedented gigapixel scale. Synthetic WSIs have many potential applications: They can augment training datasets to enhance the performance of many computational pathology applications. They allow the creation of synthesized copies of datasets that can be shared without violating p… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    ACM Class: I.4.9; I.5.4; I.2.10

  2. arXiv:2110.03477  [pdf, ps, other

    cs.CV

    InfoSeg: Unsupervised Semantic Image Segmentation with Mutual Information Maximization

    Authors: Robert Harb, Patrick Knöbelreiter

    Abstract: We propose a novel method for unsupervised semantic image segmentation based on mutual information maximization between local and global high-level image features. The core idea of our work is to leverage recent progress in self-supervised image representation learning. Representation learning methods compute a single high-level feature capturing an entire image. In contrast, we compute multiple h… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: GCPR 2021 - Best Paper

  3. arXiv:1810.06897  [pdf, other

    cs.SD eess.AS

    Sound event detection using weakly-labeled semi-supervised data with GCRNNS, VAT and Self-Adaptive Label Refinement

    Authors: Robert Harb, Franz Pernkopf

    Abstract: In this paper, we present a gated convolutional recurrent neural network based approach to solve task 4, large-scale weakly labelled semi-supervised sound event detection in domestic environments, of the DCASE 2018 challenge. Gated linear units and a temporal attention layer are used to predict the onset and offset of sound events in 10s long audio clips. Whereby for training only weakly-labelled… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: Accepted at DCASE 2018 Workshop for oral presentation