Skip to main content

Showing 1–34 of 34 results for author: Gevers, T

.
  1. arXiv:2406.09126  [pdf, other

    cs.CV

    Auto-Vocabulary Segmentation for LiDAR Points

    Authors: Weijie Wei, Osman Ülger, Fatemeh Karimi Najadasl, Theo Gevers, Martin R. Oswald

    Abstract: Existing perception methods for autonomous driving fall short of recognizing unknown entities not covered in the training data. Open-vocabulary methods offer promising capabilities in detecting any object but are limited by user-specified queries representing target classes. We propose AutoVoc3D, a framework for automatic object class recognition and open-ended segmentation. Evaluation on nuScenes… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by CVPR 2024 OpenSun3D Workshop

  2. arXiv:2403.20092  [pdf, other

    cs.CV

    Modeling Weather Uncertainty for Multi-weather Co-Presence Estimation

    Authors: Qi Bi, Shaodi You, Theo Gevers

    Abstract: Images from outdoor scenes may be taken under various weather conditions. It is well studied that weather impacts the performance of computer vision algorithms and needs to be handled properly. However, existing algorithms model weather condition as a discrete status and estimate it using multi-label classification. The fact is that, physically, specifically in meteorology, weather are modeled as… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Work in progress

  3. arXiv:2312.10217  [pdf, other

    cs.CV

    T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning

    Authors: Weijie Wei, Fatemeh Karimi Nejadasl, Theo Gevers, Martin R. Oswald

    Abstract: The scarcity of annotated data in LiDAR point cloud understanding hinders effective representation learning. Consequently, scholars have been actively investigating efficacious self-supervised pre-training paradigms. Nevertheless, temporal information, which is inherent in the LiDAR point cloud sequence, is consistently disregarded. To better utilize this property, we propose an effective pre-trai… ▽ More

    Submitted 21 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Under review

  4. arXiv:2312.10070  [pdf, other

    cs.CV cs.RO

    Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting

    Authors: Vladimir Yugay, Yue Li, Theo Gevers, Martin R. Oswald

    Abstract: We present a dense simultaneous localization and map** (SLAM) method that uses 3D Gaussians as a scene representation. Our approach enables interactive-time reconstruction and photo-realistic rendering from real-world single-camera RGBD videos. To this end, we propose a novel effective strategy for seeding new Gaussians for newly explored areas and their effective online optimization that is ind… ▽ More

    Submitted 22 March, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

  5. arXiv:2310.07669  [pdf, other

    cs.CV cs.AI

    HaarNet: Large-scale Linear-Morphological Hybrid Network for RGB-D Semantic Segmentation

    Authors: Rick Groenendijk, Leo Dorst, Theo Gevers

    Abstract: Signals from different modalities each have their own combination algebra which affects their sampling processing. RGB is mostly linear; depth is a geometric signal following the operations of mathematical morphology. If a network obtaining RGB-D input has both kinds of operators available in its layers, it should be able to give effective output with fewer parameters. In this paper, morphological… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    ACM Class: I.4.6; I.2.6

  6. arXiv:2310.07573  [pdf, other

    cs.CV

    Relational Prior Knowledge Graphs for Detection and Instance Segmentation

    Authors: Osman Ülger, Yu Wang, Ysbrand Galama, Sezer Karaoglu, Theo Gevers, Martin R. Oswald

    Abstract: Humans have a remarkable ability to perceive and reason about the world around them by understanding the relationships between objects. In this paper, we investigate the effectiveness of using such relationships for object detection and instance segmentation. To this end, we propose a Relational Prior-based Feature Enhancement Model (RP-FEM), a graph transformer that enhances object proposal featu… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Published in ICCV2023 SG2RL Workshop

  7. arXiv:2309.17162  [pdf, other

    cs.CV

    APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds

    Authors: Weijie Wei, Martin R. Oswald, Fatemeh Karimi Nejadasl, Theo Gevers

    Abstract: In this paper, we focus on semantic segmentation method for point clouds of urban scenes. Our fundamental concept revolves around the collaborative utilization of diverse scene representations to benefit from different context information and network architectures. To this end, the proposed network architecture, called APNet, is split into two branches: a point cloud branch and an aerial image bra… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV Workshop 2023 and selected as an oral

  8. arXiv:2307.10924  [pdf, other

    cs.CV

    Intrinsic Image Decomposition Using Point Cloud Representation

    Authors: Xiaoyan Xing, Konrad Groh, Sezer Karaoglu, Theo Gevers

    Abstract: The purpose of intrinsic decomposition is to separate an image into its albedo (reflective properties) and shading components (illumination properties). This is challenging because it's an ill-posed problem. Conventional approaches primarily concentrate on 2D imagery and fail to fully exploit the capabilities of 3D data representation. 3D point clouds offer a more comprehensive format for represen… ▽ More

    Submitted 28 March, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: Code: https://github.com/xyxingx/PoInt-Net

  9. arXiv:2307.00371  [pdf, other

    cs.CV

    Learning Content-enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation

    Authors: Qi Bi, Shaodi You, Theo Gevers

    Abstract: Domain-generalized urban-scene semantic segmentation (USSS) aims to learn generalized semantic predictions across diverse urban-scene styles. Unlike domain gap challenges, USSS is unique in that the semantic categories are often similar in different urban scenes, while the styles can vary significantly due to changes in urban landscapes, weather conditions, lighting, and other factors. Existing ap… ▽ More

    Submitted 17 December, 2023; v1 submitted 1 July, 2023; originally announced July 2023.

    Comments: Accepted by AAAI 2024. Camera-ready version with available source code

  10. arXiv:2211.14037  [pdf, other

    cs.CV cs.LG

    MorphPool: Efficient Non-linear Pooling & Unpooling in CNNs

    Authors: Rick Groenendijk, Leo Dorst, Theo Gevers

    Abstract: Pooling is essentially an operation from the field of Mathematical Morphology, with max pooling as a limited special case. The more general setting of MorphPooling greatly extends the tool set for building neural networks. In addition to pooling operations, encoder-decoder networks used for pixel-level predictions also require unpooling. It is common to combine unpooling with convolution or deconv… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: Accepted paper at the British Machine Vision Conference (BMVC) 2022

    ACM Class: I.4.10; I.3.5; I.4.6

  11. arXiv:2208.14369  [pdf, other

    cs.CV

    SIGNet: Intrinsic Image Decomposition by a Semantic and Invariant Gradient Driven Network for Indoor Scenes

    Authors: Partha Das, Sezer Karaoglu, Arjan Gijsenij, Theo Gevers

    Abstract: Intrinsic image decomposition (IID) is an under-constrained problem. Therefore, traditional approaches use hand crafted priors to constrain the problem. However, these constraints are limited when co** with complex scenes. Deep learning-based approaches learn these constraints implicitly through the data, but they often suffer from dataset biases (due to not being able to include all possible im… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

  12. Invariant Descriptors for Intrinsic Reflectance Optimization

    Authors: Anil S. Baslamisli, Theo Gevers

    Abstract: Intrinsic image decomposition aims to factorize an image into albedo (reflectance) and shading (illumination) sub-components. Being ill-posed and under-constrained, it is a very challenging computer vision problem. There are infinite pairs of reflectance and shading images that can reconstruct the same input. To address the problem, Intrinsic Images in the Wild provides an optimization framework b… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Journal ref: Journal of the Optical Society of America A, Vol. 38, Issue 6, pp. 887-896 (2021)

  13. arXiv:2203.16670  [pdf, other

    cs.CV

    PIE-Net: Photometric Invariant Edge Guided Network for Intrinsic Image Decomposition

    Authors: Partha Das, Sezer Karaoglu, Theo Gevers

    Abstract: Intrinsic image decomposition is the process of recovering the image formation components (reflectance and shading) from an image. Previous methods employ either explicit priors to constrain the problem or implicit constraints as formulated by their losses (deep learning). These methods can be negatively influenced by strong illumination conditions causing shading-reflectance leakages. Therefore… ▽ More

    Submitted 2 May, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

  14. arXiv:2109.00863  [pdf, other

    cs.CV

    Generative Models for Multi-Illumination Color Constancy

    Authors: Partha Das, Yang Liu, Sezer Karaoglu, Theo Gevers

    Abstract: In this paper, the aim is multi-illumination color constancy. However, most of the existing color constancy methods are designed for single light sources. Furthermore, datasets for learning multiple illumination color constancy are largely missing. We propose a seed (physics driven) based multi-illumination color constancy method. GANs are exploited to model the illumination estimation problem as… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: Accepted in International Conference on Computer Vision Workshop (ICCVW) 2021

  15. arXiv:2011.04389  [pdf, other

    cs.CV

    EDEN: Multimodal Synthetic Dataset of Enclosed GarDEN Scenes

    Authors: Hoang-An Le, Thomas Mensink, Partha Das, Sezer Karaoglu, Theo Gevers

    Abstract: Multimodal large-scale datasets for outdoor scenes are mostly designed for urban driving problems. The scenes are highly structured and semantically different from scenarios seen in nature-centered scenes such as gardens or parks. To promote machine learning methods for nature-oriented applications, such as agriculture and gardening, we propose the multimodal synthetic dataset for Enclosed garDEN… ▽ More

    Submitted 10 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted for publishing at WACV 2021

  16. arXiv:2010.11844  [pdf, other

    cs.CV

    Spatio-temporal Features for Generalized Detection of Deepfake Videos

    Authors: Ipek Ganiyusufoglu, L. Minh Ngô, Nedko Savov, Sezer Karaoglu, Theo Gevers

    Abstract: For deepfake detection, video-level detectors have not been explored as extensively as image-level detectors, which do not exploit temporal data. In this paper, we empirically show that existing approaches on image and sequence classifiers generalize poorly to new manipulation techniques. To this end, we propose spatio-temporal features, modeled by 3D CNNs, to extend the generalization capabilitie… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Submitted to Computer Vision and Image Understanding (CVIU)

  17. arXiv:2009.08321  [pdf, other

    cs.CV

    Novel View Synthesis from Single Images via Point Cloud Transformation

    Authors: Hoang-An Le, Thomas Mensink, Partha Das, Theo Gevers

    Abstract: In this paper the argument is made that for true novel view synthesis of objects, where the object can be synthesized from any viewpoint, an explicit 3D shape representation isdesired. Our method estimates point clouds to capture the geometry of the object, which can be freely rotated into the desired view and then projected into a new image. This image, however, is sparse by nature and hence this… ▽ More

    Submitted 18 September, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted at British Machine Vision Conference 2020

  18. arXiv:2009.01717  [pdf, other

    cs.CV cs.AI

    Multi-Loss Weighting with Coefficient of Variations

    Authors: Rick Groenendijk, Sezer Karaoglu, Theo Gevers, Thomas Mensink

    Abstract: Many interesting tasks in machine learning and computer vision are learned by optimising an objective function defined as a weighted linear combination of multiple losses. The final performance is sensitive to choosing the correct (relative) weights for these losses. Finding a good set of weights is often done by adopting them into the set of hyper-parameters, which are set using an extensive grid… ▽ More

    Submitted 10 November, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

    Comments: Paper was accepted at the IEEE Winter Conference on Applications of Computer Vision 2021 (WACV2021)

    MSC Class: 68T45 ACM Class: I.4

  19. arXiv:2009.01540  [pdf, other

    cs.CV

    Physics-based Shading Reconstruction for Intrinsic Image Decomposition

    Authors: Anil S. Baslamisli, Yang Liu, Sezer Karaoglu, Theo Gevers

    Abstract: We investigate the use of photometric invariance and deep learning to compute intrinsic images (albedo and shading). We propose albedo and shading gradient descriptors which are derived from physics-based models. Using the descriptors, albedo transitions are masked out and an initial sparse shading map is calculated directly from the corresponding RGB image gradients in a learning-free unsupervise… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: Submitted to Computer Vision and Image Understanding (CVIU)

  20. arXiv:2004.06382  [pdf, other

    cs.CV

    Kinship Identification through Joint Learning Using Kinship Verification Ensembles

    Authors: Wei Wang, Shaodi You, Sezer Karaoglu, Theo Gevers

    Abstract: Kinship verification is a well-explored task: identifying whether or not two persons are kin. In contrast, kinship identification has been largely ignored so far. Kinship identification aims to further identify the particular type of kinship. An extension to kinship verification run short to properly obtain identification, because existing verification networks are individually trained on specific… ▽ More

    Submitted 24 August, 2020; v1 submitted 14 April, 2020; originally announced April 2020.

    Comments: 14 pages, 7 figures

  21. arXiv:1912.04023  [pdf, other

    cs.CV

    ShadingNet: Image Intrinsics by Fine-Grained Shading Decomposition

    Authors: Anil S. Baslamisli, Partha Das, Hoang-An Le, Sezer Karaoglu, Theo Gevers

    Abstract: In general, intrinsic image decomposition algorithms interpret shading as one unified component including all photometric effects. As shading transitions are generally smoother than reflectance (albedo) changes, these methods may fail in distinguishing strong photometric effects from reflectance variations. Therefore, in this paper, we propose to decompose the shading component into direct (illumi… ▽ More

    Submitted 21 January, 2021; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: Submitted to International Journal of Computer Vision (IJCV)

  22. On the Benefit of Adversarial Training for Monocular Depth Estimation

    Authors: Rick Groenendijk, Sezer Karaoglu, Theo Gevers, Thomas Mensink

    Abstract: In this paper we address the benefit of adding adversarial training to the task of monocular depth estimation. A model can be trained in a self-supervised setting on stereo pairs of images, where depth (disparities) are an intermediate result in a right-to-left image reconstruction pipeline. For the quality of the image reconstruction and disparity prediction, a combination of different losses is… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: 11 pages, 8 tables, 5 figures, accepted at CVIU

    MSC Class: 68T45

  23. arXiv:1812.10558  [pdf, other

    cs.CV

    Deception Detection by 2D-to-3D Face Reconstruction from Videos

    Authors: Minh Ngô, Burak Mandira, Selim Fırat Yılmaz, Ward Heij, Sezer Karaoglu, Henri Bouma, Hamdi Dibeklioglu, Theo Gevers

    Abstract: Lies and deception are common phenomena in society, both in our private and professional lives. However, humans are notoriously bad at accurate deception detection. Based on the literature, human accuracy of distinguishing between lies and truthful statements is 54% on average, in other words it is slightly better than a random guess. While people do not much care about this issue, in high-stakes… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

    Comments: 9 pages, 3 figures

  24. arXiv:1812.07363  [pdf, other

    cs.CV

    Improving Face Detection Performance with 3D-Rendered Synthetic Data

    Authors: Jian Han, Sezer Karaoglu, Hoang-An Le, Theo Gevers

    Abstract: In this paper, we provide a synthetic data generator methodology with fully controlled, multifaceted variations based on a new 3D face dataset (3DU-Face). We customized synthetic datasets to address specific types of variations (scale, pose, occlusion, blur, etc.), and systematically investigate the influence of different variations on face detection performances. We examine whether and how these… ▽ More

    Submitted 27 November, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: 11 pages. Submitted to Pattern Recognition Letters

  25. arXiv:1812.03085  [pdf, other

    cs.CV

    Color Constancy by GANs: An Experimental Survey

    Authors: Partha Das, Anil S. Baslamisli, Yang Liu, Sezer Karaoglu, Theo Gevers

    Abstract: In this paper, we formulate the color constancy task as an image-to-image translation problem using GANs. By conducting a large set of experiments on different datasets, an experimental survey is provided on the use of different types of GANs to solve for color constancy i.e. CC-GANs (Color Constancy GANs). Based on the experimental review, recommendations are given for the design of CC-GAN archit… ▽ More

    Submitted 7 December, 2018; originally announced December 2018.

  26. Automatic Generation of Dense Non-rigid Optical Flow

    Authors: Hoàng-Ân Lê, Tushar Nimbhorkar, Thomas Mensink, Anil S. Baslamisli, Sezer Karaoglu, Theo Gevers

    Abstract: There hardly exists any large-scale datasets with dense optical flow of non-rigid motion from real-world imagery as of today. The reason lies mainly in the required setup to derive ground truth optical flows: a series of images with known camera poses along its trajectory, and an accurate 3D model from a textured scene. Human annotation is not only too tedious for large databases, it can simply ha… ▽ More

    Submitted 7 September, 2021; v1 submitted 5 December, 2018; originally announced December 2018.

    Comments: The paper is accepted for publication for Computer Vision and Image Understanding (CVIU)

    Journal ref: Volume 212, November 2021, 103274

  27. arXiv:1812.01402  [pdf, other

    cs.CV

    Inferring Point Clouds from Single Monocular Images by Depth Intermediation

    Authors: Wei Zeng, Sezer Karaoglu, Theo Gevers

    Abstract: In this paper, we propose a pipeline to generate 3D point cloud of an object from a single-view RGB image. Most previous work predict the 3D point coordinates from single RGB images directly. We decompose this problem into depth estimation from single images and point cloud completion from partial point clouds. Our method sequentially predicts the depth maps from images and then infers the compl… ▽ More

    Submitted 26 October, 2020; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: Statement: This paper is under consideration at Computer Vision and Image Understanding

  28. arXiv:1807.11857  [pdf, other

    cs.CV

    Joint Learning of Intrinsic Images and Semantic Segmentation

    Authors: Anil S. Baslamisli, Thomas T. Groenestege, Partha Das, Hoang-An Le, Sezer Karaoglu, Theo Gevers

    Abstract: Semantic segmentation of outdoor scenes is problematic when there are variations in imaging conditions. It is known that albedo (reflectance) is invariant to all kinds of illumination effects. Thus, using reflectance images for semantic segmentation task can be favorable. Additionally, not only segmentation may benefit from reflectance, but also segmentation may be useful for reflectance computati… ▽ More

    Submitted 31 July, 2018; originally announced July 2018.

    Comments: ECCV 2018

  29. arXiv:1807.07473  [pdf, other

    cs.CV

    Three for one and one for three: Flow, Segmentation, and Surface Normals

    Authors: Hoang-An Le, Anil S. Baslamisli, Thomas Mensink, Theo Gevers

    Abstract: Optical flow, semantic segmentation, and surface normals represent different information modalities, yet together they bring better cues for scene understanding problems. In this paper, we study the influence between the three modalities: how one impacts on the others and their efficiency in combination. We employ a modular approach using a convolutional refinement network which is trained supervi… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

    Comments: BMVC 2018

  30. arXiv:1804.01792  [pdf, other

    cs.RO

    TrimBot2020: an outdoor robot for automatic gardening

    Authors: Nicola Strisciuglio, Radim Tylecek, Michael Blaich, Nicolai Petkov, Peter Bieber, Jochen Hemming, Eldert van Henten, Torsten Sattler, Marc Pollefeys, Theo Gevers, Thomas Brox, Robert B. Fisher

    Abstract: Robots are increasingly present in modern industry and also in everyday life. Their applications range from health-related situations, for assistance to elderly people or in surgical operations, to automatic and driver-less vehicles (on wheels or flying) or for driving assistance. Recently, an interest towards robotics applied in agriculture and gardening has arisen, with applications to automatic… ▽ More

    Submitted 15 May, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

    Comments: Accepted for publication at International Sympsium on Robotics 2018

  31. arXiv:1712.01056  [pdf, other

    cs.CV

    CNN based Learning using Reflection and Retinex Models for Intrinsic Image Decomposition

    Authors: Anil S. Baslamisli, Hoang-An Le, Theo Gevers

    Abstract: Most of the traditional work on intrinsic image decomposition rely on deriving priors about scene characteristics. On the other hand, recent research use deep learning models as in-and-out black box and do not consider the well-established, traditional image formation process as the basis of their intrinsic learning process. As a consequence, although current deep learning approaches show superior… ▽ More

    Submitted 3 April, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: CVPR 2018

  32. arXiv:1711.11379  [pdf, other

    cs.CV

    3DContextNet: K-d Tree Guided Hierarchical Learning of Point Clouds Using Local and Global Contextual Cues

    Authors: Wei Zeng, Theo Gevers

    Abstract: Classification and segmentation of 3D point clouds are important tasks in computer vision. Because of the irregular nature of point clouds, most of the existing methods convert point clouds into regular 3D voxel grids before they are used as input for ConvNets. Unfortunately, voxel representations are highly insensitive to the geometrical nature of 3D data. More recent methods encode point clouds… ▽ More

    Submitted 4 December, 2018; v1 submitted 30 November, 2017; originally announced November 2017.

    Comments: 15 pages, 5 figures

  33. Detect2Rank : Combining Object Detectors Using Learning to Rank

    Authors: Sezer Karaoglu, Yang Liu, Theo Gevers

    Abstract: Object detection is an important research area in the field of computer vision. Many detection algorithms have been proposed. However, each object detector relies on specific assumptions of the object appearance and imaging conditions. As a consequence, no algorithm can be considered as universal. With the large variety of object detectors, the subsequent question is how to select and combine them… ▽ More

    Submitted 26 December, 2014; originally announced December 2014.

  34. arXiv:1412.3506  [pdf, other

    cs.CV

    Road Detection by One-Class Color Classification: Dataset and Experiments

    Authors: Jose M. Alvarez, Theo Gevers, Antonio M. Lopez

    Abstract: Detecting traversable road areas ahead a moving vehicle is a key process for modern autonomous driving systems. A common approach to road detection consists of exploiting color features to classify pixels as road or background. These algorithms reduce the effect of lighting variations and weather conditions by exploiting the discriminant/invariant properties of different color representations. Fur… ▽ More

    Submitted 17 December, 2014; v1 submitted 10 December, 2014; originally announced December 2014.

    Comments: 10 pages