Skip to main content

Showing 1–50 of 115 results for author: Schindler, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11202  [pdf, other

    cs.CV cs.GR

    Consistency^2: Consistent and Fast 3D Painting with Latent Consistency Models

    Authors: Tianfu Wang, Anton Obukhov, Konrad Schindler

    Abstract: Generative 3D Painting is among the top productivity boosters in high-resolution 3D asset management and recycling. Ever since text-to-image models became accessible for inference on consumer hardware, the performance of 3D Painting methods has consistently improved and is currently close to plateauing. At the core of most such models lies denoising diffusion in the latent space, an inherently tim… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.04928  [pdf, other

    cs.CV cs.LG eess.IV

    AGBD: A Global-scale Biomass Dataset

    Authors: Ghjulia Sialelli, Torben Peters, Jan D. Wegner, Konrad Schindler

    Abstract: Accurate estimates of Above Ground Biomass (AGB) are essential in addressing two of humanity's biggest challenges, climate change and biodiversity loss. Existing datasets for AGB estimation from satellite imagery are limited. Either they focus on specific, local regions at high resolution, or they offer global coverage at low resolution. There is a need for a machine learning-ready, globally repre… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2406.02506  [pdf, other

    cs.CV

    An Open-Source Tool for Map** War Destruction at Scale in Ukraine using Sentinel-1 Time Series

    Authors: Olivier Dietrich, Torben Peters, Vivien Sainte Fare Garnot, Valerie Sticher, Thao Ton-That Whelan, Konrad Schindler, Jan Dirk Wegner

    Abstract: Access to detailed war impact assessments is crucial for humanitarian organizations to effectively assist populations most affected by armed conflicts. However, maintaining a comprehensive understanding of the situation on the ground is challenging, especially in conflicts that cover vast territories and extend over long periods. This study presents a scalable and transferable method for estimatin… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  4. arXiv:2405.18087  [pdf, other

    cs.CV

    FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms

    Authors: Lea Bogensperger, Dominik Narnhofer, Alexander Falk, Konrad Schindler, Thomas Pock

    Abstract: Medical image segmentation is a crucial task that relies on the ability to accurately identify and isolate regions of interest in medical images. Thereby, generative approaches allow to capture the statistical properties of segmentation masks that are dependent on the respective structures. In this work we propose FlowSDF, an image-guided conditional flow matching framework to represent the signed… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  5. arXiv:2405.14438  [pdf, other

    cs.LG

    LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks

    Authors: Michelle Halbheer, Dominik J. Mühlematter, Alexander Becker, Dominik Narnhofer, Helge Aasen, Konrad Schindler, Mehmet Ozgur Turkoglu

    Abstract: Numerous crucial tasks in real-world decision-making rely on machine learning algorithms with calibrated uncertainty estimates. However, modern methods often yield overconfident and uncalibrated predictions. Various approaches involve training an ensemble of separate models to quantify the uncertainty related to the model itself, known as epistemic uncertainty. In an explicit implementation, the e… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: under review

  6. arXiv:2404.02838  [pdf, other

    cs.AI

    I-Design: Personalized LLM Interior Designer

    Authors: Ata Çelen, Guo Han, Konrad Schindler, Luc Van Gool, Iro Armeni, Anton Obukhov, Xi Wang

    Abstract: Interior design allows us to be who we are and live how we want - each design is as unique as our distinct personality. However, it is not trivial for non-professionals to express and materialize this since it requires aligning functional and visual expectations with the constraints of physical space; this renders interior design a luxury. To make it more accessible, we present I-Design, a persona… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  7. arXiv:2403.20142  [pdf, other

    cs.CV eess.IV

    StegoGAN: Leveraging Steganography for Non-Bijective Image-to-Image Translation

    Authors: Sidi Wu, Yizi Chen, Samuel Mermet, Lorenz Hurni, Konrad Schindler, Nicolas Gonthier, Loic Landrieu

    Abstract: Most image-to-image translation models postulate that a unique correspondence exists between the semantic classes of the source and target domains. However, this assumption does not always hold in real-world scenarios due to divergent distributions, different class sets, and asymmetrical information representation. As conventional GANs attempt to generate images that match the distribution of the… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  8. arXiv:2403.02136  [pdf, other

    cs.CV

    Point2Building: Reconstructing Buildings from Airborne LiDAR Point Clouds

    Authors: Yujia Liu, Anton Obukhov, Jan Dirk Wegner, Konrad Schindler

    Abstract: We present a learning-based approach to reconstruct buildings as 3D polygonal meshes from airborne LiDAR point clouds. What makes 3D building reconstruction from airborne LiDAR hard is the large diversity of building designs and especially roof shapes, the low and varying point density across the scene, and the often incomplete coverage of building facades due to occlusions by vegetation or to the… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  9. arXiv:2402.10130  [pdf, other

    cs.LG cs.AI cs.CV

    Is Continual Learning Ready for Real-world Challenges?

    Authors: Theodora Kontogianni, Yuanwen Yue, Siyu Tang, Konrad Schindler

    Abstract: Despite continual learning's long and well-established academic history, its application in real-world scenarios remains rather limited. This paper contends that this gap is attributable to a misalignment between the actual challenges of continual learning and the evaluation protocols in use, rendering proposed solutions ineffective for addressing the complexities of real-world setups. We validate… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  10. arXiv:2401.15739  [pdf

    cs.CV cs.LG

    SegmentAnyTree: A sensor and platform agnostic deep learning model for tree segmentation using laser scanning data

    Authors: Maciej Wielgosz, Stefano Puliti, Binbin Xiang, Konrad Schindler, Rasmus Astrup

    Abstract: This research advances individual tree crown (ITC) segmentation in lidar data, using a deep learning model applicable to various laser scanning types: airborne (ULS), terrestrial (TLS), and mobile (MLS). It addresses the challenge of transferability across different data characteristics in 3D forest scene analysis. The study evaluates the model's performance based on platform (ULS, MLS) and data d… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  11. arXiv:2312.15084  [pdf, other

    cs.CV

    Automated forest inventory: analysis of high-density airborne LiDAR point clouds with 3D deep learning

    Authors: Binbin Xiang, Maciej Wielgosz, Theodora Kontogianni, Torben Peters, Stefano Puliti, Rasmus Astrup, Konrad Schindler

    Abstract: Detailed forest inventories are critical for sustainable and flexible management of forest resources, to conserve various ecosystem services. Modern airborne laser scanners deliver high-density point clouds with great potential for fine-scale forest inventory and analysis, but automatically partitioning those point clouds into meaningful entities like individual trees or tree components remains a… ▽ More

    Submitted 23 February, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  12. arXiv:2312.09138  [pdf, other

    cs.CV

    Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments

    Authors: Liyuan Zhu, Shengyu Huang, Konrad Schindler, Iro Armeni

    Abstract: Research into dynamic 3D scene understanding has primarily focused on short-term change tracking from dense observations, while little attention has been paid to long-term changes with sparse observations. We address this gap with MoRE, a novel approach for multi-object relocalization and reconstruction in evolving environments. We view these environments as "living scenes" and consider the proble… ▽ More

    Submitted 26 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 camera-ready

  13. arXiv:2312.05247  [pdf, other

    cs.CV

    Dynamic LiDAR Re-simulation using Compositional Neural Fields

    Authors: Hanfeng Wu, Xingxing Zuo, Stefan Leutenegger, Or Litany, Konrad Schindler, Shengyu Huang

    Abstract: We introduce DyNFL, a novel neural field-based approach for high-fidelity re-simulation of LiDAR scans in dynamic driving scenes. DyNFL processes LiDAR measurements from dynamic environments, accompanied by bounding boxes of moving objects, to construct an editable neural field. This field, comprising separately reconstructed static background and dynamic objects, allows users to modify viewpoints… ▽ More

    Submitted 3 April, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Project page: https://shengyuh.github.io/dynfl

  14. arXiv:2312.04962  [pdf, other

    cs.CV

    Point2CAD: Reverse Engineering CAD Models from 3D Point Clouds

    Authors: Yujia Liu, Anton Obukhov, Jan Dirk Wegner, Konrad Schindler

    Abstract: Computer-Aided Design (CAD) model reconstruction from point clouds is an important problem at the intersection of computer vision, graphics, and machine learning; it saves the designer significant time when iterating on in-the-wild objects. Recent advancements in this direction achieve relatively reliable semantic segmentation but still struggle to produce an adequate topology of the CAD model. In… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  15. arXiv:2312.03048  [pdf, other

    cs.CV

    DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control

    Authors: Yuru Jia, Lukas Hoyer, Shengyu Huang, Tianfu Wang, Luc Van Gool, Konrad Schindler, Anton Obukhov

    Abstract: Large, pretrained latent diffusion models (LDMs) have demonstrated an extraordinary ability to generate creative content, specialize to user data through few-shot fine-tuning, and condition their output on other modalities, such as semantic maps. However, are they usable as large-scale data generators, e.g., to improve tasks in the perception stack, like semantic segmentation? We investigate this… ▽ More

    Submitted 8 April, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

  16. arXiv:2312.02145  [pdf, other

    cs.CV

    Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

    Authors: Bingxin Ke, Anton Obukhov, Shengyu Huang, Nando Metzger, Rodrigo Caye Daudt, Konrad Schindler

    Abstract: Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth from a single image is geometrically ill-posed and requires scene understanding, so it is not surprising that the rise of deep learning has led to a breakthrough. The impressive progress of monocular depth estimators has mirrored the growth in model capacity, from relatively modest CNNs to large Transformer archi… ▽ More

    Submitted 3 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 camera ready

  17. arXiv:2311.17643  [pdf, other

    cs.CV

    Neural Fields with Thermal Activations for Arbitrary-Scale Super-Resolution

    Authors: Alexander Becker, Rodrigo Caye Daudt, Nando Metzger, Jan Dirk Wegner, Konrad Schindler

    Abstract: Recent approaches for arbitrary-scale single image super-resolution (ASSR) have used local neural fields to represent continuous signals that can be sampled at arbitrary rates. However, the point-wise query of the neural field does not naturally match the point spread function (PSF) of a given pixel, which may cause aliasing in the super-resolved image. We present a novel way to design neural fiel… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  18. arXiv:2311.14006  [pdf, other

    cs.CV

    High-resolution Population Maps Derived from Sentinel-1 and Sentinel-2

    Authors: Nando Metzger, Rodrigo Caye Daudt, Devis Tuia, Konrad Schindler

    Abstract: Detailed population maps play an important role in diverse fields ranging from humanitarian action to urban planning. Generating such maps in a timely and scalable manner presents a challenge, especially in data-scarce regions. To address it we have developed POPCORN, a population map** method whose only inputs are free, globally available satellite images from Sentinel-1 and Sentinel-2; and a s… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 17 pages, 10 tables, 7 Figures

  19. arXiv:2311.09346  [pdf, other

    cs.CV cs.LG cs.RO

    Nothing Stands Still: A Spatiotemporal Benchmark on 3D Point Cloud Registration Under Large Geometric and Temporal Change

    Authors: Tao Sun, Yan Hao, Shengyu Huang, Silvio Savarese, Konrad Schindler, Marc Pollefeys, Iro Armeni

    Abstract: Building 3D geometric maps of man-made spaces is a well-established and active field that is fundamental to computer vision and robotics. However, considering the evolving nature of built environments, it is essential to question the capabilities of current map** efforts in handling temporal changes. In addition, spatiotemporal map** holds significant potential for achieving sustainability and… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 27 pages, 29 figures. For the project page, see http://nothing-stands-still.com

  20. Cross-attention Spatio-temporal Context Transformer for Semantic Segmentation of Historical Maps

    Authors: Sidi Wu, Yizi Chen, Konrad Schindler, Lorenz Hurni

    Abstract: Historical maps provide useful spatio-temporal information on the Earth's surface before modern earth observation techniques came into being. To extract information from maps, neural networks, which gain wide popularity in recent years, have replaced hand-crafted map processing methods and tedious manual labor. However, aleatoric uncertainty, known as data-dependent uncertainty, inherent in the dr… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  21. arXiv:2309.11248  [pdf, other

    cs.CV

    Box2Poly: Memory-Efficient Polygon Prediction of Arbitrarily Shaped and Rotated Text

    Authors: Xuyang Chen, Dong Wang, Konrad Schindler, Mingwei Sun, Yongliang Wang, Nicolo Savioli, Liqiu Meng

    Abstract: Recently, Transformer-based text detection techniques have sought to predict polygons by encoding the coordinates of individual boundary vertices using distinct query features. However, this approach incurs a significant memory overhead and struggles to effectively capture the intricate relationships between vertices belonging to the same instance. Consequently, irregular text layouts often lead t… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  22. arXiv:2309.08523  [pdf, other

    cs.CV cs.GR

    Breathing New Life into 3D Assets with Generative Repainting

    Authors: Tianfu Wang, Menelaos Kanakis, Konrad Schindler, Luc Van Gool, Anton Obukhov

    Abstract: Diffusion-based text-to-image models ignited immense attention from the vision community, artists, and content creators. Broad adoption of these models is due to significant improvement in the quality of generations and efficient conditioning on various modalities, not just text. However, lifting the rich generative priors of these 2D models into 3D is challenging. Recent works have proposed vario… ▽ More

    Submitted 18 October, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

  23. arXiv:2309.01797  [pdf, other

    cs.CV eess.IV

    Accuracy and Consistency of Space-based Vegetation Height Maps for Forest Dynamics in Alpine Terrain

    Authors: Yuchang Jiang, Marius Rüetschi, Vivien Sainte Fare Garnot, Mauro Marty, Konrad Schindler, Christian Ginzler, Jan D. Wegner

    Abstract: Monitoring and understanding forest dynamics is essential for environmental conservation and management. This is why the Swiss National Forest Inventory (NFI) provides countrywide vegetation height maps at a spatial resolution of 0.5 m. Its long update time of 6 years, however, limits the temporal analysis of forest dynamics. This can be improved by using spaceborne remote sensing and deep learnin… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  24. arXiv:2307.02877  [pdf, other

    cs.CV

    Towards accurate instance segmentation in large-scale LiDAR point clouds

    Authors: Binbin Xiang, Torben Peters, Theodora Kontogianni, Frawa Vetterli, Stefano Puliti, Rasmus Astrup, Konrad Schindler

    Abstract: Panoptic segmentation is the combination of semantic and instance segmentation: assign the points in a 3D point cloud to semantic categories and partition them into distinct object instances. It has many obvious applications for outdoor scene understanding, from city map** to forest management. Existing methods struggle to segment nearby instances of the same semantic category, like adjacent pie… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  25. arXiv:2306.04385  [pdf, other

    cs.CV

    SF-FSDA: Source-Free Few-Shot Domain Adaptive Object Detection with Efficient Labeled Data Factory

    Authors: Han Sun, Rui Gong, Konrad Schindler, Luc Van Gool

    Abstract: Domain adaptive object detection aims to leverage the knowledge learned from a labeled source domain to improve the performance on an unlabeled target domain. Prior works typically require the access to the source domain data for adaptation, and the availability of sufficient data on the target domain. However, these assumptions may not hold due to data privacy and rare data collection. In this pa… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  26. arXiv:2306.00977  [pdf, other

    cs.CV cs.HC

    AGILE3D: Attention Guided Interactive Multi-object 3D Segmentation

    Authors: Yuanwen Yue, Sabarinath Mahadevan, Jonas Schult, Francis Engelmann, Bastian Leibe, Konrad Schindler, Theodora Kontogianni

    Abstract: During interactive segmentation, a model and a user work together to delineate objects of interest in a 3D point cloud. In an iterative process, the model assigns each data point to an object (or the background), while the user corrects errors in the resulting segmentation and feeds them back into the model. The current best practice formulates the problem as binary classification and segments obj… ▽ More

    Submitted 10 April, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: ICLR 2024 camera-ready. Project page: https://ywyue.github.io/AGILE3D

  27. arXiv:2305.15178  [pdf, other

    cs.LG

    Mixture of Experts with Uncertainty Voting for Imbalanced Deep Regression Problems

    Authors: Yuchang Jiang, Vivien Sainte Fare Garnot, Konrad Schindler, Jan Dirk Wegner

    Abstract: Data imbalance is ubiquitous when applying machine learning to real-world problems, particularly regression problems. If training data are imbalanced, the learning is dominated by the densely covered regions of the target distribution, consequently, the learned regressor tends to exhibit poor performance in sparsely covered regions. Beyond standard measures like over-sampling or re-weighting, ther… ▽ More

    Submitted 30 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  28. U-TILISE: A Sequence-to-sequence Model for Cloud Removal in Optical Satellite Time Series

    Authors: Corinne Stucker, Vivien Sainte Fare Garnot, Konrad Schindler

    Abstract: Satellite image time series in the optical and infrared spectrum suffer from frequent data gaps due to cloud cover, cloud shadows, and temporary sensor outages. It has been a long-standing problem of remote sensing research how to best reconstruct the missing pixel values and obtain complete, cloud-free image sequences. We approach that problem from the perspective of representation learning and d… ▽ More

    Submitted 4 December, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Published in the IEEE Transactions on Geoscience and Remote Sensing

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, Vol. 61, 2023

  29. arXiv:2305.08413  [pdf, other

    cs.CV eess.IV stat.AP

    Artificial intelligence to advance Earth observation: a perspective

    Authors: Devis Tuia, Konrad Schindler, Begüm Demir, Gustau Camps-Valls, Xiao Xiang Zhu, Mrinalini Kochupillai, Sašo Džeroski, Jan N. van Rijn, Holger H. Hoos, Fabio Del Frate, Mihai Datcu, Jorge-Arnulfo Quiané-Ruiz, Volker Markl, Bertrand Le Saux, Rochelle Schneider

    Abstract: Earth observation (EO) is a prime instrument for monitoring land and ocean processes, studying the dynamics at work, and taking the pulse of our planet. This article gives a bird's eye view of the essential scientific tools and approaches informing and supporting the transition from raw EO data to usable EO-based information. The promises, as well as the current challenges of these developments, a… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  30. arXiv:2305.01643  [pdf, other

    cs.CV

    Neural LiDAR Fields for Novel View Synthesis

    Authors: Shengyu Huang, Zan Gojcic, Zian Wang, Francis Williams, Yoni Kasten, Sanja Fidler, Konrad Schindler, Or Litany

    Abstract: We present Neural Fields for LiDAR (NFL), a method to optimise a neural field scene representation from LiDAR measurements, with the goal of synthesizing realistic LiDAR scans from novel viewpoints. NFL combines the rendering power of neural fields with a detailed, physically motivated model of the LiDAR sensing process, thus enabling it to accurately reproduce key sensor behaviors like beam diver… ▽ More

    Submitted 13 August, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: ICCV 2023 - camera ready. Project page: https://research.nvidia.com/labs/toronto-ai/nfl/

  31. arXiv:2304.13980  [pdf, other

    cs.CV

    A Review of Panoptic Segmentation for Mobile Map** Point Clouds

    Authors: Binbin Xiang, Yuanwen Yue, Torben Peters, Konrad Schindler

    Abstract: 3D point cloud panoptic segmentation is the combined task to (i) assign each point to a semantic class and (ii) separate the points in each class into object instances. Recently there has been an increased interest in such comprehensive 3D scene understanding, building on the rapid advances of semantic segmentation due to the advent of deep 3D neural networks. Yet, to date there is very little wor… ▽ More

    Submitted 17 August, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

  32. arXiv:2304.02569  [pdf, other

    cs.CV

    DEFLOW: Self-supervised 3D Motion Estimation of Debris Flow

    Authors: Liyuan Zhu, Yuru Jia, Shengyu Huang, Nicholas Meyer, Andreas Wieser, Konrad Schindler, Jordan Aaron

    Abstract: Existing work on scene flow estimation focuses on autonomous driving and mobile robotics, while automated solutions are lacking for motion in nature, such as that exhibited by debris flows. We propose DEFLOW, a model for 3D motion estimation of debris flows, together with a newly captured dataset. We adopt a novel multi-level sensor fusion architecture and self-supervision to incorporate the induc… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Photogrammetric Computer Vision Workshop, CVPRW 2023, camera ready

  33. arXiv:2212.01953  [pdf, other

    physics.soc-ph cs.LG

    Context-aware multi-head self-attentional neural network model for next location prediction

    Authors: Ye Hong, Yatao Zhang, Konrad Schindler, Martin Raubal

    Abstract: Accurate activity location prediction is a crucial component of many mobility applications and is particularly required to develop personalized, sustainable transportation systems. Despite the widespread adoption of deep learning models, next location prediction models lack a comprehensive discussion and integration of mobility-related spatio-temporal contexts. Here, we utilize a multi-head self-a… ▽ More

    Submitted 21 August, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: updated Discussion section; accepted by Transportation Research Part C

  34. arXiv:2211.15658  [pdf, other

    cs.CV

    Connecting the Dots: Floorplan Reconstruction Using Two-Level Queries

    Authors: Yuanwen Yue, Theodora Kontogianni, Konrad Schindler, Francis Engelmann

    Abstract: We address 2D floorplan reconstruction from 3D scans. Existing approaches typically employ heuristically designed multi-stage pipelines. Instead, we formulate floorplan reconstruction as a single-stage structured prediction task: find a variable-size set of polygons, which in turn are variable-length sequences of ordered vertices. To solve it we develop a novel Transformer architecture that genera… ▽ More

    Submitted 27 March, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: CVPR 2023 camera-ready. Project page: https://ywyue.github.io/RoomFormer

  35. arXiv:2211.13220  [pdf, other

    cs.CV cs.GR cs.LG

    TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation

    Authors: Nikolai Kalischek, Torben Peters, Jan D. Wegner, Konrad Schindler

    Abstract: Probabilistic denoising diffusion models (DDMs) have set a new standard for 2D image generation. Extending DDMs for 3D content creation is an active field of research. Here, we propose TetraDiffusion, a diffusion model that operates on a tetrahedral partitioning of 3D space to enable efficient, high-resolution 3D shape generation. Our model introduces operators for convolution and transpose convol… ▽ More

    Submitted 7 December, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: This version introduces a substantial update of arXiv:2211.13220v1 with significant changes in the framework and entirely new results. Project page https://tetradiffusion.github.io/

  36. arXiv:2211.13190  [pdf, other

    cs.CV cs.LG

    BiasBed -- Rigorous Texture Bias Evaluation

    Authors: Nikolai Kalischek, Rodrigo C. Daudt, Torben Peters, Reinhard Furrer, Jan D. Wegner, Konrad Schindler

    Abstract: The well-documented presence of texture bias in modern convolutional neural networks has led to a plethora of algorithms that promote an emphasis on shape cues, often to support generalization to new domains. Yet, common datasets, benchmarks and general model selection strategies are missing, and there is no agreed, rigorous evaluation protocol. In this paper, we investigate difficulties and limit… ▽ More

    Submitted 24 March, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

  37. arXiv:2211.11592  [pdf, other

    cs.CV cs.AI

    Guided Depth Super-Resolution by Deep Anisotropic Diffusion

    Authors: Nando Metzger, Rodrigo Caye Daudt, Konrad Schindler

    Abstract: Performing super-resolution of a depth image using the guidance from an RGB image is a problem that concerns several fields, such as robotics, medical imaging, and remote sensing. While deep learning methods have achieved good results in this problem, recent work highlighted the value of combining modern methods with more formal frameworks. In this work, we propose a novel approach which combines… ▽ More

    Submitted 28 March, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: 8 main pages, Accepted to CVPR2023

  38. arXiv:2211.04039  [pdf, other

    cs.LG cs.CV stat.AP

    Fine-grained Population Map** from Coarse Census Counts and Open Geodata

    Authors: Nando Metzger, John E. Vargas-Muñoz, Rodrigo C. Daudt, Benjamin Kellenberger, Thao Ton-That Whelan, Ferda Ofli, Muhammad Imran, Konrad Schindler, Devis Tuia

    Abstract: Fine-grained population maps are needed in several domains, like urban planning, environmental monitoring, public health, and humanitarian operations. Unfortunately, in many countries only aggregate census counts over large spatial units are collected, moreover, these are not always up-to-date. We present POMELO, a deep learning model that employs coarse census counts and open geodata to estimate… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

  39. arXiv:2209.15529  [pdf, other

    cs.LG cs.CV stat.ML

    TT-NF: Tensor Train Neural Fields

    Authors: Anton Obukhov, Mikhail Usvyatsov, Christos Sakaridis, Konrad Schindler, Luc Van Gool

    Abstract: Learning neural fields has been an active topic in deep learning research, focusing, among other issues, on finding more compact and easy-to-fit representations. In this paper, we introduce a novel low-rank representation termed Tensor Train Neural Fields (TT-NF) for learning neural fields on dense regular grids and efficient methods for sampling from them. Our representation is a TT parameterizat… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: Preprint, under review

  40. arXiv:2208.01421  [pdf, other

    cs.CV

    T4DT: Tensorizing Time for Learning Temporal 3D Visual Data

    Authors: Mikhail Usvyatsov, Rafael Ballester-Rippoll, Lina Bashaeva, Konrad Schindler, Gonzalo Ferrer, Ivan Oseledets

    Abstract: Unlike 2D raster images, there is no single dominant representation for 3D visual data processing. Different formats like point clouds, meshes, or implicit functions each have their strengths and weaknesses. Still, grid representations such as signed distance functions have attractive properties also in 3D. In particular, they offer constant-time random access and are eminently suitable for modern… ▽ More

    Submitted 5 October, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

  41. arXiv:2207.12394  [pdf, other

    cs.CV

    Dynamic 3D Scene Analysis by Point Cloud Accumulation

    Authors: Shengyu Huang, Zan Gojcic, Jiahui Huang, Andreas Wieser, Konrad Schindler

    Abstract: Multi-beam LiDAR sensors, as used on autonomous vehicles and mobile robots, acquire sequences of 3D range scans ("frames"). Each frame covers the scene sparsely, due to limited angular scanning resolution and occlusion. The sparsity restricts the performance of downstream processes like semantic segmentation or surface reconstruction. Luckily, when the sensor moves, frames are captured from a sequ… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: ECCV 2022, camera ready

  42. arXiv:2206.11128  [pdf, other

    cs.LG cs.MS

    tntorch: Tensor Network Learning with PyTorch

    Authors: Mikhail Usvyatsov, Rafael Ballester-Ripoll, Konrad Schindler

    Abstract: We present tntorch, a tensor learning framework that supports multiple decompositions (including Candecomp/Parafac, Tucker, and Tensor Train) under a unified interface. With our library, the user can learn and handle low-rank tensors with automatic differentiation, seamless GPU support, and the convenience of PyTorch's API. Besides decomposition algorithms, tntorch implements differentiable tensor… ▽ More

    Submitted 21 September, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Journal ref: JMLR (2022) 23-208

  43. arXiv:2206.06119  [pdf, other

    cs.CV cs.AI

    Satellite-based high-resolution maps of cocoa planted area for Côte d'Ivoire and Ghana

    Authors: Nikolai Kalischek, Nico Lang, Cécile Renier, Rodrigo Caye Daudt, Thomas Addoah, William Thompson, Wilma J. Blaser-Hart, Rachael Garrett, Konrad Schindler, Jan D. Wegner

    Abstract: Côte d'Ivoire and Ghana, the world's largest producers of cocoa, account for two thirds of the global cocoa production. In both countries, cocoa is the primary perennial crop, providing income to almost two million farmers. Yet precise maps of cocoa planted area are missing, hindering accurate quantification of expansion in protected areas, production and yields, and limiting information available… ▽ More

    Submitted 9 May, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

  44. arXiv:2206.01466  [pdf, other

    cs.CV cs.LG

    Recognition of Unseen Bird Species by Learning from Field Guides

    Authors: Andrés C. Rodríguez, Stefano D'Aronco, Rodrigo Caye Daudt, Jan D. Wegner, Konrad Schindler

    Abstract: We exploit field guides to learn bird species recognition, in particular zero-shot recognition of unseen species. Illustrations contained in field guides deliberately focus on discriminative properties of each species, and can serve as side information to transfer knowledge from seen to unseen bird species. We study two approaches: (1) a contrastive encoding of illustrations, which can be fed into… ▽ More

    Submitted 2 November, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Accepted to WACV2024

  45. arXiv:2206.00050  [pdf, other

    cs.LG

    FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation

    Authors: Mehmet Ozgur Turkoglu, Alexander Becker, Hüseyin Anil Gündüz, Mina Rezaei, Bernd Bischl, Rodrigo Caye Daudt, Stefano D'Aronco, Jan Dirk Wegner, Konrad Schindler

    Abstract: The ability to estimate epistemic uncertainty is often crucial when deploying machine learning in the real world, but modern methods often produce overconfident, uncalibrated uncertainty predictions. A common approach to quantify epistemic uncertainty, usable across a wide class of prediction models, is to train a model ensemble. In a naive implementation, the ensemble approach has high computatio… ▽ More

    Submitted 19 December, 2022; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: accepted at NeurIPS 2022

  46. arXiv:2204.12875  [pdf, other

    cs.CV

    Urban Change Forecasting from Satellite Images

    Authors: Nando Metzger, Mehmet Özgür Türkoglu, Rodrigo Caye Daudt, Jan Dirk Wegner, Konrad Schindler

    Abstract: Forecasting where and when new buildings will emerge is a rather unexplored topic, but one that is very useful in many disciplines such as urban planning, agriculture, resource management, and even autonomous flying. In the present work, we present a method that accomplishes this task with a deep neural network and a custom pretraining procedure. In Stage 1, a U-Net backbone is pretrained within a… ▽ More

    Submitted 18 September, 2023; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: PFG 2023, accepted,

  47. arXiv:2204.08322  [pdf, other

    cs.CV cs.LG eess.IV

    A high-resolution canopy height model of the Earth

    Authors: Nico Lang, Walter Jetz, Konrad Schindler, Jan Dirk Wegner

    Abstract: The worldwide variation in vegetation height is fundamental to the global carbon cycle and central to the functioning of ecosystems and their biodiversity. Geospatially explicit and, ideally, highly resolved information is required to manage terrestrial ecosystems, mitigate climate change, and prevent biodiversity loss. Here, we present the first global, wall-to-wall canopy height map at 10 m grou… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

  48. arXiv:2204.07183  [pdf, other

    cs.CV

    Interactive Object Segmentation in 3D Point Clouds

    Authors: Theodora Kontogianni, Ekin Celikkan, Siyu Tang, Konrad Schindler

    Abstract: We propose an interactive approach for 3D instance segmentation, where users can iteratively collaborate with a deep learning model to segment objects in a 3D point cloud directly. Current methods for 3D instance segmentation are generally trained in a fully-supervised fashion, which requires large amounts of costly training labels, and does not generalize well to classes unseen during training. F… ▽ More

    Submitted 23 January, 2023; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: 2023 IEEE International Conference on Robotics and Automation (ICRA)

  49. arXiv:2203.15536  [pdf, other

    cs.CV cs.AI cs.LG

    BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information

    Authors: Nadine Rueegg, Silvia Zuffi, Konrad Schindler, Michael J. Black

    Abstract: Our goal is to recover the 3D shape and pose of dogs from a single image. This is a challenging task because dogs exhibit a wide range of shapes and appearances, and are highly articulated. Recent work has proposed to directly regress the SMAL animal model, with additional limb scale parameters, from images. Our method, called BARC (Breed-Augmented Regression using Classification), goes beyond pri… ▽ More

    Submitted 18 June, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: accepted for publication at CVPR 2022

    ACM Class: I.4; I.2

  50. arXiv:2203.14297  [pdf, other

    cs.CV

    Learning Graph Regularisation for Guided Super-Resolution

    Authors: Riccardo de Lutio, Alexander Becker, Stefano D'Aronco, Stefania Russo, Jan D. Wegner, Konrad Schindler

    Abstract: We introduce a novel formulation for guided super-resolution. Its core is a differentiable optimisation layer that operates on a learned affinity graph. The learned graph potentials make it possible to leverage rich contextual information from the guide image, while the explicit graph optimisation within the architecture guarantees rigorous fidelity of the high-resolution target to the low-resolut… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

    Comments: CVPR 2022