Skip to main content

Showing 1–9 of 9 results for author: Kulhánek, J

.
  1. arXiv:2407.08447  [pdf, other

    cs.CV

    WildGaussians: 3D Gaussian Splatting in the Wild

    Authors: Jonas Kulhanek, Songyou Peng, Zuzana Kukelova, Marc Pollefeys, Torsten Sattler

    Abstract: While the field of 3D scene reconstruction is dominated by NeRFs due to their photorealistic quality, 3D Gaussian Splatting (3DGS) has recently emerged, offering similar quality with real-time rendering speeds. However, both methods primarily excel with well-controlled 3D scenes, while in-the-wild data - characterized by occlusions, dynamic objects, and varying illumination - remains challenging.… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: https://wild-gaussians.github.io/

  2. arXiv:2406.17345  [pdf, other

    cs.CV

    NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods

    Authors: Jonas Kulhanek, Torsten Sattler

    Abstract: Novel view synthesis is an important problem with many applications, including AR/VR, gaming, and simulations for robotics. With the recent rapid development of Neural Radiance Fields (NeRFs) and 3D Gaussian Splatting (3DGS) methods, it is becoming difficult to keep track of the current state of the art (SoTA) due to methods using different evaluation protocols, codebases being difficult to instal… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Web: https://jkulhanek.com/nerfbaselines

  3. arXiv:2406.03175  [pdf, other

    cs.CV

    Dynamic 3D Gaussian Fields for Urban Areas

    Authors: Tobias Fischer, Jonas Kulhanek, Samuel Rota Bulò, Lorenzo Porzi, Marc Pollefeys, Peter Kontschieder

    Abstract: We present an efficient neural 3D scene representation for novel-view synthesis (NVS) in large-scale, dynamic urban areas. Existing works are not well suited for applications like mixed-reality or closed-loop simulation due to their limited visual quality and non-interactive rendering speeds. Recently, rasterization-based approaches have achieved high-quality NVS at impressive speeds. However, the… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Project page is available at https://tobiasfshr.github.io/pub/4dgf/

  4. arXiv:2304.09987  [pdf, other

    cs.CV cs.GR cs.LG

    Tetra-NeRF: Representing Neural Radiance Fields Using Tetrahedra

    Authors: Jonas Kulhanek, Torsten Sattler

    Abstract: Neural Radiance Fields (NeRFs) are a very recent and very popular approach for the problems of novel view synthesis and 3D reconstruction. A popular scene representation used by NeRFs is to combine a uniform, voxel-based subdivision of the scene with an MLP. Based on the observation that a (sparse) point cloud of the scene is often available, this paper proposes to use an adaptive representation b… ▽ More

    Submitted 20 August, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: ICCV 2023, Web: https://jkulhanek.com/tetra-nerf

  5. arXiv:2205.15764  [pdf, other

    cs.LG cs.CV cs.NE

    SymFormer: End-to-end symbolic regression using transformer-based architecture

    Authors: Martin Vastl, Jonáš Kulhánek, Jiří Kubalík, Erik Derner, Robert Babuška

    Abstract: Many real-world problems can be naturally described by mathematical formulas. The task of finding formulas from a set of observed inputs and outputs is called symbolic regression. Recently, neural networks have been applied to symbolic regression, among which the transformer-based ones seem to be the most promising. After training the transformer on a large number of formulas (in the order of days… ▽ More

    Submitted 20 October, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

  6. arXiv:2203.10157  [pdf, other

    cs.CV cs.LG

    ViewFormer: NeRF-free Neural Rendering from Few Images Using Transformers

    Authors: Jonáš Kulhánek, Erik Derner, Torsten Sattler, Robert Babuška

    Abstract: Novel view synthesis is a long-standing problem. In this work, we consider a variant of the problem where we are given only a few context views sparsely covering a scene or an object. The goal is to predict novel viewpoints in the scene, which requires learning priors. The current state of the art is based on Neural Radiance Field (NeRF), and while achieving impressive results, the methods suffer… ▽ More

    Submitted 21 July, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: ECCV 2022 poster

  7. AuGPT: Auxiliary Tasks and Data Augmentation for End-To-End Dialogue with Pre-Trained Language Models

    Authors: Jonáš Kulhánek, Vojtěch Hudeček, Tomáš Nekvinda, Ondřej Dušek

    Abstract: Attention-based pre-trained language models such as GPT-2 brought considerable progress to end-to-end dialogue modelling. However, they also present considerable risks for task-oriented dialogue, such as lack of knowledge grounding or diversity. To address these issues, we introduce modified training objectives for language model finetuning, and we employ massive data augmentation via back-transla… ▽ More

    Submitted 14 January, 2022; v1 submitted 9 February, 2021; originally announced February 2021.

    Journal ref: Proceedings of the 3rd Workshop on Natural Language Processing for Conversational AI (2021), 198-210

  8. arXiv:2010.10903  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement Learning

    Authors: Jonáš Kulhánek, Erik Derner, Robert Babuška

    Abstract: Visual navigation is essential for many applications in robotics, from manipulation, through mobile robotics to automated driving. Deep reinforcement learning (DRL) provides an elegant map-free approach integrating image processing, localization, and planning in one module, which can be trained and therefore optimized for a given environment. However, to date, DRL-based visual navigation was valid… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  9. arXiv:1908.03627  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    Vision-based Navigation Using Deep Reinforcement Learning

    Authors: Jonáš Kulhánek, Erik Derner, Tim de Bruin, Robert Babuška

    Abstract: Deep reinforcement learning (RL) has been successfully applied to a variety of game-like environments. However, the application of deep RL to visual navigation with realistic environments is a challenging task. We propose a novel learning architecture capable of navigating an agent, e.g. a mobile robot, to a target given by an image. To achieve this, we have extended the batched A2C algorithm with… ▽ More

    Submitted 9 November, 2019; v1 submitted 8 August, 2019; originally announced August 2019.

    Comments: ECMR 2019: European Conference on Mobile Robots

    Journal ref: 2019 European Conference on Mobile Robots (ECMR), 2019, p.1-8