Skip to main content

Showing 1–12 of 12 results for author: Boss, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.12008  [pdf, other

    cs.CV

    SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion

    Authors: Vikram Voleti, Chun-Han Yao, Mark Boss, Adam Letts, David Pankratz, Dmitry Tochilkin, Christian Laforte, Robin Rombach, Varun Jampani

    Abstract: We present Stable Video 3D (SV3D) -- a latent video diffusion model for high-resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent work on 3D generation propose techniques to adapt 2D generative models for novel view synthesis (NVS) and 3D optimization. However, these methods have several disadvantages due to either limited views or inconsistent NVS, thereby affec… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Project page: https://sv3d.github.io/

  2. arXiv:2402.05919  [pdf, other

    cs.CV cs.GR

    Collaborative Control for Geometry-Conditioned PBR Image Generation

    Authors: Shimon Vainer, Mark Boss, Mathias Parger, Konstantin Kutsy, Dante De Nigris, Ciara Rowles, Nicolas Perony, Simon Donné

    Abstract: Current 3D content generation approaches build on diffusion models that output RGB images. Modern graphics pipelines, however, require physically-based rendering (PBR) material properties. We propose to model the PBR image distribution directly, avoiding photometric inaccuracies in RGB generation and the inherent ambiguity in extracting PBR from RGB. Existing paradigms for cross-modal fine-tuning… ▽ More

    Submitted 20 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 19 pages, 10 figures; Project page: https://unity-research.github.io/holo-gen/

    ACM Class: I.4.0

  3. arXiv:2401.10171  [pdf, other

    cs.CV cs.GR

    SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild

    Authors: Andreas Engelhardt, Amit Raj, Mark Boss, Yunzhi Zhang, Abhishek Kar, Yuanzhen Li, Deqing Sun, Ricardo Martin Brualla, Jonathan T. Barron, Hendrik P. A. Lensch, Varun Jampani

    Abstract: We present SHINOBI, an end-to-end framework for the reconstruction of shape, material, and illumination from object images captured with varying lighting, pose, and background. Inverse rendering of an object based on unconstrained image collections is a long-standing challenge in computer vision and graphics and requires a joint optimization over shape, radiance, and pose. We show that an implicit… ▽ More

    Submitted 29 March, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024). Updated supplementary material and acknowledgements

  4. Video object detection for privacy-preserving patient monitoring in intensive care

    Authors: Raphael Emberger, Jens Michael Boss, Daniel Baumann, Marko Seric, Shufan Huo, Lukas Tuggener, Emanuela Keller, Thilo Stadelmann

    Abstract: Patient monitoring in intensive care units, although assisted by biosensors, needs continuous supervision of staff. To reduce the burden on staff members, IT infrastructures are built to record monitoring data and develop clinical decision support systems. These systems, however, are vulnerable to artifacts (e.g. muscle movement due to ongoing treatment), which are often indistinguishable from rea… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 4 pages, 3 figures, 2023 10th Swiss Conference on Data Science (SDS), code available at https://github.com/raember/yolov5r_autodidact and https://github.com/raember/VideoProc

    ACM Class: I.2.10

  5. arXiv:2211.09084  [pdf, ps, other

    cs.SE cs.AI

    Technical Report on Neural Language Models and Few-Shot Learning for Systematic Requirements Processing in MDSE

    Authors: Vincent Bertram, Miriam Boß, Evgeny Kusmenko, Imke Helene Nachmann, Bernhard Rumpe, Danilo Trotta, Louis Wachtmeister

    Abstract: Systems engineering, in particular in the automotive domain, needs to cope with the massively increasing numbers of requirements that arise during the development process. To guarantee a high product quality and make sure that functional safety standards such as ISO26262 are fulfilled, the exploitation of potentials of model-driven systems engineering in the form of automatic analyses, consistency… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    ACM Class: D.2.1; I.2.7

  6. arXiv:2205.15768  [pdf, other

    cs.CV cs.GR cs.LG

    SAMURAI: Shape And Material from Unconstrained Real-world Arbitrary Image collections

    Authors: Mark Boss, Andreas Engelhardt, Abhishek Kar, Yuanzhen Li, Deqing Sun, Jonathan T. Barron, Hendrik P. A. Lensch, Varun Jampani

    Abstract: Inverse rendering of an object under entirely unknown capture conditions is a fundamental challenge in computer vision and graphics. Neural approaches such as NeRF have achieved photorealistic results on novel view synthesis, but they require known camera poses. Solving this problem with unknown camera poses is highly challenging as it requires joint optimization over shape, radiance, and pose. Th… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

  7. TC-Driver: Trajectory Conditioned Driving for Robust Autonomous Racing -- A Reinforcement Learning Approach

    Authors: Edoardo Ghignone, Nicolas Baumann, Mike Boss, Michele Magno

    Abstract: Autonomous racing is becoming popular for academic and industry researchers as a test for general autonomous driving by pushing perception, planning, and control algorithms to their limits. While traditional control methods such as MPC are capable of generating an optimal control sequence at the edge of the vehicles physical controllability, these methods are sensitive to the accuracy of the model… ▽ More

    Submitted 6 July, 2023; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: 6 pages, 4 figures, 3 tables, ICRA, OPPORTUNITIES AND CHALLENGES WITH AUTONOMOUS RACING, IEEE

    Journal ref: Field Robotics 2023

  8. Federated Learning Enables Big Data for Rare Cancer Boundary Detection

    Authors: Sarthak Pati, Ujjwal Baid, Brandon Edwards, Micah Sheller, Shih-Han Wang, G Anthony Reina, Patrick Foley, Alexey Gruzdev, Deepthi Karkada, Christos Davatzikos, Chiharu Sako, Satyam Ghodasara, Michel Bilello, Suyash Mohan, Philipp Vollmuth, Gianluca Brugnara, Chandrakanth J Preetha, Felix Sahm, Klaus Maier-Hein, Maximilian Zenk, Martin Bendszus, Wolfgang Wick, Evan Calabrese, Jeffrey Rudie, Javier Villanueva-Meyer , et al. (254 additional authors not shown)

    Abstract: Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train acc… ▽ More

    Submitted 25 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: federated learning, deep learning, convolutional neural network, segmentation, brain tumor, glioma, glioblastoma, FeTS, BraTS

  9. arXiv:2110.14373  [pdf, other

    cs.CV cs.GR cs.LG

    Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition

    Authors: Mark Boss, Varun Jampani, Raphael Braun, Ce Liu, Jonathan T. Barron, Hendrik P. A. Lensch

    Abstract: Decomposing a scene into its shape, reflectance and illumination is a fundamental problem in computer vision and graphics. Neural approaches such as NeRF have achieved remarkable success in view synthesis, but do not explicitly perform decomposition and instead operate exclusively on radiance (the product of reflectance and illumination). Extensions to NeRF, such as NeRD, can perform decomposition… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Project page: https://markboss.me/publication/2021-neural-pil/ Video: https://youtu.be/AsdAR5u3vQ8 - Accepted at NeurIPS 2021

  10. arXiv:2012.03918  [pdf, other

    cs.CV cs.GR cs.LG

    NeRD: Neural Reflectance Decomposition from Image Collections

    Authors: Mark Boss, Raphael Braun, Varun Jampani, Jonathan T. Barron, Ce Liu, Hendrik P. A. Lensch

    Abstract: Decomposing a scene into its shape, reflectance, and illumination is a challenging but important problem in computer vision and graphics. This problem is inherently more challenging when the illumination is not a single light source under laboratory conditions but is instead an unconstrained environmental illumination. Though recent work has shown that implicit representations can be used to model… ▽ More

    Submitted 26 August, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: Accepted at ICCV 2021

  11. Two-shot Spatially-varying BRDF and Shape Estimation

    Authors: Mark Boss, Varun Jampani, Kihwan Kim, Hendrik P. A. Lensch, Jan Kautz

    Abstract: Capturing the shape and spatially-varying appearance (SVBRDF) of an object from images is a challenging task that has applications in both computer vision and graphics. Traditional optimization-based approaches often need a large number of images taken from multiple views in a controlled environment. Newer deep learning-based approaches require only a few input images, but the reconstruction quali… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

  12. arXiv:1910.05148  [pdf, other

    cs.GR cs.CV cs.LG eess.IV

    Single Image BRDF Parameter Estimation with a Conditional Adversarial Network

    Authors: Mark Boss, Hendrik P. A. Lensch

    Abstract: Creating plausible surfaces is an essential component in achieving a high degree of realism in rendering. To relieve artists, who create these surfaces in a time-consuming, manual process, automated retrieval of the spatially-varying Bidirectional Reflectance Distribution Function (SVBRDF) from a single mobile phone image is desirable. By leveraging a deep neural network, this casual capturing met… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.