Skip to main content

Showing 1–50 of 100 results for author: Mitra, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00609  [pdf, other

    cs.CV cs.AI

    SuperGaussian: Repurposing Video Models for 3D Super Resolution

    Authors: Yuan Shen, Duygu Ceylan, Paul Guerrero, Zexiang Xu, Niloy J. Mitra, Shenlong Wang, Anna Frühstück

    Abstract: We present a simple, modular, and generic method that upsamples coarse 3D models by adding geometric and appearance details. While generative 3D models now exist, they do not yet match the quality of their counterparts in image and video domains. We demonstrate that it is possible to directly repurpose existing (pretrained) video models for 3D super-resolution and thus sidestep the problem of the… ▽ More

    Submitted 1 July, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted at ECCV 2024, project website with interactive demo: https://supergaussian.github.io

  2. arXiv:2403.17103  [pdf, other

    cs.CV

    Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos

    Authors: Remy Sabathier, Niloy J. Mitra, David Novotny

    Abstract: We present a method to build animatable dog avatars from monocular videos. This is challenging as animals display a range of (unpredictable) non-rigid movements and have a variety of appearance details (e.g., fur, spots, tails). We develop an approach that links the video frames via a 4D solution that jointly solves for animal's pose variation, and its appearance (in a canonical pose). To this end… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  3. arXiv:2312.08744  [pdf, other

    cs.CV cs.GR

    GOEnFusion: Gradient Origin Encodings for 3D Forward Diffusion Models

    Authors: Animesh Karnewar, Andrea Vedaldi, Niloy J. Mitra, David Novotny

    Abstract: The recently introduced Forward-Diffusion method allows to train a 3D diffusion model using only 2D images for supervision. However, it does not easily generalise to different 3D representations and requires a computationally expensive auto-regressive sampling process to generate the underlying 3D scenes. In this paper, we propose GOEn: Gradient Origin Encoding (pronounced "gone"). GOEn can encode… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: project page at: https://holodiffusion.github.io/goenfusion

  4. arXiv:2312.03079  [pdf, other

    cs.CV cs.GR

    LooseControl: Lifting ControlNet for Generalized Depth Conditioning

    Authors: Shariq Farooq Bhat, Niloy J. Mitra, Peter Wonka

    Abstract: We present LooseControl to allow generalized depth conditioning for diffusion-based image generation. ControlNet, the SOTA for depth-conditioned image generation, produces remarkable results but relies on having access to detailed depth maps for guidance. Creating such exact depth maps, in many scenarios, is challenging. This paper introduces a generalized version of depth conditioning that enable… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  5. arXiv:2312.02190  [pdf, other

    cs.CV cs.GR

    Diffusion Handles: Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D

    Authors: Karran Pandey, Paul Guerrero, Matheus Gadelha, Yannick Hold-Geoffroy, Karan Singh, Niloy Mitra

    Abstract: Diffusion Handles is a novel approach to enabling 3D object edits on diffusion images. We accomplish these edits using existing pre-trained diffusion models, and 2D image depth estimation, without any fine-tuning or 3D object retrieval. The edited results remain plausible, photo-real, and preserve object identity. Diffusion Handles address a critically missing facet of generative image based creat… ▽ More

    Submitted 6 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Project Webpage: https://diffusionhandles.github.io/

  6. arXiv:2311.17851  [pdf, other

    cs.CV

    Leveraging VLM-Based Pipelines to Annotate 3D Objects

    Authors: Rishabh Kabra, Loic Matthey, Alexander Lerchner, Niloy J. Mitra

    Abstract: Pretrained vision language models (VLMs) present an opportunity to caption unlabeled 3D objects at scale. The leading approach to summarize VLM descriptions from different views of an object (Luo et al., 2023) relies on a language model (GPT4) to produce the final output. This text-based aggregation is susceptible to hallucinations as it merges potentially contradictory descriptions. We propose an… ▽ More

    Submitted 17 June, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  7. arXiv:2311.17024  [pdf, other

    cs.CV cs.GR

    Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features

    Authors: Niladri Shekhar Dutt, Sanjeev Muralikrishnan, Niloy J. Mitra

    Abstract: We present Diff3F as a simple, robust, and class-agnostic feature descriptor that can be computed for untextured input shapes (meshes or point clouds). Our method distills diffusion features from image foundational models onto input shapes. Specifically, we use the input shapes to produce depth and normal maps as guidance for conditional image synthesis. In the process, we produce (diffusion) feat… ▽ More

    Submitted 2 April, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted at CVPR'24

  8. arXiv:2311.16703  [pdf, other

    cs.CV cs.GR

    CADTalk: An Algorithm and Benchmark for Semantic Commenting of CAD Programs

    Authors: Haocheng Yuan, **g Xu, Hao Pan, Adrien Bousseau, Niloy J. Mitra, Changjian Li

    Abstract: CAD programs are a popular way to compactly encode shapes as a sequence of operations that are easy to parametrically modify. However, without sufficient semantic comments and structure, such programs can be challenging to understand, let alone modify. We introduce the problem of semantic commenting CAD programs, wherein the goal is to segment the input program into code blocks corresponding to se… ▽ More

    Submitted 25 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  9. arXiv:2311.14468  [pdf, other

    cs.LG

    Efficient Gradient Estimation via Adaptive Sampling and Importance Sampling

    Authors: Corentin Salaün, Xingchang Huang, Iliyan Georgiev, Niloy J. Mitra, Gurprit Singh

    Abstract: Machine learning problems rely heavily on stochastic gradient descent (SGD) for optimization. The effectiveness of SGD is contingent upon accurately estimating gradients from a mini-batch of data samples. Instead of the commonly used uniform sampling, adaptive or importance sampling reduces noise in gradient estimation by forming mini-batches that prioritize crucial data points. Previous research… ▽ More

    Submitted 27 November, 2023; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: 15 pages, 10 figures

  10. arXiv:2310.09965  [pdf, other

    cs.CV cs.GR

    ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context

    Authors: Binglun Wang, Niladri Shekhar Dutt, Niloy J. Mitra

    Abstract: Neural Radiance Fields (NeRFs) have recently emerged as a popular option for photo-realistic object capture due to their ability to faithfully capture high-fidelity volumetric content even from handheld video input. Although much research has been devoted to efficient optimization leading to real-time training and rendering, options for interactive editing NeRFs remain limited. We present a very s… ▽ More

    Submitted 23 April, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: Accepted at I3D'24 (ACM SIGGRAPH SYMPOSIUM ON INTERACTIVE 3D GRAPHICS AND GAMES)

  11. arXiv:2309.04836  [pdf, other

    cs.CV cs.GR

    Neural Semantic Surface Maps

    Authors: Luca Morreale, Noam Aigerman, Vladimir G. Kim, Niloy J. Mitra

    Abstract: We present an automated technique for computing a map between two genus-zero shapes, which matches semantically corresponding regions to one another. Lack of annotated data prohibits direct inference of 3D semantic priors; instead, current State-of-the-art methods predominantly optimize geometric properties or require varying amounts of manual annotation. To overcome the lack of annotated training… ▽ More

    Submitted 8 March, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted at Eurographics 2024

  12. arXiv:2309.01765  [pdf, other

    cs.CV

    BLiSS: Bootstrapped Linear Shape Space

    Authors: Sanjeev Muralikrishnan, Chun-Hao Paul Huang, Duygu Ceylan, Niloy J. Mitra

    Abstract: Morphable models are fundamental to numerous human-centered processes as they offer a simple yet expressive shape space. Creating such morphable models, however, is both tedious and expensive. The main challenge is establishing dense correspondences across raw scans that capture sufficient shape variation. This is often addressed using a mix of significant manual intervention and non-rigid registr… ▽ More

    Submitted 9 February, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: 12 pages, 10 figures

  13. arXiv:2308.14244  [pdf, other

    cs.CV cs.GR

    HoloFusion: Towards Photo-realistic 3D Generative Modeling

    Authors: Animesh Karnewar, Niloy J. Mitra, Andrea Vedaldi, David Novotny

    Abstract: Diffusion-based image generators can now produce high-quality and diverse samples, but their success has yet to fully translate to 3D generation: existing diffusion methods can either generate low-resolution but 3D consistent outputs, or detailed 2D views of 3D objects but with potential structural defects and lacking view consistency or realism. We present HoloFusion, a method that combines the b… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: ICCV 2023 conference; project page at: https://holodiffusion.github.io/holofusion

  14. arXiv:2306.02489  [pdf, other

    cs.HC

    A Comparative Evaluation of Visual Summarization Techniques for Event Sequences

    Authors: Kazi Tasnim Zinat, **hua Yang, Arjun Gandhi, Nistha Mitra, Zhicheng Liu

    Abstract: Real-world event sequences are often complex and heterogeneous, making it difficult to create meaningful visualizations using simple data aggregation and visual encoding techniques. Consequently, visualization researchers have developed numerous visual summarization techniques to generate concise overviews of sequential data. These techniques vary widely in terms of summary structures and contents… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

  15. arXiv:2305.05661  [pdf, other

    cs.GR cs.AI cs.CV cs.LG cs.PL

    ShapeCoder: Discovering Abstractions for Visual Programs from Unstructured Primitives

    Authors: R. Kenny Jones, Paul Guerrero, Niloy J. Mitra, Daniel Ritchie

    Abstract: Programs are an increasingly popular representation for visual data, exposing compact, interpretable structure that supports manipulation. Visual programs are usually written in domain-specific languages (DSLs). Finding "good" programs, that only expose meaningful degrees of freedom, requires access to a DSL with a "good" library of functions, both of which are typically authored by domain experts… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: SIGGRAPH 2023

  16. arXiv:2304.10950  [pdf, other

    cs.CV

    Factored Neural Representation for Scene Understanding

    Authors: Yu-Shiang Wong, Niloy J. Mitra

    Abstract: A long-standing goal in scene understanding is to obtain interpretable and editable representations that can be directly constructed from a raw monocular RGB-D video, without requiring specialized hardware setup or priors. The problem is significantly more challenging in the presence of multiple moving and/or deforming objects. Traditional methods have approached the setup with a mix of simplifica… ▽ More

    Submitted 20 June, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

  17. arXiv:2304.10320  [pdf, other

    cs.GR

    Neurosymbolic Models for Computer Graphics

    Authors: Daniel Ritchie, Paul Guerrero, R. Kenny Jones, Niloy J. Mitra, Adriana Schulz, Karl D. D. Willis, Jiajun Wu

    Abstract: Procedural models (i.e. symbolic programs that output visual data) are a historically-popular method for representing graphics content: vegetation, buildings, textures, etc. They offer many advantages: interpretable design parameters, stochastic variations, high-quality outputs, compact representation, and more. But they also have some limitations, such as the difficulty of authoring a procedural… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: Eurographics 2023 State-of-the-art report (STAR)

  18. arXiv:2303.16509  [pdf, other

    cs.CV cs.GR

    HoloDiffusion: Training a 3D Diffusion Model using 2D Images

    Authors: Animesh Karnewar, Andrea Vedaldi, David Novotny, Niloy Mitra

    Abstract: Diffusion models have emerged as the best approach for generative modeling of 2D images. Part of their success is due to the possibility of training them on millions if not billions of images with a stable learning objective. However, extending these models to 3D remains difficult for two reasons. First, finding a large quantity of 3D training data is much more complex than for 2D images. Second,… ▽ More

    Submitted 21 May, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 conference; project page at: https://holodiffusion.github.io/

  19. arXiv:2303.12688  [pdf, other

    cs.CV

    Pix2Video: Video Editing using Image Diffusion

    Authors: Duygu Ceylan, Chun-Hao Paul Huang, Niloy J. Mitra

    Abstract: Image diffusion models, trained on massive image collections, have emerged as the most versatile image generator model in terms of quality and diversity. They support inverting real images and conditional (e.g., text) generation, making them attractive for high-quality image editing applications. We investigate how to use such pre-trained image models for text-guided video editing. The critical ch… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  20. arXiv:2303.08639  [pdf, other

    cs.CV

    Blowing in the Wind: CycleNet for Human Cinemagraphs from Still Images

    Authors: Hugo Bertiche, Niloy J. Mitra, Kuldeep Kulkarni, Chun-Hao Paul Huang, Tuanfeng Y. Wang, Meysam Madadi, Sergio Escalera, Duygu Ceylan

    Abstract: Cinemagraphs are short loo** videos created by adding subtle motions to a static image. This kind of media is popular and engaging. However, automatic generation of cinemagraphs is an underexplored area and current solutions require tedious low-level manual authoring by artists. In this paper, we present an automatic method that allows generating human cinemagraphs from single RGB images. We inv… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  21. arXiv:2211.14902  [pdf, other

    cs.CV cs.GR

    3inGAN: Learning a 3D Generative Model from Images of a Self-similar Scene

    Authors: Animesh Karnewar, Oliver Wang, Tobias Ritschel, Niloy Mitra

    Abstract: We introduce 3inGAN, an unconditional 3D generative model trained from 2D images of a single self-similar 3D scene. Such a model can be used to produce 3D "remixes" of a given scene, by map** spatial latent codes into a 3D volumetric representation, which can subsequently be rendered from arbitrary views using physically based volume rendering. By construction, the generated scenes remain view-c… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Conference accept at 3DV 2022

  22. arXiv:2211.09869  [pdf, other

    cs.CV cs.LG

    RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation

    Authors: Titas Anciukevičius, Zexiang Xu, Matthew Fisher, Paul Henderson, Hakan Bilen, Niloy J. Mitra, Paul Guerrero

    Abstract: Diffusion models currently achieve state-of-the-art performance for both conditional and unconditional image generation. However, so far, image diffusion models do not support tasks required for 3D understanding, such as view-consistent 3D generation or single-view object reconstruction. In this paper, we present RenderDiffusion, the first diffusion model for 3D generation and inference, trained u… ▽ More

    Submitted 20 February, 2024; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted at CVPR 2023. Project page: https://github.com/Anciukevicius/RenderDiffusion

  23. arXiv:2210.14808  [pdf, other

    cs.CV

    Search for Concepts: Discovering Visual Concepts Using Direct Optimization

    Authors: Pradyumna Reddy, Paul Guerrero, Niloy J. Mitra

    Abstract: Finding an unsupervised decomposition of an image into individual objects is a key step to leverage compositionality and to perform symbolic reasoning. Traditionally, this problem is solved using amortized inference, which does not generalize beyond the scope of the training data, may sometimes miss correct decompositions, and requires large amounts of training data. We propose finding a decomposi… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  24. Motion Guided Deep Dynamic 3D Garments

    Authors: Meng Zhang, Duygu Ceylan, Niloy J. Mitra

    Abstract: Realistic dynamic garments on animated characters have many AR/VR applications. While authoring such dynamic garment geometry is still a challenging task, data-driven simulation provides an attractive alternative, especially if it can be controlled simply using the motion of the underlying character. In this work, we focus on motion guided dynamic 3D garments, especially for loose garments. In a d… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: 11 pages

  25. arXiv:2207.08890  [pdf, other

    cs.CV cs.GR cs.LG

    NeuForm: Adaptive Overfitting for Neural Shape Editing

    Authors: Connor Z. Lin, Niloy J. Mitra, Gordon Wetzstein, Leonidas Guibas, Paul Guerrero

    Abstract: Neural representations are popular for representing shapes, as they can be learned form sensor data and used for data cleanup, model completion, shape editing, and shape synthesis. Current neural representations can be categorized as either overfitting to a single object instance, or representing a collection of objects. However, neither allows accurate editing of neural scene representations: on… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  26. MatFormer: A Generative Model for Procedural Materials

    Authors: Paul Guerrero, Miloš Hašan, Kalyan Sunkavalli, Radomír Měch, Tamy Boubekeur, Niloy J. Mitra

    Abstract: Procedural material graphs are a compact, parameteric, and resolution-independent representation that are a popular choice for material authoring. However, designing procedural materials requires significant expertise and publicly accessible libraries contain only a few thousand such graphs. We present MatFormer, a generative model that can produce a diverse set of high-quality procedural material… ▽ More

    Submitted 15 August, 2022; v1 submitted 3 July, 2022; originally announced July 2022.

    Journal ref: ACM Transactions on Graphics, Volume 41, Issue 4 (Proceedings of Siggraph 2022)

  27. arXiv:2205.14657  [pdf, other

    cs.CV cs.GR cs.LG

    COFS: Controllable Furniture layout Synthesis

    Authors: Wamiq Reyaz Para, Paul Guerrero, Niloy Mitra, Peter Wonka

    Abstract: Scalable generation of furniture layouts is essential for many applications in virtual reality, augmented reality, game development and synthetic data generation. Many existing methods tackle this problem as a sequence generation problem which imposes a specific ordering on the elements of the layout making such methods impractical for interactive editing or scene completion. Additionally, most me… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

    Comments: Initial Version

  28. arXiv:2205.13996  [pdf, other

    cs.CV cs.GR

    Video2StyleGAN: Disentangling Local and Global Variations in a Video

    Authors: Rameen Abdal, Peihao Zhu, Niloy J. Mitra, Peter Wonka

    Abstract: Image editing using a pretrained StyleGAN generator has emerged as a powerful paradigm for facial editing, providing disentangled controls over age, expression, illumination, etc. However, the approach cannot be directly adopted for video manipulations. We hypothesize that the main missing ingredient is the lack of fine-grained and disentangled control over face location, face pose, and local faci… ▽ More

    Submitted 30 May, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Video : https://youtu.be/oUeXFyfdE1A

  29. ReLU Fields: The Little Non-linearity That Could

    Authors: Animesh Karnewar, Tobias Ritschel, Oliver Wang, Niloy J. Mitra

    Abstract: In many recent works, multi-layer perceptions (MLPs) have been shown to be suitable for modeling complex spatially-varying functions including images and 3D scenes. Although the MLPs are able to represent complex scenes with unprecedented quality and memory footprint, this expressive power of the MLPs, however, comes at the cost of long training and inference times. On the other hand, bilinear/tri… ▽ More

    Submitted 2 July, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: Published at SIGGRAPH 2022

  30. arXiv:2204.02289  [pdf, other

    cs.CV cs.GR

    Neural Convolutional Surfaces

    Authors: Luca Morreale, Noam Aigerman, Paul Guerrero, Vladimir G. Kim, Niloy J. Mitra

    Abstract: This work is concerned with a representation of shapes that disentangles fine, local and possibly repeating geometry, from global, coarse structures. Achieving such disentanglement leads to two unrelated advantages: i) a significant compression in the number of parameters required to represent a given geometry; ii) the ability to manipulate either global geometry, or local details, without harming… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Journal ref: CVPR 2022

  31. arXiv:2203.07293  [pdf, other

    cs.CV cs.GR cs.LG

    InsetGAN for Full-Body Image Generation

    Authors: Anna Frühstück, Krishna Kumar Singh, Eli Shechtman, Niloy J. Mitra, Peter Wonka, **gwan Lu

    Abstract: While GANs can produce photo-realistic images in ideal conditions for certain domains, the generation of full-body human images remains difficult due to the diversity of identities, hairstyles, clothing, and the variance in pose. Instead of modeling this complex domain with a single GAN, we propose a novel method to combine multiple pretrained GANs, where one GAN generates a global canvas (e.g., h… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: Project webpage and video available at http://afruehstueck.github.io/insetgan

  32. arXiv:2201.10326  [pdf, other

    cs.CV cs.GR cs.LG

    ShapeFormer: Transformer-based Shape Completion via Sparse Representation

    Authors: Xingguang Yan, Liqiang Lin, Niloy J. Mitra, Dani Lischinski, Daniel Cohen-Or, Hui Huang

    Abstract: We present ShapeFormer, a transformer-based network that produces a distribution of object completions, conditioned on incomplete, and possibly noisy, point clouds. The resultant distribution can then be sampled to generate likely completions, each exhibiting plausible shape details while being faithful to the input. To facilitate the use of transformers for 3D, we introduce a compact 3D represent… ▽ More

    Submitted 22 May, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: Project page: https://shapeformer.github.io/

  33. arXiv:2112.05219  [pdf, other

    cs.CV cs.GR

    CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions

    Authors: Rameen Abdal, Peihao Zhu, John Femiani, Niloy J. Mitra, Peter Wonka

    Abstract: The success of StyleGAN has enabled unprecedented semantic editing capabilities, on both synthesized and real images. However, such editing operations are either trained with semantic supervision or described using human guidance. In another development, the CLIP architecture has been trained with internet-scale image and text pairings and has been shown to be useful in several zero-shot learning… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  34. arXiv:2111.05916  [pdf, other

    cs.CV

    Dance In the Wild: Monocular Human Animation with Neural Dynamic Appearance Synthesis

    Authors: Tuanfeng Y. Wang, Duygu Ceylan, Krishna Kumar Singh, Niloy J. Mitra

    Abstract: Synthesizing dynamic appearances of humans in motion plays a central role in applications such as AR/VR and video editing. While many recent methods have been proposed to tackle this problem, handling loose garments with complex textures and high dynamic motion still remains challenging. In this paper, we propose a video based appearance synthesis method that tackles such challenges and demonstrat… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

  35. arXiv:2109.10695  [pdf, other

    cs.CV cs.GR

    Differentiable Surface Triangulation

    Authors: Marie-Julie Rakotosaona, Noam Aigerman, Niloy Mitra, Maks Ovsjanikov, Paul Guerrero

    Abstract: Triangle meshes remain the most popular data representation for surface geometry. This ubiquitous representation is essentially a hybrid one that decouples continuous vertex locations from the discrete topological triangulation. Unfortunately, the combinatorial nature of the triangulation prevents taking derivatives over the space of possible meshings of any given surface. As a result, to date, me… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

  36. arXiv:2109.00113  [pdf, other

    cs.CV

    CPFN: Cascaded Primitive Fitting Networks for High-Resolution Point Clouds

    Authors: Eric-Tuan Lê, Minhyuk Sung, Duygu Ceylan, Radomir Mech, Tamy Boubekeur, Niloy J. Mitra

    Abstract: Representing human-made objects as a collection of base primitives has a long history in computer vision and reverse engineering. In the case of high-resolution point cloud scans, the challenge is to be able to detect both large primitives as well as those explaining the detailed parts. While the classical RANSAC approach requires case-specific parameter tuning, state-of-the-art networks are limit… ▽ More

    Submitted 6 September, 2021; v1 submitted 31 August, 2021; originally announced September 2021.

    Comments: ICCV 2021: 15 pages, 8 figures

    Journal ref: ICCV 2021

  37. arXiv:2108.03225  [pdf, other

    cs.CV

    GLASS: Geometric Latent Augmentation for Shape Spaces

    Authors: Sanjeev Muralikrishnan, Siddhartha Chaudhuri, Noam Aigerman, Vladimir Kim, Matthew Fisher, Niloy Mitra

    Abstract: We investigate the problem of training generative models on a very sparse collection of 3D models. We use geometrically motivated energies to augment and thus boost a sparse collection of example (training) models. We analyze the Hessian of the as-rigid-as-possible (ARAP) energy to sample from and project to the underlying (local) shape space, and use the augmented dataset to train a variational a… ▽ More

    Submitted 29 April, 2022; v1 submitted 6 August, 2021; originally announced August 2021.

    Comments: CVPR 2022

  38. arXiv:2107.05284  [pdf, other

    cs.GR

    CurveFusion: Reconstructing Thin Structures from RGBD Sequences

    Authors: Lingjie Liu, Nenglun Chen, Duygu Ceylan, Christian Theobalt, Wen** Wang, Niloy J. Mitra

    Abstract: We introduce CurveFusion, the first approach for high quality scanning of thin structures at interactive rates using a handheld RGBD camera. Thin filament-like structures are mathematically just 1D curves embedded in R^3, and integration-based reconstruction works best when depth sequences (from the thin structure parts) are fused using the object's (unknown) curve skeleton. Thus, using the comple… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

  39. arXiv:2106.06866  [pdf, other

    cs.CV cs.GR

    A Multi-Implicit Neural Representation for Fonts

    Authors: Pradyumna Reddy, Zhifei Zhang, Matthew Fisher, Hailin **, Zhaowen Wang, Niloy J. Mitra

    Abstract: Fonts are ubiquitous across documents and come in a variety of styles. They are either represented in a native vector format or rasterized to produce fixed resolution images. In the first case, the non-standard representation prevents benefiting from latest network architectures for neural representations; while, in the latter case, the rasterized representation, when encoded via networks, results… ▽ More

    Submitted 9 January, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

  40. arXiv:2106.04477  [pdf, other

    cs.CV

    MoCo-Flow: Neural Motion Consensus Flow for Dynamic Humans in Stationary Monocular Cameras

    Authors: Xuelin Chen, Weiyu Li, Daniel Cohen-Or, Niloy J. Mitra, Baoquan Chen

    Abstract: Synthesizing novel views of dynamic humans from stationary monocular cameras is a specialized but desirable setup. This is particularly attractive as it does not require static scenes, controlled environments, or specialized capture hardware. In contrast to techniques that exploit multi-view observations, the problem of modeling a dynamic scene from a single view is significantly more under-constr… ▽ More

    Submitted 7 February, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

  41. arXiv:2106.02711  [pdf, other

    cs.LG cs.AI cs.CV cs.GR

    SketchGen: Generating Constrained CAD Sketches

    Authors: Wamiq Reyaz Para, Shariq Farooq Bhat, Paul Guerrero, Tom Kelly, Niloy Mitra, Leonidas Guibas, Peter Wonka

    Abstract: Computer-aided design (CAD) is the most widely used modeling approach for technical design. The typical starting point in these designs is 2D sketches which can later be extruded and combined to obtain complex three-dimensional assemblies. Such sketches are typically composed of parametric primitives, such as points, lines, and circular arcs, augmented with geometric constraints linking the primit… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: 21 pages, 12 figures, 8 tables

  42. arXiv:2106.00329  [pdf, other

    cs.CV

    Consistent Two-Flow Network for Tele-Registration of Point Clouds

    Authors: Zihao Yan, Zimu Yi, Ruizhen Hu, Niloy J. Mitra, Daniel Cohen-Or, Hui Huang

    Abstract: Rigid registration of partial observations is a fundamental problem in various applied fields. In computer graphics, special attention has been given to the registration between two partial point clouds generated by scanning devices. State-of-the-art registration techniques still struggle when the overlap region between the two point clouds is small, and completely fail if there is no overlap betw… ▽ More

    Submitted 10 October, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: Accepted to IEEE TVCG 2021, project page at https://vcc.tech/research/2021/CTFNet

  43. arXiv:2105.14548  [pdf, other

    cs.GR cs.CV cs.LG

    Z2P: Instant Visualization of Point Clouds

    Authors: Gal Metzer, Rana Hanocka, Raja Giryes, Niloy J. Mitra, Daniel Cohen-Or

    Abstract: We present a technique for visualizing point clouds using a neural network. Our technique allows for an instant preview of any point cloud, and bypasses the notoriously difficult surface reconstruction problem or the need to estimate oriented normals for splat-based rendering. We cast the preview problem as a conditional image-to-image translation task, and design a neural network that translates… ▽ More

    Submitted 21 February, 2022; v1 submitted 30 May, 2021; originally announced May 2021.

    Comments: Eurographics 2022

  44. arXiv:2104.06392  [pdf, other

    cs.GR cs.AI cs.CV cs.LG

    ShapeMOD: Macro Operation Discovery for 3D Shape Programs

    Authors: R. Kenny Jones, David Charatan, Paul Guerrero, Niloy J. Mitra, Daniel Ritchie

    Abstract: A popular way to create detailed yet easily controllable 3D shapes is via procedural modeling, i.e. generating geometry using programs. Such programs consist of a series of instructions along with their associated parameter values. To fully realize the benefits of this representation, a shape program should be compact and only expose degrees of freedom that allow for meaningful manipulation of out… ▽ More

    Submitted 22 March, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: SIGGRAPH 2021. Project Page: https://rkjones4.github.io/shapeMOD.html

  45. arXiv:2103.16942  [pdf, other

    cs.CV cs.GR

    Neural Surface Maps

    Authors: Luca Morreale, Noam Aigerman, Vladimir Kim, Niloy J. Mitra

    Abstract: Maps are arguably one of the most fundamental concepts used to define and operate on manifold surfaces in differentiable geometry. Accordingly, in geometry processing, maps are ubiquitous and are used in many core applications, such as paramterization, shape analysis, remeshing, and deformation. Unfortunately, most computational representations of surface maps do not lend themselves to manipulatio… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: project page: http://geometry.cs.ucl.ac.uk/projects/2021/neuralmaps/

  46. arXiv:2103.14968  [pdf, other

    cs.CV

    Labels4Free: Unsupervised Segmentation using StyleGAN

    Authors: Rameen Abdal, Peihao Zhu, Niloy Mitra, Peter Wonka

    Abstract: We propose an unsupervised segmentation framework for StyleGAN generated objects. We build on two main observations. First, the features generated by StyleGAN hold valuable information that can be utilized towards training segmentation networks. Second, the foreground and background can often be treated to be largely independent and be composited in different ways. For our solution, we propose to… ▽ More

    Submitted 27 March, 2021; originally announced March 2021.

    Comments: "Project Page: https://rameenabdal.github.io/Labels4Free/"

    Journal ref: ICCV 2021

  47. arXiv:2103.00262  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Walk2Map: Extracting Floor Plans from Indoor Walk Trajectories

    Authors: Claudio Mura, Renato Pajarola, Konrad Schindler, Niloy Mitra

    Abstract: Recent years have seen a proliferation of new digital products for the efficient management of indoor spaces, with important applications like emergency management, virtual property showcasing and interior design. These products rely on accurate 3D models of the environments considered, including information on both architectural and non-permanent elements. These models must be created from measur… ▽ More

    Submitted 27 February, 2021; originally announced March 2021.

    Comments: To be published in Computer Graphics Forum (Proc. Eurographics 2021)

  48. arXiv:2102.11861  [pdf, other

    cs.GR cs.CV cs.LG

    Generative Modelling of BRDF Textures from Flash Images

    Authors: Philipp Henzler, Valentin Deschaintre, Niloy J. Mitra, Tobias Ritschel

    Abstract: We learn a latent space for easy capture, consistent interpolation, and efficient reproduction of visual material appearance. When users provide a photo of a stationary natural material captured under flashlight illumination, first it is converted into a latent material code. Then, in the second step, conditioned on the material code, our method produces an infinite and diverse spatial field of BR… ▽ More

    Submitted 10 September, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

  49. arXiv:2102.11811  [pdf, other

    cs.CV

    Dynamic Neural Garments

    Authors: Meng Zhang, Duygu Ceylan, Tuanfeng Wang, Niloy J. Mitra

    Abstract: A vital task of the wider digital human effort is the creation of realistic garments on digital avatars, both in the form of characteristic fold patterns and wrinkles in static frames as well as richness of garment dynamics under avatars' motion. Existing workflow of modeling, simulation, and rendering closely replicates the physics behind real garments, but is tedious and requires repeating most… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: 13 pages

  50. arXiv:2102.02798  [pdf, other

    cs.CV cs.GR

    Im2Vec: Synthesizing Vector Graphics without Vector Supervision

    Authors: Pradyumna Reddy, Michael Gharbi, Michal Lukac, Niloy J. Mitra

    Abstract: Vector graphics are widely used to represent fonts, logos, digital artworks, and graphic designs. But, while a vast body of work has focused on generative algorithms for raster images, only a handful of options exists for vector graphics. One can always rasterize the input graphic and resort to image-based generative approaches, but this negates the advantages of the vector representation. The cur… ▽ More

    Submitted 1 April, 2021; v1 submitted 4 February, 2021; originally announced February 2021.