Skip to main content

Showing 1–43 of 43 results for author: Aubry, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.08721  [pdf, other

    cs.CV

    Historical Astronomical Diagrams Decomposition in Geometric Primitives

    Authors: Syrine Kalleli, Scott Trigg, Ségolène Albouy, Mathieu Husson, Mathieu Aubry

    Abstract: Automatically extracting the geometric content from the hundreds of thousands of diagrams drawn in historical manuscripts would enable historians to study the diffusion of astronomical knowledge on a global scale. However, state-of-the-art vectorization methods, often designed to tackle modern data, are not adapted to the complexity and diversity of historical astronomical diagrams. Our contributi… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Code and dataset are available in http://imagine.enpc.fr/~kallelis/icdar2024/

  2. arXiv:2312.02985  [pdf, other

    cs.CV

    FocalPose++: Focal Length and Object Pose Estimation via Render and Compare

    Authors: Martin Cífka, Georgy Ponimatkin, Yann Labbé, Bryan Russell, Mathieu Aubry, Vladimir Petrik, Josef Sivic

    Abstract: We introduce FocalPose++, a neural render-and-compare method for jointly estimating the camera-object 6D pose and camera focal length given a single RGB input image depicting a known object. The contributions of this work are threefold. First, we derive a focal length update rule that extends an existing state-of-the-art render-and-compare 6D pose estimator to address the joint estimation task. Se… ▽ More

    Submitted 15 November, 2023; originally announced December 2023.

    Comments: 21 pages, 18 figures. arXiv admin note: substantial text overlap with arXiv:2204.05145

  3. arXiv:2307.05473  [pdf, other

    cs.CV

    Differentiable Blocks World: Qualitative 3D Decomposition by Rendering Primitives

    Authors: Tom Monnier, Jake Austin, Angjoo Kanazawa, Alexei A. Efros, Mathieu Aubry

    Abstract: Given a set of calibrated images of a scene, we present an approach that produces a simple, compact, and actionable 3D world representation by means of 3D primitives. While many approaches focus on recovering high-fidelity 3D scenes, we focus on parsing a scene into mid-level 3D representations made of a small set of textured primitives. Such representations are interpretable, easy to manipulate a… ▽ More

    Submitted 26 December, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Project webpage with code and videos: https://www.tmonnier.com/DBW. V2 update includes comparisons based on NeuS, hyperparameter analysis and failure cases

  4. arXiv:2304.09704  [pdf, other

    cs.CV

    Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans

    Authors: Romain Loiseau, Elliot Vincent, Mathieu Aubry, Loic Landrieu

    Abstract: We propose an unsupervised method for parsing large 3D scans of real-world scenes with easily-interpretable shapes. This work aims to provide a practical tool for analyzing 3D scenes in the context of aerial surveying and map**, without the need for user annotations. Our approach is based on a probabilistic reconstruction model that decomposes an input 3D point cloud into a small set of learned… ▽ More

    Submitted 28 March, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

  5. arXiv:2303.12533  [pdf, other

    cs.CV

    Pixel-wise Agricultural Image Time Series Classification: Comparisons and a Deformable Prototype-based Approach

    Authors: Elliot Vincent, Jean Ponce, Mathieu Aubry

    Abstract: Improvements in Earth observation by satellites allow for imagery of ever higher temporal and spatial resolution. Leveraging this data for agricultural monitoring is key for addressing environmental and economic challenges. Current methods for crop segmentation using temporal data either rely on annotated data or are heavily engineered to compensate the lack of supervision. In this paper, we prese… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  6. arXiv:2302.01660  [pdf, other

    cs.CV

    The Learnable Typewriter: A Generative Approach to Text Analysis

    Authors: Ioannis Siglidis, Nicolas Gonthier, Julien Gaubil, Tom Monnier, Mathieu Aubry

    Abstract: We present a generative document-specific approach to character analysis and recognition in text lines. Our main idea is to build on unsupervised multi-object segmentation methods and in particular those that reconstruct images based on a limited amount of visual elements, called sprites. Taking as input a set of text lines with similar font or handwriting, our approach can learn a large number of… ▽ More

    Submitted 14 April, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: For the code and a quick-overview visit the project webpage at http://imagine.enpc.fr/~siglidii/learnable-typewriter

  7. arXiv:2212.06870  [pdf, other

    cs.CV cs.RO

    MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare

    Authors: Yann Labbé, Lucas Manuelli, Arsalan Mousavian, Stephen Tyree, Stan Birchfield, Jonathan Tremblay, Justin Carpentier, Mathieu Aubry, Dieter Fox, Josef Sivic

    Abstract: We introduce MegaPose, a method to estimate the 6D pose of novel objects, that is, objects unseen during training. At inference time, the method only assumes knowledge of (i) a region of interest displaying the object in the image and (ii) a CAD model of the observed object. The contributions of this work are threefold. First, we present a 6D pose refiner based on a render&compare strategy which c… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: CoRL 2022

  8. arXiv:2208.12625  [pdf, other

    cs.LG cs.CV

    Take One Gram of Neural Features, Get Enhanced Group Robustness

    Authors: Simon Roburin, Charles Corbière, Gilles Puy, Nicolas Thome, Matthieu Aubry, Renaud Marlet, Patrick Pérez

    Abstract: Predictive performance of machine learning models trained with empirical risk minimization (ERM) can degrade considerably under distribution shifts. The presence of spurious correlations in training datasets leads ERM-trained models to display high loss when evaluated on minority groups not presenting such correlations. Extensive attempts have been made to develop methods improving worst-group rob… ▽ More

    Submitted 7 February, 2023; v1 submitted 26 August, 2022; originally announced August 2022.

    Comments: Long version (Previous version: OOD-CV Workshop @ ECCV 2022)

  9. arXiv:2208.03311  [pdf, other

    cs.SD eess.AS

    A Model You Can Hear: Audio Identification with Playable Prototypes

    Authors: Romain Loiseau, Baptiste Bouvier, Yann Teytaut, Elliot Vincent, Mathieu Aubry, Loic Landrieu

    Abstract: Machine learning techniques have proved useful for classifying and analyzing audio content. However, recent methods typically rely on abstract and high-dimensional representations that are difficult to interpret. Inspired by transformation-invariant approaches developed for image and 3D data, we propose an audio identification model based on learnable spectral prototypes. Equipped with dedicated t… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

  10. arXiv:2206.08194  [pdf, other

    cs.CV

    Online Segmentation of LiDAR Sequences: Dataset and Algorithm

    Authors: Romain Loiseau, Mathieu Aubry, Loïc Landrieu

    Abstract: Roof-mounted spinning LiDAR sensors are widely used by autonomous vehicles. However, most semantic datasets and algorithms used for LiDAR sequence segmentation operate on $360^\circ$ frames, causing an acquisition latency incompatible with real-time applications. To address this issue, we first introduce HelixNet, a $10$ billion point dataset with fine-grained labels, timestamps, and sensor rotati… ▽ More

    Submitted 21 July, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Code and data are available at: https://romainloiseau.fr/helixnet

  11. arXiv:2206.06273  [pdf, other

    cs.CG cs.CV

    Learning Joint Surface Atlases

    Authors: Theo Deprelle, Thibault Groueix, Noam Aigerman, Vladimir G. Kim, Mathieu Aubry

    Abstract: This paper describes new techniques for learning atlas-like representations of 3D surfaces, i.e. homeomorphic transformations from a 2D domain to surfaces. Compared to prior work, we propose two major contributions. First, instead of map** a fixed 2D domain, such as a set of square patches, to the surface, we learn a continuous 2D domain with arbitrary topology by optimizing a point sampling dis… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  12. arXiv:2204.10310  [pdf, other

    cs.CV cs.GR

    Share With Thy Neighbors: Single-View Reconstruction by Cross-Instance Consistency

    Authors: Tom Monnier, Matthew Fisher, Alexei A. Efros, Mathieu Aubry

    Abstract: Approaches for single-view reconstruction typically rely on viewpoint annotations, silhouettes, the absence of background, multiple views of the same instance, a template shape, or symmetry. We avoid all such supervision and assumptions by explicitly leveraging the consistency between images of different object instances. As a result, our method can learn from large collections of unlabelled image… ▽ More

    Submitted 25 July, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: ECCV 2022. Project webpage with code and videos: http://imagine.enpc.fr/~monniert/UNICORN/

  13. arXiv:2204.05145  [pdf, other

    cs.CV

    Focal Length and Object Pose Estimation via Render and Compare

    Authors: Georgy Ponimatkin, Yann Labbé, Bryan Russell, Mathieu Aubry, Josef Sivic

    Abstract: We introduce FocalPose, a neural render-and-compare method for jointly estimating the camera-object 6D pose and camera focal length given a single RGB input image depicting a known object. The contributions of this work are twofold. First, we derive a focal length update rule that extends an existing state-of-the-art render-and-compare 6D pose estimator to address the joint estimation task. Second… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR2022. Code available at http://github.com/ponimatkin/focalpose

  14. arXiv:2112.09648  [pdf, other

    cs.CV

    Improving neural implicit surfaces geometry with patch war**

    Authors: François Darmon, Bénédicte Bascle, Jean-Clément Devaux, Pascal Monasse, Mathieu Aubry

    Abstract: Neural implicit surfaces have become an important technique for multi-view 3D reconstruction but their accuracy remains limited. In this paper, we argue that this comes from the difficulty to learn and render high frequency textures with neural networks. We thus propose to add to the standard neural rendering optimization a direct photo-consistency term across the different views. Intuitively, we… ▽ More

    Submitted 8 May, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: Accepted at CVPR2022. Project wepbage: http://imagine.enpc.fr/~darmonf/

  15. arXiv:2110.15904  [pdf, other

    cs.CV

    Learning Co-segmentation by Segment Swap** for Retrieval and Discovery

    Authors: Xi Shen, Alexei A. Efros, Armand Joulin, Mathieu Aubry

    Abstract: The goal of this work is to efficiently identify visually similar patterns in images, e.g. identifying an artwork detail copied between an engraving and an oil painting, or recognizing parts of a night-time photograph visible in its daytime counterpart. Lack of training data is a key challenge for this co-segmentation task. We present a simple yet surprisingly effective approach to overcome this d… ▽ More

    Submitted 27 March, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

    Comments: add results of unsupervised saliency detection

  16. arXiv:2109.01605  [pdf, other

    cs.CV

    Representing Shape Collections with Alignment-Aware Linear Models

    Authors: Romain Loiseau, Tom Monnier, Mathieu Aubry, Loïc Landrieu

    Abstract: In this paper, we revisit the classical representation of 3D point clouds as linear shape models. Our key insight is to leverage deep learning to represent a collection of shapes as affine transformations of low-dimensional linear shape models. Each linear model is characterized by a shape prototype, a low-dimensional shape basis and two neural networks. The networks take as input a point cloud an… ▽ More

    Submitted 17 December, 2021; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: Accepted to 3DV 2021. 17 pages, 10 figures. Code and data are available at: https://romainloiseau.github.io/deep-linear-shapes

  17. arXiv:2108.08109  [pdf, other

    cs.CV

    Image Collation: Matching illustrations in manuscripts

    Authors: Ryad Kaoua, Xi Shen, Alexandra Durr, Stavros Lazaris, David Picard, Mathieu Aubry

    Abstract: Illustrations are an essential transmission instrument. For an historian, the first step in studying their evolution in a corpus of similar manuscripts is to identify which ones correspond to each other. This image collation task is daunting for manuscripts separated by many lost copies, spreading over centuries, which might have been completely re-organized and greatly modified to adapt to novel… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: accepted to ICDAR 2021

  18. Deep Multi-View Stereo gone wild

    Authors: François Darmon, Bénédicte Bascle, Jean-Clément Devaux, Pascal Monasse, Mathieu Aubry

    Abstract: Deep multi-view stereo (MVS) methods have been developed and extensively compared on simple datasets, where they now outperform classical approaches. In this paper, we ask whether the conclusions reached in controlled scenarios are still valid when working with Internet photo collections. We propose a methodology for evaluation and explore the influence of three aspects of deep MVS methods: networ… ▽ More

    Submitted 2 December, 2021; v1 submitted 30 April, 2021; originally announced April 2021.

    Comments: Accepted to 3DV2021

  19. arXiv:2104.14575  [pdf, other

    cs.CV

    Unsupervised Layered Image Decomposition into Object Prototypes

    Authors: Tom Monnier, Elliot Vincent, Jean Ponce, Mathieu Aubry

    Abstract: We present an unsupervised learning framework for decomposing images into layers of automatically discovered object models. Contrary to recent approaches that model image layers with autoencoder networks, we represent them as explicit transformations of a small set of prototypical images. Our model has three main components: (i) a set of object prototypes in the form of learnable images with a tra… ▽ More

    Submitted 23 August, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: Accepted at ICCV 2021. Project webpage: https://imagine.enpc.fr/~monniert/DTI-Sprites

  20. arXiv:2104.09359  [pdf, other

    cs.CV cs.RO

    Single-view robot pose and joint angle estimation via render & compare

    Authors: Yann Labbé, Justin Carpentier, Mathieu Aubry, Josef Sivic

    Abstract: We introduce RoboPose, a method to estimate the joint angles and the 6D camera-to-robot pose of a known articulated robot from a single RGB image. This is an important problem to grant mobile and itinerant autonomous systems the ability to interact with other robots using only visual information in non-instrumented environments, especially in the context of collaborative robotics. It is also chall… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Accepted at CVPR 2021 (Oral)

  21. docExtractor: An off-the-shelf historical document element extraction

    Authors: Tom Monnier, Mathieu Aubry

    Abstract: We present docExtractor, a generic approach for extracting visual elements such as text lines or illustrations from historical documents without requiring any real data annotation. We demonstrate it provides high-quality performances as an off-the-shelf system across a wide variety of datasets and leads to results on par with state-of-the-art when fine-tuned. We argue that the performance obtained… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: Accepted at 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR) (oral). Project webpage: http://imagine.enpc.fr/~monniert/docExtractor/

  22. Learning to Guide Local Feature Matches

    Authors: François Darmon, Mathieu Aubry, Pascal Monasse

    Abstract: We tackle the problem of finding accurate and robust keypoint correspondences between images. We propose a learning-based approach to guide local feature matches via a learned approximate image matching. Our approach can boost the results of SIFT to a level similar to state-of-the-art deep descriptors, such as Superpoint, ContextDesc, or D2-Net and can improve performance for these descriptors. We… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: Accepted to 3DV 2020

  23. arXiv:2008.08465  [pdf, other

    cs.CV

    CosyPose: Consistent multi-view multi-object 6D pose estimation

    Authors: Yann Labbé, Justin Carpentier, Mathieu Aubry, Josef Sivic

    Abstract: We introduce an approach for recovering the 6D pose of multiple known objects in a scene captured by a set of input images with unknown camera viewpoints. First, we present a single-view single-object 6D pose estimation method, which we use to generate 6D object pose hypotheses. Second, we develop a robust method for matching individual 6D object pose hypotheses across different input images in or… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  24. arXiv:2007.08872  [pdf, other

    cs.CV

    Impact of base dataset design on few-shot image classification

    Authors: Othman Sbai, Camille Couprie, Mathieu Aubry

    Abstract: The quality and generality of deep image features is crucially determined by the data they have been trained on, but little is known about this often overlooked effect. In this paper, we systematically study the effect of variations in the training data by evaluating deep features trained on different image sets in a few-shot classification setting. The experimental protocol we define allows to ex… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: 23 pages, 11 figures, to appear in ECCV 2020

  25. arXiv:2006.13382  [pdf, other

    cs.LG stat.ML

    Spherical Perspective on Learning with Normalization Layers

    Authors: Simon Roburin, Yann de Mont-Marin, Andrei Bursuc, Renaud Marlet, Patrick Pérez, Mathieu Aubry

    Abstract: Normalization Layers (NLs) are widely used in modern deep-learning architectures. Despite their apparent simplicity, their effect on optimization is not yet fully understood. This paper introduces a spherical framework to study the optimization of neural networks with NLs from a geometric perspective. Concretely, the radial invariance of groups of parameters, such as filters for convolutional neur… ▽ More

    Submitted 19 May, 2022; v1 submitted 23 June, 2020; originally announced June 2020.

  26. arXiv:2006.11132  [pdf, other

    cs.CV cs.LG stat.ML

    Deep Transformation-Invariant Clustering

    Authors: Tom Monnier, Thibault Groueix, Mathieu Aubry

    Abstract: Recent advances in image clustering typically focus on learning better deep representations. In contrast, we present an orthogonal approach that does not rely on abstract features but instead learns to predict image transformations and performs clustering directly in image space. This learning process naturally fits in the gradient-based training of K-means and Gaussian mixture model, without requ… ▽ More

    Submitted 27 October, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: Accepted at NeurIPS 2020 (oral). Project webpage: http://imagine.enpc.fr/~monniert/DTIClustering/

  27. arXiv:2004.01526  [pdf, other

    cs.CV

    RANSAC-Flow: generic two-stage image alignment

    Authors: Xi Shen, François Darmon, Alexei A. Efros, Mathieu Aubry

    Abstract: This paper considers the generic problem of dense alignment between two images, whether they be two frames of a video, two widely different views of a scene, two paintings depicting similar content, etc. Whereas each such task is typically addressed with a domain-specific solution, we show that a simple unsupervised approach performs surprisingly well across a range of tasks. Our main insight is t… ▽ More

    Submitted 17 July, 2020; v1 submitted 3 April, 2020; originally announced April 2020.

    Comments: Accepted to ECCV 2020 as a spotlight. Project page: http://imagine.enpc.fr/~shenx/RANSAC-Flow/

  28. arXiv:1908.10254  [pdf, other

    cs.CV

    Large-Scale Historical Watermark Recognition: dataset and a new consistency-based approach

    Authors: Xi Shen, Ilaria Pastrolin, Oumayma Bounou, Spyros Gidaris, Marc Smith, Olivier Poncet, Mathieu Aubry

    Abstract: Historical watermark recognition is a highly practical, yet unsolved challenge for archivists and historians. With a large number of well-defined classes, cluttered and noisy samples, different types of representations, both subtle differences between classes and high intra-class variation, historical watermarks are also challenging for pattern recognition. In this paper, overcoming the difficulty… ▽ More

    Submitted 27 August, 2019; originally announced August 2019.

  29. arXiv:1908.04725  [pdf, other

    cs.CV cs.AI

    Learning elementary structures for 3D shape generation and matching

    Authors: Theo Deprelle, Thibault Groueix, Matthew Fisher, Vladimir G. Kim, Bryan C. Russell, Mathieu Aubry

    Abstract: We propose to represent shapes as the deformation and combination of learnable elementary 3D structures, which are primitives resulting from training over a collection of shape. We demonstrate that the learned elementary 3D structures lead to clear improvements in 3D shape generation and matching. More precisely, we present two complementary approaches for learning elementary structures: (i) patch… ▽ More

    Submitted 14 August, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

  30. arXiv:1907.03165  [pdf, other

    cs.CV

    Unsupervised cycle-consistent deformation for shape matching

    Authors: Thibault Groueix, Matthew Fisher, Vladimir G. Kim, Bryan C. Russell, Mathieu Aubry

    Abstract: We propose a self-supervised approach to deep surface deformation. Given a pair of shapes, our algorithm directly predicts a parametric transformation from one shape to the other respecting correspondences. Our insight is to use cycle-consistency to define a notion of good correspondences in groups of objects and use it as a supervisory signal to train our network. Our method does not rely on a te… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

  31. arXiv:1906.05105  [pdf, other

    cs.CV

    Pose from Shape: Deep Pose Estimation for Arbitrary 3D Objects

    Authors: Yang Xiao, Xuchong Qiu, Pierre-Alain Langlois, Mathieu Aubry, Renaud Marlet

    Abstract: Most deep pose estimation methods need to be trained for specific object instances or categories. In this work we propose a completely generic deep pose estimation approach, which does not require the network to have been trained on relevant categories, nor objects in a category to have a canonical pose. We believe this is a crucial step to design robotic systems that can interact with new objects… ▽ More

    Submitted 5 August, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

  32. arXiv:1904.10348  [pdf, other

    cs.RO cs.CV

    Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning

    Authors: Yann Labbé, Sergey Zagoruyko, Igor Kalevatykh, Ivan Laptev, Justin Carpentier, Mathieu Aubry, Josef Sivic

    Abstract: We address the problem of visually guided rearrangement planning with many movable objects, i.e., finding a sequence of actions to move a set of objects from an initial arrangement to a desired one, while relying on visual inputs coming from an RGB camera. To do so, we introduce a complete pipeline relying on two key contributions. First, we introduce an efficient and scalable rearrangement planni… ▽ More

    Submitted 1 April, 2020; v1 submitted 23 April, 2019; originally announced April 2019.

    Comments: Accepted for publication in IEEE Robotics and Automation Letters (RA-L)

  33. arXiv:1903.02678  [pdf, other

    cs.CV

    Discovering Visual Patterns in Art Collections with Spatially-consistent Feature Learning

    Authors: Xi Shen, Alexei A. Efros, Mathieu Aubry

    Abstract: Our goal in this paper is to discover near duplicate patterns in large collections of artworks. This is harder than standard instance mining due to differences in the artistic media (oil, pastel, drawing, etc), and imperfections inherent in the copying process. The key technical insight is to adapt a standard deep feature to this task by fine-tuning it on the specific art collection using self-sup… ▽ More

    Submitted 8 March, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

  34. Virtual Training for a Real Application: Accurate Object-Robot Relative Localization without Calibration

    Authors: Vianney Loing, Renaud Marlet, Mathieu Aubry

    Abstract: Localizing an object accurately with respect to a robot is a key step for autonomous robotic manipulation. In this work, we propose to tackle this task knowing only 3D models of the robot and object in the particular case where the scene is viewed from uncalibrated cameras -- a situation which would be typical in an uncontrolled environment, e.g., on a construction site. We demonstrate that this l… ▽ More

    Submitted 7 February, 2019; originally announced February 2019.

    Journal ref: Int J Comput Vis (2018) 126: 1045

  35. arXiv:1812.05484  [pdf, other

    cs.CV

    Unsupervised Image Decomposition in Vector Layers

    Authors: Othman Sbai, Camille Couprie, Mathieu Aubry

    Abstract: Deep image generation is becoming a tool to enhance artists and designers creativity potential. In this paper, we aim at making the generation process more structured and easier to interact with. Inspired by vector graphics systems, we propose a new deep image reconstruction paradigm where the outputs are composed from simple layers, defined by their color and a vector transparency mask. This pres… ▽ More

    Submitted 7 July, 2019; v1 submitted 13 December, 2018; originally announced December 2018.

    Comments: 10 pages, 10 figures

  36. arXiv:1806.05228  [pdf, other

    cs.CV

    3D-CODED : 3D Correspondences by Deep Deformation

    Authors: Thibault Groueix, Matthew Fisher, Vladimir G. Kim, Bryan C. Russell, Mathieu Aubry

    Abstract: We present a new deep learning approach for matching deformable shapes by introducing {\it Shape Deformation Networks} which jointly encode 3D shapes and correspondences. This is achieved by factoring the surface representation into (i) a template, that parameterizes the surface, and (ii) a learnt global feature vector that parameterizes the transformation of the template into the input surface. B… ▽ More

    Submitted 27 July, 2018; v1 submitted 13 June, 2018; originally announced June 2018.

  37. arXiv:1802.05384  [pdf, other

    cs.CV

    AtlasNet: A Papier-Mâché Approach to Learning 3D Surface Generation

    Authors: Thibault Groueix, Matthew Fisher, Vladimir G. Kim, Bryan C. Russell, Mathieu Aubry

    Abstract: We introduce a method for learning to generate the surface of 3D shapes. Our approach represents a 3D shape as a collection of parametric surface elements and, in contrast to methods generating voxel grids or point clouds, naturally infers a surface representation of the shape. Beyond its novelty, our new shape generation framework, AtlasNet, comes with significant advantages, such as improved pre… ▽ More

    Submitted 20 July, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

  38. arXiv:1707.08390  [pdf, other

    cs.GR cs.CV

    3D Sketching using Multi-View Deep Volumetric Prediction

    Authors: Johanna Delanoy, Mathieu Aubry, Phillip Isola, Alexei A. Efros, Adrien Bousseau

    Abstract: Sketch-based modeling strives to bring the ease and immediacy of drawing to the 3D world. However, while drawings are easy for humans to create, they are very challenging for computers to interpret due to their sparsity and ambiguity. We propose a data-driven approach that tackles this challenge by learning to reconstruct 3D shapes from one or more drawings. At the core of our approach is a deep c… ▽ More

    Submitted 19 June, 2018; v1 submitted 26 July, 2017; originally announced July 2017.

    Comments: See our accompanying video on https://youtu.be/DGIYzmlm2pQ, networks and databases available at https://ns.inria.fr/d3/3DSketching/. To appear in PACMCGIT

    ACM Class: I.3.5

  39. arXiv:1609.03894  [pdf, other

    cs.CV cs.LG cs.NE

    Crafting a multi-task CNN for viewpoint estimation

    Authors: Francisco Massa, Renaud Marlet, Mathieu Aubry

    Abstract: Convolutional Neural Networks (CNNs) were recently shown to provide state-of-the-art results for object category viewpoint estimation. However different ways of formulating this problem have been proposed and the competing approaches have been explored with very different design choices. This paper presents a comparison of these approaches in a unified setting as well as a detailed analysis of the… ▽ More

    Submitted 13 September, 2016; originally announced September 2016.

    Comments: To appear in BMVC 2016

  40. arXiv:1604.05383  [pdf, other

    cs.CV

    Learning Dense Correspondence via 3D-guided Cycle Consistency

    Authors: Tinghui Zhou, Philipp Krähenbühl, Mathieu Aubry, Qixing Huang, Alexei A. Efros

    Abstract: Discriminative deep learning approaches have shown impressive results for problems where human-labeled ground truth is plentiful, but what about tasks where labels are difficult or impossible to obtain? This paper tackles one such problem: establishing dense visual correspondence across different object instances. For this task, although we do not know what the ground-truth is, we know it should b… ▽ More

    Submitted 18 April, 2016; originally announced April 2016.

    Comments: To appear in CVPR 2016 (oral presentation)

  41. arXiv:1512.02497  [pdf, other

    cs.CV cs.LG cs.NE

    Deep Exemplar 2D-3D Detection by Adapting from Real to Rendered Views

    Authors: Francisco Massa, Bryan Russell, Mathieu Aubry

    Abstract: This paper presents an end-to-end convolutional neural network (CNN) for 2D-3D exemplar detection. We demonstrate that the ability to adapt the features of natural images to better align with those of CAD rendered views is critical to the success of our technique. We show that the adaptation can be learned by compositing rendered views of textured object models on natural images. Our approach can… ▽ More

    Submitted 18 April, 2016; v1 submitted 8 December, 2015; originally announced December 2015.

    Comments: To appear in CVPR 2016

  42. arXiv:1506.01151  [pdf, other

    cs.CV

    Understanding deep features with computer-generated imagery

    Authors: Mathieu Aubry, Bryan Russell

    Abstract: We introduce an approach for analyzing the variation of features generated by convolutional neural networks (CNNs) with respect to scene factors that occur in natural images. Such factors may include object style, 3D viewpoint, color, and scene lighting configuration. Our approach analyzes CNN feature responses corresponding to different scene factors by controlling for them via rendering using a… ▽ More

    Submitted 3 June, 2015; originally announced June 2015.

  43. arXiv:1412.7190  [pdf, other

    cs.CV cs.LG cs.NE

    Convolutional Neural Networks for joint object detection and pose estimation: A comparative study

    Authors: Francisco Massa, Mathieu Aubry, Renaud Marlet

    Abstract: In this paper we study the application of convolutional neural networks for jointly detecting objects depicted in still images and estimating their 3D pose. We identify different feature representations of oriented objects, and energies that lead a network to learn this representations. The choice of the representation is crucial since the pose of an object has a natural, continuous structure whil… ▽ More

    Submitted 28 February, 2015; v1 submitted 22 December, 2014; originally announced December 2014.