Skip to main content

Showing 1–8 of 8 results for author: Luvizon, D C

.
  1. arXiv:2110.09380  [pdf, other

    cs.CV

    Learning multiplane images from single views with self-supervision

    Authors: Gustavo Sutter P. Carvalho, Diogo C. Luvizon, Antonio Joia Neto, Andre G. C. Pacheco, Otavio A. B. Penatti

    Abstract: Generating static novel views from an already captured image is a hard task in computer vision and graphics, in particular when the single input image has dynamic parts such as persons or moving objects. In this paper, we tackle this problem by proposing a new framework, called CycleMPI, that is capable of learning a multiplane image representation from single images through a cyclic training stra… ▽ More

    Submitted 19 October, 2021; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: To appear on BMVC 2021

  2. arXiv:2102.11771  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Improving Deep Learning Sound Events Classifiers using Gram Matrix Feature-wise Correlations

    Authors: Antonio Joia Neto, Andre G C Pacheco, Diogo C Luvizon

    Abstract: In this paper, we propose a new Sound Event Classification (SEC) method which is inspired in recent works for out-of-distribution detection. In our method, we analyse all the activations of a generic CNN in order to produce feature representations using Gram Matrices. The similarity metrics are evaluated considering all possible classes, and the final prediction is defined as the class that minimi… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: To appear on ICASSP 2021

  3. arXiv:2011.13317  [pdf, other

    cs.CV

    Adaptive Multiplane Image Generation from a Single Internet Picture

    Authors: Diogo C. Luvizon, Gustavo Sutter P. Carvalho, Andreza A. dos Santos, Jhonatas S. Conceicao, Jose L. Flores-Campana, Luis G. L. Decker, Marcos R. Souza, Helio Pedrini, Antonio Joia, Otavio A. B. Penatti

    Abstract: In the last few years, several works have tackled the problem of novel view synthesis from stereo images or even from a single picture. However, previous methods are computationally expensive, specially for high-resolution images. In this paper, we address the problem of generating a multiplane image (MPI) from a single high-resolution picture. We present the adaptive-MPI representation, which all… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

  4. Parallax Motion Effect Generation Through Instance Segmentation And Depth Estimation

    Authors: Allan Pinto, Manuel A. Córdova, Luis G. L. Decker, Jose L. Flores-Campana, Marcos R. Souza, Andreza A. dos Santos, Jhonatas S. Conceição, Henrique F. Gagliardi, Diogo C. Luvizon, Ricardo da S. Torres, Helio Pedrini

    Abstract: Stereo vision is a growing topic in computer vision due to the innumerable opportunities and applications this technology offers for the development of modern solutions, such as virtual and augmented reality applications. To enhance the user's experience in three-dimensional virtual environments, the motion parallax estimation is a promising technique to achieve this objective. In this paper, we p… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 2020 IEEE International Conference on Image Processing (ICIP), Abu Dhabi, United Arab Emirates

    Journal ref: 2020 IEEE International Conference on Image Processing (ICIP), Abu Dhabi, United Arab Emirates, 2020, pp. 1621-1625

  5. Multi-task Deep Learning for Real-Time 3D Human Pose Estimation and Action Recognition

    Authors: Diogo C Luvizon, Hedi Tabia, David Picard

    Abstract: Human pose estimation and action recognition are related tasks since both problems are strongly dependent on the human body representation and analysis. Nonetheless, most recent methods in the literature handle the two problems separately. In this work, we propose a multi-task framework for jointly estimating 2D or 3D human poses from monocular color images and classifying human actions from video… ▽ More

    Submitted 3 March, 2020; v1 submitted 14 December, 2019; originally announced December 2019.

    Comments: Accepted to TPAMI. arXiv admin note: text overlap with arXiv:1802.09232

  6. arXiv:1911.09245  [pdf, other

    cs.CV

    Consensus-based Optimization for 3D Human Pose Estimation in Camera Coordinates

    Authors: Diogo C Luvizon, Hedi Tabia, David Picard

    Abstract: 3D human pose estimation is frequently seen as the task of estimating 3D poses relative to the root body joint. Alternatively, we propose a 3D human pose estimation method in camera coordinates, which allows effective combination of 2D annotated data and 3D poses and a straightforward multi-view generalization. To that end, we cast the problem as a view frustum space pose estimation, where absolut… ▽ More

    Submitted 20 August, 2021; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: Source code is available at https://github.com/dluvizon/3d-pose-consensus

  7. arXiv:1802.09232  [pdf, other

    cs.CV

    2D/3D Pose Estimation and Action Recognition using Multitask Deep Learning

    Authors: Diogo C. Luvizon, David Picard, Hedi Tabia

    Abstract: Action recognition and human pose estimation are closely related but both problems are generally handled as distinct tasks in the literature. In this work, we propose a multitask framework for jointly 2D and 3D pose estimation from still images and human action recognition from video sequences. We show that a single architecture can be used to solve the two problems in an efficient way and still a… ▽ More

    Submitted 21 March, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

    Comments: To appear in CVPR 2018

  8. arXiv:1710.02322  [pdf, other

    cs.CV

    Human Pose Regression by Combining Indirect Part Detection and Contextual Information

    Authors: Diogo C. Luvizon, Hedi Tabia, David Picard

    Abstract: In this paper, we propose an end-to-end trainable regression approach for human pose estimation from still images. We use the proposed Soft-argmax function to convert feature maps directly to joint coordinates, resulting in a fully differentiable framework. Our method is able to learn heat maps representations indirectly, without additional steps of artificial ground truth generation. Consequently… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.