Skip to main content

Showing 1–17 of 17 results for author: Papandreou, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10180  [pdf, other

    cs.CV

    MeshPose: Unifying DensePose and 3D Body Mesh reconstruction

    Authors: Eric-Tuan Lê, Antonis Kakolyris, Petros Koutras, Himmy Tam, Efstratios Skordos, George Papandreou, Rıza Alp Güler, Iasonas Kokkinos

    Abstract: DensePose provides a pixel-accurate association of images with 3D mesh coordinates, but does not provide a 3D mesh, while Human Mesh Reconstruction (HMR) systems have high 2D reprojection error, as measured by DensePose localization metrics. In this work we introduce MeshPose to jointly tackle DensePose and HMR. For this we first introduce new losses that allow us to use weak DensePose supervision… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

    MSC Class: 68 ACM Class: I.2.10

    Journal ref: CVPR 2024

  2. arXiv:1905.12162  [pdf, other

    cs.CV

    Volumetric Capture of Humans with a Single RGBD Camera via Semi-Parametric Learning

    Authors: Rohit Pandey, Anastasia Tkach, Shuoran Yang, Pavel Pidlypenskyi, Jonathan Taylor, Ricardo Martin-Brualla, Andrea Tagliasacchi, George Papandreou, Philip Davidson, Cem Keskin, Shahram Izadi, Sean Fanello

    Abstract: Volumetric (4D) performance capture is fundamental for AR/VR content generation. Whereas previous work in 4D performance capture has shown impressive results in studio settings, the technology is still far from being accessible to a typical consumer who, at best, might own a single RGBD sensor. Thus, in this work, we propose a method to synthesize free viewpoint renderings using a single RGBD came… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  3. arXiv:1902.05093  [pdf, other

    cs.CV

    DeeperLab: Single-Shot Image Parser

    Authors: Tien-Ju Yang, Maxwell D. Collins, Yukun Zhu, Jyh-**g Hwang, Ting Liu, Xiao Zhang, Vivienne Sze, George Papandreou, Liang-Chieh Chen

    Abstract: We present a single-shot, bottom-up approach for whole image parsing. Whole image parsing, also known as Panoptic Segmentation, generalizes the tasks of semantic segmentation for 'stuff' classes and instance segmentation for 'thing' classes, assigning both semantic and instance labels to every pixel in an image. Recent approaches to whole image parsing typically employ separate standalone modules… ▽ More

    Submitted 12 March, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

    Comments: 20 pages. The code of the proposed Parsing Covering metric is available at http://deeperlab.mit.edu

  4. arXiv:1809.04184  [pdf, other

    cs.CV cs.LG stat.ML

    Searching for Efficient Multi-Scale Architectures for Dense Image Prediction

    Authors: Liang-Chieh Chen, Maxwell D. Collins, Yukun Zhu, George Papandreou, Barret Zoph, Florian Schroff, Hartwig Adam, Jonathon Shlens

    Abstract: The design of neural network architectures is an important component for achieving state-of-the-art performance with machine learning systems across a broad array of tasks. Much work has endeavored to design and build architectures automatically through clever construction of a search space paired with simple learning algorithms. Recent progress has demonstrated that such meta-learning methods may… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: Accepted by NIPS 2018

  5. arXiv:1803.08225  [pdf, other

    cs.CV

    PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model

    Authors: George Papandreou, Tyler Zhu, Liang-Chieh Chen, Spyros Gidaris, Jonathan Tompson, Kevin Murphy

    Abstract: We present a box-free bottom-up approach for the tasks of pose estimation and instance segmentation of people in multi-person images using an efficient single-shot model. The proposed PersonLab model tackles both semantic-level reasoning and object-part associations using part-based modeling. Our model employs a convolutional network which learns to detect individual keypoints and predict their re… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

    Comments: Person detection and pose estimation, segmentation and grou**

  6. arXiv:1802.02611  [pdf, other

    cs.CV

    Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

    Authors: Liang-Chieh Chen, Yukun Zhu, George Papandreou, Florian Schroff, Hartwig Adam

    Abstract: Spatial pyramid pooling module or encode-decoder structure are used in deep neural networks for semantic segmentation task. The former networks are able to encode multi-scale contextual information by probing the incoming features with filters or pooling operations at multiple rates and multiple effective fields-of-view, while the latter networks can capture sharper object boundaries by gradually… ▽ More

    Submitted 22 August, 2018; v1 submitted 7 February, 2018; originally announced February 2018.

    Comments: ECCV 2018 camera ready

  7. arXiv:1712.04837  [pdf, other

    cs.CV

    MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features

    Authors: Liang-Chieh Chen, Alexander Hermans, George Papandreou, Florian Schroff, Peng Wang, Hartwig Adam

    Abstract: In this work, we tackle the problem of instance segmentation, the task of simultaneously solving object detection and semantic segmentation. Towards this goal, we present a model, called MaskLab, which produces three outputs: box detection, semantic segmentation, and direction prediction. Building on top of the Faster-RCNN object detector, the predicted boxes provide accurate localization of objec… ▽ More

    Submitted 13 December, 2017; originally announced December 2017.

    Comments: 10 pages including reference

  8. arXiv:1706.05587  [pdf, other

    cs.CV

    Rethinking Atrous Convolution for Semantic Image Segmentation

    Authors: Liang-Chieh Chen, George Papandreou, Florian Schroff, Hartwig Adam

    Abstract: In this work, we revisit atrous convolution, a powerful tool to explicitly adjust filter's field-of-view as well as control the resolution of feature responses computed by Deep Convolutional Neural Networks, in the application of semantic image segmentation. To handle the problem of segmenting objects at multiple scales, we design modules which employ atrous convolution in cascade or in parallel t… ▽ More

    Submitted 5 December, 2017; v1 submitted 17 June, 2017; originally announced June 2017.

    Comments: Add more experimental results

  9. arXiv:1701.01779  [pdf, other

    cs.CV

    Towards Accurate Multi-person Pose Estimation in the Wild

    Authors: George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson, Chris Bregler, Kevin Murphy

    Abstract: We propose a method for multi-person detection and 2-D pose estimation that achieves state-of-art results on the challenging COCO keypoints task. It is a simple, yet powerful, top-down approach consisting of two stages. In the first stage, we predict the location and scale of boxes which are likely to contain people; for this we use the Faster RCNN detector. In the second stage, we estimate the… ▽ More

    Submitted 14 April, 2017; v1 submitted 6 January, 2017; originally announced January 2017.

    Comments: Paper describing an improved version of the G-RMI entry to the 2016 COCO keypoints challenge (http://image-net.org/challenges/ilsvrc+coco2016). Camera ready version to appear in the Proceedings of CVPR 2017

  10. arXiv:1606.00915  [pdf, other

    cs.CV

    DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

    Authors: Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, Alan L. Yuille

    Abstract: In this work we address the task of semantic image segmentation with Deep Learning and make three main contributions that are experimentally shown to have substantial practical merit. First, we highlight convolution with upsampled filters, or 'atrous convolution', as a powerful tool in dense prediction tasks. Atrous convolution allows us to explicitly control the resolution at which feature respon… ▽ More

    Submitted 11 May, 2017; v1 submitted 2 June, 2016; originally announced June 2016.

    Comments: Accepted by TPAMI

  11. arXiv:1511.03328  [pdf, other

    cs.CV

    Semantic Image Segmentation with Task-Specific Edge Detection Using CNNs and a Discriminatively Trained Domain Transform

    Authors: Liang-Chieh Chen, Jonathan T. Barron, George Papandreou, Kevin Murphy, Alan L. Yuille

    Abstract: Deep convolutional neural networks (CNNs) are the backbone of state-of-art semantic image segmentation systems. Recent work has shown that complementing CNNs with fully-connected conditional random fields (CRFs) can significantly enhance their object localization accuracy, yet dense CRF inference is computationally expensive. We propose replacing the fully-connected CRF with domain transform (DT),… ▽ More

    Submitted 1 June, 2016; v1 submitted 10 November, 2015; originally announced November 2015.

    Comments: 14 pages. Accepted to appear at CVPR 2016

  12. arXiv:1505.02438  [pdf, other

    cs.CV

    Deep Learning for Semantic Part Segmentation with High-Level Guidance

    Authors: S. Tsogkas, I. Kokkinos, G. Papandreou, A. Vedaldi

    Abstract: In this work we address the task of segmenting an object into its parts, or semantic part segmentation. We start by adapting a state-of-the-art semantic segmentation system to this task, and show that a combination of a fully-convolutional Deep CNN system coupled with Dense CRF labelling provides excellent results for a broad range of object categories. Still, this approach remains agnostic to hig… ▽ More

    Submitted 24 November, 2015; v1 submitted 10 May, 2015; originally announced May 2015.

    Comments: 11 pages (including references), 3 figures, 2 tables

  13. arXiv:1502.02734  [pdf, other

    cs.CV

    Weakly- and Semi-Supervised Learning of a DCNN for Semantic Image Segmentation

    Authors: George Papandreou, Liang-Chieh Chen, Kevin Murphy, Alan L. Yuille

    Abstract: Deep convolutional neural networks (DCNNs) trained on a large number of images with strong pixel-level annotations have recently significantly pushed the state-of-art in semantic image segmentation. We study the more challenging problem of learning DCNNs for semantic image segmentation from either (1) weakly annotated training data such as bounding boxes or image-level labels or (2) a combination… ▽ More

    Submitted 5 October, 2015; v1 submitted 9 February, 2015; originally announced February 2015.

    Comments: Accepted to ICCV 2015

  14. arXiv:1412.7062  [pdf, other

    cs.CV cs.LG cs.NE

    Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

    Authors: Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, Alan L. Yuille

    Abstract: Deep Convolutional Neural Networks (DCNNs) have recently shown state of the art performance in high level vision tasks, such as image classification and object detection. This work brings together methods from DCNNs and probabilistic graphical models for addressing the task of pixel-level classification (also called "semantic image segmentation"). We show that responses at the final layer of DCNNs… ▽ More

    Submitted 7 June, 2016; v1 submitted 22 December, 2014; originally announced December 2014.

    Comments: 14 pages. Updated related work

  15. arXiv:1412.0296  [pdf, ps, other

    cs.CV

    Untangling Local and Global Deformations in Deep Convolutional Networks for Image Classification and Sliding Window Detection

    Authors: George Papandreou, Iasonas Kokkinos, Pierre-André Savalle

    Abstract: Deep Convolutional Neural Networks (DCNNs) commonly use generic `max-pooling' (MP) layers to extract deformation-invariant features, but we argue in favor of a more refined treatment. First, we introduce epitomic convolution as a building block alternative to the common convolution-MP cascade of DCNNs; while having identical complexity to MP, Epitomic Convolution allows for parameter sharing acros… ▽ More

    Submitted 30 November, 2014; originally announced December 2014.

    Comments: 13 pages, 7 figures, 5 tables. arXiv admin note: substantial text overlap with arXiv:1406.2732

  16. arXiv:1406.2732  [pdf, ps, other

    cs.CV cs.LG

    Deep Epitomic Convolutional Neural Networks

    Authors: George Papandreou

    Abstract: Deep convolutional neural networks have recently proven extremely competitive in challenging image recognition tasks. This paper proposes the epitomic convolution as a new building block for deep neural networks. An epitomic convolution layer replaces a pair of consecutive convolution and max-pooling layers found in standard deep convolutional neural networks. The main version of the proposed mode… ▽ More

    Submitted 10 June, 2014; originally announced June 2014.

    Comments: 9 pages

  17. arXiv:1107.4637  [pdf, ps, other

    cs.CV cs.IT stat.ML

    Efficient variational inference in large-scale Bayesian compressed sensing

    Authors: George Papandreou, Alan Yuille

    Abstract: We study linear models under heavy-tailed priors from a probabilistic viewpoint. Instead of computing a single sparse most probable (MAP) solution as in standard deterministic approaches, the focus in the Bayesian compressed sensing framework shifts towards capturing the full posterior distribution on the latent variables, which allows quantifying the estimation uncertainty and learning model para… ▽ More

    Submitted 4 September, 2011; v1 submitted 22 July, 2011; originally announced July 2011.

    Comments: 8 pages, 3 figures, appears in Proc. IEEE Workshop on Information Theory in Computer Vision and Pattern Recognition (in conjunction with ICCV-11), Barcelona, Spain, Nov. 2011

    Journal ref: Proc. IEEE Workshop on Information Theory in Computer Vision and Pattern Recognition (in conjunction with ICCV-11), pp. 1332-1339, Barcelona, Spain, Nov. 2011