Skip to main content

Showing 1–39 of 39 results for author: Kokkinos, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10180  [pdf, other

    cs.CV

    MeshPose: Unifying DensePose and 3D Body Mesh reconstruction

    Authors: Eric-Tuan Lê, Antonis Kakolyris, Petros Koutras, Himmy Tam, Efstratios Skordos, George Papandreou, Rıza Alp Güler, Iasonas Kokkinos

    Abstract: DensePose provides a pixel-accurate association of images with 3D mesh coordinates, but does not provide a 3D mesh, while Human Mesh Reconstruction (HMR) systems have high 2D reprojection error, as measured by DensePose localization metrics. In this work we introduce MeshPose to jointly tackle DensePose and HMR. For this we first introduce new losses that allow us to use weak DensePose supervision… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

    MSC Class: 68 ACM Class: I.2.10

    Journal ref: CVPR 2024

  2. arXiv:2210.09446  [pdf, other

    cs.CV cs.AI cs.LG q-bio.NC

    Deformably-Scaled Transposed Convolution

    Authors: Stefano B. Blumberg, Daniele Raví, Mou-Cheng Xu, Matteo Figini, Iasonas Kokkinos, Daniel C. Alexander

    Abstract: Transposed convolution is crucial for generating high-resolution outputs, yet has received little attention compared to convolution layers. In this work we revisit transposed convolution and introduce a novel layer that allows us to place information in the image selectively and choose the `stroke breadth' at which the image is synthesized, whilst incurring a small additional parameter cost. For t… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  3. arXiv:2202.07778  [pdf, other

    cs.CV cs.LG

    Beyond Deterministic Translation for Unsupervised Domain Adaptation

    Authors: Eleni Chiou, Eleftheria Panagiotaki, Iasonas Kokkinos

    Abstract: In this work we challenge the common approach of using a one-to-one map** ('translation') between the source and target domains in unsupervised domain adaptation (UDA). Instead, we rely on stochastic translation to capture inherent translation ambiguities. This allows us to (i) train more accurate target networks by generating multiple outputs conditioned on the same source image, leveraging bot… ▽ More

    Submitted 20 November, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: Accepted at BMVC 2022. Code is available at https://github.com/elchiou/Beyond-deterministic-translation-for-UDA

  4. arXiv:2109.09736  [pdf, other

    eess.IV cs.CV cs.LG

    Unsupervised Domain Adaptation with Semantic Consistency across Heterogeneous Modalities for MRI Prostate Lesion Segmentation

    Authors: Eleni Chiou, Francesco Giganti, Shonit Punwani, Iasonas Kokkinos, Eleftheria Panagiotaki

    Abstract: Any novel medical imaging modality that differs from previous protocols e.g. in the number of imaging channels, introduces a new domain that is heterogeneous from previous ones. This common medical imaging scenario is rarely considered in the domain adaptation literature, which handles shifts across domains of the same dimensionality. In our work we rely on stochastic generative modeling to transl… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: Accepted at MICCAI 2021 Workshop on Domain Adaptation and Representation Transfer (DART). arXiv admin note: text overlap with arXiv:2010.07411

  5. arXiv:2106.05662  [pdf, other

    cs.CV

    To The Point: Correspondence-driven monocular 3D category reconstruction

    Authors: Filippos Kokkinos, Iasonas Kokkinos

    Abstract: We present To The Point (TTP), a method for reconstructing 3D objects from a single image using 2D to 3D correspondences learned from weak supervision. We recover a 3D shape from a 2D image by first regressing the 2D positions corresponding to the 3D template vertices and then jointly estimating a rigid camera transform and non-rigid template deformation that optimally explain the 2D positions thr… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

  6. arXiv:2103.16352  [pdf, other

    cs.CV cs.GR cs.LG

    Learning monocular 3D reconstruction of articulated categories from motion

    Authors: Filippos Kokkinos, Iasonas Kokkinos

    Abstract: Monocular 3D reconstruction of articulated object categories is challenging due to the lack of training data and the inherent ill-posedness of the problem. In this work we use video self-supervision, forcing the consistency of consecutive 3D reconstructions by a motion-based cycle loss. This largely improves both optimization-based and learning-based 3D mesh reconstruction. We further introduce an… ▽ More

    Submitted 27 April, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR2021. For project website see https://fkokkinos.github.io/video_3d_reconstruction/

  7. arXiv:2010.07411  [pdf, other

    cs.CV cs.LG eess.IV

    Harnessing Uncertainty in Domain Adaptation for MRI Prostate Lesion Segmentation

    Authors: Eleni Chiou, Francesco Giganti, Shonit Punwani, Iasonas Kokkinos, Eleftheria Panagiotaki

    Abstract: The need for training data can impede the adoption of novel imaging modalities for learning-based medical image analysis. Domain adaptation methods partially mitigate this problem by translating training data from a related source domain to a novel target domain, but typically assume that a one-to-one translation is possible. Our work addresses the challenge of adapting to a more informative targe… ▽ More

    Submitted 18 January, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: Accepted at MICCAI 2020. Code is available at https://github.com/elchiou/DA

  8. arXiv:2008.10041  [pdf, other

    cs.CV cs.LG

    Holistic Multi-View Building Analysis in the Wild with Projection Pooling

    Authors: Zbigniew Wojna, Krzysztof Maziarz, Łukasz Jocz, Robert Pałuba, Robert Kozikowski, Iasonas Kokkinos

    Abstract: We address six different classification tasks related to fine-grained building attributes: construction type, number of floors, pitch and geometry of the roof, facade material, and occupancy class. Tackling such a remote building analysis problem became possible only recently due to growing large-scale datasets of urban scenes. To this end, we introduce a new benchmarking dataset, consisting of 49… ▽ More

    Submitted 19 December, 2020; v1 submitted 23 August, 2020; originally announced August 2020.

    Comments: Accepted for publication at the 35th AAAI Conference on Artificial Intelligence (AAAI 2021)

  9. arXiv:2004.01946  [pdf, other

    cs.CV

    Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild

    Authors: Dominik Kulon, Riza Alp Güler, Iasonas Kokkinos, Michael Bronstein, Stefanos Zafeiriou

    Abstract: We introduce a simple and effective network architecture for monocular 3D hand pose estimation consisting of an image encoder followed by a mesh convolutional decoder that is trained through a direct 3D hand mesh reconstruction loss. We train our network by gathering a large-scale dataset of hand action in YouTube videos and use it as a source of weak supervision. Our weakly-supervised mesh convol… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

    Comments: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2020). Additional resources: https://arielai.com/mesh_hands

  10. arXiv:1907.00960  [pdf, other

    cs.CV cs.GR

    Going Deeper with Lean Point Networks

    Authors: Eric-Tuan Le, Iasonas Kokkinos, Niloy J. Mitra

    Abstract: In this work we introduce Lean Point Networks (LPNs) to train deeper and more accurate point processing networks by relying on three novel point processing blocks that improve memory consumption, inference time, and accuracy: a convolution-type block for point sets that blends neighborhood information in a memory-efficient manner; a crosslink block that efficiently shares information across low- a… ▽ More

    Submitted 16 June, 2020; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: 16 pages, 11 figures, 9 tables

    MSC Class: 68T45 ACM Class: I.2.10; I.3.0; I.4.8

  11. arXiv:1906.07496  [pdf

    eess.IV cs.CV

    Deep Learning Enhanced Extended Depth-of-Field for Thick Blood-Film Malaria High-Throughput Microscopy

    Authors: Petru Manescu, Lydia Neary- Zajiczek, Michael J. Shaw, Muna Elmi, Remy Claveau, Vijay Pawar, John Shawe-Taylor, Iasonas Kokkinos, Mandayam A. Srinivasan, Ikeoluwa Lagunju, Olugbemiro Sodeinde, Biobele J. Brown, Delmiro Fernandez-Reyes

    Abstract: Fast accurate diagnosis of malaria is still a global health challenge for which automated digital-pathology approaches could provide scalable solutions amenable to be deployed in low-to-middle income countries. Here we address the problem of Extended Depth-of-Field (EDoF) in thick blood film microscopy for rapid automated malaria diagnosis. High magnification oil-objectives (100x) with large numer… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

    Comments: 10 pages, 4 figures

  12. arXiv:1906.05706  [pdf, other

    cs.CV

    Slim DensePose: Thrifty Learning from Sparse Annotations and Motion Cues

    Authors: Natalia Neverova, James Thewlis, Rıza Alp Güler, Iasonas Kokkinos, Andrea Vedaldi

    Abstract: DensePose supersedes traditional landmark detectors by densely map** image pixels to body surface coordinates. This power, however, comes at a greatly increased annotation time, as supervising the model requires to manually label hundreds of points per pose instance. In this work, we thus seek methods to significantly slim down the DensePose annotations, proposing more efficient data collection… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: CVPR 2019

  13. arXiv:1904.11960  [pdf, other

    cs.CV

    Lifting AutoEncoders: Unsupervised Learning of a Fully-Disentangled 3D Morphable Model using Deep Non-Rigid Structure from Motion

    Authors: Mihir Sahasrabudhe, Zhixin Shu, Edward Bartrum, Riza Alp Guler, Dimitris Samaras, Iasonas Kokkinos

    Abstract: In this work we introduce Lifting Autoencoders, a generative 3D surface-based model of object categories. We bring together ideas from non-rigid structure from motion, image formation, and morphable models to learn a controllable, geometric model of 3D categories in an entirely unsupervised manner from an unstructured set of images. We exploit the 3D geometric nature of our model and use normal in… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

    Comments: 19 pages; 12 figures; code will be released; Project page: https://msahasrabudhe.github.io/projects/lae/

  14. arXiv:1904.08918  [pdf, other

    cs.CV

    Attentive Single-Tasking of Multiple Tasks

    Authors: Kevis-Kokitsi Maninis, Ilija Radosavovic, Iasonas Kokkinos

    Abstract: In this work we address task interference in universal networks by considering that a network is trained on multiple tasks, but performs one task at a time, an approach we refer to as "single-tasking multiple tasks". The network thus modifies its behaviour through task-dependent feature adaptation, or task attention. This gives the network the ability to accentuate the features that are adapted to… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

    Comments: CVPR 2019 Camera Ready

  15. arXiv:1902.05509  [pdf, other

    cs.CV

    MultiGrain: a unified image embedding for classes and instances

    Authors: Maxim Berman, Hervé Jégou, Andrea Vedaldi, Iasonas Kokkinos, Matthijs Douze

    Abstract: MultiGrain is a network architecture producing compact vector representations that are suited both for image classification and particular object retrieval. It builds on a standard classification trunk. The top of the network produces an embedding containing coarse and fine-grained information, so that images can be recognized based on the object class, particular object, or if they are distorted… ▽ More

    Submitted 3 April, 2019; v1 submitted 14 February, 2019; originally announced February 2019.

  16. arXiv:1809.01995  [pdf, other

    cs.CV

    Dense Pose Transfer

    Authors: Natalia Neverova, Riza Alp Guler, Iasonas Kokkinos

    Abstract: In this work we integrate ideas from surface-based modeling with neural synthesis: we propose a combination of surface-based pose estimation and deep generative models that allows us to perform accurate pose transfer, i.e. synthesize a new image of a person based on a single image of that person and the image of a pose donor. We use a dense pose estimation system that maps pixels from both images… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: ECCV 2018

  17. arXiv:1808.05577  [pdf, other

    cs.CV cs.AI cs.LG q-bio.NC

    Deeper Image Quality Transfer: Training Low-Memory Neural Networks for 3D Images

    Authors: Stefano B. Blumberg, Ryutaro Tanno, Iasonas Kokkinos, Daniel C. Alexander

    Abstract: In this paper we address the memory demands that come with the processing of 3-dimensional, high-resolution, multi-channeled medical images in deep learning. We exploit memory-efficient backpropagation techniques, to reduce the memory complexity of network training from being linear in the network's depth, to being roughly constant $ - $ permitting us to elongate deep architectures with negligible… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

    Comments: Accepted in: MICCAI 2018

  18. arXiv:1807.03148  [pdf, other

    cs.CV cs.LG stat.ML

    Deep Spatio-Temporal Random Fields for Efficient Video Segmentation

    Authors: Siddhartha Chandra, Camille Couprie, Iasonas Kokkinos

    Abstract: In this work we introduce a time- and memory-efficient method for structured prediction that couples neuron decisions across both space at time. We show that we are able to perform exact and efficient inference on a densely connected spatio-temporal graph by capitalizing on recent advances on deep Gaussian Conditional Random Fields (GCRFs). Our method, called VideoGCRF is (a) efficient, (b) has a… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

    Comments: CVPR 2018

  19. arXiv:1806.06503  [pdf, other

    cs.CV

    Deforming Autoencoders: Unsupervised Disentangling of Shape and Appearance

    Authors: Zhixin Shu, Mihir Sahasrabudhe, Alp Guler, Dimitris Samaras, Nikos Paragios, Iasonas Kokkinos

    Abstract: In this work we introduce Deforming Autoencoders, a generative model for images that disentangles shape from appearance in an unsupervised manner. As in the deformable template paradigm, shape is represented as a deformation between a canonical coordinate system (`template') and an observed image, while appearance is modeled in `canonical', template, coordinates, thus discarding variability due to… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Comments: 17 pages including references, plus 12 pages appendix. Video available at : https://youtu.be/Oi7pyxKkF1g Code will be made available soon

  20. arXiv:1803.02188  [pdf, other

    cs.CV

    DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild

    Authors: Riza Alp Guler, Yuxiang Zhou, George Trigeorgis, Epameinondas Antonakos, Patrick Snape, Stefanos Zafeiriou, Iasonas Kokkinos

    Abstract: In this work we use deep learning to establish dense correspondences between a 3D object model and an image "in the wild". We introduce "DenseReg", a fully-convolutional neural network (F-CNN) that densely regresses at every foreground pixel a pair of U-V template coordinates in a single feedforward pass. To train DenseReg we construct a supervision signal by combining 3D deformable model fitting… ▽ More

    Submitted 11 March, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1612.01202

  21. arXiv:1802.00434  [pdf, other

    cs.CV

    DensePose: Dense Human Pose Estimation In The Wild

    Authors: Rıza Alp Güler, Natalia Neverova, Iasonas Kokkinos

    Abstract: In this work, we establish dense correspondences between RGB image and a surface-based representation of the human body, a task we refer to as dense human pose estimation. We first gather dense correspondences for 50K persons appearing in the COCO dataset by introducing an efficient annotation pipeline. We then use our dataset to train CNN-based systems that deliver dense correspondence 'in the wi… ▽ More

    Submitted 1 February, 2018; originally announced February 2018.

  22. arXiv:1711.01161  [pdf, other

    cs.CL

    Learning Filterbanks from Raw Speech for Phone Recognition

    Authors: Neil Zeghidour, Nicolas Usunier, Iasonas Kokkinos, Thomas Schatz, Gabriel Synnaeve, Emmanuel Dupoux

    Abstract: We train a bank of complex filters that operates on the raw waveform and is fed into a convolutional neural network for end-to-end phone recognition. These time-domain filterbanks (TD-filterbanks) are initialized as an approximation of mel-filterbanks, and then fine-tuned jointly with the remaining convolutional architecture. We perform phone recognition experiments on TIMIT and show that for seve… ▽ More

    Submitted 4 April, 2018; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: Accepted at ICASSP 2018

  23. arXiv:1708.04607  [pdf, other

    cs.CV

    Segmentation-Aware Convolutional Networks Using Local Attention Masks

    Authors: Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos

    Abstract: We introduce an approach to integrate segmentation information within a convolutional neural network (CNN). This counter-acts the tendency of CNNs to smooth information across regions and increases their spatial precision. To obtain segmentation information, we set up a CNN to provide an embedding space where region co-membership can be estimated based on Euclidean distance. We use these embedding… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

  24. arXiv:1708.03816  [pdf, other

    cs.CV

    Mass Displacement Networks

    Authors: Natalia Neverova, Iasonas Kokkinos

    Abstract: Despite the large improvements in performance attained by using deep learning in computer vision, one can often further improve results with some additional post-processing that exploits the geometric nature of the underlying task. This commonly involves displacing the posterior distribution of a CNN in a way that makes it more appropriate for the task at hand, e.g. better aligned with local image… ▽ More

    Submitted 12 August, 2017; originally announced August 2017.

    Comments: 12 pages, 4 figures

  25. arXiv:1612.01202  [pdf, other

    cs.CV

    DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild

    Authors: Rıza Alp Güler, George Trigeorgis, Epameinondas Antonakos, Patrick Snape, Stefanos Zafeiriou, Iasonas Kokkinos

    Abstract: In this paper we propose to learn a map** from image pixels into a dense template grid through a fully convolutional network. We formulate this task as a regression problem and train our network by leveraging upon manually annotated facial landmarks "in-the-wild". We use such landmarks to establish a dense correspondence field between a three-dimensional object template and the input image, whic… ▽ More

    Submitted 19 June, 2017; v1 submitted 4 December, 2016; originally announced December 2016.

    Comments: CVPR 2017

  26. arXiv:1611.09051  [pdf, other

    cs.CV

    Deep, Dense, and Low-Rank Gaussian Conditional Random Fields

    Authors: Siddhartha Chandra, Iasonas Kokkinos

    Abstract: In this work we introduce a fully-connected graph structure in the Deep Gaussian Conditional Random Field (G-CRF) model. For this we express the pairwise interactions between pixels as the inner-products of low-dimensional embeddings, delivered by a new subnetwork of a deep architecture. We efficiently minimize the resulting energy by solving the resulting low-rank linear system with conjugate gra… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

  27. arXiv:1609.02132  [pdf, other

    cs.CV cs.AI cs.LG

    UberNet: Training a `Universal' Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited Memory

    Authors: Iasonas Kokkinos

    Abstract: In this work we introduce a convolutional neural network (CNN) that jointly handles low-, mid-, and high-level vision tasks in a unified architecture that is trained end-to-end. Such a universal network can act like a `swiss knife' for vision tasks; we call this architecture an UberNet to indicate its overarching nature. We address two main technical challenges that emerge when broadening up the… ▽ More

    Submitted 7 September, 2016; originally announced September 2016.

  28. arXiv:1607.06787  [pdf, other

    cs.CV

    Prior-based Coregistration and Cosegmentation

    Authors: Mahsa Shakeri, Enzo Ferrante, Stavros Tsogkas, Sarah Lippe, Samuel Kadoury, Iasonas Kokkinos, Nikos Paragios

    Abstract: We propose a modular and scalable framework for dense coregistration and cosegmentation with two key characteristics: first, we substitute ground truth data with the semantic map output of a classifier; second, we combine this output with population deformable registration to improve both alignment and segmentation. Our approach deforms all volumes towards consensus, taking into account image simi… ▽ More

    Submitted 22 July, 2016; originally announced July 2016.

    Comments: The first two authors contributed equally

    Journal ref: MICCAI 2016

  29. arXiv:1606.00915  [pdf, other

    cs.CV

    DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

    Authors: Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, Alan L. Yuille

    Abstract: In this work we address the task of semantic image segmentation with Deep Learning and make three main contributions that are experimentally shown to have substantial practical merit. First, we highlight convolution with upsampled filters, or 'atrous convolution', as a powerful tool in dense prediction tasks. Atrous convolution allows us to explicitly control the resolution at which feature respon… ▽ More

    Submitted 11 May, 2017; v1 submitted 2 June, 2016; originally announced June 2016.

    Comments: Accepted by TPAMI

  30. arXiv:1603.08358  [pdf, other

    cs.CV cs.LG

    Fast, Exact and Multi-Scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs

    Authors: Siddhartha Chandra, Iasonas Kokkinos

    Abstract: In this work we propose a structured prediction technique that combines the virtues of Gaussian Conditional Random Fields (G-CRF) with Deep Learning: (a) our structured prediction task has a unique global optimum that is obtained exactly from the solution of a linear system (b) the gradients of our model parameters are analytically computed using closed form expressions, in contrast to the memory-… ▽ More

    Submitted 29 November, 2016; v1 submitted 28 March, 2016; originally announced March 2016.

    Comments: Our code is available at https://github.com/siddharthachandra/gcrf

  31. arXiv:1602.02130  [pdf, other

    cs.CV

    Sub-cortical brain structure segmentation using F-CNN's

    Authors: Mahsa Shakeri, Stavros Tsogkas, Enzo Ferrante, Sarah Lippe, Samuel Kadoury, Nikos Paragios, Iasonas Kokkinos

    Abstract: In this paper we propose a deep learning approach for segmenting sub-cortical structures of the human brain in Magnetic Resonance (MR) image data. We draw inspiration from a state-of-the-art Fully-Convolutional Neural Network (F-CNN) architecture for semantic segmentation of objects in natural images, and adapt it to our task. Unlike previous CNN-based methods that operate on image patches, our mo… ▽ More

    Submitted 5 February, 2016; originally announced February 2016.

    Comments: ISBI 2016: International Symposium on Biomedical Imaging, Apr 2016, Prague, Czech Republic

  32. arXiv:1511.07386  [pdf, other

    cs.CV cs.LG

    Pushing the Boundaries of Boundary Detection using Deep Learning

    Authors: Iasonas Kokkinos

    Abstract: In this work we show that adapting Deep Convolutional Neural Network training to the task of boundary detection can result in substantial improvements over the current state-of-the-art in boundary detection. Our contributions consist firstly in combining a careful design of the loss for boundary detection training, a multi-resolution architecture and training with external data to improve the de… ▽ More

    Submitted 22 January, 2016; v1 submitted 23 November, 2015; originally announced November 2015.

    Comments: The previous version reported large improvements w.r.t. the LPO region proposal baseline, which turned out to be due to a wrong computation for the baseline. The improvements are currently less important, and are omitted. We are sorry if the reported results caused any confusion. We have also integrated reviewer feedback regarding human performance on the BSD benchmark

  33. arXiv:1511.04377  [pdf, other

    cs.CV

    Learning Dense Convolutional Embeddings for Semantic Segmentation

    Authors: Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos

    Abstract: This paper proposes a new deep convolutional neural network (DCNN) architecture that learns pixel embeddings, such that pairwise distances between the embeddings can be used to infer whether or not the pixels lie on the same region. That is, for any two pixels on the same object, the embeddings are trained to be similar; for any pair that straddles an object boundary, the embeddings are trained to… ▽ More

    Submitted 7 January, 2016; v1 submitted 13 November, 2015; originally announced November 2015.

  34. arXiv:1507.02620  [pdf, other

    cs.CV

    Deep filter banks for texture recognition, description, and segmentation

    Authors: Mircea Cimpoi, Subhransu Maji, Iasonas Kokkinos, Andrea Vedaldi

    Abstract: Visual textures have played a key role in image understanding because they convey important semantics of images, and because texture representations that pool local image descriptors in an orderless manner have had a tremendous impact in diverse applications. In this paper we make several contributions to texture understanding. First, instead of focusing on texture instance and material category r… ▽ More

    Submitted 18 November, 2015; v1 submitted 9 July, 2015; originally announced July 2015.

    Comments: 29 pages; 13 figures; 8 tables

  35. arXiv:1505.02438  [pdf, other

    cs.CV

    Deep Learning for Semantic Part Segmentation with High-Level Guidance

    Authors: S. Tsogkas, I. Kokkinos, G. Papandreou, A. Vedaldi

    Abstract: In this work we address the task of segmenting an object into its parts, or semantic part segmentation. We start by adapting a state-of-the-art semantic segmentation system to this task, and show that a combination of a fully-convolutional Deep CNN system coupled with Dense CRF labelling provides excellent results for a broad range of object categories. Still, this approach remains agnostic to hig… ▽ More

    Submitted 24 November, 2015; v1 submitted 10 May, 2015; originally announced May 2015.

    Comments: 11 pages (including references), 3 figures, 2 tables

  36. arXiv:1412.7062  [pdf, other

    cs.CV cs.LG cs.NE

    Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

    Authors: Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, Alan L. Yuille

    Abstract: Deep Convolutional Neural Networks (DCNNs) have recently shown state of the art performance in high level vision tasks, such as image classification and object detection. This work brings together methods from DCNNs and probabilistic graphical models for addressing the task of pixel-level classification (also called "semantic image segmentation"). We show that responses at the final layer of DCNNs… ▽ More

    Submitted 7 June, 2016; v1 submitted 22 December, 2014; originally announced December 2014.

    Comments: 14 pages. Updated related work

  37. arXiv:1412.6537  [pdf, other

    cs.CV

    Fracking Deep Convolutional Image Descriptors

    Authors: Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Francesc Moreno-Noguer

    Abstract: In this paper we propose a novel framework for learning local image descriptors in a discriminative manner. For this purpose we explore a siamese architecture of Deep Convolutional Neural Networks (CNN), with a Hinge embedding loss on the L2 distance between descriptors. Since a siamese architecture uses pairs rather than single image patches to train, there exist a large number of positive sample… ▽ More

    Submitted 25 February, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

  38. arXiv:1412.0296  [pdf, ps, other

    cs.CV

    Untangling Local and Global Deformations in Deep Convolutional Networks for Image Classification and Sliding Window Detection

    Authors: George Papandreou, Iasonas Kokkinos, Pierre-André Savalle

    Abstract: Deep Convolutional Neural Networks (DCNNs) commonly use generic `max-pooling' (MP) layers to extract deformation-invariant features, but we argue in favor of a more refined treatment. First, we introduce epitomic convolution as a building block alternative to the common convolution-MP cascade of DCNNs; while having identical complexity to MP, Epitomic Convolution allows for parameter sharing acros… ▽ More

    Submitted 30 November, 2014; originally announced December 2014.

    Comments: 13 pages, 7 figures, 5 tables. arXiv admin note: substantial text overlap with arXiv:1406.2732

  39. arXiv:1311.3618  [pdf, other

    cs.CV

    Describing Textures in the Wild

    Authors: Mircea Cimpoi, Subhransu Maji, Iasonas Kokkinos, Sammy Mohamed, Andrea Vedaldi

    Abstract: Patterns and textures are defining characteristics of many natural objects: a shirt can be striped, the wings of a butterfly can be veined, and the skin of an animal can be scaly. Aiming at supporting this analytical dimension in image understanding, we address the challenging problem of describing textures with semantic attributes. We identify a rich vocabulary of forty-seven texture terms and us… ▽ More

    Submitted 15 November, 2013; v1 submitted 14 November, 2013; originally announced November 2013.

    Comments: 13 pages; 12 figures Fixed misplaced affiliation