Skip to main content

Showing 1–18 of 18 results for author: Tsogkas, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09081  [pdf, other

    cs.CV cs.LG

    Probabilistic Directed Distance Fields for Ray-Based Shape Representations

    Authors: Tristan Aumentado-Armstrong, Stavros Tsogkas, Sven Dickinson, Allan Jepson

    Abstract: In modern computer vision, the optimal representation of 3D shape continues to be task-dependent. One fundamental operation applied to such representations is differentiable rendering, as it enables inverse graphics approaches in learning frameworks. Standard explicit shape representations (voxels, point clouds, or meshes) are often easily rendered, but can suffer from limited geometric fidelity,… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: Extension of arXiv:2112.05300

    ACM Class: I.2.10

  2. arXiv:2306.08132  [pdf, other

    cs.RO

    Fast-Grasp'D: Dexterous Multi-finger Grasp Generation Through Differentiable Simulation

    Authors: Dylan Turpin, Tao Zhong, Shutong Zhang, Guanglei Zhu, **gzhou Liu, Ritvik Singh, Eric Heiden, Miles Macklin, Stavros Tsogkas, Sven Dickinson, Animesh Garg

    Abstract: Multi-finger gras** relies on high quality training data, which is hard to obtain: human data is hard to transfer and synthetic data relies on simplifying assumptions that reduce grasp quality. By making grasp simulation differentiable, and contact dynamics amenable to gradient-based optimization, we accelerate the search for high-quality grasps with fewer limiting assumptions. We present Grasp'… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  3. arXiv:2301.10759  [pdf, other

    cs.CV

    Efficient Flow-Guided Multi-frame De-fencing

    Authors: Stavros Tsogkas, Fengjia Zhang, Allan Jepson, Alex Levinshtein

    Abstract: Taking photographs ''in-the-wild'' is often hindered by fence obstructions that stand between the camera user and the scene of interest, and which are hard or impossible to avoid. De-fencing is the algorithmic process of automatically removing such obstructions from images, revealing the invisible parts of the scene. While this problem can be formulated as a combination of fence segmentation and i… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: 16 pages, 12 figures. Published at the Winter Conference on Application of Computer Vision (WACV) 2023

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023, pp. 1838-1847

  4. arXiv:2208.12250  [pdf, other

    cs.RO

    Grasp'D: Differentiable Contact-rich Grasp Synthesis for Multi-fingered Hands

    Authors: Dylan Turpin, Liquan Wang, Eric Heiden, Yun-Chun Chen, Miles Macklin, Stavros Tsogkas, Sven Dickinson, Animesh Garg

    Abstract: The study of hand-object interaction requires generating viable grasp poses for high-dimensional multi-finger models, often relying on analytic grasp synthesis which tends to produce brittle and unnatural results. This paper presents Grasp'D, an approach for grasp synthesis with a differentiable contact simulation from both known models as well as visual inputs. We use gradient-based methods as an… ▽ More

    Submitted 25 August, 2022; v1 submitted 25 August, 2022; originally announced August 2022.

  5. arXiv:2112.05300  [pdf, other

    cs.CV cs.LG

    Representing 3D Shapes with Probabilistic Directed Distance Fields

    Authors: Tristan Aumentado-Armstrong, Stavros Tsogkas, Sven Dickinson, Allan Jepson

    Abstract: Differentiable rendering is an essential operation in modern vision, allowing inverse graphics approaches to 3D understanding to be utilized in modern machine learning frameworks. Explicit shape representations (voxels, point clouds, or meshes), while relatively easily rendered, often suffer from limited geometric fidelity or topological constraints. On the other hand, implicit representations (oc… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: 22 pages

    ACM Class: I.2.6; I.2.10

  6. arXiv:2106.14973  [pdf, other

    cs.RO

    GIFT: Generalizable Interaction-aware Functional Tool Affordances without Labels

    Authors: Dylan Turpin, Liquan Wang, Stavros Tsogkas, Sven Dickinson, Animesh Garg

    Abstract: Tool use requires reasoning about the fit between an object's affordances and the demands of a task. Visual affordance learning can benefit from goal-directed interaction experience, but current techniques rely on human labels or expert demonstrations to generate this data. In this paper, we describe a method that grounds affordances in physical interactions instead, thus removing the need for hum… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: Qualitative results available at https://www.pair.toronto.edu/gift-tools-rss21

  7. arXiv:2106.06440  [pdf, other

    cs.CV cs.LG

    Learning Compositional Shape Priors for Few-Shot 3D Reconstruction

    Authors: Mateusz Michalkiewicz, Stavros Tsogkas, Sarah Parisot, Mahsa Baktashmotlagh, Anders Eriksson, Eugene Belilovsky

    Abstract: The impressive performance of deep convolutional neural networks in single-view 3D reconstruction suggests that these models perform non-trivial reasoning about the 3D structure of the output space. Recent work has challenged this belief, showing that, on standard benchmarks, complex encoder-decoder architectures perform similarly to nearest-neighbor baselines or simple linear decoder models that… ▽ More

    Submitted 16 June, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: 13 pages, 12 figures. arXiv admin note: substantial text overlap with arXiv:2004.06302

  8. Disentangling Geometric Deformation Spaces in Generative Latent Shape Models

    Authors: Tristan Aumentado-Armstrong, Stavros Tsogkas, Sven Dickinson, Allan Jepson

    Abstract: A complete representation of 3D objects requires characterizing the space of deformations in an interpretable manner, from articulations of a single instance to changes in shape across categories. In this work, we improve on a prior generative model of geometric disentanglement for 3D shapes, wherein the space of object geometry is factorized into rigid orientation, non-rigid pose, and intrinsic s… ▽ More

    Submitted 18 March, 2023; v1 submitted 27 February, 2021; originally announced March 2021.

    Comments: Accepted to IJCV

    ACM Class: I.2.10; I.5.4

  9. arXiv:2011.08026  [pdf, other

    cs.CV cs.LG

    Cycle-Consistent Generative Rendering for 2D-3D Modality Translation

    Authors: Tristan Aumentado-Armstrong, Alex Levinshtein, Stavros Tsogkas, Konstantinos G. Derpanis, Allan D. Jepson

    Abstract: For humans, visual understanding is inherently generative: given a 3D shape, we can postulate how it would look in the world; given a 2D image, we can infer the 3D structure that likely gave rise to it. We can thus translate between the 2D visual and 3D structural modalities of a given object. In the context of computer vision, this corresponds to a learnable module that serves two purposes: (i) g… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

    Comments: 3DV 2020 (oral). Project page: https://ttaa9.github.io/genren/

    ACM Class: I.2.10; I.2.6

  10. arXiv:2009.06943  [pdf, other

    eess.IV cs.CV

    AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

    Authors: Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, **gwen He, Yu Qiao, Chao Dong, Xiaotong Luo, Liang Chen, Jiangtao Zhang, Maitreya Suin , et al. (60 additional authors not shown)

    Abstract: This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor x4 based on a set of prior examples of low and corresponding high resolution images. The goal is to devise a network that reduces one or several aspects such as runtime, parameter co… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  11. arXiv:2004.06302  [pdf, other

    cs.CV cs.LG

    Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors

    Authors: Mateusz Michalkiewicz, Sarah Parisot, Stavros Tsogkas, Mahsa Baktashmotlagh, Anders Eriksson, Eugene Belilovsky

    Abstract: The impressive performance of deep convolutional neural networks in single-view 3D reconstruction suggests that these models perform non-trivial reasoning about the 3D structure of the output space. However, recent work has challenged this belief, showing that complex encoder-decoder architectures perform similarly to nearest-neighbor baselines or simple linear decoder models that exploit large am… ▽ More

    Submitted 2 May, 2020; v1 submitted 14 April, 2020; originally announced April 2020.

  12. arXiv:2004.02677  [pdf, other

    cs.CV

    Appearance Shock Grammar for Fast Medial Axis Extraction from Real Images

    Authors: Charles-Olivier Dufresne Camaro, Morteza Rezanejad, Stavros Tsogkas, Kaleem Siddiqi, Sven Dickinson

    Abstract: We combine ideas from shock graph theory with more recent appearance-based methods for medial axis extraction from complex natural scenes, improving upon the present best unsupervised method, in terms of efficiency and performance. We make the following specific contributions: i) we extend the shock graph representation to the domain of real images, by generalizing the shock type definitions using… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: Accepted to CVPR 2020

  13. arXiv:1908.06386  [pdf, other

    cs.CV cs.LG eess.IV

    Geometric Disentanglement for Generative Latent Shape Models

    Authors: Tristan Aumentado-Armstrong, Stavros Tsogkas, Allan Jepson, Sven Dickinson

    Abstract: Representing 3D shape is a fundamental problem in artificial intelligence, which has numerous applications within computer vision and graphics. One avenue that has recently begun to be explored is the use of latent representations of generative models. However, it remains an open problem to learn a generative model of shape that is interpretable and easily manipulated, particularly in the absence… ▽ More

    Submitted 18 August, 2019; originally announced August 2019.

    Comments: ICCV 2019

    ACM Class: I.2.10; I.5.4

  14. arXiv:1811.12608  [pdf, other

    cs.CV

    DeepFlux for Skeletons in the Wild

    Authors: Yukang Wang, Yongchao Xu, Stavros Tsogkas, Xiang Bai, Sven Dickinson, Kaleem Siddiqi

    Abstract: Computing object skeletons in natural images is challenging, owing to large variations in object appearance and scale, and the complexity of handling background clutter. Many recent methods frame object skeleton detection as a binary pixel classification problem, which is similar in spirit to learning-based edge detection, as well as to semantic segmentation methods. In the present article, we dep… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: 10 pages

  15. arXiv:1703.08628  [pdf, other

    cs.CV

    AMAT: Medial Axis Transform for Natural Images

    Authors: Stavros Tsogkas, Sven Dickinson

    Abstract: We introduce Appearance-MAT (AMAT), a generalization of the medial axis transform for natural images, that is framed as a weighted geometric set cover problem. We make the following contributions: i) we extend previous medial point detection methods for color images, by associating each medial point with a local scale; ii) inspired by the invertibility property of the binary MAT, we also associate… ▽ More

    Submitted 2 August, 2017; v1 submitted 24 March, 2017; originally announced March 2017.

    Comments: 10 pages (including references), 5 figures, accepted at ICCV 2017

  16. arXiv:1607.06787  [pdf, other

    cs.CV

    Prior-based Coregistration and Cosegmentation

    Authors: Mahsa Shakeri, Enzo Ferrante, Stavros Tsogkas, Sarah Lippe, Samuel Kadoury, Iasonas Kokkinos, Nikos Paragios

    Abstract: We propose a modular and scalable framework for dense coregistration and cosegmentation with two key characteristics: first, we substitute ground truth data with the semantic map output of a classifier; second, we combine this output with population deformable registration to improve both alignment and segmentation. Our approach deforms all volumes towards consensus, taking into account image simi… ▽ More

    Submitted 22 July, 2016; originally announced July 2016.

    Comments: The first two authors contributed equally

    Journal ref: MICCAI 2016

  17. arXiv:1602.02130  [pdf, other

    cs.CV

    Sub-cortical brain structure segmentation using F-CNN's

    Authors: Mahsa Shakeri, Stavros Tsogkas, Enzo Ferrante, Sarah Lippe, Samuel Kadoury, Nikos Paragios, Iasonas Kokkinos

    Abstract: In this paper we propose a deep learning approach for segmenting sub-cortical structures of the human brain in Magnetic Resonance (MR) image data. We draw inspiration from a state-of-the-art Fully-Convolutional Neural Network (F-CNN) architecture for semantic segmentation of objects in natural images, and adapt it to our task. Unlike previous CNN-based methods that operate on image patches, our mo… ▽ More

    Submitted 5 February, 2016; originally announced February 2016.

    Comments: ISBI 2016: International Symposium on Biomedical Imaging, Apr 2016, Prague, Czech Republic

  18. arXiv:1505.02438  [pdf, other

    cs.CV

    Deep Learning for Semantic Part Segmentation with High-Level Guidance

    Authors: S. Tsogkas, I. Kokkinos, G. Papandreou, A. Vedaldi

    Abstract: In this work we address the task of segmenting an object into its parts, or semantic part segmentation. We start by adapting a state-of-the-art semantic segmentation system to this task, and show that a combination of a fully-convolutional Deep CNN system coupled with Dense CRF labelling provides excellent results for a broad range of object categories. Still, this approach remains agnostic to hig… ▽ More

    Submitted 24 November, 2015; v1 submitted 10 May, 2015; originally announced May 2015.

    Comments: 11 pages (including references), 3 figures, 2 tables