Skip to main content

Showing 1–15 of 15 results for author: Dickinson, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09081  [pdf, other

    cs.CV cs.LG

    Probabilistic Directed Distance Fields for Ray-Based Shape Representations

    Authors: Tristan Aumentado-Armstrong, Stavros Tsogkas, Sven Dickinson, Allan Jepson

    Abstract: In modern computer vision, the optimal representation of 3D shape continues to be task-dependent. One fundamental operation applied to such representations is differentiable rendering, as it enables inverse graphics approaches in learning frameworks. Standard explicit shape representations (voxels, point clouds, or meshes) are often easily rendered, but can suffer from limited geometric fidelity,… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: Extension of arXiv:2112.05300

    ACM Class: I.2.10

  2. arXiv:2306.08132  [pdf, other

    cs.RO

    Fast-Grasp'D: Dexterous Multi-finger Grasp Generation Through Differentiable Simulation

    Authors: Dylan Turpin, Tao Zhong, Shutong Zhang, Guanglei Zhu, **gzhou Liu, Ritvik Singh, Eric Heiden, Miles Macklin, Stavros Tsogkas, Sven Dickinson, Animesh Garg

    Abstract: Multi-finger gras** relies on high quality training data, which is hard to obtain: human data is hard to transfer and synthetic data relies on simplifying assumptions that reduce grasp quality. By making grasp simulation differentiable, and contact dynamics amenable to gradient-based optimization, we accelerate the search for high-quality grasps with fewer limiting assumptions. We present Grasp'… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  3. arXiv:2208.12250  [pdf, other

    cs.RO

    Grasp'D: Differentiable Contact-rich Grasp Synthesis for Multi-fingered Hands

    Authors: Dylan Turpin, Liquan Wang, Eric Heiden, Yun-Chun Chen, Miles Macklin, Stavros Tsogkas, Sven Dickinson, Animesh Garg

    Abstract: The study of hand-object interaction requires generating viable grasp poses for high-dimensional multi-finger models, often relying on analytic grasp synthesis which tends to produce brittle and unnatural results. This paper presents Grasp'D, an approach for grasp synthesis with a differentiable contact simulation from both known models as well as visual inputs. We use gradient-based methods as an… ▽ More

    Submitted 25 August, 2022; v1 submitted 25 August, 2022; originally announced August 2022.

  4. arXiv:2112.05300  [pdf, other

    cs.CV cs.LG

    Representing 3D Shapes with Probabilistic Directed Distance Fields

    Authors: Tristan Aumentado-Armstrong, Stavros Tsogkas, Sven Dickinson, Allan Jepson

    Abstract: Differentiable rendering is an essential operation in modern vision, allowing inverse graphics approaches to 3D understanding to be utilized in modern machine learning frameworks. Explicit shape representations (voxels, point clouds, or meshes), while relatively easily rendered, often suffer from limited geometric fidelity or topological constraints. On the other hand, implicit representations (oc… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: 22 pages

    ACM Class: I.2.6; I.2.10

  5. arXiv:2106.14973  [pdf, other

    cs.RO

    GIFT: Generalizable Interaction-aware Functional Tool Affordances without Labels

    Authors: Dylan Turpin, Liquan Wang, Stavros Tsogkas, Sven Dickinson, Animesh Garg

    Abstract: Tool use requires reasoning about the fit between an object's affordances and the demands of a task. Visual affordance learning can benefit from goal-directed interaction experience, but current techniques rely on human labels or expert demonstrations to generate this data. In this paper, we describe a method that grounds affordances in physical interactions instead, thus removing the need for hum… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: Qualitative results available at https://www.pair.toronto.edu/gift-tools-rss21

  6. Disentangling Geometric Deformation Spaces in Generative Latent Shape Models

    Authors: Tristan Aumentado-Armstrong, Stavros Tsogkas, Sven Dickinson, Allan Jepson

    Abstract: A complete representation of 3D objects requires characterizing the space of deformations in an interpretable manner, from articulations of a single instance to changes in shape across categories. In this work, we improve on a prior generative model of geometric disentanglement for 3D shapes, wherein the space of object geometry is factorized into rigid orientation, non-rigid pose, and intrinsic s… ▽ More

    Submitted 18 March, 2023; v1 submitted 27 February, 2021; originally announced March 2021.

    Comments: Accepted to IJCV

    ACM Class: I.2.10; I.5.4

  7. arXiv:2004.02677  [pdf, other

    cs.CV

    Appearance Shock Grammar for Fast Medial Axis Extraction from Real Images

    Authors: Charles-Olivier Dufresne Camaro, Morteza Rezanejad, Stavros Tsogkas, Kaleem Siddiqi, Sven Dickinson

    Abstract: We combine ideas from shock graph theory with more recent appearance-based methods for medial axis extraction from complex natural scenes, improving upon the present best unsupervised method, in terms of efficiency and performance. We make the following specific contributions: i) we extend the shock graph representation to the domain of real images, by generalizing the shock type definitions using… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: Accepted to CVPR 2020

  8. arXiv:1908.06386  [pdf, other

    cs.CV cs.LG eess.IV

    Geometric Disentanglement for Generative Latent Shape Models

    Authors: Tristan Aumentado-Armstrong, Stavros Tsogkas, Allan Jepson, Sven Dickinson

    Abstract: Representing 3D shape is a fundamental problem in artificial intelligence, which has numerous applications within computer vision and graphics. One avenue that has recently begun to be explored is the use of latent representations of generative models. However, it remains an open problem to learn a generative model of shape that is interpretable and easily manipulated, particularly in the absence… ▽ More

    Submitted 18 August, 2019; originally announced August 2019.

    Comments: ICCV 2019

    ACM Class: I.2.10; I.5.4

  9. arXiv:1811.12608  [pdf, other

    cs.CV

    DeepFlux for Skeletons in the Wild

    Authors: Yukang Wang, Yongchao Xu, Stavros Tsogkas, Xiang Bai, Sven Dickinson, Kaleem Siddiqi

    Abstract: Computing object skeletons in natural images is challenging, owing to large variations in object appearance and scale, and the complexity of handling background clutter. Many recent methods frame object skeleton detection as a binary pixel classification problem, which is similar in spirit to learning-based edge detection, as well as to semantic segmentation methods. In the present article, we dep… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: 10 pages

  10. arXiv:1811.10524  [pdf, other

    cs.CV

    Scene Categorization from Contours: Medial Axis Based Salience Measures

    Authors: Morteza Rezanejad, Gabriel Downs, John Wilder, Dirk B. Walther, Allan Jepson, Sven Dickinson, Kaleem Siddiqi

    Abstract: The computer vision community has witnessed recent advances in scene categorization from images, with the state-of-the art systems now achieving impressive recognition rates on challenging benchmarks such as the Places365 dataset. Such systems have been trained on photographs which include color, texture and shading cues. The geometry of shapes and surfaces, as conveyed by scene contours, is not e… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

  11. arXiv:1703.08628  [pdf, other

    cs.CV

    AMAT: Medial Axis Transform for Natural Images

    Authors: Stavros Tsogkas, Sven Dickinson

    Abstract: We introduce Appearance-MAT (AMAT), a generalization of the medial axis transform for natural images, that is framed as a weighted geometric set cover problem. We make the following contributions: i) we extend previous medial point detection methods for color images, by associating each medial point with a local scale; ii) inspired by the invertibility property of the binary MAT, we also associate… ▽ More

    Submitted 2 August, 2017; v1 submitted 24 March, 2017; originally announced March 2017.

    Comments: 10 pages (including references), 5 figures, accepted at ICCV 2017

  12. arXiv:1502.01761  [pdf, other

    cs.CV

    A Framework for Symmetric Part Detection in Cluttered Scenes

    Authors: Tom Lee, Sanja Fidler, Alex Levinshtein, Cristian Sminchisescu, Sven Dickinson

    Abstract: The role of symmetry in computer vision has waxed and waned in importance during the evolution of the field from its earliest days. At first figuring prominently in support of bottom-up indexing, it fell out of favor as shape gave way to appearance and recognition gave way to detection. With a strong prior in the form of a target object, the role of the weaker priors offered by perceptual grou**… ▽ More

    Submitted 5 February, 2015; originally announced February 2015.

    Comments: 10 pages, 8 figures

  13. arXiv:1408.6418  [pdf

    cs.CV cs.CL cs.IR

    Video In Sentences Out

    Authors: Andrei Barbu, Alexander Bridge, Zachary Burchill, Dan Coroian, Sven Dickinson, Sanja Fidler, Aaron Michaux, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, **lian Wei, Yifan Yin, Zhiqi Zhang

    Abstract: We present a system that produces sentential descriptions of video: who did what to whom, and where and how they did it. Action class is rendered as a verb, participant objects as noun phrases, properties of those objects as adjectival modifiers in those noun phrases, spatial relations between those participants as prepositional phrases, and characteristics of the event as prepositional-phrase adj… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-102-112

  14. arXiv:1204.3616  [pdf, other

    cs.CV cs.AI

    Large-Scale Automatic Labeling of Video Events with Verbs Based on Event-Participant Interaction

    Authors: Andrei Barbu, Alexander Bridge, Dan Coroian, Sven Dickinson, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, **lian Wei, Yifan Yin, Zhiqi Zhang

    Abstract: We present an approach to labeling short video clips with English verbs as event descriptions. A key distinguishing aspect of this work is that it labels videos with verbs that describe the spatiotemporal interaction between event participants, humans and objects interacting with each other, abstracting away all object-class information and fine-grained image characteristics, and relying solely on… ▽ More

    Submitted 16 April, 2012; originally announced April 2012.

  15. arXiv:1204.2742  [pdf, other

    cs.CV cs.AI

    Video In Sentences Out

    Authors: Andrei Barbu, Alexander Bridge, Zachary Burchill, Dan Coroian, Sven Dickinson, Sanja Fidler, Aaron Michaux, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, **lian Wei, Yifan Yin, Zhiqi Zhang

    Abstract: We present a system that produces sentential descriptions of video: who did what to whom, and where and how they did it. Action class is rendered as a verb, participant objects as noun phrases, properties of those objects as adjectival modifiers in those noun phrases,spatial relations between those participants as prepositional phrases, and characteristics of the event as prepositional-phrase adju… ▽ More

    Submitted 12 April, 2012; originally announced April 2012.