Skip to main content

Showing 1–7 of 7 results for author: Choy, C B

Searching in archive cs. Search in all archives.
.
  1. arXiv:1803.08495  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Text2Shape: Generating Shapes from Natural Language by Learning Joint Embeddings

    Authors: Kevin Chen, Christopher B. Choy, Manolis Savva, Angel X. Chang, Thomas Funkhouser, Silvio Savarese

    Abstract: We present a method for generating colored 3D shapes from natural language. To this end, we first learn joint embeddings of freeform text descriptions and colored 3D shapes. Our model combines and extends learning by association and metric learning approaches to learn implicit cross-modal connections, and produces a joint representation that captures the many-to-many relations between language and… ▽ More

    Submitted 22 March, 2018; originally announced March 2018.

  2. arXiv:1710.07563  [pdf, other

    cs.CV

    SEGCloud: Semantic Segmentation of 3D Point Clouds

    Authors: Lyne P. Tchapmi, Christopher B. Choy, Iro Armeni, JunYoung Gwak, Silvio Savarese

    Abstract: 3D semantic scene labeling is fundamental to agents operating in the real world. In particular, labeling raw 3D point sets from sensors provides fine-grained semantics. Recent works leverage the capabilities of Neural Networks (NNs), but are limited to coarse voxel predictions and do not explicitly enforce global consistency. We present SEGCloud, an end-to-end framework to obtain 3D point-level se… ▽ More

    Submitted 20 October, 2017; originally announced October 2017.

    Comments: Accepted as a spotlight at the International Conference of 3D Vision (3DV 2017)

  3. arXiv:1705.10904  [pdf, other

    cs.CV

    Weakly supervised 3D Reconstruction with Adversarial Constraint

    Authors: JunYoung Gwak, Christopher B. Choy, Animesh Garg, Manmohan Chandraker, Silvio Savarese

    Abstract: Supervised 3D reconstruction has witnessed a significant progress through the use of deep neural networks. However, this increase in performance requires large scale annotations of 2D/3D data. In this paper, we explore inexpensive 2D supervision as an alternative for expensive 3D CAD annotation. Specifically, we use foreground masks as weak supervision through a raytrace pooling layer that enables… ▽ More

    Submitted 4 October, 2017; v1 submitted 30 May, 2017; originally announced May 2017.

  4. arXiv:1704.04394  [pdf, other

    cs.CV

    DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

    Authors: Namhoon Lee, Wongun Choi, Paul Vernaza, Christopher B. Choy, Philip H. S. Torr, Manmohan Chandraker

    Abstract: We introduce a Deep Stochastic IOC RNN Encoderdecoder framework, DESIRE, for the task of future predictions of multiple interacting agents in dynamic scenes. DESIRE effectively predicts future locations of objects in multiple scenes by 1) accounting for the multi-modal nature of the future prediction (i.e., given the same context, future may vary), 2) foreseeing the potential future outcomes and m… ▽ More

    Submitted 14 April, 2017; originally announced April 2017.

    Comments: Accepted at CVPR 2017

  5. arXiv:1701.02426  [pdf, other

    cs.CV

    Scene Graph Generation by Iterative Message Passing

    Authors: Danfei Xu, Yuke Zhu, Christopher B. Choy, Li Fei-Fei

    Abstract: Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their relationships using scene graphs, a visually-grounded graphical structure of an image. We propose a novel end-to-end model that generates such structured scene represent… ▽ More

    Submitted 12 April, 2017; v1 submitted 9 January, 2017; originally announced January 2017.

    Comments: CVPR 2017

  6. arXiv:1606.03558  [pdf, other

    cs.CV

    Universal Correspondence Network

    Authors: Christopher B. Choy, JunYoung Gwak, Silvio Savarese, Manmohan Chandraker

    Abstract: We present a deep learning framework for accurate visual correspondences and demonstrate its effectiveness for both geometric and semantic matching, spanning across rigid motions to intra-class shape or appearance variations. In contrast to previous CNN-based approaches that optimize a surrogate patch similarity objective, we use deep metric learning to directly learn a feature space that preserve… ▽ More

    Submitted 31 October, 2016; v1 submitted 11 June, 2016; originally announced June 2016.

    Comments: To appear at NIPS 2016 as full oral presentation

  7. arXiv:1604.00449  [pdf, other

    cs.CV cs.AI

    3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction

    Authors: Christopher B. Choy, Danfei Xu, JunYoung Gwak, Kevin Chen, Silvio Savarese

    Abstract: Inspired by the recent success of methods that employ shape priors to achieve robust 3D reconstructions, we propose a novel recurrent neural network architecture that we call the 3D Recurrent Reconstruction Neural Network (3D-R2N2). The network learns a map** from images of objects to their underlying 3D shapes from a large collection of synthetic data. Our network takes in one or more images of… ▽ More

    Submitted 1 April, 2016; originally announced April 2016.

    Comments: Appendix can be found at http://cvgl.stanford.edu/papers/choy_16_appendix.pdf