Skip to main content

Showing 1–9 of 9 results for author: Thai, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.14198  [pdf, other

    cs.CV

    ZeroShape: Regression-based Zero-shot Shape Reconstruction

    Authors: Zixuan Huang, Stefan Stojanov, Anh Thai, Varun Jampani, James M. Rehg

    Abstract: We study the problem of single-image zero-shot 3D shape reconstruction. Recent works learn zero-shot shape reconstruction through generative modeling of 3D assets, but these models are computationally expensive at train and inference time. In contrast, the traditional approach to this problem is regression-based, where deterministic models are trained to directly regress the object shape. Such reg… ▽ More

    Submitted 16 January, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Project page: https://zixuanh.com/projects/zeroshape.html

  2. arXiv:2312.03533  [pdf, other

    cs.CV

    Low-shot Object Learning with Mutual Exclusivity Bias

    Authors: Anh Thai, Ahmad Humayun, Stefan Stojanov, Zixuan Huang, Bikram Boote, James M. Rehg

    Abstract: This paper introduces Low-shot Object Learning with Mutual Exclusivity Bias (LSME), the first computational framing of mutual exclusivity bias, a phenomenon commonly observed in infants during word learning. We provide a novel dataset, comprehensive baselines, and a state-of-the-art method to enable the ML community to tackle this challenging learning task. The goal of LSME is to analyze an RGB im… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted at NeurIPS 2023, Datasets and Benchmarks Track. Project website https://ngailapdi.github.io/projects/lsme/

  3. arXiv:2304.06247  [pdf, other

    cs.CV

    ShapeClipper: Scalable 3D Shape Learning from Single-View Images via Geometric and CLIP-based Consistency

    Authors: Zixuan Huang, Varun Jampani, Anh Thai, Yuanzhen Li, Stefan Stojanov, James M. Rehg

    Abstract: We present ShapeClipper, a novel method that reconstructs 3D object shapes from real-world single-view RGB images. Instead of relying on laborious 3D, multi-view or camera pose annotation, ShapeClipper learns shape reconstruction from a set of single-view segmented images. The key idea is to facilitate shape learning via CLIP-based shape consistency, where we encourage objects with similar CLIP en… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR 2023, project website at https://zixuanh.com/projects/shapeclipper.html

  4. arXiv:2211.15059  [pdf, other

    cs.CV

    Learning Dense Object Descriptors from Multiple Views for Low-shot Category Generalization

    Authors: Stefan Stojanov, Anh Thai, Zixuan Huang, James M. Rehg

    Abstract: A hallmark of the deep learning era for computer vision is the successful use of large-scale labeled datasets to train feature representations for tasks ranging from object recognition and semantic segmentation to optical flow estimation and novel view synthesis of 3D scenes. In this work, we aim to learn dense discriminative object representations for low-shot category recognition without requiri… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted at NeurIPS 2022. Code and data available at https://github.com/rehg-lab/dope_selfsup

  5. arXiv:2204.10235  [pdf, other

    cs.CV

    Planes vs. Chairs: Category-guided 3D shape learning without any 3D cues

    Authors: Zixuan Huang, Stefan Stojanov, Anh Thai, Varun Jampani, James M. Rehg

    Abstract: We present a novel 3D shape reconstruction method which learns to predict an implicit 3D shape representation from a single RGB image. Our approach uses a set of single-view images of multiple object categories without viewpoint annotation, forcing the model to learn across multiple object categories without 3D supervision. To facilitate learning with such minimal supervision, we use category labe… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: Project page: https://zixuanh.com/multiclass3D

  6. arXiv:2101.07296  [pdf, other

    cs.CV cs.LG

    Using Shape to Categorize: Low-Shot Learning with an Explicit Shape Bias

    Authors: Stefan Stojanov, Anh Thai, James M. Rehg

    Abstract: It is widely accepted that reasoning about object shape is important for object recognition. However, the most powerful object recognition methods today do not explicitly make use of object shape during learning. In this work, motivated by recent developments in low-shot learning, findings in developmental psychology, and the increased use of synthetic data in computer vision research, we investig… ▽ More

    Submitted 20 June, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: Accepted at CVPR2021. Project page, code and data available at https://rehg-lab.github.io/publication-pages/lowshot-shapebias/

  7. arXiv:2101.07295  [pdf, other

    cs.LG cs.CV

    The Surprising Positive Knowledge Transfer in Continual 3D Object Shape Reconstruction

    Authors: Anh Thai, Stefan Stojanov, Zixuan Huang, Isaac Rehg, James M. Rehg

    Abstract: Continual learning has been extensively studied for classification tasks with methods developed to primarily avoid catastrophic forgetting, a phenomenon where earlier learned concepts are forgotten at the expense of more recent samples. In this work, we present a set of continual 3D object shape reconstruction tasks, including complete 3D shape reconstruction from different input modalities, as we… ▽ More

    Submitted 8 September, 2022; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: Accepted to 3DV 2022

  8. arXiv:2006.07752  [pdf, other

    cs.CV

    3D Reconstruction of Novel Object Shapes from Single Images

    Authors: Anh Thai, Stefan Stojanov, Vijay Upadhya, James M. Rehg

    Abstract: Accurately predicting the 3D shape of any arbitrary object in any pose from a single image is a key goal of computer vision research. This is challenging as it requires a model to learn a representation that can infer both the visible and occluded portions of any object using a limited training set. A training set that covers all possible object shapes is inherently infeasible. Such learning-based… ▽ More

    Submitted 1 September, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: First two authors contributed equally

  9. arXiv:1909.04518  [pdf

    eess.IV cs.CV q-bio.QM

    Virtual organelle self-coding for fluorescence imaging via adversarial learning

    Authors: Thanh Nguyen, Vy Bui, Anh Thai, Van Lam, Christopher B. Raub, Lin-Ching Chang, George Nehmetallah

    Abstract: Fluorescence microscopy plays a vital role in understanding the subcellular structures of living cells. However, it requires considerable effort in sample preparation related to chemical fixation, staining, cost, and time. To reduce those factors, we present a virtual fluorescence staining method based on deep neural networks (VirFluoNet) to transform fluorescence images of molecular labels into o… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: 20 pages, 9 figures

    MSC Class: 92B20