Skip to main content

Showing 1–3 of 3 results for author: Yi, J S K

.
  1. arXiv:2208.00173  [pdf, other

    cs.CV cs.AI cs.LG

    A Survey on Masked Autoencoder for Self-supervised Learning in Vision and Beyond

    Authors: Chaoning Zhang, Chenshuang Zhang, Junha Song, John Seon Keun Yi, Kang Zhang, In So Kweon

    Abstract: Masked autoencoders are scalable vision learners, as the title of MAE \cite{he2022masked}, which suggests that self-supervised learning (SSL) in vision might undertake a similar trajectory as in NLP. Specifically, generative pretext tasks with the masked prediction (e.g., BERT) have become a de facto standard SSL practice in NLP. By contrast, early attempts at generative methods in vision have bee… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: First survey on masked autoencoder (under progress)

  2. arXiv:2201.07459  [pdf, other

    cs.CV

    PT4AL: Using Self-Supervised Pretext Tasks for Active Learning

    Authors: John Seon Keun Yi, Minseok Seo, Jongchan Park, Dong-Geol Choi

    Abstract: Labeling a large set of data is expensive. Active learning aims to tackle this problem by asking to annotate only the most informative data from the unlabeled set. We propose a novel active learning approach that utilizes self-supervised pretext tasks and a unique data sampler to select data that are both difficult and representative. We discover that the loss of a simple self-supervised pretext t… ▽ More

    Submitted 26 July, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: Code is available at https://github.com/johnsk95/PT4AL Updated for ECCV 2022 submission

  3. arXiv:2201.01901  [pdf, other

    cs.CV cs.CL

    Incremental Object Grounding Using Scene Graphs

    Authors: John Seon Keun Yi, Yoonwoo Kim, Sonia Chernova

    Abstract: Object grounding tasks aim to locate the target object in an image through verbal communications. Understanding human command is an important process needed for effective human-robot communication. However, this is challenging because human commands can be ambiguous and erroneous. This paper aims to disambiguate the human's referring expressions by allowing the agent to ask relevant questions base… ▽ More

    Submitted 13 November, 2022; v1 submitted 5 January, 2022; originally announced January 2022.