Skip to main content

Showing 1–2 of 2 results for author: Shingyouchi, K

.
  1. LatteGAN: Visually Guided Language Attention for Multi-Turn Text-Conditioned Image Manipulation

    Authors: Shoya Matsumori, Yuki Abe, Kosuke Shingyouchi, Komei Sugiura, Michita Imai

    Abstract: Text-guided image manipulation tasks have recently gained attention in the vision-and-language community. While most of the prior studies focused on single-turn manipulation, our goal in this paper is to address the more challenging multi-turn image manipulation (MTIM) task. Previous models for this task successfully generate images iteratively, given a sequence of instructions and a previously ge… ▽ More

    Submitted 2 June, 2022; v1 submitted 27 December, 2021; originally announced December 2021.

    Journal ref: IEEE Access, 9, 160521-160532 (2021)

  2. arXiv:2106.15550  [pdf, other

    cs.CV

    Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue

    Authors: Shoya Matsumori, Kosuke Shingyouchi, Yuki Abe, Yosuke Fukuchi, Komei Sugiura, Michita Imai

    Abstract: Building an interactive artificial intelligence that can ask questions about the real world is one of the biggest challenges for vision and language problems. In particular, goal-oriented visual dialogue, where the aim of the agent is to seek information by asking questions during a turn-taking dialogue, has been gaining scholarly attention recently. While several existing models based on the Gues… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.