Skip to main content

Showing 1–2 of 2 results for author: Umagami, R

.
  1. arXiv:2401.10005  [pdf, other

    cs.CV cs.CL

    Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation

    Authors: Kohei Uehara, Nabarun Goswami, Hanqin Wang, Toshiaki Baba, Kohtaro Tanaka, Tomohiro Hashimoto, Kai Wang, Rei Ito, Takagi Naoya, Ryo Umagami, Yingyi Wen, Tanachai Anakewat, Tatsuya Harada

    Abstract: The increasing demand for intelligent systems capable of interpreting and reasoning about visual content requires the development of Large Multi-Modal Models (LMMs) that are not only accurate but also have explicit reasoning capabilities. This paper presents a novel approach to imbue an LMM with the ability to conduct explicit reasoning based on visual content and textual instructions. We introduc… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  2. arXiv:2305.08073  [pdf, other

    cs.LG

    HiPerformer: Hierarchically Permutation-Equivariant Transformer for Time Series Forecasting

    Authors: Ryo Umagami, Yu Ono, Yusuke Mukuta, Tatsuya Harada

    Abstract: It is imperative to discern the relationships between multiple time series for accurate forecasting. In particular, for stock prices, components are often divided into groups with the same characteristics, and a model that extracts relationships consistent with this group structure should be effective. Thus, we propose the concept of hierarchical permutation-equivariance, focusing on index swappin… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: 10 pages, 3 figures