Skip to main content

Showing 1–2 of 2 results for author: Azaman, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.06680  [pdf, other

    cs.CV

    STUPD: A Synthetic Dataset for Spatial and Temporal Relation Reasoning

    Authors: Palaash Agrawal, Haidi Azaman, Cheston Tan

    Abstract: Understanding relations between objects is crucial for understanding the semantics of a visual scene. It is also an essential step in order to bridge visual and language models. However, current state-of-the-art computer vision models still lack the ability to perform spatial reasoning well. Existing datasets mostly cover a relatively small number of spatial relations, all of which are static rela… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Submitted to Neurips Dataset track. 24 pages including citations and appendix

  2. arXiv:2309.04504  [pdf, other

    cs.LG cs.AI

    Compositional Learning of Visually-Grounded Concepts Using Reinforcement

    Authors: Zijun Lin, Haidi Azaman, M Ganesh Kumar, Cheston Tan

    Abstract: Children can rapidly generalize compositionally-constructed rules to unseen test sets. On the other hand, deep reinforcement learning (RL) agents need to be trained over millions of episodes, and their ability to generalize to unseen combinations remains unclear. Hence, we investigate the compositional abilities of RL agents, using the task of navigating to specified color-shape targets in synthet… ▽ More

    Submitted 3 May, 2024; v1 submitted 8 September, 2023; originally announced September 2023.