Skip to main content

Showing 1–2 of 2 results for author: Eghbalzadeh, H

.
  1. arXiv:2307.05784  [pdf, other

    cs.CV cs.AI

    EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video

    Authors: Matthias De Lange, Hamid Eghbalzadeh, Reuben Tan, Michael Iuzzolino, Franziska Meier, Karl Ridgeway

    Abstract: In egocentric action recognition a single population model is typically trained and subsequently embodied on a head-mounted device, such as an augmented reality headset. While this model remains static for new users and environments, we introduce an adaptive paradigm of two phases, where after pretraining a population model, the model adapts on-device and online to the user's experience. This sett… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: Preprint

  2. arXiv:2304.09179  [pdf, other

    cs.CV cs.AI

    Pretrained Language Models as Visual Planners for Human Assistance

    Authors: Dhruvesh Patel, Hamid Eghbalzadeh, Nitin Kamra, Michael Louis Iuzzolino, Unnat Jain, Ruta Desai

    Abstract: In our pursuit of advancing multi-modal AI assistants capable of guiding users to achieve complex multi-step goals, we propose the task of "Visual Planning for Assistance (VPA)". Given a succinct natural language goal, e.g., "make a shelf", and a video of the user's progress so far, the aim of VPA is to devise a plan, i.e., a sequence of actions such as "sand shelf", "paint shelf", etc. to realize… ▽ More

    Submitted 26 August, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: Accepted at ICCV 2023