Skip to main content

Showing 1–6 of 6 results for author: Ehsanpour, M

.
  1. arXiv:2404.05578  [pdf, other

    cs.CV

    Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning

    Authors: Mahsa Ehsanpour, Ian Reid, Hamid Rezatofighi

    Abstract: For a complete comprehension of multi-person scenes, it is essential to go beyond basic tasks like detection and tracking. Higher-level tasks, such as understanding the interactions and social activities among individuals, are also crucial. Progress towards models that can fully understand scenes involving multiple people is hindered by a lack of sufficient annotated data for such high-level tasks… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  2. arXiv:2106.08827  [pdf, other

    cs.CV

    JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection

    Authors: Mahsa Ehsanpour, Fatemeh Saleh, Silvio Savarese, Ian Reid, Hamid Rezatofighi

    Abstract: The availability of large-scale video action understanding datasets has facilitated advances in the interpretation of visual scenes containing people. However, learning to recognise human actions and their social interactions in an unconstrained real-world environment comprising numerous people, with potentially highly unbalanced and long-tailed distributed action labels from a stream of sensory d… ▽ More

    Submitted 23 November, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

  3. TRiPOD: Human Trajectory and Pose Dynamics Forecasting in the Wild

    Authors: Vida Adeli, Mahsa Ehsanpour, Ian Reid, Juan Carlos Niebles, Silvio Savarese, Ehsan Adeli, Hamid Rezatofighi

    Abstract: Joint forecasting of human trajectory and pose dynamics is a fundamental building block of various applications ranging from robotics and autonomous driving to surveillance systems. Predicting body dynamics requires capturing subtle information embedded in the humans' interactions with each other and with the objects present in the scene. In this paper, we propose a novel TRajectory and POse Dynam… ▽ More

    Submitted 27 August, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Journal ref: IEEE/CVF International Conference on Computer Vision, pp. 13390-13400. 2021

  4. arXiv:2103.14829  [pdf, other

    cs.CV

    Looking Beyond Two Frames: End-to-End Multi-Object Tracking Using Spatial and Temporal Transformers

    Authors: Tianyu Zhu, Markus Hiller, Mahsa Ehsanpour, Rongkai Ma, Tom Drummond, Ian Reid, Hamid Rezatofighi

    Abstract: Tracking a time-varying indefinite number of objects in a video sequence over time remains a challenge despite recent advances in the field. Most existing approaches are not able to properly handle multi-object tracking challenges such as occlusion, in part because they ignore long-term temporal information. To address these shortcomings, we present MO3TR: a truly end-to-end Transformer-based onli… ▽ More

    Submitted 7 October, 2022; v1 submitted 27 March, 2021; originally announced March 2021.

    Comments: This paper has been accepted as a Regular Paper in an upcoming issue of the Transactions on Pattern Analysis and Machine Intelligence (Tpami)

  5. arXiv:2007.07172  [pdf, other

    cs.LG cs.HC stat.ML

    Attend And Discriminate: Beyond the State-of-the-Art for Human Activity Recognition using Wearable Sensors

    Authors: Alireza Abedin, Mahsa Ehsanpour, Qinfeng Shi, Hamid Rezatofighi, Damith C. Ranasinghe

    Abstract: Wearables are fundamental to improving our understanding of human activities, especially for an increasing number of healthcare applications from rehabilitation to fine-grained gait analysis. Although our collective know-how to solve Human Activity Recognition (HAR) problems with wearables has progressed immensely with end-to-end deep learning paradigms, several fundamental opportunities remain ov… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: 15 pages, 7 figures

  6. arXiv:2007.02632  [pdf, other

    cs.CV

    Joint Learning of Social Groups, Individuals Action and Sub-group Activities in Videos

    Authors: Mahsa Ehsanpour, Alireza Abedin, Fatemeh Saleh, Javen Shi, Ian Reid, Hamid Rezatofighi

    Abstract: The state-of-the art solutions for human activity understanding from a video stream formulate the task as a spatio-temporal problem which requires joint localization of all individuals in the scene and classification of their actions or group activity over time. Who is interacting with whom, e.g. not everyone in a queue is interacting with each other, is often not predicted. There are scenarios wh… ▽ More

    Submitted 27 July, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: Accepted in the European Conference On Computer Vision (ECCV) 2020