Skip to main content

Showing 1–7 of 7 results for author: Souri, Y

.
  1. arXiv:2210.06501  [pdf, other

    cs.CV

    Robust Action Segmentation from Timestamp Supervision

    Authors: Yaser Souri, Yazan Abu Farha, Emad Bahrami, Gianpiero Francesca, Juergen Gall

    Abstract: Action segmentation is the task of predicting an action label for each frame of an untrimmed video. As obtaining annotations to train an approach for action segmentation in a fully supervised way is expensive, various approaches have been proposed to train action segmentation models using different forms of weak supervision, e.g., action transcripts, action sets, or more recently timestamps. Times… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: BMVC 2022

  2. arXiv:2108.03894  [pdf, other

    cs.CV cs.LG

    FIFA: Fast Inference Approximation for Action Segmentation

    Authors: Yaser Souri, Yazan Abu Farha, Fabien Despinoy, Gianpiero Francesca, Juergen Gall

    Abstract: We introduce FIFA, a fast approximate inference method for action segmentation and alignment. Unlike previous approaches, FIFA does not rely on expensive dynamic programming for inference. Instead, it uses an approximate differentiable energy function that can be minimized using gradient-descent. FIFA is a general approach that can replace exact inference improving its speed by more than 5 times w… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  3. arXiv:2101.08581  [pdf, other

    cs.CV

    Hierarchical Graph-RNNs for Action Detection of Multiple Activities

    Authors: Sovan Biswas, Yaser Souri, Juergen Gall

    Abstract: In this paper, we propose an approach that spatially localizes the activities in a video frame where each person can perform multiple activities at the same time. Our approach takes the temporal scene context as well as the relations of the actions of detected persons into account. While the temporal context is modeled by a temporal recurrent neural network (RNN), the relations of the actions are… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: Accepted at ICIP 2019

  4. arXiv:2005.09743  [pdf, ps, other

    cs.CV

    On Evaluating Weakly Supervised Action Segmentation Methods

    Authors: Yaser Souri, Alexander Richard, Luca Minciullo, Juergen Gall

    Abstract: Action segmentation is the task of temporally segmenting every frame of an untrimmed video. Weakly supervised approaches to action segmentation, especially from transcripts have been of considerable interest to the computer vision community. In this work, we focus on two aspects of the use and evaluation of weakly supervised action segmentation approaches that are often overlooked: the performance… ▽ More

    Submitted 21 October, 2021; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: Technical Report

  5. arXiv:1904.03116  [pdf, other

    cs.CV cs.LG

    Fast Weakly Supervised Action Segmentation Using Mutual Consistency

    Authors: Yaser Souri, Mohsen Fayyaz, Luca Minciullo, Gianpiero Francesca, Juergen Gall

    Abstract: Action segmentation is the task of predicting the actions for each frame of a video. As obtaining the full annotation of videos for action segmentation is expensive, weakly supervised approaches that can learn only from transcripts are appealing. In this paper, we propose a novel end-to-end approach for weakly supervised action segmentation based on a two-branch neural network. The two branches of… ▽ More

    Submitted 10 June, 2021; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: Accepted for publication at TPAMI (IEEE Transactions on Pattern Analysis and Machine Intelligence) in 2021. First two authors contributed equally

  6. arXiv:1904.03000  [pdf, other

    cs.CV cs.LG cs.RO

    What Object Should I Use? - Task Driven Object Detection

    Authors: Johann Sawatzky, Yaser Souri, Christian Grund, Juergen Gall

    Abstract: When humans have to solve everyday tasks, they simply pick the objects that are most suitable. While the question which object should one use for a specific task sounds trivial for humans, it is very difficult to answer for robots or other autonomous systems. This issue, however, is not addressed by current benchmarks for object detection that focus on detecting object categories. We therefore int… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: CVPR 2019. The first two authors contributed equally, ordered alphabetically

  7. arXiv:1512.04103  [pdf, other

    cs.CV

    Deep Relative Attributes

    Authors: Yaser Souri, Erfan Noury, Ehsan Adeli

    Abstract: Visual attributes are great means of describing images or scenes, in a way both humans and computers understand. In order to establish a correspondence between images and to be able to compare the strength of each property between images, relative attributes were introduced. However, since their introduction, hand-crafted and engineered features were used to learn increasingly complex models for t… ▽ More

    Submitted 13 September, 2016; v1 submitted 13 December, 2015; originally announced December 2015.

    Comments: ACCV 2016