Skip to main content

Showing 1–2 of 2 results for author: Arzani, M M

.
  1. arXiv:1806.07754  [pdf, other

    cs.CV

    Spatio-Temporal Channel Correlation Networks for Action Classification

    Authors: Ali Diba, Mohsen Fayyaz, Vivek Sharma, M. Mahdi Arzani, Rahman Yousefzadeh, Juergen Gall, Luc Van Gool

    Abstract: The work in this paper is driven by the question if spatio-temporal correlations are enough for 3D convolutional neural networks (CNN)? Most of the traditional 3D networks use local spatio-temporal features. We introduce a new block that models correlations between channels of a 3D CNN with respect to temporal and spatial features. This new block can be added as a residual unit to different parts… ▽ More

    Submitted 7 February, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: Accepted in ECCV 2018. arXiv admin note: substantial text overlap with arXiv:1711.08200

  2. arXiv:1711.08200  [pdf, other

    cs.CV

    Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification

    Authors: Ali Diba, Mohsen Fayyaz, Vivek Sharma, Amir Hossein Karami, Mohammad Mahdi Arzani, Rahman Yousefzadeh, Luc Van Gool

    Abstract: The work in this paper is driven by the question how to exploit the temporal cues available in videos for their accurate classification, and for human action recognition in particular? Thus far, the vision community has focused on spatio-temporal approaches with fixed temporal convolution kernel depths. We introduce a new temporal layer that models variable temporal convolution kernel depths. We e… ▽ More

    Submitted 22 November, 2017; originally announced November 2017.