Skip to main content

Showing 1–11 of 11 results for author: Nasrollahi, K

.
  1. arXiv:2407.06000  [pdf, other

    cs.CV

    Bounding Boxes and Probabilistic Graphical Models: Video Anomaly Detection Simplified

    Authors: Mia Siemon, Thomas B. Moeslund, Barry Norton, Kamal Nasrollahi

    Abstract: In this study, we formulate the task of Video Anomaly Detection as a probabilistic analysis of object bounding boxes. We hypothesize that the representation of objects via their bounding boxes only, can be sufficient to successfully identify anomalous events in a scene. The implied value of this approach is increased object anonymization, faster model training and fewer computational resources. Th… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2404.08088  [pdf, other

    cs.CV

    Visual Context-Aware Person Fall Detection

    Authors: Aleksander Nagaj, Zenjie Li, Dim P. Papadopoulos, Kamal Nasrollahi

    Abstract: As the global population ages, the number of fall-related incidents is on the rise. Effective fall detection systems, specifically in healthcare sector, are crucial to mitigate the risks associated with such events. This study evaluates the role of visual context, including background objects, on the accuracy of fall detection classifiers. We present a segmentation pipeline to semi-automatically s… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 10 pages, 6 figures, KES IDT-24 conference

  3. arXiv:2308.16572  [pdf, other

    cs.CV cs.AI cs.LG

    CL-MAE: Curriculum-Learned Masked Autoencoders

    Authors: Neelu Madan, Nicolae-Catalin Ristea, Kamal Nasrollahi, Thomas B. Moeslund, Radu Tudor Ionescu

    Abstract: Masked image modeling has been demonstrated as a powerful pretext task for generating robust representations that can be effectively generalized across multiple downstream tasks. Typically, this approach involves randomly masking patches (tokens) in input images, with the masking strategy remaining unchanged during training. In this paper, we propose a curriculum learning approach that updates the… ▽ More

    Submitted 28 February, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted at WACV 2024

  4. Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection

    Authors: Neelu Madan, Nicolae-Catalin Ristea, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: Anomaly detection has recently gained increasing attention in the field of computer vision, likely due to its broad set of applications ranging from product fault detection on industrial production lines and impending event detection in video surveillance to finding lesions in medical scans. Regardless of the domain, anomaly detection is typically framed as a one-class classification task, where t… ▽ More

    Submitted 5 October, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence

  5. arXiv:2207.08003  [pdf, other

    cs.CV cs.LG

    SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection

    Authors: Antonio Barbalau, Radu Tudor Ionescu, Mariana-Iuliana Georgescu, Jacob Dueholm, Bharathkumar Ramachandra, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: A self-supervised multi-task learning (SSMTL) framework for video anomaly detection was recently introduced in literature. Due to its highly accurate results, the method attracted the attention of many researchers. In this work, we revisit the self-supervised multi-task learning framework, proposing several updates to the original method. First, we study various detection methods, e.g. based on de… ▽ More

    Submitted 12 February, 2023; v1 submitted 16 July, 2022; originally announced July 2022.

    Comments: Accepted in Computer Vision and Image Understanding

  6. Video Transformers: A Survey

    Authors: Javier Selva, Anders S. Johansen, Sergio Escalera, Kamal Nasrollahi, Thomas B. Moeslund, Albert Clapés

    Abstract: Transformer models have shown great success handling long-range interactions, making them a promising tool for modeling video. However, they lack inductive biases and scale quadratically with input length. These limitations are further exacerbated when dealing with the high dimensionality introduced by the temporal dimension. While there are surveys analyzing the advances of Transformers for visio… ▽ More

    Submitted 13 February, 2023; v1 submitted 16 January, 2022; originally announced January 2022.

  7. arXiv:2111.09099  [pdf, other

    cs.CV cs.LG

    Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection

    Authors: Nicolae-Catalin Ristea, Neelu Madan, Radu Tudor Ionescu, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund, Mubarak Shah

    Abstract: Anomaly detection is commonly pursued as a one-class classification problem, where models can only learn from normal training samples, while being evaluated on both normal and abnormal test samples. Among the successful approaches for anomaly detection, a distinguished category of methods relies on predicting masked information (e.g. patches, future frames, etc.) and leveraging the reconstruction… ▽ More

    Submitted 14 March, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: Accepted at CVPR 2022. Paper + supplementary (14 pages, 9 figures)

  8. arXiv:2102.03113  [pdf, other

    cs.CV

    Real-World Super-Resolution of Face-Images from Surveillance Cameras

    Authors: Andreas Aakerberg, Kamal Nasrollahi, Thomas B. Moeslund

    Abstract: Most existing face image Super-Resolution (SR) methods assume that the Low-Resolution (LR) images were artificially downsampled from High-Resolution (HR) images with bicubic interpolation. This operation changes the natural image characteristics and reduces noise. Hence, SR methods trained on such data most often fail to produce good results when applied to real LR images. To solve this problem, w… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

  9. arXiv:2011.13367  [pdf, other

    cs.CV

    SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos

    Authors: Adrien Deliège, Anthony Cioppa, Silvio Giancola, Meisam J. Seikavandi, Jacob V. Dueholm, Kamal Nasrollahi, Bernard Ghanem, Thomas B. Moeslund, Marc Van Droogenbroeck

    Abstract: Understanding broadcast videos is a challenging task in computer vision, as it requires generic reasoning capabilities to appreciate the content offered by the video editing. In this work, we propose SoccerNet-v2, a novel large-scale corpus of manual annotations for the SoccerNet video dataset, along with open challenges to encourage more research in soccer understanding and broadcast production.… ▽ More

    Submitted 19 April, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: Paper accepted for the CVsports workshop at CVPR2021. This document contains 8 pages + references + supplementary material

  10. arXiv:2004.01382  [pdf, other

    cs.CV cs.LG eess.IV

    Effective Fusion of Deep Multitasking Representations for Robust Visual Tracking

    Authors: Seyed Mojtaba Marvasti-Zadeh, Hossein Ghanei-Yakhdan, Shohreh Kasaei, Kamal Nasrollahi, Thomas B. Moeslund

    Abstract: Visual object tracking remains an active research field in computer vision due to persisting challenges with various problem-specific factors in real-world scenes. Many existing tracking methods based on discriminative correlation filters (DCFs) employ feature extraction networks (FENs) to model the target appearance during the learning process. However, using deep feature maps extracted from FENs… ▽ More

    Submitted 20 September, 2021; v1 submitted 3 April, 2020; originally announced April 2020.

    Comments: To be appeared in The Visual Computer (International Journal of Computer Graphics), Springer, 2021

  11. arXiv:1805.10078  [pdf

    cs.CV

    A Double-Deep Spatio-Angular Learning Framework for Light Field based Face Recognition

    Authors: Alireza Sepas-Moghaddam, Mohammad A. Haque, Paulo Lobato Correia, Kamal Nasrollahi, Thomas B. Moeslund, Fernando Pereira

    Abstract: Face recognition has attracted increasing attention due to its wide range of applications, but it is still challenging when facing large variations in the biometric data characteristics. Lenslet light field cameras have recently come into prominence to capture rich spatio-angular information, thus offering new possibilities for advanced biometric recognition systems. This paper proposes a double-d… ▽ More

    Submitted 24 April, 2019; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: Submitted to IEEE Transactions on Circuits and Systems for Video Technology