Skip to main content

Showing 1–50 of 105 results for author: Bennamoun, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19675  [pdf

    cs.CV

    Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey

    Authors: Uchitha Rajapaksha, Ferdous Sohel, Hamid Laga, Dean Diepeveen, Mohammed Bennamoun

    Abstract: Estimating depth from single RGB images and videos is of widespread interest due to its applications in many areas, including autonomous driving, 3D reconstruction, digital entertainment, and robotics. More than 500 deep learning-based papers have been published in the past 10 years, which indicates the growing interest in the task. This paper presents a comprehensive survey of the existing deep l… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 46 pages, 10 figures, The paper has been accepted for publication in ACM Computing Surveys 2024

    ACM Class: I.2.10; I.4; I.5.1; I.4.8

  2. arXiv:2406.06075  [pdf, other

    cs.NE astro-ph.IM

    Supervised Radio Frequency Interference Detection with SNNs

    Authors: Nicholas J. Pritchard, Andreas Wicenec, Mohammed Bennamoun, Richard Dodson

    Abstract: Radio Frequency Interference (RFI) poses a significant challenge in radio astronomy, arising from terrestrial and celestial sources, disrupting observations conducted by radio telescopes. Addressing RFI involves intricate heuristic algorithms, manual examination, and, increasingly, machine learning methods. Given the dynamic and temporal nature of radio astronomy observations, Spiking Neural Netwo… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 7 pages, 2 figures, 4 tables. International Conference on Neuromorphic Systems (ICONS) 2024, Accepted

  3. arXiv:2406.05205  [pdf, other

    cs.CV cs.CL cs.LG cs.MM eess.IV

    CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

    Authors: Sajid Javed, Arif Mahmood, Iyyakutti Iyappan Ganapathi, Fayaz Ali Dharejo, Naoufel Werghi, Mohammed Bennamoun

    Abstract: This paper proposes Comprehensive Pathology Language Image Pre-training (CPLIP), a new unsupervised technique designed to enhance the alignment of images and text in histopathology for tasks such as classification and segmentation. This methodology enriches vision-language models by leveraging extensive data without needing ground truth annotations. CPLIP involves constructing a pathology-specific… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  4. arXiv:2404.01591  [pdf, other

    cs.CV

    Language Model Guided Interpretable Video Action Reasoning

    Authors: Ning Wang, Guangming Zhu, HS Li, Liang Zhang, Syed Afaq Ali Shah, Mohammed Bennamoun

    Abstract: While neural networks have excelled in video action recognition tasks, their black-box nature often obscures the understanding of their decision-making processes. Recent approaches used inherently interpretable models to analyze video actions in a manner akin to human reasoning. These models, however, usually fall short in performance compared to their black-box counterparts. In this work, we pres… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  5. arXiv:2403.19407  [pdf, other

    cs.CV

    Towards Temporally Consistent Referring Video Object Segmentation

    Authors: Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Mubarak Shah, Ajmal Mian

    Abstract: Referring Video Object Segmentation (R-VOS) methods face challenges in maintaining consistent object segmentation due to temporal context variability and the presence of other visually similar objects. We propose an end-to-end R-VOS paradigm that explicitly models temporal instance consistency alongside the referring segmentation. Specifically, we introduce a novel hybrid memory that facilitates i… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  6. arXiv:2403.01156  [pdf, other

    cs.CV

    Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation

    Authors: Lian Xu, Mohammed Bennamoun, Farid Boussaid, Wanli Ouyang, Ferdous Sohel, Dan Xu

    Abstract: Most existing weakly supervised semantic segmentation (WSSS) methods rely on Class Activation Map** (CAM) to extract coarse class-specific localization maps using image-level labels. Prior works have commonly used an off-line heuristic thresholding process that combines the CAM maps with off-the-shelf saliency maps produced by a general pre-trained saliency model to produce more accurate pseudo-… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: Accepted at IEEE Transactions on Neural Networks and Learning Systems. arXiv admin note: substantial text overlap with arXiv:2107.11787

  7. arXiv:2402.17910  [pdf, other

    cs.CV

    Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models

    Authors: Ashkan Taghipour, Morteza Ghahremani, Mohammed Bennamoun, Aref Miri Rekavandi, Hamid Laga, Farid Boussaid

    Abstract: While latent diffusion models (LDMs) excel at creating imaginative images, they often lack precision in semantic fidelity and spatial control over where objects are generated. To address these deficiencies, we introduce the Box-it-to-Bind-it (B2B) module - a novel, training-free approach for improving spatial control and semantic accuracy in text-to-image (T2I) diffusion models. B2B targets three… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  8. arXiv:2402.11141  [pdf, other

    cs.CV

    Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review

    Authors: Thang-Anh-Quan Nguyen, Amine Bourki, Mátyás Macudzinski, Anthony Brunel, Mohammed Bennamoun

    Abstract: This review thoroughly examines the role of semantically-aware Neural Radiance Fields (NeRFs) in visual scene understanding, covering an analysis of over 250 scholarly papers. It explores how NeRFs adeptly infer 3D representations for both stationary and dynamic objects in a scene. This capability is pivotal for generating high-quality new viewpoints, completing missing scene details (inpainting),… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  9. arXiv:2401.03742  [pdf, other

    cs.CV

    Flowmind2Digital: The First Comprehensive Flowmind Recognition and Conversion Approach

    Authors: Huanyu Liu, Jianfeng Cai, Tingjia Zhang, Hongsheng Li, Siyuan Wang, Guangming Zhu, Syed Afaq Ali Shah, Mohammed Bennamoun, Liang Zhang

    Abstract: Flowcharts and mind maps, collectively known as flowmind, are vital in daily activities, with hand-drawn versions facilitating real-time collaboration. However, there's a growing need to digitize them for efficient processing. Automated conversion methods are essential to overcome manual conversion challenges. Existing sketch recognition methods face limitations in practical situations, being fiel… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  10. arXiv:2311.14747  [pdf, other

    cs.CV

    HOMOE: A Memory-Based and Composition-Aware Framework for Zero-Shot Learning with Hopfield Network and Soft Mixture of Experts

    Authors: Do Huu Dat, Po Yuan Mao, Tien Hoang Nguyen, Wray Buntine, Mohammed Bennamoun

    Abstract: Compositional Zero-Shot Learning (CZSL) has emerged as an essential paradigm in machine learning, aiming to overcome the constraints of traditional zero-shot learning by incorporating compositional thinking into its methodology. Conventional zero-shot learning has difficulty managing unfamiliar combinations of seen and unseen classes because it depends on pre-defined class embeddings. In contrast,… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  11. arXiv:2311.14303  [pdf, other

    astro-ph.IM cs.NE

    RFI Detection with Spiking Neural Networks

    Authors: Nicholas J. Pritchard, Andreas Wicenec, Mohammed Bennamoun, Richard Dodson

    Abstract: Detecting and mitigating Radio Frequency Interference (RFI) is critical for enabling and maximising the scientific output of radio telescopes. The emergence of machine learning methods has led to their application in radio astronomy, and in RFI detection. Spiking Neural Networks (SNNs), inspired by biological systems, are well-suited for processing spatio-temporal data. This study introduces the f… ▽ More

    Submitted 22 March, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: 11 pages, 5 figures, 5 tables. Accepted for publication in PASA

  12. arXiv:2310.13263  [pdf, other

    cs.CV

    UE4-NeRF:Neural Radiance Field for Real-Time Rendering of Large-Scale Scene

    Authors: Jiaming Gu, Minchao Jiang, Hongsheng Li, Xiaoyuan Lu, Guangming Zhu, Syed Afaq Ali Shah, Liang Zhang, Mohammed Bennamoun

    Abstract: Neural Radiance Fields (NeRF) is a novel implicit 3D reconstruction method that shows immense potential and has been gaining increasing attention. It enables the reconstruction of 3D scenes solely from a set of photographs. However, its real-time rendering capability, especially for interactive real-time rendering of large-scale scenes, still has significant limitations. To address these challenge… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS2023

  13. arXiv:2309.04902  [pdf, other

    cs.CV

    Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art

    Authors: Aref Miri Rekavandi, Shima Rashidi, Farid Boussaid, Stephen Hoefs, Emre Akbas, Mohammed bennamoun

    Abstract: Transformers have rapidly gained popularity in computer vision, especially in the field of object recognition and detection. Upon examining the outcomes of state-of-the-art object detection methods, we noticed that transformers consistently outperformed well-established CNN-based detectors in almost every video or image dataset. While transformer-based approaches remain at the forefront of small o… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

  14. arXiv:2309.00330  [pdf, other

    cs.LG cs.NE

    Multitask Deep Learning for Accurate Risk Stratification and Prediction of Next Steps for Coronary CT Angiography Patients

    Authors: Juan Lu, Mohammed Bennamoun, Jonathon Stewart, JasonK. Eshraghian, Yanbin Liu, Benjamin Chow, Frank M. Sanfilippo, Girish Dwivedi

    Abstract: Diagnostic investigation has an important role in risk stratification and clinical decision making of patients with suspected and documented Coronary Artery Disease (CAD). However, the majority of existing tools are primarily focused on the selection of gatekeeper tests, whereas only a handful of systems contain information regarding the downstream testing or treatment. We propose a multi-task dee… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  15. arXiv:2308.03005  [pdf, other

    cs.CV

    MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation

    Authors: Lian Xu, Mohammed Bennamoun, Farid Boussaid, Hamid Laga, Wanli Ouyang, Dan Xu

    Abstract: This paper proposes a novel transformer-based framework that aims to enhance weakly supervised semantic segmentation (WSSS) by generating accurate class-specific object localization maps as pseudo labels. Building upon the observation that the attended regions of the one-class token in the standard vision transformer can contribute to a class-agnostic localization map, we explore the potential of… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Journal extension for MCTformer

  16. arXiv:2307.13537  [pdf, other

    cs.CV cs.AI cs.MM

    Spectrum-guided Multi-granularity Referring Video Object Segmentation

    Authors: Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

    Abstract: Current referring video object segmentation (R-VOS) techniques extract conditional kernels from encoded (low-resolution) vision-language features to segment the decoded high-resolution features. We discovered that this causes significant feature drift, which the segmentation kernels struggle to perceive during the forward computation. This negatively affects the ability of segmentation kernels. To… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023, code is at https://github.com/bo-miao/SgMg

  17. arXiv:2304.06897  [pdf, other

    cs.NE

    A Bibliometric Review of Neuromorphic Computing and Spiking Neural Networks

    Authors: Nicholas J. Pritchard, Andreas Wicenec, Mohammed Bennamoun, Richard Dodson

    Abstract: Neuromorphic computing and spiking neural networks aim to leverage biological inspiration to achieve greater energy efficiency and computational power beyond traditional von Neumann architectured machines. In particular, spiking neural networks hold the potential to advance artificial intelligence as the basis of third-generation neural networks. Aided by developments in memristive and compute-in-… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 8 pages, 4 figures

    ACM Class: A.1

  18. arXiv:2303.06052  [pdf, other

    cs.LG cs.AI cs.CY q-bio.NC

    Analysis and Evaluation of Explainable Artificial Intelligence on Suicide Risk Assessment

    Authors: Hao Tang, Aref Miri Rekavandi, Dhar**der Rooprai, Girish Dwivedi, Frank Sanfilippo, Farid Boussaid, Mohammed Bennamoun

    Abstract: This study investigates the effectiveness of Explainable Artificial Intelligence (XAI) techniques in predicting suicide risks and identifying the dominant causes for such behaviours. Data augmentation techniques and ML models are utilized to predict the associated risk. Furthermore, SHapley Additive exPlanations (SHAP) and correlation analysis are used to rank the importance of variables in predic… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  19. arXiv:2211.15664  [pdf

    cs.LG cs.CE

    Utilising physics-guided deep learning to overcome data scarcity

    Authors: **shuai Bai, Laith Alzubaidi, Qingxia Wang, Ellen Kuhl, Mohammed Bennamoun, Yuantong Gu

    Abstract: Deep learning (DL) relies heavily on data, and the quality of data influences its performance significantly. However, obtaining high-quality, well-annotated datasets can be challenging or even impossible in many real-world applications, such as structural risk estimation and medical diagnosis. This presents a significant barrier to the practical implementation of DL in these fields. Physics-guided… ▽ More

    Submitted 8 January, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: 26 Pages, 4 figures, 2 tables, 116 references. This work is going to submit to Nature Machine Intelligence

  20. arXiv:2210.05952  [pdf, other

    eess.IV cs.CV

    3D Brain and Heart Volume Generative Models: A Survey

    Authors: Yanbin Liu, Girish Dwivedi, Farid Boussaid, Mohammed Bennamoun

    Abstract: Generative models such as generative adversarial networks and autoencoders have gained a great deal of attention in the medical field due to their excellent data generation capability. This paper provides a comprehensive survey of generative models for three-dimensional (3D) volumes, focusing on the brain and heart. A new and elaborate taxonomy of unconditional and conditional generative models is… ▽ More

    Submitted 5 December, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted at ACM Computing Surveys (CSUR) 2023

    MSC Class: 92C55 (Primary); 68U10 (Secondary) ACM Class: I.4; J.3

  21. arXiv:2209.08305  [pdf, other

    cs.CV

    Active-Passive SimStereo -- Benchmarking the Cross-Generalization Capabilities of Deep Learning-based Stereo Methods

    Authors: Laurent Jospin, Allen Antony, Lian Xu, Hamid Laga, Farid Boussaid, Mohammed Bennamoun

    Abstract: In stereo vision, self-similar or bland regions can make it difficult to match patches between two images. Active stereo-based methods mitigate this problem by projecting a pseudo-random pattern on the scene so that each patch of an image pair can be identified without ambiguity. However, the projected pattern significantly alters the appearance of the image. If this pattern acts as a form of adve… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

    Comments: 22 pages, 12 figures, accepted in NeurIPS 2022 Datasets and Benchmarks Track

  22. arXiv:2209.05082  [pdf, other

    cs.CV

    Bayesian Learning for Disparity Map Refinement for Semi-Dense Active Stereo Vision

    Authors: Laurent Valentin Jospin, Hamid Laga, Farid Boussaid, Mohammed Bennamoun

    Abstract: A major focus of recent developments in stereo vision has been on how to obtain accurate dense disparity maps in passive stereo vision. Active vision systems enable more accurate estimations of dense disparity compared to passive stereo. However, subpixel-accurate disparity estimation remains an open problem that has received little attention. In this paper, we propose a new learning strategy to t… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 15 pages, 15 figures

  23. Inflating 2D Convolution Weights for Efficient Generation of 3D Medical Images

    Authors: Yanbin Liu, Girish Dwivedi, Farid Boussaid, Frank Sanfilippo, Makoto Yamada, Mohammed Bennamoun

    Abstract: The generation of three-dimensional (3D) medical images has great application potential since it takes into account the 3D anatomical structure. Two problems prevent effective training of a 3D medical generative model: (1) 3D medical images are expensive to acquire and annotate, resulting in an insufficient number of training images, and (2) a large number of parameters are involved in 3D convolut… ▽ More

    Submitted 5 December, 2023; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: Published at Computer Methods and Programs in Biomedicine (CMPB) 2023

    ACM Class: I.4; J.3

    Journal ref: Computer Methods and Programs in Biomedicine (2023): 107685

  24. arXiv:2207.13037  [pdf, other

    cs.CV

    Learning Resolution-Adaptive Representations for Cross-Resolution Person Re-Identification

    Authors: Lin Wu, Lingqiao Liu, Yang Wang, Zheng Zhang, Farid Boussaid, Mohammed Bennamoun

    Abstract: The cross-resolution person re-identification (CRReID) problem aims to match low-resolution (LR) query identity images against high resolution (HR) gallery images. It is a challenging and practical problem since the query images often suffer from resolution degradation due to the different capturing conditions from real-world cameras. To address this problem, state-of-the-art (SOTA) solutions eith… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: Under review

  25. arXiv:2207.13036  [pdf, other

    cs.LG cs.AI cs.CR

    Jacobian Norm with Selective Input Gradient Regularization for Improved and Interpretable Adversarial Defense

    Authors: Deyin Liu, Lin Wu, Haifeng Zhao, Farid Boussaid, Mohammed Bennamoun, Xianghua Xie

    Abstract: Deep neural networks (DNNs) are known to be vulnerable to adversarial examples that are crafted with imperceptible perturbations, i.e., a small change in an input image can induce a mis-classification, and thus threatens the reliability of deep learning based deployment systems. Adversarial training (AT) is often adopted to improve robustness through training a mixture of corrupted and clean data.… ▽ More

    Submitted 14 November, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: Under review

  26. arXiv:2207.13035  [pdf, other

    cs.CV

    Pseudo-Pair based Self-Similarity Learning for Unsupervised Person Re-identification

    Authors: Lin Wu, Deyin Liu, Wenying Zhang, Dapeng Chen, Zongyuan Ge, Farid Boussaid, Mohammed Bennamoun, Jialie Shen

    Abstract: Person re-identification (re-ID) is of great importance to video surveillance systems by estimating the similarity between a pair of cross-camera person shorts. Current methods for estimating such similarity require a large number of labeled samples for supervised training. In this paper, we present a pseudo-pair based self-similarity learning approach for unsupervised person re-ID without human a… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

    Comments: Under review

    Journal ref: IEEE Transactions on Image Processing 2022

  27. arXiv:2207.12926  [pdf, other

    cs.CV cs.LG

    A Guide to Image and Video based Small Object Detection using Deep Learning : Case Study of Maritime Surveillance

    Authors: Aref Miri Rekavandi, Lian Xu, Farid Boussaid, Abd-Krim Seghouane, Stephen Hoefs, Mohammed Bennamoun

    Abstract: Small object detection (SOD) in optical images and videos is a challenging problem that even state-of-the-art generic object detection methods fail to accurately localize and identify such objects. Typically, small objects appear in real-world due to large camera-object distance. Because small objects occupy only a small area in the input image (e.g., less than 10%), the information extracted from… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  28. arXiv:2207.11635  [pdf, other

    cs.CV

    Spatial-temporal Analysis for Automated Concrete Workability Estimation

    Authors: Litao Yu, Jian Zhang, Mohammed Bennamoun, Xiaojun Chang, Vute Sirivivatnanon, Ali Nezhad

    Abstract: Concrete workability measure is mostly determined based on subjective assessment of a certified assessor with visual inspections. The potential human error in measuring the workability and the resulting unnecessary adjustments for the workability is a major challenge faced by the construction industry, leading to significant costs, material waste and delay. In this paper, we try to apply computer… ▽ More

    Submitted 24 September, 2022; v1 submitted 23 July, 2022; originally announced July 2022.

    Comments: We have some significant changes in the experiment

  29. arXiv:2207.10258  [pdf, other

    cs.CV

    Region Aware Video Object Segmentation with Deep Motion Modeling

    Authors: Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

    Abstract: Current semi-supervised video object segmentation (VOS) methods usually leverage the entire features of one frame to predict object masks and update memory. This introduces significant redundant computations. To reduce redundancy, we present a Region Aware Video Object Segmentation (RAVOS) approach that predicts regions of interest (ROIs) for efficient object segmentation and memory storage. RAVOS… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  30. arXiv:2206.06506  [pdf, other

    cs.CV

    Spiking Neural Networks for Frame-based and Event-based Single Object Localization

    Authors: Sami Barchid, José Mennesson, Jason Eshraghian, Chaabane Djéraba, Mohammed Bennamoun

    Abstract: Spiking neural networks have shown much promise as an energy-efficient alternative to artificial neural networks. However, understanding the impacts of sensor noises and input encodings on the network activity and performance remains difficult with common neuromorphic vision baselines like classification. Therefore, we propose a spiking neural network approach for single object localization traine… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: 21 pages, 12 figures

  31. arXiv:2203.13387  [pdf, other

    cs.CV

    CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation

    Authors: Mohammed Hassanin, Abdelwahed Khamiss, Mohammed Bennamoun, Farid Boussaid, Ibrahim Radwan

    Abstract: 3D human pose estimation can be handled by encoding the geometric dependencies between the body parts and enforcing the kinematic constraints. Recently, Transformer has been adopted to encode the long-range dependencies between the joints in the spatial and temporal domains. While they had shown excellence in long-range dependencies, studies have noted the need for improving the locality of vision… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  32. arXiv:2203.02891  [pdf, other

    cs.CV

    Multi-class Token Transformer for Weakly Supervised Semantic Segmentation

    Authors: Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaid, Dan Xu

    Abstract: This paper proposes a new transformer-based framework to learn class-specific object localization maps as pseudo labels for weakly supervised semantic segmentation (WSSS). Inspired by the fact that the attended regions of the one-class token in the standard vision transformer can be leveraged to form a class-agnostic localization map, we investigate if the transformer model can also effectively ca… ▽ More

    Submitted 6 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR 2022

  33. arXiv:2202.02543  [pdf, other

    cs.CV cs.AI math.OC

    Unsupervised Learning on 3D Point Clouds by Clustering and Contrasting

    Authors: Guofeng Mei, Litao Yu, Qiang Wu, Jian Zhang, Mohammed Bennamoun

    Abstract: Learning from unlabeled or partially labeled data to alleviate human labeling remains a challenging research topic in 3D modeling. Along this line, unsupervised representation learning is a promising direction to auto-extract features without human intervention. This paper proposes a general unsupervised approach, named \textbf{ConClu}, to perform the learning of point-wise and global features by… ▽ More

    Submitted 14 February, 2022; v1 submitted 5 February, 2022; originally announced February 2022.

  34. arXiv:2202.01975  [pdf

    q-bio.QM cs.LG

    Performance of multilabel machine learning models and risk stratification schemas for predicting stroke and bleeding risk in patients with non-valvular atrial fibrillation

    Authors: Juan Lu, Rebecca Hutchens, Joseph Hung, Mohammed Bennamoun, Brendan McQuillan, Tom Briffa, Ferdous Sohel, Kevin Murray, Jonathon Stewart, Benjamin Chow, Frank Sanfilippo, Girish Dwivedi

    Abstract: Appropriate antithrombotic therapy for patients with atrial fibrillation (AF) requires assessment of ischemic stroke and bleeding risks. However, risk stratification schemas such as CHA2DS2-VASc and HAS-BLED have modest predictive capacity for patients with AF. Machine learning (ML) techniques may improve predictive performance and support decision-making for appropriate antithrombotic therapy. We… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  35. Spatio-Temporal Graph Representation Learning for Fraudster Group Detection

    Authors: Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

    Abstract: Motivated by potential financial gain, companies may hire fraudster groups to write fake reviews to either demote competitors or promote their own businesses. Such groups are considerably more successful in misleading customers, as people are more likely to be influenced by the opinion of a large group. To detect such groups, a common model is to represent fraudster groups' static networks, conseq… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

  36. arXiv:2201.02560  [pdf, other

    cs.CV

    A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items

    Authors: Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi

    Abstract: Screening cluttered and occluded contraband items from baggage X-ray scans is a cumbersome task even for the expert security staff. This paper presents a novel strategy that extends a conventional encoder-decoder architecture to perform instance-aware segmentation and extract merged instances of contraband items without using any additional sub-network or an object detector. The encoder-decoder ne… ▽ More

    Submitted 10 January, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

    Comments: IEEE Transactions on Systems, Man, and Cybernetics: Systems, Source code is available at https://github.com/taimurhassan/inc-inst-seg

    Journal ref: IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2021

  37. arXiv:2201.00443  [pdf, other

    cs.CV

    Scene Graph Generation: A Comprehensive Survey

    Authors: Guangming Zhu, Liang Zhang, Youliang Jiang, Yixuan Dang, Haoran Hou, Peiyi Shen, Mingtao Feng, Xia Zhao, Qiguang Miao, Syed Afaq Ali Shah, Mohammed Bennamoun

    Abstract: Deep learning techniques have led to remarkable breakthroughs in the field of generic object detection and have spawned a lot of scene-understanding tasks in recent years. Scene graph has been the focus of research because of its powerful semantic representation and applications to scene understanding. Scene Graph Generation (SGG) refers to the task of automatically map** an image into a semanti… ▽ More

    Submitted 22 June, 2022; v1 submitted 2 January, 2022; originally announced January 2022.

    Comments: Submitted to TPAMI

  38. arXiv:2112.14381  [pdf, other

    cs.CV

    COTReg:Coupled Optimal Transport based Point Cloud Registration

    Authors: Guofeng Mei, Xiaoshui Huang, Litao Yu, Jian Zhang, Mohammed Bennamoun

    Abstract: Generating a set of high-quality correspondences or matches is one of the most critical steps in point cloud registration. This paper proposes a learning framework COTReg by jointly considering the pointwise and structural matchings to predict correspondences of 3D point cloud registration. Specifically, we transform the two matchings into a Wasserstein distance-based and a Gromov-Wasserstein dist… ▽ More

    Submitted 7 October, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

  39. arXiv:2112.13210  [pdf, other

    q-bio.QM cs.AI cs.LG

    Explainable Artificial Intelligence for Pharmacovigilance: What Features Are Important When Predicting Adverse Outcomes?

    Authors: Isaac Ronald Ward, Ling Wang, Juan lu, Mohammed Bennamoun, Girish Dwivedi, Frank M Sanfilippo

    Abstract: Explainable Artificial Intelligence (XAI) has been identified as a viable method for determining the importance of features when making predictions using Machine Learning (ML) models. In this study, we created models that take an individual's health information (e.g. their drug history and comorbidities) as inputs, and predict the probability that the individual will have an Acute Coronary Syndrom… ▽ More

    Submitted 25 December, 2021; originally announced December 2021.

    Comments: Comput Methods Programs Biomed. 2021 Nov;212:106415. Epub 2021 Sep 26

  40. arXiv:2112.00941  [pdf, other

    cs.CV

    Generalized Closed-form Formulae for Feature-based Subpixel Alignment in Patch-based Matching

    Authors: Laurent Valentin Jospin, Farid Boussaid, Hamid Laga, Mohammed Bennamoun

    Abstract: Cost-based image patch matching is at the core of various techniques in computer vision, photogrammetry and remote sensing. When the subpixel disparity between the reference patch in the source and target images is required, either the cost function or the target image have to be interpolated. While cost-based interpolation is the easiest to implement, multiple works have shown that image based in… ▽ More

    Submitted 12 February, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: 29 pages, 10 figures

    ACM Class: I.4.8

  41. arXiv:2111.05645  [pdf, other

    cs.LG cs.AI

    Social Fraud Detection Review: Methods, Challenges and Analysis

    Authors: Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

    Abstract: Social reviews have dominated the web and become a plausible source of product information. People and businesses use such information for decision-making. Businesses also make use of social information to spread fake information using a single user, groups of users, or a bot trained to generate fraudulent content. Many studies proposed approaches based on user behaviors and review text to address… ▽ More

    Submitted 4 January, 2023; v1 submitted 10 November, 2021; originally announced November 2021.

  42. arXiv:2110.04453  [pdf, other

    cs.IT

    A Novel Quantum Calculus-based Complex Least Mean Square Algorithm (q-CLMS)

    Authors: Alishba Sadiq, Imran Naseem, Shujaat Khan, Muhammad Moinuddin, Roberto Togneri, Mohammed Bennamoun

    Abstract: In this research, a novel adaptive filtering algorithm is proposed for complex domain signal processing. The proposed algorithm is based on Wirtinger calculus and is called as q-Complex Least Mean Square (q-CLMS) algorithm. The proposed algorithm could be considered as an extension of the q-LMS algorithm for the complex domain. Transient and steady-state analyses of the proposed q-CLMS algorithm a… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: 35 pages, 14 figures

  43. arXiv:2109.12894  [pdf, other

    cs.NE cs.ET cs.LG

    Training Spiking Neural Networks Using Lessons From Deep Learning

    Authors: Jason K. Eshraghian, Max Ward, Emre Neftci, Xinxin Wang, Gregor Lenz, Girish Dwivedi, Mohammed Bennamoun, Doo Seok Jeong, Wei D. Lu

    Abstract: The brain is the perfect place to look for inspiration to develop more efficient neural networks. The inner workings of our synapses and neurons provide a glimpse at what the future of deep learning might look like. This paper serves as a tutorial and perspective showing how to apply the lessons learnt from several decades of research in deep learning, gradient descent, backpropagation and neurosc… ▽ More

    Submitted 13 August, 2023; v1 submitted 27 September, 2021; originally announced September 2021.

  44. arXiv:2108.10217  [pdf, other

    cs.CV cs.AI

    Deep Bayesian Image Set Classification: A Defence Approach against Adversarial Attacks

    Authors: Nima Mirnateghi, Syed Afaq Ali Shah, Mohammed Bennamoun

    Abstract: Deep learning has become an integral part of various computer vision systems in recent years due to its outstanding achievements for object recognition, facial recognition, and scene understanding. However, deep neural networks (DNNs) are susceptible to be fooled with nearly high confidence by an adversary. In practice, the vulnerability of deep learning systems against carefully perturbed images,… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

  45. arXiv:2108.09603  [pdf, other

    cs.CV

    Tensor Pooling Driven Instance Segmentation Framework for Baggage Threat Recognition

    Authors: Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi

    Abstract: Automated systems designed for screening contraband items from the X-ray imagery are still facing difficulties with high clutter, concealment, and extreme occlusion. In this paper, we addressed this challenge using a novel multi-scale contour instance segmentation framework that effectively identifies the cluttered contraband data within the baggage X-ray scans. Unlike standard models that employ… ▽ More

    Submitted 21 September, 2021; v1 submitted 21 August, 2021; originally announced August 2021.

    Comments: Accepted in Neural Computing and Applications. Source code is available at https://github.com/taimurhassan/tensorpooling

  46. arXiv:2107.12569  [pdf, other

    cs.CV

    Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation

    Authors: Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

    Abstract: We propose a self-supervised spatio-temporal matching method, coined Motion-Aware Mask Propagation (MAMP), for video object segmentation. MAMP leverages the frame reconstruction task for training without the need for annotations. During inference, MAMP extracts high-resolution features from each frame to build a memory bank from the features as well as the predicted masks of selected past frames.… ▽ More

    Submitted 27 October, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

  47. arXiv:2107.11787  [pdf, other

    cs.CV

    Leveraging Auxiliary Tasks with Affinity Learning for Weakly Supervised Semantic Segmentation

    Authors: Lian Xu, Wanli Ouyang, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel, Dan Xu

    Abstract: Semantic segmentation is a challenging task in the absence of densely labelled data. Only relying on class activation maps (CAM) with image-level labels provides deficient segmentation supervision. Prior works thus consider pre-trained models to produce coarse saliency maps to guide the generation of pseudo segmentation labels. However, the commonly used off-line heuristic generation process canno… ▽ More

    Submitted 26 July, 2021; v1 submitted 25 July, 2021; originally announced July 2021.

    Comments: Accepted at ICCV 2021

  48. arXiv:2107.07333  [pdf, other

    cs.CV eess.IV

    Unsupervised Anomaly Instance Segmentation for Baggage Threat Recognition

    Authors: Taimur Hassan, Samet Akcay, Mohammed Bennamoun, Salman Khan, Naoufel Werghi

    Abstract: Identifying potential threats concealed within the baggage is of prime concern for the security staff. Many researchers have developed frameworks that can detect baggage threats from X-ray scans. However, to the best of our knowledge, all of these frameworks require extensive training on large-scale and well-annotated datasets, which are hard to procure in the real world. This paper presents a nov… ▽ More

    Submitted 16 July, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: Accepted in J-AIHC, Source Code is available at https://github.com/taimurhassan/anomaly

  49. arXiv:2106.12864  [pdf, other

    eess.IV cs.CV cs.LG

    A Systematic Collection of Medical Image Datasets for Deep Learning

    Authors: Johann Li, Guangming Zhu, Cong Hua, Mingtao Feng, BasheerBennamoun, ** Li, Xiaoyuan Lu, Juan Song, Peiyi Shen, Xu Xu, Lin Mei, Liang Zhang, Syed Afaq Ali Shah, Mohammed Bennamoun

    Abstract: The astounding success made by artificial intelligence (AI) in healthcare and other fields proves that AI can achieve human-like performance. However, success always comes with challenges. Deep learning algorithms are data-dependent and require large datasets for training. The lack of data in the medical imaging field creates a bottleneck for the application of deep learning to medical image analy… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: This paper has been submitted to one journal

  50. arXiv:2106.10649  [pdf, other

    cs.CV cs.AI cs.LG

    CAMERAS: Enhanced Resolution And Sanity preserving Class Activation Map** for image saliency

    Authors: Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

    Abstract: Backpropagation image saliency aims at explaining model predictions by estimating model-centric importance of individual pixels in the input. However, class-insensitivity of the earlier layers in a network only allows saliency computation with low resolution activation maps of the deeper layers, resulting in compromised image saliency. Remedifying this can lead to sanity failures. We propose CAMER… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

    Comments: IEEE CVPR 2021 paper