Skip to main content

Showing 1–22 of 22 results for author: Laga, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19675  [pdf

    cs.CV

    Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey

    Authors: Uchitha Rajapaksha, Ferdous Sohel, Hamid Laga, Dean Diepeveen, Mohammed Bennamoun

    Abstract: Estimating depth from single RGB images and videos is of widespread interest due to its applications in many areas, including autonomous driving, 3D reconstruction, digital entertainment, and robotics. More than 500 deep learning-based papers have been published in the past 10 years, which indicates the growing interest in the task. This paper presents a comprehensive survey of the existing deep l… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 46 pages, 10 figures, The paper has been accepted for publication in ACM Computing Surveys 2024

    ACM Class: I.2.10; I.4; I.5.1; I.4.8

  2. arXiv:2406.04861  [pdf, other

    cs.CV cs.GR

    Normal-guided Detail-Preserving Neural Implicit Functions for High-Fidelity 3D Surface Reconstruction

    Authors: Aarya Patel, Hamid Laga, Ojaswa Sharma

    Abstract: Neural implicit representations have emerged as a powerful paradigm for 3D reconstruction. However, despite their success, existing methods fail to capture fine geometric details and thin structures, especially in scenarios where only sparse RGB views of the objects of interest are available. We hypothesize that current methods for learning neural implicit representations from RGB or RGBD images p… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Original version. Project page with images and code: https://sn-nir.github.io/

  3. arXiv:2402.17910  [pdf, other

    cs.CV

    Box It to Bind It: Unified Layout Control and Attribute Binding in T2I Diffusion Models

    Authors: Ashkan Taghipour, Morteza Ghahremani, Mohammed Bennamoun, Aref Miri Rekavandi, Hamid Laga, Farid Boussaid

    Abstract: While latent diffusion models (LDMs) excel at creating imaginative images, they often lack precision in semantic fidelity and spatial control over where objects are generated. To address these deficiencies, we introduce the Box-it-to-Bind-it (B2B) module - a novel, training-free approach for improving spatial control and semantic accuracy in text-to-image (T2I) diffusion models. B2B targets three… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  4. arXiv:2308.03005  [pdf, other

    cs.CV

    MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation

    Authors: Lian Xu, Mohammed Bennamoun, Farid Boussaid, Hamid Laga, Wanli Ouyang, Dan Xu

    Abstract: This paper proposes a novel transformer-based framework that aims to enhance weakly supervised semantic segmentation (WSSS) by generating accurate class-specific object localization maps as pseudo labels. Building upon the observation that the attended regions of the one-class token in the standard vision transformer can contribute to a class-agnostic localization map, we explore the potential of… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Journal extension for MCTformer

  5. arXiv:2209.08305  [pdf, other

    cs.CV

    Active-Passive SimStereo -- Benchmarking the Cross-Generalization Capabilities of Deep Learning-based Stereo Methods

    Authors: Laurent Jospin, Allen Antony, Lian Xu, Hamid Laga, Farid Boussaid, Mohammed Bennamoun

    Abstract: In stereo vision, self-similar or bland regions can make it difficult to match patches between two images. Active stereo-based methods mitigate this problem by projecting a pseudo-random pattern on the scene so that each patch of an image pair can be identified without ambiguity. However, the projected pattern significantly alters the appearance of the image. If this pattern acts as a form of adve… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

    Comments: 22 pages, 12 figures, accepted in NeurIPS 2022 Datasets and Benchmarks Track

  6. arXiv:2209.05082  [pdf, other

    cs.CV

    Bayesian Learning for Disparity Map Refinement for Semi-Dense Active Stereo Vision

    Authors: Laurent Valentin Jospin, Hamid Laga, Farid Boussaid, Mohammed Bennamoun

    Abstract: A major focus of recent developments in stereo vision has been on how to obtain accurate dense disparity maps in passive stereo vision. Active vision systems enable more accurate estimations of dense disparity compared to passive stereo. However, subpixel-accurate disparity estimation remains an open problem that has received little attention. In this paper, we propose a new learning strategy to t… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 15 pages, 15 figures

  7. arXiv:2112.07819  [pdf, other

    cs.CV cs.AI

    Weed Recognition using Deep Learning Techniques on Class-imbalanced Imagery

    Authors: A S M Mahmudul Hasan, Ferdous Sohel, Dean Diepeveen, Hamid Laga, Michael G. K. Jones

    Abstract: Most weed species can adversely impact agricultural productivity by competing for nutrients required by high-value crops. Manual weeding is not practical for large crop** areas. Many studies have been undertaken to develop automatic weed management systems for agricultural crops. In this process, one of the major tasks is to recognise the weeds from images. However, weed recognition is a challen… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: The paper is accepted by Crop and Pasture Science journal (https://www.publish.csiro.au/CP/justaccepted/CP21626)

  8. arXiv:2112.00941  [pdf, other

    cs.CV

    Generalized Closed-form Formulae for Feature-based Subpixel Alignment in Patch-based Matching

    Authors: Laurent Valentin Jospin, Farid Boussaid, Hamid Laga, Mohammed Bennamoun

    Abstract: Cost-based image patch matching is at the core of various techniques in computer vision, photogrammetry and remote sensing. When the subpixel disparity between the reference patch in the source and target images is required, either the cost function or the target image have to be interpolated. While cost-based interpolation is the easiest to implement, multiple works have shown that image based in… ▽ More

    Submitted 12 February, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: 29 pages, 10 figures

    ACM Class: I.4.8

  9. arXiv:2110.08693  [pdf, other

    cs.LG cs.CG cs.GR stat.ML

    Elastic Shape Analysis of Tree-like 3D Objects using Extended SRVF Representation

    Authors: Guan Wang, Hamid Laga, Anuj Srivastava

    Abstract: How can one analyze detailed 3D biological objects, such as neurons and botanical trees, that exhibit complex geometrical and topological variation? In this paper, we develop a novel mathematical framework for representing, comparing, and computing geodesic deformations between the shapes of such tree-like 3D objects. A hierarchical organization of subtrees characterizes these objects -- each subt… ▽ More

    Submitted 26 November, 2023; v1 submitted 16 October, 2021; originally announced October 2021.

  10. arXiv:2109.11844  [pdf, other

    cs.CV

    Learnable Triangulation for Deep Learning-based 3D Reconstruction of Objects of Arbitrary Topology from Single RGB Images

    Authors: Tarek Ben Charrada, Hedi Tabia, Aladine Chetouani, Hamid Laga

    Abstract: We propose a novel deep reinforcement learning-based approach for 3D object reconstruction from monocular images. Prior works that use mesh representations are template based. Thus, they are limited to the reconstruction of objects that have the same topology as the template. Methods that use volumetric grids as intermediate representations are computationally expensive, which limits their applica… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

  11. arXiv:2103.01415  [pdf, other

    cs.CV cs.LG

    A Survey of Deep Learning Techniques for Weed Detection from Images

    Authors: A S M Mahmudul Hasan, Ferdous Sohel, Dean Diepeveen, Hamid Laga, Michael G. K. Jones

    Abstract: The rapid advances in Deep Learning (DL) techniques have enabled rapid detection, localisation, and recognition of objects from images or videos. DL techniques are now being used in many applications related to agriculture and farming. Automatic detection and classification of weeds can play an important role in weed management and so contribute to higher yields. Weed detection in crops from image… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

  12. arXiv:2101.09403  [pdf, other

    cs.CV

    4D Atlas: Statistical Analysis of the Spatiotemporal Variability in Longitudinal 3D Shape Data

    Authors: Hamid Laga, Marcel Padilla, Ian H. Jermyn, Sebastian Kurtek, Mohammed Bennamoun, Anuj Srivastava

    Abstract: We propose a novel framework to learn the spatiotemporal variability in longitudinal 3D shape data sets, which contain observations of objects that evolve and deform over time. This problem is challenging since surfaces come with arbitrary parameterizations and thus, they need to be spatially registered. Also, different deforming objects, also called 4D surfaces, evolve at different speeds and thu… ▽ More

    Submitted 20 August, 2021; v1 submitted 22 January, 2021; originally announced January 2021.

  13. arXiv:2009.07532  [pdf

    eess.IV cs.CV cs.LG

    RCNN for Region of Interest Detection in Whole Slide Images

    Authors: A Nugaliyadde, Kok Wai Wong, Jeremy Parry, Ferdous Sohel, Hamid Laga, Upeka V. Somaratne, Chris Yeomans, Orchid Foster

    Abstract: Digital pathology has attracted significant attention in recent years. Analysis of Whole Slide Images (WSIs) is challenging because they are very large, i.e., of Giga-pixel resolution. Identifying Regions of Interest (ROIs) is the first step for pathologists to analyse further the regions of diagnostic interest for cancer detection and other anomalies. In this paper, we investigate the use of RCNN… ▽ More

    Submitted 17 September, 2020; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: This paper was accepted to the 27th International Conference on Neural Information Processing (ICONIP 2020) and will be published in the Springer CCIS Series

  14. Hands-on Bayesian Neural Networks -- a Tutorial for Deep Learning Users

    Authors: Laurent Valentin Jospin, Wray Buntine, Farid Boussaid, Hamid Laga, Mohammed Bennamoun

    Abstract: Modern deep learning methods constitute incredibly powerful tools to tackle a myriad of challenging problems. However, since deep learning methods operate as black boxes, the uncertainty associated with their predictions is often challenging to quantify. Bayesian statistics offer a formalism to understand and quantify the uncertainty associated with deep neural network predictions. This tutorial p… ▽ More

    Submitted 3 January, 2022; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: 20 pages, 13 figures

    MSC Class: 62-02 (Primary) ACM Class: G.3; I.2.6

    Journal ref: IEEE Computational Intelligence Magazine ( Volume: 17, Issue: 2, May 2022)

  15. A Survey on Deep Learning Techniques for Stereo-based Depth Estimation

    Authors: Hamid Laga, Laurent Valentin Jospin, Farid Boussaid, Mohammed Bennamoun

    Abstract: Estimating depth from RGB images is a long-standing ill-posed problem, which has been explored for decades by the computer vision, graphics, and machine learning communities. Among the existing techniques, stereo matching remains one of the most widely used in the literature due to its strong connection to the human binocular system. Traditionally, stereo-based depth estimation has been addressed… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020

  16. arXiv:1907.09236  [pdf, other

    cs.CV cs.LG

    RGB-D image-based Object Detection: from Traditional Methods to Deep Learning Techniques

    Authors: Isaac Ronald Ward, Hamid Laga, Mohammed Bennamoun

    Abstract: Object detection from RGB images is a long-standing problem in image processing and computer vision. It has applications in various domains including robotics, surveillance, human-computer interaction, and medical diagnosis. With the availability of low cost 3D scanners, a large number of RGB-D object detection approaches have been proposed in the past years. This chapter provides a comprehensive… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: Chapter in the book 'RGB-D Image Analysis and Processing' (Paul Rosin)

  17. arXiv:1906.06543  [pdf, other

    cs.CV cs.CG cs.GR cs.LG

    Image-based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era

    Authors: Xian-Feng Han, Hamid Laga, Mohammed Bennamoun

    Abstract: 3D reconstruction is a longstanding ill-posed problem, which has been explored for decades by the computer vision, computer graphics, and machine learning communities. Since 2015, image-based 3D reconstruction using convolutional neural networks (CNN) has attracted increasing interest and demonstrated an impressive performance. Given this new era of rapid evolution, this article provides a compreh… ▽ More

    Submitted 1 November, 2019; v1 submitted 15 June, 2019; originally announced June 2019.

    Comments: arXiv admin note: text overlap with arXiv:1806.06098, arXiv:1712.06584, arXiv:1804.10975 by other authors

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, Nov. 2019

  18. arXiv:1906.06113  [pdf, other

    cs.CV cs.GR cs.RO eess.IV

    A Survey on Deep Learning Architectures for Image-based Depth Reconstruction

    Authors: Hamid Laga

    Abstract: Estimating depth from RGB images is a long-standing ill-posed problem, which has been explored for decades by the computer vision, graphics, and machine learning communities. In this article, we provide a comprehensive survey of the recent developments in this field. We will focus on the works which use deep learning techniques to estimate depth from one or multiple images. Deep learning, coupled… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

  19. Statistical Analysis and Modeling of the Geometry and Topology of Plant Roots

    Authors: Guan Wang, Hamid Laga, **yuan Jia, Stanley J. Miklavcic, Anuj Srivastava

    Abstract: The root is an important organ of a plant since it is responsible for water and nutrient uptake. Analyzing and modelling variabilities in the geometry and topology of roots can help in assessing the plant's health, understanding its growth patterns, and modeling relations between plant species and between plants and their environment. In this article, we develop a framework for the statistical ana… ▽ More

    Submitted 15 October, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

    Journal ref: Journal of Theoretical Biology, 2020

  20. arXiv:1812.10111  [pdf, other

    cs.GR cs.CG cs.CV

    A Survey on Non-rigid 3D Shape Analysis

    Authors: Hamid Laga

    Abstract: Shape is an important physical property of natural and manmade 3D objects that characterizes their external appearances. Understanding differences between shapes and modeling the variability within and across shape classes, hereinafter referred to as \emph{shape analysis}, are fundamental problems to many applications, ranging from computer vision and computer graphics to biology and medicine. Thi… ▽ More

    Submitted 25 December, 2018; originally announced December 2018.

  21. arXiv:1810.04020  [pdf, other

    cs.CV cs.LG stat.ML

    A Comprehensive Survey of Deep Learning for Image Captioning

    Authors: Md. Zakir Hossain, Ferdous Sohel, Mohd Fairuz Shiratuddin, Hamid Laga

    Abstract: Generating a description of an image is called image captioning. Image captioning requires to recognize the important objects, their attributes and their relationships in an image. It also needs to generate syntactically and semantically correct sentences. Deep learning-based techniques are capable of handling the complexities and challenges of image captioning. In this survey paper, we aim to pre… ▽ More

    Submitted 14 October, 2018; v1 submitted 6 October, 2018; originally announced October 2018.

    Comments: 36 Pages, Accepted as a Journal Paper in ACM Computing Surveys (October 2018)

  22. Numerical Inversion of SRNF Maps for Elastic Shape Analysis of Genus-Zero Surfaces

    Authors: Hamid Laga, Qian Xie, Ian H. Jermyn, Anuj Srivastava

    Abstract: Recent developments in elastic shape analysis (ESA) are motivated by the fact that it provides comprehensive frameworks for simultaneous registration, deformation, and comparison of shapes. These methods achieve computational efficiency using certain square-root representations that transform invariant elastic metrics into Euclidean metrics, allowing for applications of standard algorithms and sta… ▽ More

    Submitted 14 October, 2016; originally announced October 2016.

    Report number: Volume: 39 , Issue: 12

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017