Skip to main content

Showing 1–50 of 60 results for author: Bilodeau, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.08018  [pdf, other

    cs.CV

    Learning Data Association for Multi-Object Tracking using Only Coordinates

    Authors: Mehdi Miah, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: We propose a novel Transformer-based module to address the data association problem for multi-object tracking. From detections obtained by a pretrained detector, this module uses only coordinates from bounding boxes to estimate an affinity score between pairs of tracks extracted from two distinct temporal windows. This module, named TWiX, is trained on sets of tracks with the objective of discrimi… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Preprint submitted to Pattern Recognition

  2. arXiv:2403.03296  [pdf, other

    cs.CV

    CenterDisks: Real-time instance segmentation with disk covering

    Authors: Katia Jodogne-Del Litto, Guillaume-Alexandre Bilodeau

    Abstract: Increasing the accuracy of instance segmentation methods is often done at the expense of speed. Using coarser representations, we can reduce the number of parameters and thus obtain real-time masks. In this paper, we take inspiration from the set cover problem to predict mask approximations. Given ground-truth binary masks of objects of interest as training input, our method learns to predict the… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  3. arXiv:2402.18503  [pdf, other

    cs.CV

    Detection of Micromobility Vehicles in Urban Traffic Videos

    Authors: Khalil Sabri, Célia Djilali, Guillaume-Alexandre Bilodeau, Nicolas Saunier, Wassim Bouachir

    Abstract: Urban traffic environments present unique challenges for object detection, particularly with the increasing presence of micromobility vehicles like e-scooters and bikes. To address this object detection problem, this work introduces an adapted detection model that combines the accuracy and speed of single-frame object detection with the richer features offered by video object detection frameworks.… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  4. arXiv:2402.10752  [pdf, other

    cs.CV

    STF: Spatio-Temporal Fusion Module for Improving Video Object Detection

    Authors: Noreen Anwar, Guillaume-Alexandre Bilodeau, Wassim Bouachir

    Abstract: Consecutive frames in a video contain redundancy, but they may also contain relevant complementary information for the detection task. The objective of our work is to leverage this complementary information to improve detection. Therefore, we propose a spatio-temporal fusion framework (STF). We first introduce multi-frame and single-frame attention modules that allow a neural network to share feat… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 8 pages,3 figures

  5. arXiv:2312.06486  [pdf, other

    cs.CV

    STDiff: Spatio-temporal Diffusion for Continuous Stochastic Video Prediction

    Authors: Xi Ye, Guillaume-Alexandre Bilodeau

    Abstract: Predicting future frames of a video is challenging because it is difficult to learn the uncertainty of the underlying factors influencing their contents. In this paper, we propose a novel video prediction model, which has infinite-dimensional latent variables over the spatio-temporal domain. Specifically, we first decompose the video motion and content information, then take a neural stochastic di… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Journal ref: AAAI2024

  6. arXiv:2311.03177  [pdf, other

    cs.CV cs.AI

    1D-Convolutional transformer for Parkinson disease diagnosis from gait

    Authors: Safwen Naimi, Wassim Bouachir, Guillaume-Alexandre Bilodeau

    Abstract: This paper presents an efficient deep neural network model for diagnosing Parkinson's disease from gait. More specifically, we introduce a hybrid ConvNet-Transformer architecture to accurately diagnose the disease by detecting the severity stage. The proposed architecture exploits the strengths of both Convolutional Neural Networks and Transformers in a single end-to-end model, where the former is… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 17 pages, 5 Figures, 6 Tables. Accepted for publication in Neural Computing and Applications (NCAA) 2023

  7. arXiv:2310.17080  [pdf, other

    cs.CV cs.LG

    Automating lichen monitoring in ecological studies using instance segmentation of time-lapse images

    Authors: Safwen Naimi, Olfa Koubaa, Wassim Bouachir, Guillaume-Alexandre Bilodeau, Gregory Jeddore, Patricia Baines, David Correia, Andre Arsenault

    Abstract: Lichens are symbiotic organisms composed of fungi, algae, and/or cyanobacteria that thrive in a variety of environments. They play important roles in carbon and nitrogen cycling, and contribute directly and indirectly to biodiversity. Ecologists typically monitor lichens by using them as indicators to assess air quality and habitat conditions. In particular, epiphytic lichens, which live on trees,… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 6 pages, 3 Figures, 8 Tables, Accepted for publication in IEEE International Conference on Machine Learning and Applications (ICMLA), copyright IEEE

  8. arXiv:2310.17078  [pdf, other

    cs.CV cs.LG

    HCT: Hybrid Convnet-Transformer for Parkinson's disease detection and severity prediction from gait

    Authors: Safwen Naimi, Wassim Bouachir, Guillaume-Alexandre Bilodeau

    Abstract: In this paper, we propose a novel deep learning method based on a new Hybrid ConvNet-Transformer architecture to detect and stage Parkinson's disease (PD) from gait data. We adopt a two-step approach by dividing the problem into two sub-problems. Our Hybrid ConvNet-Transformer model first distinguishes healthy versus parkinsonian patients. If the patient is parkinsonian, a multi-class Hybrid ConvN… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 6 pages, 6 figures, 3 tables, Accepted for publication in IEEE International Conference on Machine Learning and Applications (ICMLA), copyright IEEE

  9. arXiv:2305.05490  [pdf, other

    cs.CV

    Real-time instance segmentation with polygons using an Intersection-over-Union loss

    Authors: Katia Jodogne-Del Litto, Guillaume-Alexandre Bilodeau

    Abstract: Predicting a binary mask for an object is more accurate but also more computationally expensive than a bounding box. Polygonal masks as developed in CenterPoly can be a good compromise. In this paper, we improve over CenterPoly by enhancing the classical regression L1 loss with a novel region-based loss and a novel order loss, as well as with a new training process for the vertices prediction head… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  10. arXiv:2304.11291  [pdf, other

    cs.CV

    VisiTherS: Visible-thermal infrared stereo disparity estimation of human silhouette

    Authors: Noreen Anwar, Philippe Duplessis-Guindon, Guillaume-Alexandre Bilodeau, Wassim Bouachir

    Abstract: This paper presents a novel approach for visible-thermal infrared stereoscopy, focusing on the estimation of disparities of human silhouettes. Visible-thermal infrared stereo poses several challenges, including occlusions and differently textured matching regions in both spectra. Finding matches between two spectra with varying colors, textures, and shapes adds further complexity to the task. To a… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 8 pages,3 Figures,CVPR workshop

  11. arXiv:2304.09012  [pdf, other

    cs.CV

    GUILGET: GUI Layout GEneration with Transformer

    Authors: Andrey Sobolevsky, Guillaume-Alexandre Bilodeau, **ghui Cheng, ** L. C. Guo

    Abstract: Sketching out Graphical User Interface (GUI) layout is part of the pipeline of designing a GUI and a crucial task for the success of a software application. Arranging all components inside a GUI layout manually is a time-consuming task. In order to assist designers, we developed a method named GUILGET to automatically generate GUI layouts from positional constraints represented as GUI arrangement… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 12 pages, 5 figures, Canadian AI Conference 2023

  12. arXiv:2304.06114  [pdf, ps, other

    cs.CV

    TopTrack: Tracking Objects By Their Top

    Authors: Jacob Meilleur, Guillaume-Alexandre Bilodeau

    Abstract: In recent years, the joint detection-and-tracking paradigm has been a very popular way of tackling the multi-object tracking (MOT) task. Many of the methods following this paradigm use the object center keypoint for detection. However, we argue that the center point is not optimal since it is often not visible in crowded scenarios, which results in many missed detections when the objects are parti… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: 14 pages, 7 figures, submitted to Machine Vision and Applications

  13. arXiv:2212.06026  [pdf, other

    cs.CV

    Video Prediction by Efficient Transformers

    Authors: Xi Ye, Guillaume-Alexandre Bilodeau

    Abstract: Video prediction is a challenging computer vision task that has a wide range of applications. In this work, we present a new family of Transformer-based models for video prediction. Firstly, an efficient local spatial-temporal separation attention mechanism is proposed to reduce the complexity of standard Transformers. Then, a full autoregressive model, a partial autoregressive model and a non-aut… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: Accepted by Image and Vision Computing. arXiv admin note: text overlap with arXiv:2203.15836

  14. arXiv:2210.05810  [pdf, other

    cs.CV

    A unified model for continuous conditional video prediction

    Authors: Xi Ye, Guillaume-Alexandre Bilodeau

    Abstract: Different conditional video prediction tasks, like video future frame prediction and video frame interpolation, are normally solved by task-related models even though they share many common underlying characteristics. Furthermore, almost all conditional video prediction models can only achieve discrete prediction. In this paper, we propose a unified model that addresses these two issues at the sam… ▽ More

    Submitted 6 April, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted by CVPR2023 Workshop

  15. arXiv:2204.10677  [pdf, other

    cs.CV

    Improving tracking with a tracklet associator

    Authors: Rémi Nahon, Guillaume-Alexandre Bilodeau, Gilles Pesant

    Abstract: Multiple object tracking (MOT) is a task in computer vision that aims to detect the position of various objects in videos and to associate them to a unique identity. We propose an approach based on Constraint Programming (CP) whose goal is to be grafted to any existing tracker in order to improve its object association results. We developed a modular algorithm divided into three independent phases… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: 8 pages, 6 figures, CRV 2022

    MSC Class: ACM-class: I.4.8

  16. arXiv:2204.09089  [pdf, other

    cs.CV

    4D-MultispectralNet: Multispectral Stereoscopic Disparity Estimation using Human Masks

    Authors: Philippe Duplessis-Guindon, Guillaume-Alexandre Bilodeau

    Abstract: Multispectral stereoscopy is an emerging field. A lot of work has been done in classical stereoscopy, but multispectral stereoscopy is not studied as frequently. This type of stereoscopy can be used in autonomous vehicles to complete the information given by RGB cameras. It helps to identify objects in the surroundings when the conditions are more difficult, such as in night scenes. This paper foc… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: 6 pages, 2 figures

  17. arXiv:2204.08671  [pdf, other

    cs.CV

    ActAR: Actor-Driven Pose Embeddings for Video Action Recognition

    Authors: Soufiane Lamghari, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: Human action recognition (HAR) in videos is one of the core tasks of video understanding. Based on video sequences, the goal is to recognize actions performed by humans. While HAR has received much attention in the visible spectrum, action recognition in infrared videos is little studied. Accurate recognition of human actions in the infrared domain is a highly challenging task because of the redun… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  18. arXiv:2204.00423  [pdf, other

    cs.LG

    Transformers for 1D Signals in Parkinson's Disease Detection from Gait

    Authors: Duc Minh Dimitri Nguyen, Mehdi Miah, Guillaume-Alexandre Bilodeau, Wassim Bouachir

    Abstract: This paper focuses on the detection of Parkinson's disease based on the analysis of a patient's gait. The growing popularity and success of Transformer networks in natural language processing and image recognition motivated us to develop a novel method for this problem based on an automatic features extraction via Transformers. The use of Transformers in 1D signal is not really widespread yet, but… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: International Conference on Pattern Recognition (ICPR 2022)

  19. arXiv:2203.15836  [pdf, other

    cs.CV

    VPTR: Efficient Transformers for Video Prediction

    Authors: Xi Ye, Guillaume-Alexandre Bilodeau

    Abstract: In this paper, we propose a new Transformer block for video future frames prediction based on an efficient local spatial-temporal separation attention mechanism. Based on this new Transformer block, a fully autoregressive video future frames prediction Transformer is proposed. In addition, a non-autoregressive video prediction Transformer is also proposed to increase the inference speed and reduce… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  20. arXiv:2111.03715  [pdf, other

    cs.CL cs.LG

    Leveraging Sentiment Analysis Knowledge to Solve Emotion Detection Tasks

    Authors: Maude Nguyen-The, Guillaume-Alexandre Bilodeau, Jan Rockemann

    Abstract: Identifying and understanding underlying sentiment or emotions in text is a key component of multiple natural language processing applications. While simple polarity sentiment analysis is a well-studied subject, fewer advances have been made in identifying more complex, finer-grained emotions using only textual data. In this paper, we present a Transformer-based model with a Fusion of Adapter laye… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  21. arXiv:2111.01606  [pdf, other

    cs.CV

    PolyTrack: Tracking with Bounding Polygons

    Authors: Gaspar Faure, Hughes Perreault, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: In this paper, we present a novel method called PolyTrack for fast multi-object tracking and segmentation using bounding polygons. Polytrack detects objects by producing heatmaps of their center keypoint. For each of them, a rough segmentation is done by computing a bounding polygon over each instance instead of the traditional bounding box. Tracking is done by taking two consecutive frames as inp… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021 Machine Learning for Autonomous Driving Workshop

  22. arXiv:2110.11284  [pdf, other

    cs.CV

    Multi-Object Tracking and Segmentation with a Space-Time Memory Network

    Authors: Mehdi Miah, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: We propose a method for multi-object tracking and segmentation based on a novel memory-based mechanism to associate tracklets. The proposed tracker, MeNToS, addresses particularly the long-term data association problem, when objects are not observable for long time intervals. Indeed, the recently introduced HOTA metric (High Order Tracking Accuracy), which has a better alignment than the formerly… ▽ More

    Submitted 15 May, 2023; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: text overlap with arXiv:2107.07067 Accepted at CRV 2023 (Conference on Robots and Vision)

  23. arXiv:2109.12414  [pdf, other

    cs.CV

    Vehicle Detection and Tracking From Surveillance Cameras in Urban Scenes

    Authors: Oumayma Messoussi, Felipe Gohring de Magalhaes, Francois Lamarre, Francis Perreault, Ibrahima Sogoba, Guillaume-Alexandre Bilodeau, Gabriela Nicolescu

    Abstract: Detecting and tracking vehicles in urban scenes is a crucial step in many traffic-related applications as it helps to improve road user safety among other benefits. Various challenges remain unresolved in multi-object tracking (MOT) including target information description, long-term occlusions and fast motion. We propose a multi-vehicle detection and tracking system following the tracking-by-dete… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  24. arXiv:2109.07298  [pdf, other

    cs.CV

    FFAVOD: Feature Fusion Architecture for Video Object Detection

    Authors: Hughes Perreault, Guillaume-Alexandre Bilodeau, Nicolas Saunier, Maguelonne Héritier

    Abstract: A significant amount of redundancy exists between consecutive frames of a video. Object detectors typically produce detections for one image at a time, without any capabilities for taking advantage of this redundancy. Meanwhile, many applications for object detection work with videos, including intelligent transportation systems, advanced driver assistance systems and video surveillance. Our work… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: Accepted for publication in Pattern Recognition Letters

  25. arXiv:2108.08923  [pdf, other

    cs.CV

    CenterPoly: real-time instance segmentation using bounding polygons

    Authors: Hughes Perreault, Guillaume-Alexandre Bilodeau, Nicolas Saunier, Maguelonne Héritier

    Abstract: We present a novel method, called CenterPoly, for real-time instance segmentation using bounding polygons. We apply it to detect road users in dense urban environments, making it suitable for applications in intelligent transportation systems like automated vehicles. CenterPoly detects objects by their center keypoint while predicting a fixed number of polygon vertices for each object, thus perfor… ▽ More

    Submitted 15 September, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: Accepted to the 2nd Autonomous Vehicle Vision Workshop (AVVision)

  26. arXiv:2107.07067  [pdf, other

    cs.CV

    MeNToS: Tracklets Association with a Space-Time Memory Network

    Authors: Mehdi Miah, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: We propose a method for multi-object tracking and segmentation (MOTS) that does not require fine-tuning or per benchmark hyperparameter selection. The proposed method addresses particularly the data association problem. Indeed, the recently introduced HOTA metric, that has a better alignment with the human visual assessment by evenly balancing detections and associations quality, has shown that im… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

    Comments: Presented at the "Robust Video Scene Understanding: Tracking and Video Segmentation" workshop (CVPR-W 2021)

  27. arXiv:2106.06059  [pdf, other

    cs.CV

    Predicting Next Local Appearance for Video Anomaly Detection

    Authors: Pankaj Raj Roy, Guillaume-Alexandre Bilodeau, Lama Seoud

    Abstract: We present a local anomaly detection method in videos. As opposed to most existing methods that are computationally expensive and are not very generalizable across different video scenes, we propose an adversarial framework that learns the temporal local appearance variations by predicting the appearance of a normally behaving object in the next frame of a scene by only relying on its current and… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: Accepted as an oral presentation for MVA'2021

  28. arXiv:2103.01222  [pdf, other

    cs.CV

    Multiple Convolutional Features in Siamese Networks for Object Tracking

    Authors: Zhenxi Li, Guillaume-Alexandre Bilodeau, Wassim Bouachir

    Abstract: Siamese trackers demonstrated high performance in object tracking due to their balance between accuracy and speed. Unlike classification-based CNNs, deep similarity networks are specifically designed to address the image similarity problem, and thus are inherently more appropriate for the tracking task. However, Siamese trackers mainly use the last convolutional layers for similarity analysis and… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Accepted for Machine Vision and Applications, 2021. arXiv admin note: substantial text overlap with arXiv:2103.00810

  29. arXiv:2103.00810  [pdf, other

    cs.CV

    MFST: Multi-Features Siamese Tracker

    Authors: Zhenxi Li, Guillaume-Alexandre Bilodeau, Wassim Bouachir

    Abstract: Siamese trackers have recently achieved interesting results due to their balance between accuracy and speed. This success is mainly due to the fact that deep similarity networks were specifically designed to address the image similarity problem. Therefore, they are inherently more appropriate than classical CNNs for the tracking task. However, Siamese trackers rely on the last convolutional layers… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: ICPR 2021, Oral

  30. arXiv:2011.06722  [pdf, other

    cs.CV cs.LG

    Local Anomaly Detection in Videos using Object-Centric Adversarial Learning

    Authors: Pankaj Raj Roy, Guillaume-Alexandre Bilodeau, Lama Seoud

    Abstract: We propose a novel unsupervised approach based on a two-stage object-centric adversarial framework that only needs object regions for detecting frame-level local anomalies in videos. The first stage consists in learning the correspondence between the current appearance and past gradient images of objects in scenes deemed normal, allowing us to either generate the past gradient from current appeara… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Accepted for The First International Workshop on Deep Learning for Human-Centric Activity Understanding (ICPR2020 workshop)

  31. arXiv:2010.08841  [pdf, other

    cs.CV

    A Grid-based Representation for Human Action Recognition

    Authors: Soufiane Lamghari, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: Human action recognition (HAR) in videos is a fundamental research topic in computer vision. It consists mainly in understanding actions performed by humans based on a sequence of visual observations. In recent years, HAR have witnessed significant progress, especially with the emergence of deep learning models. However, most of existing approaches for action recognition rely on information that i… ▽ More

    Submitted 29 October, 2020; v1 submitted 17 October, 2020; originally announced October 2020.

    Comments: Accepted on 25th International Conference on Pattern Recognition (ICPR 2020)

  32. arXiv:2010.07881  [pdf, other

    cs.CV

    An Empirical Analysis of Visual Features for Multiple Object Tracking in Urban Scenes

    Authors: Mehdi Miah, Justine Pepin, Nicolas Saunier, Guillaume-Alexandre Bilodeau

    Abstract: This paper addresses the problem of selecting appearance features for multiple object tracking (MOT) in urban scenes. Over the years, a large number of features has been used for MOT. However, it is not clear whether some of them are better than others. Commonly used features are color histograms, histograms of oriented gradients, deep features from convolutional neural networks and re-identificat… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted on 25th International Conference on Pattern Recognition (ICPR 2020)

  33. arXiv:2007.15560  [pdf, other

    cs.CV

    Unsupervised Disentanglement GAN for Domain Adaptive Person Re-Identification

    Authors: Yacine Khraimeche, Guillaume-Alexandre Bilodeau, David Steele, Harshad Mahadik

    Abstract: While recent person re-identification (ReID) methods achieve high accuracy in a supervised setting, their generalization to an unlabelled domain is still an open problem. In this paper, we introduce a novel unsupervised disentanglement generative adversarial network (UD-GAN) to address the domain adaptation issue of supervised person ReID. Our framework jointly trains a ReID network for discrimina… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

    Comments: 8 pages, 5 figures, submitted to ICPR 2020

  34. arXiv:2005.00088  [pdf, other

    cs.CV

    Domain Siamese CNNs for Sparse Multispectral Disparity Estimation

    Authors: David-Alexandre Beaupre, Guillaume-Alexandre Bilodeau

    Abstract: Multispectral disparity estimation is a difficult task for many reasons: it has all the same challenges as traditional visible-visible disparity estimation (occlusions, repetitive patterns, textureless surfaces), in addition of having very few common visual information between images (e.g. color information vs. thermal information). In this paper, we propose a new CNN architecture able to do dispa… ▽ More

    Submitted 30 April, 2020; originally announced May 2020.

  35. arXiv:2003.13644  [pdf, other

    cs.CV

    Supervised and Unsupervised Detections for Multiple Object Tracking in Traffic Scenes: A Comparative Study

    Authors: Hui-Lee Ooi, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: In this paper, we propose a multiple object tracker, called MF-Tracker, that integrates multiple classical features (spatial distances and colours) and modern features (detection labels and re-identification features) in its tracking framework. Since our tracker can work with detections coming either from unsupervised and supervised object detectors, we also investigated the impact of supervised a… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: Accepted for ICIAR 2020

  36. arXiv:2003.10898  [pdf, other

    cs.CV

    RN-VID: A Feature Fusion Architecture for Video Object Detection

    Authors: Hughes Perreault, Maguelonne Héritier, Pierre Gravel, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: Consecutive frames in a video are highly redundant. Therefore, to perform the task of video object detection, executing single frame detectors on every frame without reusing any information is quite wasteful. It is with this idea in mind that we propose RN-VID (standing for RetinaNet-VIDeo), a novel approach to video object detection. Our contributions are twofold. First, we propose a new architec… ▽ More

    Submitted 2 April, 2020; v1 submitted 24 March, 2020; originally announced March 2020.

  37. arXiv:2003.04468  [pdf, other

    cs.CV

    Tracking Road Users using Constraint Programming

    Authors: Alexandre Pineault, Guillaume-Alexandre Bilodeau, Gilles Pesant

    Abstract: In this paper, we aim at improving the tracking of road users in urban scenes. We present a constraint programming (CP) approach for the data association phase found in the tracking-by-detection paradigm of the multiple object tracking (MOT) problem. Such an approach can solve the data association problem more efficiently than graph-based methods and can handle better the combinatorial explosion o… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  38. arXiv:2002.05540  [pdf, other

    cs.CV

    SpotNet: Self-Attention Multi-Task Network for Object Detection

    Authors: Hughes Perreault, Guillaume-Alexandre Bilodeau, Nicolas Saunier, Maguelonne Héritier

    Abstract: Humans are very good at directing their visual attention toward relevant areas when they search for different types of objects. For instance, when we search for cars, we will look at the streets, not at the top of buildings. The motivation of this paper is to train a network to do the same via a multi-task learning approach. To train visual attention, we produce foreground/background segmentation… ▽ More

    Submitted 11 June, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

  39. arXiv:1911.13114  [pdf, other

    cs.CV

    Color inference from semantic labeling for person search in videos

    Authors: Jules Simon, Guillaume-Alexandre Bilodeau, David Steele, Harshad Mahadik

    Abstract: We propose an explainable model to generate semantic color labels for person search. In this context, persons are described from their semantic parts, such as hat, shirt, etc. Person search consists in looking for people based on these descriptions. In this work, we aim to improve the accuracy of color labels for people. Our goal is to handle the high variability of human perception. Existing solu… ▽ More

    Submitted 6 April, 2020; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: 8 pages, 7 figures ICIAR 2020

  40. arXiv:1910.11509  [pdf, other

    cs.LG cs.CV eess.IV

    Deep 1D-Convnet for accurate Parkinson disease detection and severity prediction from gait

    Authors: Imanne El Maachi, Guillaume-Alexandre Bilodeau, Wassim Bouachir

    Abstract: Diagnosing Parkinson's disease is a complex task that requires the evaluation of several motor and non-motor symptoms. During diagnosis, gait abnormalities are among the important symptoms that physicians should consider. However, gait evaluation is challenging and relies on the expertise and subjectivity of clinicians. In this context, the use of an intelligent gait analysis algorithm may assist… ▽ More

    Submitted 16 May, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

    Comments: Source code available at https://github.com/imanneelmaachi/Parkinson-disease-detection-and-severity-prediction-from-gait

    Journal ref: Expert Systems with Applications, 113075 (2019)

  41. arXiv:1905.06381  [pdf, other

    cs.CV

    Tracking in Urban Traffic Scenes from Background Subtraction and Object Detection

    Authors: Hui-Lee Ooi, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: In this paper, we propose to combine detections from background subtraction and from a multiclass object detector for multiple object tracking (MOT) in urban traffic scenes. These objects are associated across frames using spatial, colour and class label information, and trajectory prediction is evaluated to yield the final MOT outputs. The proposed method was tested on the Urban tracker dataset a… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

  42. arXiv:1903.12049  [pdf, other

    cs.CV

    Road User Detection in Videos

    Authors: Hughes Perreault, Guillaume-Alexandre Bilodeau, Nicolas Saunier, Pierre Gravel

    Abstract: Successive frames of a video are highly redundant, and the most popular object detection methods do not take advantage of this fact. Using multiple consecutive frames can improve detection of small objects or difficult examples and can improve speed and detection consistency in a video sequence, for instance by interpolating features between frames. In this work, a novel approach is introduced to… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

  43. arXiv:1903.11040  [pdf, other

    cs.LG stat.ML

    Adversarially Learned Abnormal Trajectory Classifier

    Authors: Pankaj Raj Roy, Guillaume-Alexandre Bilodeau

    Abstract: We address the problem of abnormal event detection from trajectory data. In this paper, a new adversarial approach is proposed for building a deep neural network binary classifier, trained in an unsupervised fashion, that can distinguish normal from abnormal trajectory-based events without the need for setting manual detection threshold. Inspired by the generative adversarial network (GAN) framewo… ▽ More

    Submitted 3 April, 2019; v1 submitted 26 March, 2019; originally announced March 2019.

    Comments: Accepted for the 16th Conference on Computer and Robot Vision (CRV) 2019

  44. arXiv:1809.02851  [pdf, other

    cs.CV

    Online Mutual Foreground Segmentation for Multispectral Stereo Videos

    Authors: Pierre-Luc St-Charles, Guillaume-Alexandre Bilodeau, Robert Bergevin

    Abstract: The segmentation of video sequences into foreground and background regions is a low-level process commonly used in video content analysis and smart surveillance applications. Using a multispectral camera setup can improve this process by providing more diverse data to help identify objects despite adverse imaging conditions. The registration of several data sources is however not trivial if the ap… ▽ More

    Submitted 21 December, 2018; v1 submitted 8 September, 2018; originally announced September 2018.

    Comments: Preprint accepted for publication in IJCV (December 2018)

  45. arXiv:1809.02073  [pdf, other

    cs.CV

    Multiple Object Tracking in Urban Traffic Scenes with a Multiclass Object Detector

    Authors: Hui-Lee Ooi, Guillaume-Alexandre Bilodeau, Nicolas Saunier, David-Alexandre Beaupré

    Abstract: Multiple object tracking (MOT) in urban traffic aims to produce the trajectories of the different road users that move across the field of view with different directions and speeds and that can have varying appearances and sizes. Occlusions and interactions among the different objects are expected and common due to the nature of urban road traffic. In this work, a tracking framework employing clas… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: 13th International Symposium on Visual Computing (ISVC)

  46. arXiv:1809.00957  [pdf, other

    cs.CV cs.LG stat.ML

    Road User Abnormal Trajectory Detection using a Deep Autoencoder

    Authors: Pankaj Raj Roy, Guillaume-Alexandre Bilodeau

    Abstract: In this paper, we focus on the development of a method that detects abnormal trajectories of road users at traffic intersections. The main difficulty with this is the fact that there are very few abnormal data and the normal ones are insufficient for the training of any kinds of machine learning model. To tackle these problems, we proposed the solution of using a deep autoencoder network trained s… ▽ More

    Submitted 25 August, 2018; originally announced September 2018.

    Comments: This paper has been accepted for oral presentation at ISVC'18

  47. arXiv:1808.07349  [pdf, other

    cs.CV

    Multi-Branch Siamese Networks with Online Selection for Object Tracking

    Authors: Zhenxi Li, Guillaume-Alexandre Bilodeau, Wassim Bouachir

    Abstract: In this paper, we propose a robust object tracking algorithm based on a branch selection mechanism to choose the most efficient object representations from multi-branch siamese networks. While most deep learning trackers use a single CNN for target representation, the proposed Multi-Branch Siamese Tracker (MBST) employs multiple branches of CNNs pre-trained for different tasks, and used for variou… ▽ More

    Submitted 31 August, 2018; v1 submitted 22 August, 2018; originally announced August 2018.

    Comments: ISVC2018, oral presentation

  48. arXiv:1801.09646  [pdf, other

    cs.CV

    Improving Multiple Object Tracking with Optical Flow and Edge Preprocessing

    Authors: David-Alexandre Beaupré, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: In this paper, we present a new method for detecting road users in an urban environment which leads to an improvement in multiple object tracking. Our method takes as an input a foreground image and improves the object detection and segmentation. This new image can be used as an input to trackers that use foreground blobs from background subtraction. The first step is to create foreground images f… ▽ More

    Submitted 29 January, 2018; originally announced January 2018.

  49. arXiv:1801.03551  [pdf, other

    cs.CV

    From Superpixel to Human Shape Modelling for Carried Object Detection

    Authors: Farnoosh Ghadiri, Robert Bergevin, Guillaume-Alexandre Bilodeau

    Abstract: Detecting carried objects is one of the requirements for develo** systems to reason about activities involving people and objects. We present an approach to detect carried objects from a single video frame with a novel method that incorporates features from multiple scales. Initially, a foreground mask in a video frame is segmented into multi-scale superpixels. Then the human-like regions in the… ▽ More

    Submitted 10 January, 2018; originally announced January 2018.

  50. Domain-Specific Face Synthesis for Video Face Recognition from a Single Sample Per Person

    Authors: Fania Mokhayeri, Eric Granger, Guillaume-Alexandre Bilodeau

    Abstract: The performance of still-to-video FR systems can decline significantly because faces captured in unconstrained operational domain (OD) over multiple video cameras have a different underlying data distribution compared to faces captured under controlled conditions in the enrollment domain (ED) with a still camera. This is particularly true when individuals are enrolled to the system using a single… ▽ More

    Submitted 1 October, 2018; v1 submitted 6 January, 2018; originally announced January 2018.

    Journal ref: Transaction on Information Forensics and Security, Vol. 14, Issue 3, pp. 757-772, 2018