Skip to main content

Showing 1–14 of 14 results for author: Granger, E

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.09168  [pdf, other

    eess.IV cs.CV cs.LG

    SR-CACO-2: A Dataset for Confocal Fluorescence Microscopy Image Super-Resolution

    Authors: Soufiane Belharbi, Mara KM Whitford, Phuong Hoang, Shakeeb Murtaza, Luke McCaffrey, Eric Granger

    Abstract: Confocal fluorescence microscopy is one of the most accessible and widely used imaging techniques for the study of biological processes. Scanning confocal microscopy allows the capture of high-quality images from 3D samples, yet suffers from well-known limitations such as photobleaching and phototoxicity of specimens caused by intense light exposure, which limits its use in some applications, espe… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 23 pages, 13 figures

  2. arXiv:2403.10488  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    Joint Multimodal Transformer for Emotion Recognition in the Wild

    Authors: Paul Waligora, Haseeb Aslam, Osama Zeeshan, Soufiane Belharbi, Alessandro Lameiras Koerich, Marco Pedersoli, Simon Bacon, Eric Granger

    Abstract: Multimodal emotion recognition (MMER) systems typically outperform unimodal systems by leveraging the inter- and intra-modal relationships between, e.g., visual, textual, physiological, and auditory modalities. This paper proposes an MMER method that relies on a joint multimodal transformer (JMT) for fusion with key-based cross-attention. This framework can exploit the complementary nature of dive… ▽ More

    Submitted 20 April, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 10 pages, 4 figures, 6 tables, CVPRw 2024

  3. arXiv:2304.07958  [pdf, other

    cs.CV cs.SD eess.AS

    Recursive Joint Attention for Audio-Visual Fusion in Regression based Emotion Recognition

    Authors: R Gnana Praveen, Eric Granger, Patrick Cardinal

    Abstract: In video-based emotion recognition (ER), it is important to effectively leverage the complementary relationship among audio (A) and visual (V) modalities, while retaining the intra-modal characteristics of individual modalities. In this paper, a recursive joint attention model is proposed along with long short-term memory (LSTM) modules for the fusion of vocal and facial expressions in regression-… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  4. arXiv:2205.05841  [pdf, other

    eess.IV cs.CV cs.LG

    Leveraging Uncertainty for Deep Interpretable Classification and Weakly-Supervised Segmentation of Histology Images

    Authors: Soufiane Belharbi, Jérôme Rony, Jose Dolz, Ismail Ben Ayed, Luke McCaffrey, Eric Granger

    Abstract: Trained using only image class label, deep weakly supervised methods allow image classification and ROI segmentation for interpretability. Despite their success on natural images, they face several challenges over histology data where ROI are visually similar to background making models vulnerable to high pixel-wise false positives. These methods lack mechanisms for modeling explicitly non-discrim… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: 4 pages, 4 figures

  5. arXiv:2203.14779  [pdf, other

    cs.CV cs.HC cs.SD eess.AS

    A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition

    Authors: Gnana Praveen Rajasekar, Wheidima Carneiro de Melo, Nasib Ullah, Haseeb Aslam, Osama Zeeshan, Théo Denorme, Marco Pedersoli, Alessandro Koerich, Simon Bacon, Patrick Cardinal, Eric Granger

    Abstract: Multimodal emotion recognition has recently gained much attention since it can leverage diverse and complementary relationships over multiple modalities (e.g., audio, visual, biosignals, etc.), and can provide some robustness to noisy modalities. Most state-of-the-art methods for audio-visual (A-V) fusion rely on recurrent networks or conventional attention mechanisms that do not effectively lever… ▽ More

    Submitted 20 April, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:2111.05222

  6. arXiv:2201.02445  [pdf, other

    eess.IV cs.CV cs.LG

    Negative Evidence Matters in Interpretable Histology Image Classification

    Authors: Soufiane Belharbi, Marco Pedersoli, Ismail Ben Ayed, Luke McCaffrey, Eric Granger

    Abstract: Using only global image-class labels, weakly-supervised learning methods, such as class activation map**, allow training CNNs to jointly classify an image, and locate regions of interest associated with the predicted class. However, without any guidance at the pixel level, such methods may yield inaccurate regions. This problem is known to be more challenging with histology images than with natu… ▽ More

    Submitted 5 May, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

    Comments: 9 figures

  7. arXiv:2111.05222  [pdf, other

    cs.CV cs.SD eess.AS eess.IV

    Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition

    Authors: Gnana Praveen R, Eric Granger, Patrick Cardinal

    Abstract: Multimodal analysis has recently drawn much interest in affective computing, since it can improve the overall accuracy of emotion recognition over isolated uni-modal approaches. The most effective techniques for multimodal emotion recognition efficiently leverage diverse and complimentary sources of information, such as facial, vocal, and physiological modalities, to provide comprehensive feature… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Accepted in FG2021

  8. arXiv:2012.13736  [pdf, other

    cs.CV eess.IV

    Image Synthesis with Adversarial Networks: a Comprehensive Survey and Case Studies

    Authors: Pourya Shamsolmoali, Masoumeh Zareapoor, Eric Granger, Huiyu Zhou, Ruili Wang, M. Emre Celebi, Jie Yang

    Abstract: Generative Adversarial Networks (GANs) have been extremely successful in various application domains such as computer vision, medicine, and natural language processing. Moreover, transforming an object or person to a desired shape become a well-studied research in the GANs. GANs are powerful models for learning complex distributions to synthesize semantically meaningful samples. However, there is… ▽ More

    Submitted 26 December, 2020; originally announced December 2020.

  9. arXiv:2002.04206  [pdf, other

    cs.CV eess.IV

    Dual-Triplet Metric Learning for Unsupervised Domain Adaptation in Video-Based Face Recognition

    Authors: George Ekladious, Hugo Lemoine, Eric Granger, Kaveh Kamali, Salim Moudache

    Abstract: The scalability and complexity of deep learning models remains a key issue in many of visual recognition applications like, e.g., video surveillance, where fine tuning with labeled image data from each new camera is required to reduce the domain shift between videos captured from the source domain, e.g., a laboratory setting, and the target domain, i.e, an operational environment. In many video su… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: Submitted too IJCNN2020

  10. arXiv:1910.14552  [pdf, other

    cs.CV eess.IV

    On the Interaction Between Deep Detectors and Siamese Trackers in Video Surveillance

    Authors: Madhu Kiran, Vivek Tiwari, Le Thanh Nguyen-Meidine, Eric Granger

    Abstract: Visual object tracking is an important function in many real-time video surveillance applications, such as localization and spatio-temporal recognition of persons. In real-world applications, an object detector and tracker must interact on a periodic basis to discover new objects, and thereby to initiate tracks. Periodic interactions with the detector can also allow the tracker to validate and/or… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: Presented in AVSS-2019 Conference

  11. arXiv:1909.03354  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Weakly-Supervised Learning Methods for Classification and Localization in Histology Images: A Survey

    Authors: Jérôme Rony, Soufiane Belharbi, Jose Dolz, Ismail Ben Ayed, Luke McCaffrey, Eric Granger

    Abstract: Using deep learning models to diagnose cancer from histology data presents several challenges. Cancer grading and localization of regions of interest (ROIs) in these images normally relies on both image- and pixel-level labels, the latter requiring a costly annotation process. Deep weakly-supervised object localization (WSOL) methods provide different strategies for low-cost training of deep learn… ▽ More

    Submitted 3 March, 2023; v1 submitted 7 September, 2019; originally announced September 2019.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org/2023:004

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 2 (2023)

  12. arXiv:1907.03196  [pdf, other

    cs.CV eess.AS eess.IV

    Multimodal Fusion with Deep Neural Networks for Audio-Video Emotion Recognition

    Authors: Juan D. S. Ortega, Mohammed Senoussaoui, Eric Granger, Marco Pedersoli, Patrick Cardinal, Alessandro L. Koerich

    Abstract: This paper presents a novel deep neural network (DNN) for multimodal fusion of audio, video and text modalities for emotion recognition. The proposed DNN architecture has independent and shared layers which aim to learn the representation for each modality, as well as the best combined representation to achieve the best prediction. Experimental results on the AVEC Sentiment Analysis in the Wild da… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

  13. Boundary loss for highly unbalanced segmentation

    Authors: Hoel Kervadec, Jihene Bouchtiba, Christian Desrosiers, Eric Granger, Jose Dolz, Ismail Ben Ayed

    Abstract: Widely used loss functions for CNN segmentation, e.g., Dice or cross-entropy, are based on integrals over the segmentation regions. Unfortunately, for highly unbalanced segmentations, such regional summations have values that differ by several orders of magnitude across classes, which affects training performance and stability. We propose a boundary loss, which takes the form of a distance metric… ▽ More

    Submitted 17 October, 2020; v1 submitted 17 December, 2018; originally announced December 2018.

    Comments: Runner-up for best paper award at MIDL 2019 [PMLR 102:285-296], invited for MedIA deep learning special issue (Volume 67, January 2021)

    Journal ref: MIDL 2019, PMLR 102:285-296 -- MedIA Volume 67, January 2021, 101851

  14. arXiv:1810.11641  [pdf, other

    cs.CV eess.IV

    Cross-Modal Distillation for RGB-Depth Person Re-Identification

    Authors: Frank Hafner, Amran Bhuiyan, Julian F. P. Kooij, Eric Granger

    Abstract: Person re-identification is a key challenge for surveillance across multiple sensors. Prompted by the advent of powerful deep learning models for visual recognition, and inexpensive RGB-D cameras and sensor-rich mobile robotic platforms, e.g. self-driving vehicles, we investigate the relatively unexplored problem of cross-modal re-identification of persons between RGB (color) and depth images. The… ▽ More

    Submitted 12 February, 2022; v1 submitted 27 October, 2018; originally announced October 2018.

    Journal ref: Computer Vision and Image Understanding, 103352 (2022)