Skip to main content

Showing 1–18 of 18 results for author: Hérault, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.07370  [pdf, other

    cs.CV

    Adversarial Semi-Supervised Domain Adaptation for Semantic Segmentation: A New Role for Labeled Target Samples

    Authors: Marwa Kechaou, Mokhtar Z. Alaya, Romain Hérault, Gilles Gasso

    Abstract: Adversarial learning baselines for domain adaptation (DA) approaches in the context of semantic segmentation are under explored in semi-supervised framework. These baselines involve solely the available labeled target samples in the supervision loss. In this work, we propose to enhance their usefulness on both semantic segmentation and the single domain classifier neural networks. We design new tr… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  2. arXiv:2309.14394  [pdf, other

    cs.CL cs.AI cs.LG

    Multiple Noises in Diffusion Model for Semi-Supervised Multi-Domain Translation

    Authors: Tsiry Mayet, Simon Bernard, Clement Chatelain, Romain Herault

    Abstract: Domain-to-domain translation involves generating a target domain sample given a condition in the source domain. Most existing methods focus on fixed input and output domains, i.e. they only work for specific configurations (i.e. for two domains, either $D_1\rightarrow{}D_2$ or $D_2\rightarrow{}D_1$). This paper proposes Multi-Domain Diffusion (MDD), a conditional diffusion framework for multi-doma… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  3. arXiv:2309.06006  [pdf, ps, other

    cs.CV cs.AI

    SoccerNet 2023 Challenges Results

    Authors: Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim , et al. (77 additional authors not shown)

    Abstract: The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first theme, broadcast video understanding, is composed of three high-level tasks related to describing events occurring in the video broadcasts: (1) action spotting, fo… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  4. arXiv:2309.01270  [pdf, other

    cs.CV cs.AI cs.LG

    COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting using Transformers

    Authors: Julien Denize, Mykola Liashuha, Jaonary Rabarisoa, Astrid Orcesi, Romain Hérault

    Abstract: We present COMEDIAN, a novel pipeline to initialize spatiotemporal transformers for action spotting, which involves self-supervised learning and knowledge distillation. Action spotting is a timestamp-level temporal action detection task. Our pipeline consists of three steps, with two initialization stages. First, we perform self-supervised initialization of a spatial transformer using short videos… ▽ More

    Submitted 26 October, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: Source code is available here: https://github.com/juliendenize/eztorch

  5. Similarity Contrastive Estimation for Image and Video Soft Contrastive Self-Supervised Learning

    Authors: Julien Denize, Jaonary Rabarisoa, Astrid Orcesi, Romain Hérault

    Abstract: Contrastive representation learning has proven to be an effective self-supervised learning method for images and videos. Most successful approaches are based on Noise Contrastive Estimation (NCE) and use different views of an instance as positives that should be contrasted with other instances, called negatives, that are considered as noise. However, several instances in a dataset are drawn from t… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: Extended version of our WACV 2023 paper to video self-supervised learning

  6. arXiv:2212.03361  [pdf, other

    cs.LG cs.CV

    Domain Translation via Latent Space Map**

    Authors: Tsiry Mayet, Simon Bernard, Clement Chatelain, Romain Herault

    Abstract: In this paper, we investigate the problem of multi-domain translation: given an element $a$ of domain $A$, we would like to generate a corresponding $b$ sample in another domain $B$, and vice versa. Acquiring supervision in multiple domains can be a tedious task, also we propose to learn this translation from one domain to another when supervision is available as a pair $(a,b)\sim A\times B$ and l… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  7. arXiv:2206.07431  [pdf, other

    cs.CV cs.AI

    Physically-admissible polarimetric data augmentation for road-scene analysis

    Authors: Cyprien Ruffino, Rachel Blin, Samia Ainouz, Gilles Gasso, Romain Hérault, Fabrice Meriaudeau, Stéphane Canu

    Abstract: Polarimetric imaging, along with deep learning, has shown improved performances on different tasks including scene analysis. However, its robustness may be questioned because of the small size of the training datasets. Though the issue could be solved by data augmentation, polarization modalities are subject to physical feasibility constraints unaddressed by classical data augmentation techniques.… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  8. arXiv:2111.14585  [pdf, other

    cs.CV cs.AI cs.LG

    Similarity Contrastive Estimation for Self-Supervised Soft Contrastive Learning

    Authors: Julien Denize, Jaonary Rabarisoa, Astrid Orcesi, Romain Hérault, Stéphane Canu

    Abstract: Contrastive representation learning has proven to be an effective self-supervised learning method. Most successful approaches are based on Noise Contrastive Estimation (NCE) and use different views of an instance as positives that should be contrasted with other instances, called negatives, that are considered as noise. However, several instances in a dataset are drawn from the same distribution a… ▽ More

    Submitted 29 September, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: Accepted to IEEE Winter Conference on Applications of Computer Vision (WACV) 2023

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023

  9. arXiv:2010.01045  [pdf, other

    cs.LG stat.ML

    Open Set Domain Adaptation using Optimal Transport

    Authors: Marwa Kechaou, Romain Hérault, Mokhtar Z. Alaya, Gilles Gasso

    Abstract: We present a 2-step optimal transport approach that performs a map** from a source distribution to a target distribution. Here, the target has the particularity to present new classes not present in the source domain. The first step of the approach aims at rejecting the samples issued from these new classes using an optimal transport plan. The second step solves the target (class ratio) shift st… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

    Comments: Accepted at ECML-PKDD 2020, Acknowledgements added

  10. arXiv:2002.01281  [pdf, other

    eess.IV cs.CV cs.LG

    Pixel-wise Conditioned Generative Adversarial Networks for Image Synthesis and Completion

    Authors: Cyprien Ruffino, Romain Hérault, Eric Laloy, Gilles Gasso

    Abstract: Generative Adversarial Networks (GANs) have proven successful for unsupervised image generation. Several works have extended GANs to image inpainting by conditioning the generation with parts of the image to be reconstructed. Despite their success, these methods have limitations in settings where only a small subset of the image pixels is known beforehand. In this paper we investigate the effectiv… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

  11. arXiv:1911.00689  [pdf, other

    cs.CV cs.LG eess.IV

    Pixel-wise Conditioning of Generative Adversarial Networks

    Authors: Cyprien Ruffino, Romain Hérault, Eric Laloy, Gilles Gasso

    Abstract: Generative Adversarial Networks (GANs) have proven successful for unsupervised image generation. Several works extended GANs to image inpainting by conditioning the generation with parts of the image one wants to reconstruct. However, these methods have limitations in settings where only a small subset of the image pixels is known beforehand. In this paper, we study the effectiveness of conditioni… ▽ More

    Submitted 2 November, 2019; originally announced November 2019.

  12. arXiv:1905.08613  [pdf, other

    cs.CV cs.LG eess.IV

    Dilated Spatial Generative Adversarial Networks for Ergodic Image Generation

    Authors: Cyprien Ruffino, Romain Hérault, Eric Laloy, Gilles Gasso

    Abstract: Generative models have recently received renewed attention as a result of adversarial learning. Generative adversarial networks consist of samples generation model and a discrimination model able to distinguish between genuine and synthetic samples. In combination with convolutional (for the discriminator) and de-convolutional (for the generator) layers, they are particularly suitable for image ge… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

    Journal ref: Conf{é}rence sur l'Apprentissage Automatique, Jun 2018, Rouen, France

  13. arXiv:1812.04748  [pdf, other

    cs.CV

    An efficient supervised dictionary learning method for audio signal recognition

    Authors: Imad Rida, Romain Hérault, Gilles Gasso

    Abstract: Machine hearing or listening represents an emerging area. Conventional approaches rely on the design of handcrafted features specialized to a specific audio task and that can hardly generalized to other audio fields. For example, Mel-Frequency Cepstral Coefficients (MFCCs) and its variants were successfully applied to computational auditory scene recognition while Chroma vectors are good at music… ▽ More

    Submitted 11 December, 2018; originally announced December 2018.

  14. arXiv:1709.01867  [pdf, other

    cs.LG stat.ML

    Neural Networks Regularization Through Class-wise Invariant Representation Learning

    Authors: Soufiane Belharbi, Clément Chatelain, Romain Hérault, Sébastien Adam

    Abstract: Training deep neural networks is known to require a large number of training samples. However, in many applications only few training samples are available. In this work, we tackle the issue of training neural networks for classification task when few training samples are available. We attempt to solve this issue by proposing a new regularization term that constrains the hidden layers of a network… ▽ More

    Submitted 22 December, 2017; v1 submitted 6 September, 2017; originally announced September 2017.

    Comments: Submitted to ELSEVIER, 13 pages, 5 figures

  15. arXiv:1708.04975  [pdf, other

    stat.ML cs.CV physics.geo-ph

    Training-image based geostatistical inversion using a spatial generative adversarial neural network

    Authors: Eric Laloy, Romain Hérault, Diederik Jacques, Niklas Linde

    Abstract: Probabilistic inversion within a multiple-point statistics framework is often computationally prohibitive for high-dimensional problems. To partly address this, we introduce and evaluate a new training-image based inversion approach for complex geologic media. Our approach relies on a deep neural network of the generative adversarial network (GAN) type. After training using a training image (TI),… ▽ More

    Submitted 8 January, 2019; v1 submitted 16 August, 2017; originally announced August 2017.

    Journal ref: Water Resources Research, 54, 381-406, 2018

  16. arXiv:1508.04153  [pdf, ps, other

    stat.AP cs.HC

    Automatic sensor-based detection and classification of climbing activities

    Authors: Jérémie Boulanger, Ludovic Seifert, Romain Hérault, Jean-Francois Coeurjolly

    Abstract: This article presents a method to automatically detect and classify climbing activities using inertial measurement units (IMUs) attached to the wrists, feet and pelvis of the climber. The IMUs record limb acceleration and angular velocity. Detection requires a learning phase with manual annotation to construct the statistical models used in the cusum algorithm. Full-body activity is then classifie… ▽ More

    Submitted 23 June, 2015; originally announced August 2015.

  17. arXiv:1504.07550  [pdf, other

    cs.LG stat.ML

    Deep Neural Networks Regularization for Structured Output Prediction

    Authors: Soufiane Belharbi, Romain Hérault, Clément Chatelain, Sébastien Adam

    Abstract: A deep neural network model is a powerful framework for learning representations. Usually, it is used to learn the relation $x \to y$ by exploiting the regularities in the input $x$. In structured output prediction problems, $y$ is multi-dimensional and structural relations often exist between the dimensions. The motivation of this work is to learn the output dependencies that may lie in the outpu… ▽ More

    Submitted 30 October, 2017; v1 submitted 28 April, 2015; originally announced April 2015.

    Comments: Submitted to Neurocomputing, 8 figures

  18. arXiv:1401.1489  [pdf, other

    stat.ML cs.CV cs.LG physics.data-an stat.AP

    Key point selection and clustering of swimmer coordination through Sparse Fisher-EM

    Authors: John Komar, Romain Hérault, Ludovic Seifert

    Abstract: To answer the existence of optimal swimmer learning/teaching strategies, this work introduces a two-level clustering in order to analyze temporal dynamics of motor learning in breaststroke swimming. Each level have been performed through Sparse Fisher-EM, a unsupervised framework which can be applied efficiently on large and correlated datasets. The induced sparsity selects key points of the coord… ▽ More

    Submitted 7 January, 2014; originally announced January 2014.

    Comments: Presented at ECML/PKDD 2013 Workshop on Machine Learning and Data Mining for Sports Analytics (MLSA2013)