Skip to main content

Showing 1–9 of 9 results for author: Vaufreydaz, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2209.00383  [pdf, other

    cs.CV stat.ML

    TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut

    Authors: Yangtao Wang, Xi Shen, Yuan Yuan, Yuming Du, Maomao Li, Shell Xu Hu, James L Crowley, Dominique Vaufreydaz

    Abstract: In this paper, we describe a graph-based algorithm that uses the features obtained by a self-supervised transformer to detect and segment salient objects in images and videos. With this approach, the image patches that compose an image or video are organised into a fully connected graph, where the edge between each pair of patches is labeled with a similarity score between patches using features l… ▽ More

    Submitted 5 December, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: text overlap with arXiv:2202.11539

  2. arXiv:2202.11539  [pdf, other

    cs.CV stat.ML

    Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut

    Authors: Yangtao Wang, Xi Shen, Shell Hu, Yuan Yuan, James Crowley, Dominique Vaufreydaz

    Abstract: Transformers trained with self-supervised learning using self-distillation loss (DINO) have been shown to produce attention maps that highlight salient foreground objects. In this paper, we demonstrate a graph-based approach that uses the self-supervised transformer features to discover an object from an image. Visual tokens are viewed as nodes in a weighted graph with edges representing a connect… ▽ More

    Submitted 24 March, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Journal ref: CVPR 2022 - Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States

  3. arXiv:2110.05205  [pdf, other

    cs.RO stat.ML

    Navigation In Urban Environments Amongst Pedestrians Using Multi-Objective Deep Reinforcement Learning

    Authors: Niranjan Deshpande, Dominique Vaufreydaz, Anne Spalanzani

    Abstract: Urban autonomous driving in the presence of pedestrians as vulnerable road users is still a challenging and less examined research problem. This work formulates navigation in urban environments as a multi objective reinforcement learning problem. A deep learning variant of thresholded lexicographic Q-learning is presented for autonomous navigation amongst pedestrians. The multi objective DQN agent… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Journal ref: 24th IEEE International Conference on Intelligent Transportation Systems - ITSC2021, Sep 2021, Indianapolis, United States

  4. SocialInteractionGAN: Multi-person Interaction Sequence Generation

    Authors: Louis Airale, Dominique Vaufreydaz, Xavier Alameda-Pineda

    Abstract: Prediction of human actions in social interactions has important applications in the design of social robots or artificial avatars. In this paper, we focus on a unimodal representation of interactions and propose to tackle interaction generation in a data-driven fashion. In particular, we model human interaction generation as a discrete multi-sequence generation problem and present SocialInteracti… ▽ More

    Submitted 12 September, 2022; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: IEEE Transactions on Affective Computing, Institute of Electrical and Electronics Engineers, 2022

  5. arXiv:2010.13407  [pdf, other

    cs.NE cs.RO stat.ML

    Behavioral decision-making for urban autonomous driving in the presence of pedestrians using Deep Recurrent Q-Network

    Authors: Niranjan Deshpande, Dominique Vaufreydaz, Anne Spalanzani

    Abstract: Decision making for autonomous driving in urban environments is challenging due to the complexity of the road structure and the uncertainty in the behavior of diverse road users. Traditional methods consist of manually designed rules as the driving policy, which require expert domain knowledge, are difficult to generalize and might give sub-optimal results as the environment gets complex. Whereas,… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Journal ref: 16th International Conference on Control, Automation, Robotics and Vision (ICARCV), Dec 2020, Shenzhen, China

  6. arXiv:2009.07013  [pdf, other

    cs.CV stat.ML

    Group-Level Emotion Recognition Using a Unimodal Privacy-Safe Non-Individual Approach

    Authors: Anastasia Petrova, Dominique Vaufreydaz, Philippe Dessus

    Abstract: This article presents our unimodal privacy-safe and non-individual proposal for the audio-video group emotion recognition subtask at the Emotion Recognition in the Wild (EmotiW) Challenge 2020 1. This sub challenge aims to classify in the wild videos into three categories: Positive, Neutral and Negative. Recent deep learning models have shown tremendous advances in analyzing interactions between p… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

    Journal ref: EmotiW2020 Challenge at the 22nd ACM International Conference on Multimodal Interaction (ICMI2020), Oct 2020, Utrecht, Netherlands

  7. arXiv:1904.08155  [pdf, other

    stat.ML cs.CV cs.LG

    Deep learning investigation for chess player attention prediction using eye-tracking and game data

    Authors: Justin Le Louedec, Thomas Guntz, James Crowley, Dominique Vaufreydaz

    Abstract: This article reports on an investigation of the use of convolutional neural networks to predict the visual attention of chess players. The visual attention model described in this article has been created to generate saliency maps that capture hierarchical and spatial features of chessboard, in order to predict the probability fixation for individual pixels Using a skip-layer architecture of an au… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Journal ref: ACM Symposium On Eye Tracking Research & Applications (ETRA 2019), Jun 2019, Denver, United States.

  8. arXiv:1809.06045  [pdf, other

    stat.ML cs.CV cs.LG

    Building Prior Knowledge: A Markov Based Pedestrian Prediction Model Using Urban Environmental Data

    Authors: Pavan Vasishta, Dominique Vaufreydaz, Anne Spalanzani

    Abstract: Autonomous Vehicles navigating in urban areas have a need to understand and predict future pedestrian behavior for safer navigation. This high level of situational awareness requires observing pedestrian behavior and extrapolating their positions to know future positions. While some work has been done in this field using Hidden Markov Models (HMMs), one of the few observed drawbacks of the method… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

    Comments: 15 th International Conference on Control, Automation, Robotics and Vision (ICARCV 2018), Nov 2018, Singapore, Singapore

  9. arXiv:1710.04486  [pdf, other

    cs.HC cs.CV stat.ML

    Multimodal Observation and Interpretation of Subjects Engaged in Problem Solving

    Authors: Thomas Guntz, Raffaella Balzarini, Dominique Vaufreydaz, James L. Crowley

    Abstract: In this paper we present the first results of a pilot experiment in the capture and interpretation of multimodal signals of human experts engaged in solving challenging chess problems. Our goal is to investigate the extent to which observations of eye-gaze, posture, emotion and other physiological signals can be used to model the cognitive state of subjects, and to explore the integration of mult… ▽ More

    Submitted 12 October, 2017; originally announced October 2017.

    Journal ref: 1st Workshop on "Behavior, Emotion and Representation: Building Blocks of Interaction'', Oct 2017, Bielefeld, Germany. 2017