Skip to main content

Showing 1–8 of 8 results for author: Toisoul, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02599  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Meta 3D Gen

    Authors: Raphael Bensadoun, Tom Monnier, Yanir Kleiman, Filippos Kokkinos, Yawar Siddiqui, Mahendra Kariya, Omri Harosh, Roman Shapovalov, Benjamin Graham, Emilien Garreau, Animesh Karnewar, Ang Cao, Idan Azuri, Iurii Makarov, Eric-Tuan Le, Antoine Toisoul, David Novotny, Oran Gafni, Natalia Neverova, Andrea Vedaldi

    Abstract: We introduce Meta 3D Gen (3DGen), a new state-of-the-art, fast pipeline for text-to-3D asset generation. 3DGen offers 3D asset creation with high prompt fidelity and high-quality 3D shapes and textures in under a minute. It supports physically-based rendering (PBR), necessary for 3D asset relighting in real-world applications. Additionally, 3DGen supports generative retexturing of previously gener… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2404.07178  [pdf, other

    cs.CV

    Move Anything with Layered Scene Diffusion

    Authors: Jiawei Ren, Mengmeng Xu, Jui-Chieh Wu, Ziwei Liu, Tao Xiang, Antoine Toisoul

    Abstract: Diffusion models generate images with an unprecedented level of quality, but how can we freely rearrange image layouts? Recent works generate controllable scenes via learning spatially disentangled latent codes, but these methods do not apply to diffusion models due to their fixed forward process. In this work, we propose SceneDiffusion to optimize a layered scene representation during the diffusi… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 camera-ready

  3. arXiv:2303.17688  [pdf, ps, other

    cs.CV

    Learning Garment DensePose for Robust War** in Virtual Try-On

    Authors: Aiyu Cui, Sen He, Tao Xiang, Antoine Toisoul

    Abstract: Virtual try-on, i.e making people virtually try new garments, is an active research area in computer vision with great commercial applications. Current virtual try-on methods usually work in a two-stage pipeline. First, the garment image is warped on the person's pose using a flow estimation network. Then in the second stage, the warped garment is fused with the person image to render a new try-on… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

    Comments: 6 pages

  4. arXiv:2101.08085  [pdf, other

    cs.CV

    Few-shot Action Recognition with Prototype-centered Attentive Learning

    Authors: Xiatian Zhu, Antoine Toisoul, Juan-Manuel Perez-Rua, Li Zhang, Brais Martinez, Tao Xiang

    Abstract: Few-shot action recognition aims to recognize action classes with few training samples. Most existing methods adopt a meta-learning approach with episodic training. In each episode, the few samples in a meta-training task are split into support and query sets. The former is used to build a classifier, which is then evaluated on the latter using a query-centered loss for model updating. There are h… ▽ More

    Submitted 28 March, 2021; v1 submitted 20 January, 2021; originally announced January 2021.

    Comments: 10 pages, 4 figures

    Journal ref: BMVC 2021

  5. arXiv:2007.01883  [pdf, other

    cs.CV

    Egocentric Action Recognition by Video Attention and Temporal Context

    Authors: Juan-Manuel Perez-Rua, Antoine Toisoul, Brais Martinez, Victor Escorcia, Li Zhang, Xiatian Zhu, Tao Xiang

    Abstract: We present the submission of Samsung AI Centre Cambridge to the CVPR2020 EPIC-Kitchens Action Recognition Challenge. In this challenge, action recognition is posed as the problem of simultaneously predicting a single `verb' and `noun' class label given an input trimmed video clip. That is, a `verb' and a `noun' together define a compositional `action' class. The challenging aspects of this real-li… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: EPIC-Kitchens challenges@CVPR 2020

  6. arXiv:2004.01278  [pdf, other

    cs.CV

    Knowing What, Where and When to Look: Efficient Video Action Modeling with Attention

    Authors: Juan-Manuel Perez-Rua, Brais Martinez, Xiatian Zhu, Antoine Toisoul, Victor Escorcia, Tao Xiang

    Abstract: Attentive video modeling is essential for action recognition in unconstrained videos due to their rich yet redundant information over space and time. However, introducing attention in a deep neural network for action recognition is challenging for two reasons. First, an effective attention module needs to learn what (objects and their local motion patterns), where (spatially), and when (temporally… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

  7. arXiv:1906.06196  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Factorized Higher-Order CNNs with an Application to Spatio-Temporal Emotion Estimation

    Authors: Jean Kossaifi, Antoine Toisoul, Adrian Bulat, Yannis Panagakis, Timothy Hospedales, Maja Pantic

    Abstract: Training deep neural networks with spatio-temporal (i.e., 3D) or multidimensional convolutions of higher-order is computationally challenging due to millions of unknown parameters across dozens of layers. To alleviate this, one approach is to apply low-rank tensor decompositions to convolution kernels in order to compress the network and reduce its number of parameters. Alternatively, new convolut… ▽ More

    Submitted 31 March, 2020; v1 submitted 14 June, 2019; originally announced June 2019.

    Comments: IEEE CVPR 2020

  8. SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild

    Authors: Jean Kossaifi, Robert Walecki, Yannis Panagakis, Jie Shen, Maximilian Schmitt, Fabien Ringeval, **g Han, Vedhas Pandit, Antoine Toisoul, Bjorn Schuller, Kam Star, Elnar Hajiyev, Maja Pantic

    Abstract: Natural human-computer interaction and audio-visual human behaviour sensing systems, which would achieve robust performance in-the-wild are more needed than ever as digital devices are increasingly becoming an indispensable part of our life. Accurately annotated real-world data are the crux in devising such systems. However, existing databases usually consider controlled settings, low demographic… ▽ More

    Submitted 18 November, 2019; v1 submitted 9 January, 2019; originally announced January 2019.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019