Skip to main content

Showing 1–14 of 14 results for author: Roussos, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.04104  [pdf, other

    cs.CV

    3D Facial Expressions through Analysis-by-Neural-Synthesis

    Authors: George Retsinas, Panagiotis P. Filntisis, Radek Danecek, Victoria F. Abrevaya, Anastasios Roussos, Timo Bolkart, Petros Maragos

    Abstract: While existing methods for 3D face reconstruction from in-the-wild images excel at recovering the overall face shape, they commonly miss subtle, extreme, asymmetric, or rarely observed expressions. We improve upon these methods with SMIRK (Spatial Modeling for Image-based Reconstruction of Kinesics), which faithfully reconstructs expressive 3D faces from images. We identify two key limitations in… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  2. arXiv:2312.06613  [pdf, other

    cs.CV cs.SD eess.AS

    Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism

    Authors: Georgios Milis, Panagiotis P. Filntisis, Anastasios Roussos, Petros Maragos

    Abstract: Recent advances in deep learning for sequential data have given rise to fast and powerful models that produce realistic videos of talking humans. The state of the art in talking face generation focuses mainly on lip-syncing, being conditioned on audio clips. However, having the ability to synthesize talking humans from text transcriptions rather than audio is particularly beneficial for many appli… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  3. arXiv:2209.13971  [pdf, other

    cs.GR cs.CV

    3D Neural Sculpting (3DNS): Editing Neural Signed Distance Functions

    Authors: Petros Tzathas, Petros Maragos, Anastasios Roussos

    Abstract: In recent years, implicit surface representations through neural networks that encode the signed distance have gained popularity and have achieved state-of-the-art results in various tasks (e.g. shape representation, shape reconstruction, and learning shape priors). However, in contrast to conventional shape representations such as polygon meshes, the implicit representations cannot be easily edit… ▽ More

    Submitted 27 January, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: 14 pages, 10 figures, 3 tables

  4. arXiv:2209.01470  [pdf, other

    cs.CV

    Neural Sign Reenactor: Deep Photorealistic Sign Language Retargeting

    Authors: Christina O. Tze, Panagiotis P. Filntisis, Athanasia-Lida Dimou, Anastasios Roussos, Petros Maragos

    Abstract: In this paper, we introduce a neural rendering pipeline for transferring the facial expressions, head pose, and body movements of one person in a source video to another in a target video. We apply our method to the challenging case of Sign Language videos: given a source video of a sign language user, we can faithfully transfer the performed manual (e.g., handshape, palm orientation, movement, lo… ▽ More

    Submitted 30 May, 2023; v1 submitted 3 September, 2022; originally announced September 2022.

    Comments: Accepted at AI4CC Workshop at CVPR 2023

  5. arXiv:2207.11094  [pdf, other

    cs.CV

    Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos

    Authors: Panagiotis P. Filntisis, George Retsinas, Foivos Paraperas-Papantoniou, Athanasios Katsamanis, Anastasios Roussos, Petros Maragos

    Abstract: The recent state of the art on monocular 3D face reconstruction from image data has made some impressive advancements, thanks to the advent of Deep Learning. However, it has mostly focused on input coming from a single RGB image, overlooking the following important factors: a) Nowadays, the vast majority of facial image data of interest do not originate from single images but rather from videos, w… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  6. arXiv:2112.00585  [pdf, other

    cs.CV

    Neural Emotion Director: Speech-preserving semantic control of facial expressions in "in-the-wild" videos

    Authors: Foivos Paraperas Papantoniou, Panagiotis P. Filntisis, Petros Maragos, Anastasios Roussos

    Abstract: In this paper, we introduce a novel deep learning method for photo-realistic manipulation of the emotional state of actors in "in-the-wild" videos. The proposed method is based on a parametric 3D face representation of the actor in the input scene that offers a reliable disentanglement of the facial identity from the head pose and facial expressions. It then uses a novel deep domain translation fr… ▽ More

    Submitted 30 March, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: CVPR 2022 (oral). Project page: https://foivospar.github.io/NED/

  7. arXiv:2111.07902  [pdf, other

    cs.CV cs.AI

    Deep Semantic Manipulation of Facial Videos

    Authors: Girish Kumar Solanki, Anastasios Roussos

    Abstract: Editing and manipulating facial features in videos is an interesting and important field of research with a plethora of applications, ranging from movie post-production and visual effects to realistic avatars for video games and virtual assistants. Our method supports semantic video manipulation based on neural rendering and 3D-based facial expression modelling. We focus on interactive manipulatio… ▽ More

    Submitted 17 October, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: 4th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW), European Conference on Computer Vision (ECCV), Tel Aviv, Israel, October 2022

  8. arXiv:2008.03913  [pdf, other

    cs.CR

    NFCGate: Opening the Door for NFC Security Research with a Smartphone-Based Toolkit

    Authors: Steffen Klee, Alexandros Roussos, Max Maass, Matthias Hollick

    Abstract: Near-Field Communication (NFC) is being used in a variety of security-critical applications, from access control to payment systems. However, NFC protocol analysis typically requires expensive or conspicuous dedicated hardware, or is severely limited on smartphones. In 2015, the NFCGate proof of concept aimed at solving this issue by providing capabilities for NFC analysis employing off-the-shelf… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: Accepted to Usenix WOOT'20. Source Code and binaries available at https://github.com/nfcgate/nfcgate

  9. arXiv:2006.10500  [pdf, ps, other

    cs.CV

    ReenactNet: Real-time Full Head Reenactment

    Authors: Mohammad Rami Koujan, Michail Christos Doukas, Anastasios Roussos, Stefanos Zafeiriou

    Abstract: Video-to-video synthesis is a challenging problem aiming at learning a translation function between a sequence of semantic maps and a photo-realistic video depicting the characteristics of a driving video. We propose a head-to-head system of our own implementation capable of fully transferring the human head 3D pose, facial expressions and eye gaze from a source to a target actor, while preserving… ▽ More

    Submitted 21 May, 2020; originally announced June 2020.

    Comments: to be published in 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020)

  10. arXiv:2006.10499  [pdf, other

    cs.CV

    Real-Time Monocular 4D Face Reconstruction using the LSFM models

    Authors: Mohammad Rami Koujan, Nikolai Dochev, Anastasios Roussos

    Abstract: 4D face reconstruction from a single camera is a challenging task, especially when it is required to be performed in real time. We demonstrate a system of our own implementation that solves this task accurately and runs in real time on a commodity laptop, using a webcam as the only input. Our system is interactive, allowing the user to freely move their head and show various expressions while stan… ▽ More

    Submitted 21 May, 2020; originally announced June 2020.

    Comments: Published in Proceedings of the 15th ACM SIGGRAPH European Conference on Visual Media Production

  11. arXiv:2006.10199  [pdf, other

    cs.CV cs.LG eess.IV

    Head2Head++: Deep Facial Attributes Re-Targeting

    Authors: Michail Christos Doukas, Mohammad Rami Koujan, Viktoriia Sharmanska, Anastasios Roussos

    Abstract: Facial video re-targeting is a challenging problem aiming to modify the facial attributes of a target subject in a seamless manner by a driving monocular sequence. We leverage the 3D geometry of faces and Generative Adversarial Networks (GANs) to design a novel deep learning architecture for the task of facial and head reenactment. Our method is different to purely 3D model-based approaches, or re… ▽ More

    Submitted 28 September, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Published in IEEE Transactions on Biometrics, Behavior, and Identity Science (Volume: 3, Issue: 1, Jan. 2021)

  12. arXiv:2005.10954  [pdf, other

    cs.CV cs.LG eess.IV

    Head2Head: Video-based Neural Head Synthesis

    Authors: Mohammad Rami Koujan, Michail Christos Doukas, Anastasios Roussos, Stefanos Zafeiriou

    Abstract: In this paper, we propose a novel machine learning architecture for facial reenactment. In particular, contrary to the model-based approaches or recent frame-based methods that use Deep Convolutional Neural Networks (DCNNs) to generate individual frames, we propose a novel method that (a) exploits the special structure of facial motion (paying particular attention to mouth motion) and (b) enforces… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: To be published in 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020)

  13. arXiv:2005.07298  [pdf, other

    cs.CV

    DeepFaceFlow: In-the-wild Dense 3D Facial Motion Estimation

    Authors: Mohammad Rami Koujan, Anastasios Roussos, Stefanos Zafeiriou

    Abstract: Dense 3D facial motion capture from only monocular in-the-wild pairs of RGB images is a highly challenging problem with numerous applications, ranging from facial expression recognition to facial reenactment. In this work, we propose DeepFaceFlow, a robust, fast, and highly-accurate framework for the dense estimation of 3D non-rigid facial flow between pairs of monocular images. Our DeepFaceFlow f… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: to be published in the IEEE conference on Computer Vision and Pattern Recognition (CVPR). 2020

  14. arXiv:2005.05509  [pdf, other

    cs.CV cs.LG eess.IV

    Real-time Facial Expression Recognition "In The Wild'' by Disentangling 3D Expression from Identity

    Authors: Mohammad Rami Koujan, Luma Alharbawee, Giorgos Giannakakis, Nicolas Pugeault, Anastasios Roussos

    Abstract: Human emotions analysis has been the focus of many studies, especially in the field of Affective Computing, and is important for many applications, e.g. human-computer intelligent interaction, stress analysis, interactive games, animations, etc. Solutions for automatic emotion analysis have also benefited from the development of deep learning approaches and the availability of vast amount of visua… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: to be published in 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020)