-
Predicting the generalization gap in neural networks using topological data analysis
Authors:
Rubén Ballester,
Xavier Arnal Clemente,
Carles Casacuberta,
Meysam Madadi,
Ciprian A. Corneanu,
Sergio Escalera
Abstract:
Understanding how neural networks generalize on unseen data is crucial for designing more robust and reliable models. In this paper, we study the generalization gap of neural networks using methods from topological data analysis. For this purpose, we compute homological persistence diagrams of weighted graphs constructed from neuron activation correlations after a training phase, aiming to capture…
▽ More
Understanding how neural networks generalize on unseen data is crucial for designing more robust and reliable models. In this paper, we study the generalization gap of neural networks using methods from topological data analysis. For this purpose, we compute homological persistence diagrams of weighted graphs constructed from neuron activation correlations after a training phase, aiming to capture patterns that are linked to the generalization capacity of the network. We compare the usefulness of different numerical summaries from persistence diagrams and show that a combination of some of them can accurately predict and partially explain the generalization gap without the need of a test set. Evaluation on two computer vision recognition tasks (CIFAR10 and SVHN) shows competitive generalization gap prediction when compared against state-of-the-art methods.
△ Less
Submitted 12 August, 2023; v1 submitted 23 March, 2022;
originally announced March 2022.
-
Deep Structure Inference Network for Facial Action Unit Recognition
Authors:
Ciprian A. Corneanu,
Meysam Madadi,
Sergio Escalera
Abstract:
Facial expressions are combinations of basic components called Action Units (AU). Recognizing AUs is key for develo** general facial expression analysis. In recent years, most efforts in automatic AU recognition have been dedicated to learning combinations of local features and to exploiting correlations between Action Units. In this paper, we propose a deep neural architecture that tackles both…
▽ More
Facial expressions are combinations of basic components called Action Units (AU). Recognizing AUs is key for develo** general facial expression analysis. In recent years, most efforts in automatic AU recognition have been dedicated to learning combinations of local features and to exploiting correlations between Action Units. In this paper, we propose a deep neural architecture that tackles both problems by combining learned local and global features in its initial stages and replicating a message passing algorithm between classes similar to a graphical model inference approach in later stages. We show that by training the model end-to-end with increased supervision we improve state-of-the-art by 5.3% and 8.2% performance on BP4D and DISFA datasets, respectively.
△ Less
Submitted 23 March, 2018; v1 submitted 15 March, 2018;
originally announced March 2018.
-
Survey on Emotional Body Gesture Recognition
Authors:
Fatemeh Noroozi,
Ciprian Adrian Corneanu,
Dorota Kamińska,
Tomasz Sapiński,
Sergio Escalera,
Gholamreza Anbarjafari
Abstract:
Automatic emotion recognition has become a trending research topic in the past decade. While works based on facial expressions or speech abound, recognizing affect from body gestures remains a less explored topic. We present a new comprehensive survey ho** to boost research in the field. We first introduce emotional body gestures as a component of what is commonly known as "body language" and co…
▽ More
Automatic emotion recognition has become a trending research topic in the past decade. While works based on facial expressions or speech abound, recognizing affect from body gestures remains a less explored topic. We present a new comprehensive survey ho** to boost research in the field. We first introduce emotional body gestures as a component of what is commonly known as "body language" and comment general aspects as gender differences and culture dependence. We then define a complete framework for automatic emotional body gesture recognition. We introduce person detection and comment static and dynamic body pose estimation methods both in RGB and 3D. We then comment the recent literature related to representation learning and emotion recognition from images of emotionally expressive gestures. We also discuss multi-modal approaches that combine speech or face with body gestures for improved emotion recognition. While pre-processing methodologies (e.g. human detection and pose estimation) are nowadays mature technologies fully developed for robust large scale analysis, we show that for emotion recognition the quantity of labelled data is scarce, there is no agreement on clearly defined output spaces and the representations are shallow and largely based on naive geometrical representations.
△ Less
Submitted 23 January, 2018;
originally announced January 2018.
-
Automatic Recognition of Facial Displays of Unfelt Emotions
Authors:
Kaustubh Kulkarni,
Ciprian Adrian Corneanu,
Ikechukwu Ofodile,
Sergio Escalera,
Xavier Baro,
Sylwia Hyniewska,
Juri Allik,
Gholamreza Anbarjafari
Abstract:
Humans modify their facial expressions in order to communicate their internal states and sometimes to mislead observers regarding their true emotional states. Evidence in experimental psychology shows that discriminative facial responses are short and subtle. This suggests that such behavior would be easier to distinguish when captured in high resolution at an increased frame rate. We are proposin…
▽ More
Humans modify their facial expressions in order to communicate their internal states and sometimes to mislead observers regarding their true emotional states. Evidence in experimental psychology shows that discriminative facial responses are short and subtle. This suggests that such behavior would be easier to distinguish when captured in high resolution at an increased frame rate. We are proposing SASE-FE, the first dataset of facial expressions that are either congruent or incongruent with underlying emotion states. We show that overall the problem of recognizing whether facial movements are expressions of authentic emotions or not can be successfully addressed by learning spatio-temporal representations of the data. For this purpose, we propose a method that aggregates features along fiducial trajectories in a deeply learnt space. Performance of the proposed model shows that on average it is easier to distinguish among genuine facial expressions of emotion than among unfelt facial expressions of emotion and that certain emotion pairs such as contempt and disgust are more difficult to distinguish than the rest. Furthermore, the proposed methodology improves state of the art results on CK+ and OULU-CASIA datasets for video emotion recognition, and achieves competitive results when classifying facial action units on BP4D datase.
△ Less
Submitted 9 January, 2018; v1 submitted 13 July, 2017;
originally announced July 2017.