-
Predicting the generalization gap in neural networks using topological data analysis
Authors:
Rubén Ballester,
Xavier Arnal Clemente,
Carles Casacuberta,
Meysam Madadi,
Ciprian A. Corneanu,
Sergio Escalera
Abstract:
Understanding how neural networks generalize on unseen data is crucial for designing more robust and reliable models. In this paper, we study the generalization gap of neural networks using methods from topological data analysis. For this purpose, we compute homological persistence diagrams of weighted graphs constructed from neuron activation correlations after a training phase, aiming to capture…
▽ More
Understanding how neural networks generalize on unseen data is crucial for designing more robust and reliable models. In this paper, we study the generalization gap of neural networks using methods from topological data analysis. For this purpose, we compute homological persistence diagrams of weighted graphs constructed from neuron activation correlations after a training phase, aiming to capture patterns that are linked to the generalization capacity of the network. We compare the usefulness of different numerical summaries from persistence diagrams and show that a combination of some of them can accurately predict and partially explain the generalization gap without the need of a test set. Evaluation on two computer vision recognition tasks (CIFAR10 and SVHN) shows competitive generalization gap prediction when compared against state-of-the-art methods.
△ Less
Submitted 12 August, 2023; v1 submitted 23 March, 2022;
originally announced March 2022.
-
Industry 4.0 and Prospects of Circular Economy: A Survey of Robotic Assembly and Disassembly
Authors:
Morteza Daneshmand,
Fatemeh Noroozi,
Ciprian Corneanu,
Fereshteh Mafakheri,
Paolo Fiorini
Abstract:
Despite their contributions to the financial efficiency and environmental sustainability of industrial processes, robotic assembly and disassembly have been understudied in the existing literature. This is in contradiction to their importance in realizing the Fourth Industrial Revolution. More specifically, although most of the literature has extensively discussed how to optimally assemble or disa…
▽ More
Despite their contributions to the financial efficiency and environmental sustainability of industrial processes, robotic assembly and disassembly have been understudied in the existing literature. This is in contradiction to their importance in realizing the Fourth Industrial Revolution. More specifically, although most of the literature has extensively discussed how to optimally assemble or disassemble given products, the role of other factors has been overlooked. For example, the types of robots involved in implementing the sequence plans, which should ideally be taken into account throughout the whole chain consisting of design, assembly, disassembly and reassembly. Isolating the foregoing operations from the rest of the components of the relevant ecosystems may lead to erroneous inferences toward both the necessity and efficiency of the underlying procedures. In this paper we try to alleviate these shortcomings by comprehensively investigating the state-of-the-art in robotic assembly and disassembly. We consider and review various aspects of manufacturing and remanufacturing frameworks while particularly focusing on their desirability for supporting a circular economy.
△ Less
Submitted 14 June, 2021;
originally announced June 2021.
-
Computing the Testing Error without a Testing Set
Authors:
Ciprian Corneanu,
Meysam Madadi,
Sergio Escalera,
Aleix Martinez
Abstract:
Deep Neural Networks (DNNs) have revolutionized computer vision. We now have DNNs that achieve top (performance) results in many problems, including object recognition, facial expression analysis, and semantic segmentation, to name but a few. The design of the DNNs that achieve top results is, however, non-trivial and mostly done by trail-and-error. That is, typically, researchers will derive many…
▽ More
Deep Neural Networks (DNNs) have revolutionized computer vision. We now have DNNs that achieve top (performance) results in many problems, including object recognition, facial expression analysis, and semantic segmentation, to name but a few. The design of the DNNs that achieve top results is, however, non-trivial and mostly done by trail-and-error. That is, typically, researchers will derive many DNN architectures (i.e., topologies) and then test them on multiple datasets. However, there are no guarantees that the selected DNN will perform well in the real world. One can use a testing set to estimate the performance gap between the training and testing sets, but avoiding overfitting-to-the-testing-data is almost impossible. Using a sequestered testing dataset may address this problem, but this requires a constant update of the dataset, a very expensive venture. Here, we derive an algorithm to estimate the performance gap between training and testing that does not require any testing dataset. Specifically, we derive a number of persistent topology measures that identify when a DNN is learning to generalize to unseen samples. This allows us to compute the DNN's testing error on unseen samples, even when we do not have access to them. We provide extensive experimental validation on multiple networks and datasets to demonstrate the feasibility of the proposed approach.
△ Less
Submitted 1 May, 2020;
originally announced May 2020.
-
Deep Structure Inference Network for Facial Action Unit Recognition
Authors:
Ciprian A. Corneanu,
Meysam Madadi,
Sergio Escalera
Abstract:
Facial expressions are combinations of basic components called Action Units (AU). Recognizing AUs is key for develo** general facial expression analysis. In recent years, most efforts in automatic AU recognition have been dedicated to learning combinations of local features and to exploiting correlations between Action Units. In this paper, we propose a deep neural architecture that tackles both…
▽ More
Facial expressions are combinations of basic components called Action Units (AU). Recognizing AUs is key for develo** general facial expression analysis. In recent years, most efforts in automatic AU recognition have been dedicated to learning combinations of local features and to exploiting correlations between Action Units. In this paper, we propose a deep neural architecture that tackles both problems by combining learned local and global features in its initial stages and replicating a message passing algorithm between classes similar to a graphical model inference approach in later stages. We show that by training the model end-to-end with increased supervision we improve state-of-the-art by 5.3% and 8.2% performance on BP4D and DISFA datasets, respectively.
△ Less
Submitted 23 March, 2018; v1 submitted 15 March, 2018;
originally announced March 2018.
-
Survey on Emotional Body Gesture Recognition
Authors:
Fatemeh Noroozi,
Ciprian Adrian Corneanu,
Dorota Kamińska,
Tomasz Sapiński,
Sergio Escalera,
Gholamreza Anbarjafari
Abstract:
Automatic emotion recognition has become a trending research topic in the past decade. While works based on facial expressions or speech abound, recognizing affect from body gestures remains a less explored topic. We present a new comprehensive survey ho** to boost research in the field. We first introduce emotional body gestures as a component of what is commonly known as "body language" and co…
▽ More
Automatic emotion recognition has become a trending research topic in the past decade. While works based on facial expressions or speech abound, recognizing affect from body gestures remains a less explored topic. We present a new comprehensive survey ho** to boost research in the field. We first introduce emotional body gestures as a component of what is commonly known as "body language" and comment general aspects as gender differences and culture dependence. We then define a complete framework for automatic emotional body gesture recognition. We introduce person detection and comment static and dynamic body pose estimation methods both in RGB and 3D. We then comment the recent literature related to representation learning and emotion recognition from images of emotionally expressive gestures. We also discuss multi-modal approaches that combine speech or face with body gestures for improved emotion recognition. While pre-processing methodologies (e.g. human detection and pose estimation) are nowadays mature technologies fully developed for robust large scale analysis, we show that for emotion recognition the quantity of labelled data is scarce, there is no agreement on clearly defined output spaces and the representations are shallow and largely based on naive geometrical representations.
△ Less
Submitted 23 January, 2018;
originally announced January 2018.
-
Automatic Recognition of Facial Displays of Unfelt Emotions
Authors:
Kaustubh Kulkarni,
Ciprian Adrian Corneanu,
Ikechukwu Ofodile,
Sergio Escalera,
Xavier Baro,
Sylwia Hyniewska,
Juri Allik,
Gholamreza Anbarjafari
Abstract:
Humans modify their facial expressions in order to communicate their internal states and sometimes to mislead observers regarding their true emotional states. Evidence in experimental psychology shows that discriminative facial responses are short and subtle. This suggests that such behavior would be easier to distinguish when captured in high resolution at an increased frame rate. We are proposin…
▽ More
Humans modify their facial expressions in order to communicate their internal states and sometimes to mislead observers regarding their true emotional states. Evidence in experimental psychology shows that discriminative facial responses are short and subtle. This suggests that such behavior would be easier to distinguish when captured in high resolution at an increased frame rate. We are proposing SASE-FE, the first dataset of facial expressions that are either congruent or incongruent with underlying emotion states. We show that overall the problem of recognizing whether facial movements are expressions of authentic emotions or not can be successfully addressed by learning spatio-temporal representations of the data. For this purpose, we propose a method that aggregates features along fiducial trajectories in a deeply learnt space. Performance of the proposed model shows that on average it is easier to distinguish among genuine facial expressions of emotion than among unfelt facial expressions of emotion and that certain emotion pairs such as contempt and disgust are more difficult to distinguish than the rest. Furthermore, the proposed methodology improves state of the art results on CK+ and OULU-CASIA datasets for video emotion recognition, and achieves competitive results when classifying facial action units on BP4D datase.
△ Less
Submitted 9 January, 2018; v1 submitted 13 July, 2017;
originally announced July 2017.
-
XBadges. Identifying and training soft skills with commercial video games
Authors:
Sergio Alloza,
Flavio Escribano,
Sergi Delgado,
Ciprian Corneanu,
Sergio Escalera
Abstract:
XBadges is a research project based on the hypothesis that commercial video games (nonserious games) can train soft skills. We measure persistence, spatial reasoning and risk taking before and after subjects participate in controlled game playing sessions. In addition, we have developed an automatic facial expression recognition system capable of inferring their emotions while playing, allowing us…
▽ More
XBadges is a research project based on the hypothesis that commercial video games (nonserious games) can train soft skills. We measure persistence, spatial reasoning and risk taking before and after subjects participate in controlled game playing sessions. In addition, we have developed an automatic facial expression recognition system capable of inferring their emotions while playing, allowing us to study the role of emotions in soft skills acquisition. We have used Flappy Bird, Pacman and Tetris for assessing changes in persistence, risk taking and spatial reasoning respectively. Results show how playing Tetris significantly improves spatial reasoning and how playing Pacman significantly improves prudence in certain areas of behavior. As for emotions, they reveal that being concentrated helps to improve performance and skills acquisition. Frustration is also shown as a key element. With the results obtained we are able to glimpse multiple applications in areas which need soft skills development.
△ Less
Submitted 4 July, 2017;
originally announced July 2017.
-
Survey on RGB, 3D, Thermal, and Multimodal Approaches for Facial Expression Recognition: History, Trends, and Affect-related Applications
Authors:
Ciprian Corneanu,
Marc Oliu,
Jeffrey F. Cohn,
Sergio Escalera
Abstract:
Facial expressions are an important way through which humans interact socially. Building a system capable of automatically recognizing facial expressions from images and video has been an intense field of study in recent years. Interpreting such expressions remains challenging and much research is needed about the way they relate to human affect. This paper presents a general overview of automatic…
▽ More
Facial expressions are an important way through which humans interact socially. Building a system capable of automatically recognizing facial expressions from images and video has been an intense field of study in recent years. Interpreting such expressions remains challenging and much research is needed about the way they relate to human affect. This paper presents a general overview of automatic RGB, 3D, thermal and multimodal facial expression analysis. We define a new taxonomy for the field, encompassing all steps from face detection to facial expression recognition, and describe and classify the state of the art methods accordingly. We also present the important datasets and the bench-marking of most influential methods. We conclude with a general discussion about trends, important questions and future lines of research.
△ Less
Submitted 10 June, 2016;
originally announced June 2016.