Search | arXiv e-print repository

A Multi-Modal Explainability Approach for Human-Aware Robots in Multi-Party Conversation

Authors: Iveta Bečková, Štefan Pócoš, Giulia Belgiovine, Marco Matarese, Alessandra Sciutti, Carlo Mazzola

Abstract: The addressee estimation (understanding to whom somebody is talking) is a fundamental task for human activity recognition in multi-party conversation scenarios. Specifically, in the field of human-robot interaction, it becomes even more crucial to enable social robots to participate in such interactive contexts. However, it is usually implemented as a binary classification task, restricting the ro… ▽ More The addressee estimation (understanding to whom somebody is talking) is a fundamental task for human activity recognition in multi-party conversation scenarios. Specifically, in the field of human-robot interaction, it becomes even more crucial to enable social robots to participate in such interactive contexts. However, it is usually implemented as a binary classification task, restricting the robot's capability to estimate whether it was addressed and limiting its interactive skills. For a social robot to gain the trust of humans, it is also important to manifest a certain level of transparency and explainability. Explainable artificial intelligence thus plays a significant role in the current machine learning applications and models, to provide explanations for their decisions besides excellent performance. In our work, we a) present an addressee estimation model with improved performance in comparison with the previous SOTA; b) further modify this model to include inherently explainable attention-based segments; c) implement the explainable addressee estimation as part of a modular cognitive architecture for multi-party conversation in an iCub robot; d) propose several ways to incorporate explainability and transparency in the aforementioned architecture; and e) perform a pilot user study to analyze the effect of various explanations on how human participants perceive the robot. △ Less

Submitted 20 May, 2024; originally announced July 2024.

Comments: 21pp (+7pp sup.mat.) Submitted to Computer Vision and Image Understanding Journal on May 13, 2024. This research received funding Horizon-Europe TERAIS project (G.A. 101079338) and Slovak Research and Development Agency, project no. APVV-21-0105

ACM Class: I.4.8; I.2.10; I.2.9; I.2.11; J.4

arXiv:2311.05334 [pdf, other]

Real-time Addressee Estimation: Deployment of a Deep-Learning Model on the iCub Robot

Authors: Carlo Mazzola, Francesco Rea, Alessandra Sciutti

Abstract: Addressee Estimation is the ability to understand to whom a person is talking, a skill essential for social robots to interact smoothly with humans. In this sense, it is one of the problems that must be tackled to develop effective conversational agents in multi-party and unstructured scenarios. As humans, one of the channels that mainly lead us to such estimation is the non-verbal behavior of spe… ▽ More Addressee Estimation is the ability to understand to whom a person is talking, a skill essential for social robots to interact smoothly with humans. In this sense, it is one of the problems that must be tackled to develop effective conversational agents in multi-party and unstructured scenarios. As humans, one of the channels that mainly lead us to such estimation is the non-verbal behavior of speakers: first of all, their gaze and body pose. Inspired by human perceptual skills, in the present work, a deep-learning model for Addressee Estimation relying on these two non-verbal features is designed, trained, and deployed on an iCub robot. The study presents the procedure of such implementation and the performance of the model deployed in real-time human-robot interaction compared to previous tests on the dataset used for the training. △ Less

Submitted 9 November, 2023; originally announced November 2023.

Comments: 4 pages, 3 figures, paper presented at IRIM-3D 2023 Conference, Funded by the Horizon-Widera-2021 European Twinning project TERAIS: G.A. n. 101079338

ACM Class: I.2.9; I.2.10; H.1.2

arXiv:2308.10757 [pdf, other]

doi 10.1109/IJCNN54540.2023.10191452

To Whom are You Talking? A Deep Learning Model to Endow Social Robots with Addressee Estimation Skills

Authors: Carlo Mazzola, Marta Romeo, Francesco Rea, Alessandra Sciutti, Angelo Cangelosi

Abstract: Communicating shapes our social word. For a robot to be considered social and being consequently integrated in our social environment it is fundamental to understand some of the dynamics that rule human-human communication. In this work, we tackle the problem of Addressee Estimation, the ability to understand an utterance's addressee, by interpreting and exploiting non-verbal bodily cues from the… ▽ More Communicating shapes our social word. For a robot to be considered social and being consequently integrated in our social environment it is fundamental to understand some of the dynamics that rule human-human communication. In this work, we tackle the problem of Addressee Estimation, the ability to understand an utterance's addressee, by interpreting and exploiting non-verbal bodily cues from the speaker. We do so by implementing an hybrid deep learning model composed of convolutional layers and LSTM cells taking as input images portraying the face of the speaker and 2D vectors of the speaker's body posture. Our implementation choices were guided by the aim to develop a model that could be deployed on social robots and be efficient in ecological scenarios. We demonstrate that our model is able to solve the Addressee Estimation problem in terms of addressee localisation in space, from a robot ego-centric point of view. △ Less

Submitted 28 March, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

Comments: Accepted v. of IJCNN 2023 publication. Funded by the Horizon Europe project TERAIS (G.A. 101079338), the UKRI Node on Trust (EP/V026682/1), the EU projects TRAINCREASE and MUSAE, and the US project THRIVE++. Cite: https://doi.org/10.1109/IJCNN54540.2023.10191452 Code: https://zenodo.org/doi/10.5281/zenodo.10709857 Data: https://zenodo.org/doi/10.5281/zenodo.10711587 10 pages, 8 Figures, 3 Tables

MSC Class: 68T07; 68T40 ACM Class: I.2.6; I.2.9; I.2.10; J.7

Journal ref: 2023 International Joint Conference on Neural Networks (IJCNN), pp. 1-10

arXiv:2207.06943 [pdf]

doi 10.1109/TCDS.2022.3185100

Shared perception is different from individual perception: a new look on context dependency

Authors: Carlo Mazzola, Francesco Rea, Alessandra Sciutti

Abstract: Human perception is based on unconscious inference, where sensory input integrates with prior information. This phenomenon, known as context dependency, helps in facing the uncertainty of the external world with predictions built upon previous experience. On the other hand, human perceptual processes are inherently shaped by social interactions. However, how the mechanisms of context dependency ar… ▽ More Human perception is based on unconscious inference, where sensory input integrates with prior information. This phenomenon, known as context dependency, helps in facing the uncertainty of the external world with predictions built upon previous experience. On the other hand, human perceptual processes are inherently shaped by social interactions. However, how the mechanisms of context dependency are affected is to date unknown. If using previous experience - priors - is beneficial in individual settings, it could represent a problem in social scenarios where other agents might not have the same priors, causing a perceptual misalignment on the shared environment. The present study addresses this question. We studied context dependency in an interactive setting with a humanoid robot iCub that acted as a stimuli demonstrator. Participants reproduced the lengths shown by the robot in two conditions: one with iCub behaving socially and another with iCub acting as a mechanical arm. The different behavior of the robot significantly affected the use of prior in perception. Moreover, the social robot positively impacted perceptual performances by enhancing accuracy and reducing participants overall perceptual errors. Finally, the observed phenomenon has been modelled following a Bayesian approach to deepen and explore a new concept of shared perception. △ Less

Submitted 14 July, 2022; originally announced July 2022.

Comments: 14 pages, 9 figures, 1 table. IEEE Transactions on Cognitive and Developmental Systems, 2022

arXiv:2203.01862 [pdf, other]

doi 10.1371/journal.pone.0273643

The world seems different in a social context: a neural network analysis of human experimental data

Authors: Maria Tsfasman, Anja Philippsen, Carlo Mazzola, Serge Thill, Alessandra Sciutti, Yukie Nagai

Abstract: Human perception and behavior are affected by the situational context, in particular during social interactions. A recent study demonstrated that humans perceive visual stimuli differently depending on whether they do the task by themselves or together with a robot. Specifically, it was found that the central tendency effect is stronger in social than in non-social task settings. The particular na… ▽ More Human perception and behavior are affected by the situational context, in particular during social interactions. A recent study demonstrated that humans perceive visual stimuli differently depending on whether they do the task by themselves or together with a robot. Specifically, it was found that the central tendency effect is stronger in social than in non-social task settings. The particular nature of such behavioral changes induced by social interaction, and their underlying cognitive processes in the human brain are, however, still not well understood. In this paper, we address this question by training an artificial neural network inspired by the predictive coding theory on the above behavioral data set. Using this computational model, we investigate whether the change in behavior that was caused by the situational context in the human experiment could be explained by continuous modifications of a parameter expressing how strongly sensory and prior information affect perception. We demonstrate that it is possible to replicate human behavioral data in both individual and social task settings by modifying the precision of prior and sensory signals, indicating that social and non-social task settings might in fact exist on a continuum. At the same time an analysis of the neural activation traces of the trained networks provides evidence that information is coded in fundamentally different ways in the network in the individual and in the social conditions. Our results emphasize the importance of computational replications of behavioral data for generating hypotheses on the underlying cognitive mechanisms of shared perception and may provide inspiration for follow-up studies in the field of neuroscience. △ Less

Submitted 3 March, 2022; originally announced March 2022.

arXiv:1703.06109 [pdf, ps, other]

Generalised Reichenbachian Common Cause Systems

Authors: Claudio Mazzola

Abstract: The principle of the common cause claims that if an improbable coincidence has occurred, there must exist a common cause. This is generally taken to mean that positive correlations between non-causally related events should disappear when conditioning on the action of some underlying common cause. The extended interpretation of the principle, by contrast, urges that common causes should be called… ▽ More The principle of the common cause claims that if an improbable coincidence has occurred, there must exist a common cause. This is generally taken to mean that positive correlations between non-causally related events should disappear when conditioning on the action of some underlying common cause. The extended interpretation of the principle, by contrast, urges that common causes should be called for in order to explain positive deviations between the estimated correlation of two events and the expected value of their correlation. The aim of this paper is to provide the extended reading of the principle with a general probabilistic model, capturing the simultaneous action of a system of multiple common causes. To this end, two distinct models are elaborated, and the necessary and sufficient conditions for their existence are determined. △ Less

Submitted 16 March, 2017; originally announced March 2017.

arXiv:1703.00352 [pdf, ps, other]

doi 10.1007/s10701-017-0124-1

Do Reichenbachian Common Cause Systems of Arbitrary Finite Size Exist?

Authors: Claudio Mazzola, Peter Evans

Abstract: The principle of common cause asserts that positive correlations between causally unrelated events ought to be explained through the action of some shared causal factors. Reichenbachian common cause systems are probabilistic structures aimed at accounting for cases where correlations of the aforesaid sort cannot be explained through the action of a single common cause. The existence of Reichenbach… ▽ More The principle of common cause asserts that positive correlations between causally unrelated events ought to be explained through the action of some shared causal factors. Reichenbachian common cause systems are probabilistic structures aimed at accounting for cases where correlations of the aforesaid sort cannot be explained through the action of a single common cause. The existence of Reichenbachian common cause systems of arbitrary finite size for each pair of non-causally correlated events was allegedly demonstrated by Hofer-Szabó and Rédei in 2006. This paper shows that their proof is logically deficient, and we propose an improved proof. △ Less

Submitted 28 February, 2017; originally announced March 2017.

Showing 1–7 of 7 results for author: Mazzola, C