Skip to main content

Showing 1–14 of 14 results for author: Beyan, C

.
  1. arXiv:2404.07560  [pdf, other

    cs.RO cs.AI

    Socially Pertinent Robots in Gerontological Healthcare

    Authors: Xavier Alameda-Pineda, Angus Addlesee, Daniel Hernández García, Chris Reinke, Soraya Arias, Federica Arrigoni, Alex Auternaud, Lauriane Blavette, Cigdem Beyan, Luis Gomez Camara, Ohad Cohen, Alessandro Conti, Sébastien Dacunha, Christian Dondrup, Yoav Ellinson, Francesco Ferro, Sharon Gannot, Florian Gras, Nancie Gunson, Radu Horaud, Moreno D'Incà, Imad Kimouche, Séverin Lemaignan, Oliver Lemon, Cyril Liotard , et al. (19 additional authors not shown)

    Abstract: Despite the many recent achievements in develo** and deploying social robotics, there are still many underexplored environments and applications for which systematic evaluation of such systems by end-users is necessary. While several robotic platforms have been used in gerontological healthcare, the question of whether or not a social interactive robot with multi-modal conversational capabilitie… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  2. arXiv:2308.08303  [pdf, other

    cs.CV

    Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric Videos

    Authors: Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

    Abstract: Objects are crucial for understanding human-object interactions. By identifying the relevant objects, one can also predict potential future interactions or actions that may occur with these objects. In this paper, we study the problem of Short-Term Object interaction anticipation (STA) and propose NAOGAT (Next-Active-Object Guided Anticipation Transformer), a multi-modal end-to-end transformer net… ▽ More

    Submitted 5 October, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: Accepted in WACV'24

  3. arXiv:2307.09662  [pdf, other

    cs.CV

    Object-aware Gaze Target Detection

    Authors: Francesco Tonini, Nicola Dall'Asen, Cigdem Beyan, Elisa Ricci

    Abstract: Gaze target detection aims to predict the image location where the person is looking and the probability that a gaze is out of the scene. Several works have tackled this task by regressing a gaze heatmap centered on the gaze location, however, they overlooked decoding the relationship between the people and the gazed objects. This paper proposes a Transformer-based architecture that automatically… ▽ More

    Submitted 27 September, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCV 2023. Code is available at https://github.com/francescotonini/object-aware-gaze-target-detection

  4. arXiv:2307.01533  [pdf, other

    cs.CV

    Unsupervised Video Anomaly Detection with Diffusion Models Conditioned on Compact Motion Representations

    Authors: Anil Osman Tur, Nicola Dall'Asen, Cigdem Beyan, Elisa Ricci

    Abstract: This paper aims to address the unsupervised video anomaly detection (VAD) problem, which involves classifying each frame in a video as normal or abnormal, without any access to labels. To accomplish this, the proposed method employs conditional diffusion models, where the input data is the spatiotemporal features extracted from a pre-trained network, and the condition is the features extracted fro… ▽ More

    Submitted 19 July, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: Accepted to ICIAP 2023

  5. arXiv:2305.16066  [pdf, other

    cs.CV

    Guided Attention for Next Active Object @ EGO4D STA Challenge

    Authors: Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

    Abstract: In this technical report, we describe the Guided-Attention mechanism based solution for the short-term anticipation (STA) challenge for the EGO4D challenge. It combines the object detections, and the spatiotemporal features extracted from video clips, enhancing the motion and contextual information, and further decoding the object-centric and motion-centric information to address the problem of ST… ▽ More

    Submitted 4 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Winner of CVPR@2023 Ego4D STA challenge. arXiv admin note: substantial text overlap with arXiv:2305.12953

  6. arXiv:2305.12953  [pdf, other

    cs.CV

    Enhancing Next Active Object-based Egocentric Action Anticipation with Guided Attention

    Authors: Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

    Abstract: Short-term action anticipation (STA) in first-person videos is a challenging task that involves understanding the next active object interactions and predicting future actions. Existing action anticipation methods have primarily focused on utilizing features extracted from video clips, but often overlooked the importance of objects and their interactions. To this end, we propose a novel approach t… ▽ More

    Submitted 23 June, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE ICIP 2023, see project page here : https://sanketsans.github.io/guided-attention-egocentric.html

  7. arXiv:2304.05841  [pdf, other

    cs.CV

    Exploring Diffusion Models for Unsupervised Video Anomaly Detection

    Authors: Anil Osman Tur, Nicola Dall'Asen, Cigdem Beyan, Elisa Ricci

    Abstract: This paper investigates the performance of diffusion models for video anomaly detection (VAD) within the most challenging but also the most operational scenario in which the data annotations are not used. As being sparse, diverse, contextual, and often ambiguous, detecting abnormal events precisely is a very ambitious task. To this end, we rely only on the information-rich spatio-temporal data, an… ▽ More

    Submitted 2 July, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE ICIP 2023

  8. arXiv:2302.06358  [pdf, other

    cs.CV

    Anticipating Next Active Objects for Egocentric Videos

    Authors: Sanket Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue

    Abstract: This paper addresses the problem of anticipating the next-active-object location in the future, for a given egocentric video clip where the contact might happen, before any action takes place. The problem is considerably hard, as we aim at estimating the position of such objects in a scenario where the observed clip and the action segment are separated by the so-called ``time to contact'' (TTC) se… ▽ More

    Submitted 1 May, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted by IEEE ACCESS, this paper carries the Manuscript DOI: 10.1109/ACCESS.2024.3395282. The complete peer-reviewed version is available via this DOI, while the arXiv version is a post-author manuscript without peer-review

  9. arXiv:2208.10822  [pdf, other

    cs.CV cs.AI cs.HC

    Multimodal Across Domains Gaze Target Detection

    Authors: Francesco Tonini, Cigdem Beyan, Elisa Ricci

    Abstract: This paper addresses the gaze target detection problem in single images captured from the third-person perspective. We present a multimodal deep architecture to infer where a person in a scene is looking. This spatial model is trained on the head images of the person-of- interest, scene and depth maps representing rich context information. Our model, unlike several prior art, do not require superv… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted to 24th ACM International Conference on Multimodal Interaction (ICMI 2022)

  10. arXiv:2207.11482  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastive Loss

    Authors: Riccardo Franceschini, Enrico Fini, Cigdem Beyan, Alessandro Conti, Federica Arrigoni, Elisa Ricci

    Abstract: Emotion recognition is involved in several real-world applications. With an increase in available modalities, automatic understanding of emotions is being performed more accurately. The success in Multimodal Emotion Recognition (MER), primarily relies on the supervised learning paradigm. However, data annotation is expensive, time-consuming, and as emotion expression and perception depends on seve… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.

    Comments: Accepted to 26th International Conference on Pattern Recognition (ICPR) 2022

  11. arXiv:2207.10574  [pdf, other

    cs.HC cs.AI cs.CV cs.LG cs.MM

    Co-Located Human-Human Interaction Analysis using Nonverbal Cues: A Survey

    Authors: Cigdem Beyan, Alessandro Vinciarelli, Alessio Del Bue

    Abstract: Automated co-located human-human interaction analysis has been addressed by the use of nonverbal communication as measurable evidence of social and psychological phenomena. We survey the computing studies (since 2010) detecting phenomena related to social traits (e.g., leadership, dominance, personality traits), social roles/relations, and interaction dynamics (e.g., group cohesion, engagement, ra… ▽ More

    Submitted 4 October, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive version was published in ACM Computing Surveys, https://doi.org/10.1145/3626516

  12. arXiv:2204.10312  [pdf, other

    cs.CV

    Unsupervised Human Action Recognition with Skeletal Graph Laplacian and Self-Supervised Viewpoints Invariance

    Authors: Giancarlo Paoletti, Jacopo Cavazza, Cigdem Beyan, Alessio Del Bue

    Abstract: This paper presents a novel end-to-end method for the problem of skeleton-based unsupervised human action recognition. We propose a new architecture with a convolutional autoencoder that uses graph Laplacian regularization to model the skeletal geometry across the temporal dynamics of actions. Our approach is robust towards viewpoint variations by including a self-supervised gradient reverse layer… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Journal ref: The 32nd British Machine Vision Conference (BMVC) 2021

  13. arXiv:2105.02636  [pdf, other

    cs.CV cs.MM

    Estimating Presentation Competence using Multimodal Nonverbal Behavioral Cues

    Authors: Ömer Sümer, Cigdem Beyan, Fabian Ruth, Olaf Kramer, Ulrich Trautwein, Enkelejda Kasneci

    Abstract: Public speaking and presentation competence plays an essential role in many areas of social interaction in our educational, professional, and everyday life. Since our intention during a speech can differ from what is actually understood by the audience, the ability to appropriately convey our message requires a complex set of skills. Presentation competence is cultivated in the early school years… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  14. Subspace Clustering for Action Recognition with Covariance Representations and Temporal Pruning

    Authors: Giancarlo Paoletti, Jacopo Cavazza, Cigdem Beyan, Alessio Del Bue

    Abstract: This paper tackles the problem of human action recognition, defined as classifying which action is displayed in a trimmed sequence, from skeletal data. Albeit state-of-the-art approaches designed for this application are all supervised, in this paper we pursue a more challenging direction: Solving the problem with unsupervised learning. To this end, we propose a novel subspace clustering method, w… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Journal ref: 25th International Conference on Pattern Recognition (ICPR) 2020