Skip to main content

Showing 1–50 of 62 results for author: Radeva, P

.
  1. arXiv:2407.09285  [pdf, other

    cs.CV

    MetaFood CVPR 2024 Challenge on Physically Informed 3D Food Reconstruction: Methods and Results

    Authors: Jiangpeng He, Yuhao Chen, Gautham Vinod, Talha Ibn Mahmud, Fengqing Zhu, Edward Delp, Alexander Wong, Pengcheng Xi, Ahmad AlMughrabi, Umair Haroon, Ricardo Marques, Petia Radeva, Jiadong Tang, Dianyi Yang, Yu Gao, Zhaoxiang Liang, Yawei Jueluo, Chengyu Shi, Pengyu Wang

    Abstract: The increasing interest in computer vision applications for nutrition and dietary monitoring has led to the development of advanced 3D reconstruction techniques for food items. However, the scarcity of high-quality data and limited collaboration between industry and academia have constrained progress in this field. Building on recent advancements in 3D reconstruction, we host the MetaFood Workshop… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Technical report for MetaFood CVPR 2024 Challenge on Physically Informed 3D Food Reconstruction. arXiv admin note: substantial text overlap with arXiv:2407.01717

  2. arXiv:2407.03463  [pdf, other

    cs.CV cs.AI

    Precision at Scale: Domain-Specific Datasets On-Demand

    Authors: Jesús M Rodríguez-de-Vera, Imanol G Estepa, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva

    Abstract: In the realm of self-supervised learning (SSL), conventional wisdom has gravitated towards the utility of massive, general domain datasets for pretraining robust backbones. In this paper, we challenge this idea by exploring if it is possible to bridge the scale between general-domain datasets and (traditionally smaller) domain-specific datasets to reduce the current performance gap. More specifica… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    ACM Class: I.5.4; I.5.2; I.2.1; I.2.10

  3. arXiv:2407.02668  [pdf, other

    cs.CV

    MomentsNeRF: Leveraging Orthogonal Moments for Few-Shot Neural Rendering

    Authors: Ahmad AlMughrabi, Ricardo Marques, Petia Radeva

    Abstract: We propose MomentsNeRF, a novel framework for one- and few-shot neural rendering that predicts a neural representation of a 3D scene using Orthogonal Moments. Our architecture offers a new transfer learning method to train on multi-scenes and incorporate a per-scene optimization using one or a few images at test time. Our approach is the first to successfully harness features extracted from Gabor… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  4. arXiv:2407.01717  [pdf, other

    cs.CV

    VolETA: One- and Few-shot Food Volume Estimation

    Authors: Ahmad AlMughrabi, Umair Haroon, Ricardo Marques, Petia Radeva

    Abstract: Accurate food volume estimation is essential for dietary assessment, nutritional tracking, and portion control applications. We present VolETA, a sophisticated methodology for estimating food volume using 3D generative techniques. Our approach creates a scaled 3D mesh of food objects using one- or few-RGBD images. We start by selecting keyframes based on the RGB images and then segmenting the refe… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  5. arXiv:2406.13515  [pdf, other

    cs.CV

    MVSBoost: An Efficient Point Cloud-based 3D Reconstruction

    Authors: Umair Haroon, Ahmad AlMughrabi, Ricardo Marques, Petia Radeva

    Abstract: Efficient and accurate 3D reconstruction is crucial for various applications, including augmented and virtual reality, medical imaging, and cinematic special effects. While traditional Multi-View Stereo (MVS) systems have been fundamental in these applications, using neural implicit fields in implicit 3D scene modeling has introduced new possibilities for handling complex topologies and continuous… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: The work is under review

  6. arXiv:2310.11910  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-modal Medical Neurological Image Fusion using Wavelet Pooled Edge Preserving Autoencoder

    Authors: Manisha Das, Deep Gupta, Petia Radeva, Ashwini M Bakde

    Abstract: Medical image fusion integrates the complementary diagnostic information of the source image modalities for improved visualization and analysis of underlying anomalies. Recently, deep learning-based models have excelled the conventional fusion methods by executing feature extraction, feature selection, and feature fusion tasks, simultaneously. However, most of the existing convolutional neural net… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 8 pages, 5 figures, 6 tables

  7. arXiv:2310.11896  [pdf, other

    eess.IV cs.CV cs.LG

    A New Multimodal Medical Image Fusion based on Laplacian Autoencoder with Channel Attention

    Authors: Payal Wankhede, Manisha Das, Deep Gupta, Petia Radeva, Ashwini M Bakde

    Abstract: Medical image fusion combines the complementary information of multimodal medical images to assist medical professionals in the clinical diagnosis of patients' disorders and provide guidance during preoperative and intra-operative procedures. Deep learning (DL) models have achieved end-to-end image fusion with highly robust and accurate fusion performance. However, most DL-based fusion models perf… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 10 pages, 6 figures, % tables

  8. arXiv:2309.02995  [pdf, other

    cs.CV

    Continual Evidential Deep Learning for Out-of-Distribution Detection

    Authors: Eduardo Aguilar, Bogdan Raducanu, Petia Radeva, Joost Van de Weijer

    Abstract: Uncertainty-based deep learning models have attracted a great deal of interest for their ability to provide accurate and reliable predictions. Evidential deep learning stands out achieving remarkable performance in detecting out-of-distribution (OOD) data with a single deterministic neural network. Motivated by this fact, in this paper we propose the integration of an evidential deep learning meth… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted at Visual Continual Learning workshop (ICCV2023)

  9. arXiv:2305.02012  [pdf, other

    stat.ML cs.AI cs.LG

    A Perspective on Explainable Artificial Intelligence Methods: SHAP and LIME

    Authors: Ahmed Salih, Zahra Raisi-Estabragh, Ilaria Boscolo Galazzo, Petia Radeva, Steffen E. Petersen, Gloria Menegaz, Karim Lekadir

    Abstract: eXplainable artificial intelligence (XAI) methods have emerged to convert the black box of machine learning (ML) models into a more digestible form. These methods help to communicate how the model works with the aim of making ML models more transparent and increasing the trust of end-users into their output. SHapley Additive exPlanations (SHAP) and Local Interpretable Model Agnostic Explanation (L… ▽ More

    Submitted 17 June, 2024; v1 submitted 3 May, 2023; originally announced May 2023.

  10. arXiv:2304.01717  [pdf, other

    stat.ML cs.LG stat.AP

    Characterizing the contribution of dependent features in XAI methods

    Authors: Ahmed Salih, Ilaria Boscolo Galazzo, Zahra Raisi-Estabragh, Steffen E. Petersen, Gloria Menegaz, Petia Radeva

    Abstract: Explainable Artificial Intelligence (XAI) provides tools to help understanding how the machine learning models work and reach a specific outcome. It helps to increase the interpretability of models and makes the models more trustworthy and transparent. In this context, many XAI methods were proposed being SHAP and LIME the most popular. However, the proposed methods assume that used predictors in… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 17 pages, 5 tables

  11. arXiv:2303.12234  [pdf, other

    cs.CV

    Pre-NeRF 360: Enriching Unbounded Appearances for Neural Radiance Fields

    Authors: Ahmad AlMughrabi, Umair Haroon, Ricardo Marques, Petia Radeva

    Abstract: Neural radiance fields (NeRF) appeared recently as a powerful tool to generate realistic views of objects and confined areas. Still, they face serious challenges with open scenes, where the camera has unrestricted movement and content can appear at any distance. In such scenarios, current NeRF-inspired models frequently yield hazy or pixelated outputs, suffer slow training times, and might display… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  12. arXiv:2303.09417  [pdf, other

    cs.CV

    All4One: Symbiotic Neighbour Contrastive Learning via Self-Attention and Redundancy Reduction

    Authors: Imanol G. Estepa, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva

    Abstract: Nearest neighbour based methods have proved to be one of the most successful self-supervised learning (SSL) approaches due to their high generalization capabilities. However, their computational efficiency decreases when more than one neighbour is used. In this paper, we propose a novel contrastive SSL approach, which we call All4One, that reduces the distance between neighbour representations usi… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 14 pages, 9 figures

    ACM Class: I.5.4; I.5.1; I.2.10

  13. arXiv:2303.09269  [pdf, other

    cs.CV

    ELFIS: Expert Learning for Fine-grained Image Recognition Using Subsets

    Authors: Pablo Villacorta, Jesús M. Rodríguez-de-Vera, Marc Bolaños, Ignacio Sarasúa, Bhalaji Nagarajan, Petia Radeva

    Abstract: Fine-Grained Visual Recognition (FGVR) tackles the problem of distinguishing highly similar categories. One of the main approaches to FGVR, namely subset learning, tries to leverage information from existing class taxonomies to improve the performance of deep neural networks. However, these methods rely on the existence of handcrafted hierarchies that are not necessarily optimal for the models. In… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Pablo Villacorta and Jesús M. Rodríguez-de-Vera contributed equally to this work. 16 pages, 10 figures

    ACM Class: I.5.4; I.5.1; I.2.10

  14. arXiv:2203.12350  [pdf, other

    cs.CV

    Hyper-Spectral Imaging for Overlap** Plastic Flakes Segmentation

    Authors: Guillem Martinez, Maya Aghaei, Martin Dijkstra, Bhalaji Nagarajan, Femke Jaarsma, Jaap van de Loosdrecht, Petia Radeva, Klaas Dijkstra

    Abstract: Given the hyper-spectral imaging unique potentials in gras** the polymer characteristics of different materials, it is commonly used in sorting procedures. In a practical plastic sorting scenario, multiple plastic flakes may overlap which depending on their characteristics, the overlap can be reflected in their spectral signature. In this work, we use hyper-spectral imaging for the segmentation… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Submitted to ICIP2022

  15. arXiv:2203.08878  [pdf, other

    eess.IV cs.CV cs.LG

    Layer Ensembles: A Single-Pass Uncertainty Estimation in Deep Learning for Segmentation

    Authors: Kaisar Kushibar, Víctor Manuel Campello, Lidia Garrucho Moras, Akis Linardos, Petia Radeva, Karim Lekadir

    Abstract: Uncertainty estimation in deep learning has become a leading research field in medical image analysis due to the need for safe utilisation of AI algorithms in clinical practice. Most approaches for uncertainty estimation require sampling the network weights multiple times during testing or training multiple networks. This leads to higher training and testing costs in terms of time and computationa… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  16. arXiv:2104.11520  [pdf, other

    cs.CV

    Modeling long-term interactions to enhance action recognition

    Authors: Alejandro Cartas, Petia Radeva, Mariella Dimiccoli

    Abstract: In this paper, we propose a new approach to under-stand actions in egocentric videos that exploits the semantics of object interactions at both frame and temporal levels. At the frame level, we use a region-based approach that takes as input a primary region roughly corresponding to the user hands and a set of secondary regions potentially corresponding to the interacting objects and calculates th… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

    Comments: Accepted to the 25th International Conference on Pattern Recognition (ICPR), 2021

  17. arXiv:2009.07646  [pdf, other

    cs.CV

    Eating Habits Discovery in Egocentric Photo-streams

    Authors: Estefania Talavera, Andreea Glavan, Alina Matei, Petia Radeva

    Abstract: Eating habits are learned throughout the early stages of our lives. However, it is not easy to be aware of how our food-related routine affects our healthy living. In this work, we address the unsupervised discovery of nutritional habits from egocentric photo-streams. We build a food-related behavioural pattern discovery model, which discloses nutritional routines from the activities performed thr… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

  18. Behavioural pattern discovery from collections of egocentric photo-streams

    Authors: Martin Menchon, Estefania Talavera, Jose M Massa, Petia Radeva

    Abstract: The automatic discovery of behaviour is of high importance when aiming to assess and improve the quality of life of people. Egocentric images offer a rich and objective description of the daily life of the camera wearer. This work proposes a new method to identify a person's patterns of behaviour from collected egocentric photo-streams. Our model characterizes time-frames based on the context (pla… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

  19. arXiv:2006.02570  [pdf, other

    eess.IV cs.CV cs.LG

    Exploration of Interpretability Techniques for Deep COVID-19 Classification using Chest X-ray Images

    Authors: Soumick Chatterjee, Fatima Saad, Chompunuch Sarasaen, Suhita Ghosh, Valerie Krug, Rupali Khatun, Rahul Mishra, Nirja Desai, Petia Radeva, Georg Rose, Sebastian Stober, Oliver Speck, Andreas Nürnberger

    Abstract: The outbreak of COVID-19 has shocked the entire world with its fairly rapid spread and has challenged different sectors. One of the most effective ways to limit its spread is the early and accurate diagnosing infected patients. Medical imaging, such as X-ray and Computed Tomography (CT), combined with the potential of Artificial Intelligence (AI), plays an essential role in supporting medical pers… ▽ More

    Submitted 24 January, 2024; v1 submitted 3 June, 2020; originally announced June 2020.

    Journal ref: Journal of Imaging. 2024; 10(2):45

  20. arXiv:1910.11949  [pdf, other

    cs.MM cs.CL cs.CV

    Automatic Reminiscence Therapy for Dementia

    Authors: Mariona Caros, Maite Garolera, Petia Radeva, Xavier Giro-i-Nieto

    Abstract: With people living longer than ever, the number of cases with dementia such as Alzheimer's disease increases steadily. It affects more than 46 million people worldwide, and it is estimated that in 2050 more than 100 million will be affected. While there are not effective treatments for these terminal diseases, therapies such as reminiscence, that stimulate memories from the past are recommended. C… ▽ More

    Submitted 19 January, 2021; v1 submitted 25 October, 2019; originally announced October 2019.

    Comments: MSc thesis at TelecomBCN, Universitat Politecnica de Catalunya 2019

  21. arXiv:1910.06693  [pdf, other

    cs.CV cs.LG eess.AS

    Seeing and Hearing Egocentric Actions: How Much Can We Learn?

    Authors: Alejandro Cartas, Jordi Luque, Petia Radeva, Carlos Segura, Mariella Dimiccoli

    Abstract: Our interaction with the world is an inherently multimodal experience. However, the understanding of human-to-object interactions has historically been addressed focusing on a single modality. In particular, a limited number of works have considered to integrate the visual and audio modalities for this purpose. In this work, we propose a multimodal approach for egocentric action recognition in a k… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: Accepted for the Fifth International Workshop on Egocentric Perception, Interaction and Computing (EPIC) at the International Conference on Computer Vision (ICCV) 2019

  22. arXiv:1907.00856  [pdf, other

    eess.IV cs.CV

    SLSNet: Skin lesion segmentation using a lightweight generative adversarial network

    Authors: Md. Mostafa Kamal Sarker, Hatem A. Rashwan, Farhan Akram, Vivek Kumar Singh, Syeda Furruka Banu, Forhad U H Chowdhury, Kabir Ahmed Choudhury, Sylvie Chambon, Petia Radeva, Domenec Puig, Mohamed Abdel-Nasser

    Abstract: The determination of precise skin lesion boundaries in dermoscopic images using automated methods faces many challenges, most importantly, the presence of hair, inconspicuous lesion edges and low contrast in dermoscopic images, and variability in the color, texture and shapes of skin lesions. Existing deep learning-based skin lesion segmentation algorithms are expensive in terms of computational t… ▽ More

    Submitted 17 June, 2021; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: Accepted in Expert Systems with Applications

  23. arXiv:1906.00634  [pdf, other

    cs.CV cs.AI

    How Much Does Audio Matter to Recognize Egocentric Object Interactions?

    Authors: Alejandro Cartas, Jordi Luque, Petia Radeva, Carlos Segura, Mariella Dimiccoli

    Abstract: Sounds are an important source of information on our daily interactions with objects. For instance, a significant amount of people can discern the temperature of water that it is being poured just by using the sense of hearing. However, only a few works have explored the use of audio for the classification of object interactions in conjunction with vision or as single modality. In this preliminary… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: Accepted for presentation at EPIC@CVPR2019 workshop

  24. arXiv:1905.04734  [pdf, other

    cs.CV

    Social Relation Recognition in Egocentric Photostreams

    Authors: Emanuel Sanchez Aimar, Petia Radeva, Mariella Dimiccoli

    Abstract: This paper proposes an approach to automatically categorize the social interactions of a user wearing a photo-camera 2fpm, by relying solely on what the camera is seeing. The problem is challenging due to the overwhelming complexity of social life and the extreme intra-class variability of social interactions captured under unconstrained conditions. We adopt the formalization proposed in Bugental'… ▽ More

    Submitted 12 May, 2019; originally announced May 2019.

    Comments: Accepted at ICIP 2019

  25. arXiv:1905.04107  [pdf, other

    cs.CV

    Towards Emotion Retrieval in Egocentric PhotoStream

    Authors: Estefania Talavera, Petia Radeva, Nicolai Petkov

    Abstract: The availability and use of egocentric data are rapidly increasing due to the growing use of wearable cameras. Our aim is to study the effect (positive, neutral or negative) of egocentric images or events on an observer. Given egocentric photostreams capturing the wearer's days, we propose a method that aims to assign sentiment to events extracted from egocentric photostreams. Such moments can be… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

  26. arXiv:1905.04097  [pdf, other

    cs.CV

    Hierarchical approach to classify food scenes in egocentric photo-streams

    Authors: Estefania Talavera, Maria Leyva-Vallina, Md. Mostafa Kamal Sarker, Domenec Puig, Nicolai Petkov, Petia Radeva

    Abstract: Recent studies have shown that the environment where people eat can affect their nutritional behaviour. In this work, we provide automatic tools for a personalised analysis of a person's health habits by the examination of daily recorded egocentric photo-streams. Specifically, we propose a new automatic approach for the classification of food-related environments, that is able to classify up to 15… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

  27. arXiv:1905.04093  [pdf, other

    cs.CV

    Towards Unsupervised Familiar Scene Recognition in Egocentric Videos

    Authors: Estefania Talavera, Nicolai Petkov, Petia Radeva

    Abstract: Nowadays, there is an upsurge of interest in using lifelogging devices. Such devices generate huge amounts of image data; consequently, the need for automatic methods for analyzing and summarizing these data is drastically increasing. We present a new method for familiar scene recognition in egocentric videos, based on background pattern detection through automatically configurable COSFIRE filters… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

  28. arXiv:1905.04076  [pdf, other

    cs.CV

    Unsupervised routine discovery in egocentric photo-streams

    Authors: Estefania Talavera, Nicolai Petkov, Petia Radeva

    Abstract: The routine of a person is defined by the occurrence of activities throughout different days, and can directly affect the person's health. In this work, we address the recognition of routine related days. To do so, we rely on egocentric images, which are recorded by a wearable camera and allow to monitor the life of the user from a first-person view perspective. We propose an unsupervised model th… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

  29. arXiv:1905.04073  [pdf, other

    cs.CV

    Towards Egocentric Person Re-identification and Social Pattern Analysis

    Authors: Estefania Talavera, Alexandre Cola, Nicolai Petkov, Petia Radeva

    Abstract: Wearable cameras capture a first-person view of the daily activities of the camera wearer, offering a visual diary of the user behaviour. Detection of the appearance of people the camera user interacts with for social interactions analysis is of high interest. Generally speaking, social events, lifestyle and health are highly correlated, but there is a lack of tools to monitor and analyse them. We… ▽ More

    Submitted 10 May, 2019; originally announced May 2019.

  30. arXiv:1809.00402  [pdf, other

    cs.CV

    On the Role of Event Boundaries in Egocentric Activity Recognition from Photostreams

    Authors: Alejandro Cartas, Estefania Talavera, Petia Radeva, Mariella Dimiccoli

    Abstract: Event boundaries play a crucial role as a pre-processing step for detection, localization, and recognition tasks of human activities in videos. Typically, although their intrinsic subjectiveness, temporal bounds are provided manually as input for training action recognition algorithms. However, their role for activity recognition in the domain of egocentric photostreams has been so far neglected.… ▽ More

    Submitted 6 September, 2018; v1 submitted 2 September, 2018; originally announced September 2018.

    Comments: Presented as a short abstract in the EPIC workshop at ECCV 2018

  31. arXiv:1808.09829  [pdf, other

    cs.CV

    MACNet: Multi-scale Atrous Convolution Networks for Food Places Classification in Egocentric Photo-streams

    Authors: Md. Mostafa Kamal Sarker, Hatem A. Rashwan, Estefania Talavera, Syeda Furruka Banu, Petia Radeva, Domenec Puig

    Abstract: First-person (wearable) camera continually captures unscripted interactions of the camera user with objects, people, and scenes reflecting his personal and relational tendencies. One of the preferences of people is their interaction with food events. The regulation of food intake and its duration has a great importance to protect against diseases. Consequently, this work aims to develop a smart mo… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: 10 pages, accepted in ECCV at EPIC 2018

  32. arXiv:1805.12081  [pdf, other

    cs.CV

    CuisineNet: Food Attributes Classification using Multi-scale Convolution Network

    Authors: Md. Mostafa Kamal Sarker, Mohammed Jabreel, Hatem A. Rashwan, Syeda Furruka Banu, Antonio Moreno, Petia Radeva, Domenec Puig

    Abstract: Diversity of food and its attributes represents the culinary habits of peoples from different countries. Thus, this paper addresses the problem of identifying food culture of people around the world and its flavor by classifying two main food attributes, cuisine and flavor. A deep learning model based on multi-scale convotuional networks is proposed for extracting more accurate features from input… ▽ More

    Submitted 8 June, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: 8 pages, Submitted in CCIA 2018

  33. arXiv:1805.10241  [pdf, other

    cs.CV

    SLSDeep: Skin Lesion Segmentation Based on Dilated Residual and Pyramid Pooling Networks

    Authors: Md. Mostafa Kamal Sarker, Hatem A. Rashwan, Farhan Akram, Syeda Furruka Banu, Adel Saleh, Vivek Kumar Singh, Forhad U H Chowdhury, Saddam Abdulwahab, Santiago Romani, Petia Radeva, Domenec Puig

    Abstract: Skin lesion segmentation (SLS) in dermoscopic images is a crucial task for automated diagnosis of melanoma. In this paper, we present a robust deep learning SLS model, so-called SLSDeep, which is represented as an encoder-decoder network. The encoder network is constructed by dilated residual layers, in turn, a pyramid pooling network followed by three convolution layers is used for the decoder. U… ▽ More

    Submitted 30 May, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: Accepted in MICCAI 2018, 9 pages

  34. arXiv:1803.05940  [pdf, other

    cs.CV

    Smartphone picture organization: A hierarchical approach

    Authors: Stefan Lonn, Petia Radeva, Mariella Dimiccoli

    Abstract: We live in a society where the large majority of the population has a camera-equipped smartphone. In addition, hard drives and cloud storage are getting cheaper and cheaper, leading to a tremendous growth in stored personal photos. Unlike photo collections captured by a digital camera, which typically are pre-processed by the user who organizes them into event-related folders, smartphone pictures… ▽ More

    Submitted 6 September, 2019; v1 submitted 15 March, 2018; originally announced March 2018.

    Journal ref: Computer Vision and Image Understanding (CVIU), Volume 187, October 2019, 102789

  35. arXiv:1711.05128  [pdf, other

    cs.CV

    Grab, Pay and Eat: Semantic Food Detection for Smart Restaurants

    Authors: Eduardo Aguilar, Beatriz Remeseiro, Marc Bolaños, Petia Radeva

    Abstract: The increase in awareness of people towards their nutritional habits has drawn considerable attention to the field of automatic food analysis. Focusing on self-service restaurants environment, automatic food analysis is not only useful for extracting nutritional information from foods selected by customers, it is also of high interest to speed up the service solving the bottleneck produced at the… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

  36. Batch-based Activity Recognition from Egocentric Photo-Streams Revisited

    Authors: Alejandro Cartas, Juan Marin, Petia Radeva, Mariella Dimiccoli

    Abstract: Wearable cameras can gather large a\-mounts of image data that provide rich visual information about the daily activities of the wearer. Motivated by the large number of health applications that could be enabled by the automatic recognition of daily activities, such as lifestyle characterization for habit improvement, context-aware personal assistance and tele-rehabilitation services, we propose a… ▽ More

    Submitted 9 May, 2018; v1 submitted 11 October, 2017; originally announced October 2017.

    Journal ref: Cartas, A., Marin, J., Radeva, P. et al. Pattern Anal Applic (2018). https://doi.org/10.1007/s10044-018-0708-1

  37. arXiv:1709.05775  [pdf, other

    cs.CV

    Social Style Characterization from Egocentric Photo-streams

    Authors: Maedeh Aghaei, Mariella Dimiccoli, Cristian Canton Ferrer, Petia Radeva

    Abstract: This paper proposes a system for automatic social pattern characterization using a wearable photo-camera. The proposed pipeline consists of three major steps. First, detection of people with whom the camera wearer interacts and, second, categorization of the detected social interactions into formal and informal. These two steps act at event-level where each potential social event is modeled as a m… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

    Comments: International Conference on Computer Vision (ICCV). Workshop on Egocentric Percetion, Interaction and Computing

  38. Food Recognition using Fusion of Classifiers based on CNNs

    Authors: Eduardo Aguilar, Marc Bolaños, Petia Radeva

    Abstract: With the arrival of convolutional neural networks, the complex problem of food recognition has experienced an important improvement in recent years. The best results have been obtained using methods based on very deep convolutional neural networks, which show that the deeper the model,the better the classification accuracy will be obtain. However, very deep neural networks may suffer from the over… ▽ More

    Submitted 14 September, 2017; originally announced September 2017.

    Journal ref: ICIAP 10485 (2017) 213-224

  39. Exploring Food Detection using CNNs

    Authors: Eduardo Aguilar, Marc Bolaños, Petia Radeva

    Abstract: One of the most common critical factors directly related to the cause of a chronic disease is unhealthy diet consumption. In this sense, building an automatic system for food analysis could allow a better understanding of the nutritional information with respect to the food eaten and thus it could help in taking corrective actions in order to consume a better diet. The Computer Vision community ha… ▽ More

    Submitted 14 September, 2017; originally announced September 2017.

    Journal ref: EUROCAST 2017 10672 (2018) 339-347

  40. arXiv:1709.02780  [pdf, other

    cs.CV

    Detecting Hands in Egocentric Videos: Towards Action Recognition

    Authors: Alejandro Cartas, Mariella Dimiccoli, Petia Radeva

    Abstract: Recently, there has been a growing interest in analyzing human daily activities from data collected by wearable cameras. Since the hands are involved in a vast set of daily tasks, detecting hands in egocentric images is an important step towards the recognition of a variety of egocentric actions. However, besides extreme illumination changes in egocentric images, hand detection is not a trivial ta… ▽ More

    Submitted 8 September, 2017; originally announced September 2017.

  41. arXiv:1709.01424  [pdf, other

    cs.CV

    Towards social pattern characterization in egocentric photo-streams

    Authors: Maedeh Aghaei, Mariella Dimiccoli, Cristian Canton Ferrer, Petia Radeva

    Abstract: Following the increasingly popular trend of social interaction analysis in egocentric vision, this manuscript presents a comprehensive study for automatic social pattern characterization of a wearable photo-camera user, by relying on the visual analysis of egocentric photo-streams. The proposed framework consists of three major steps. The first step is to detect social interactions of the user whe… ▽ More

    Submitted 9 January, 2018; v1 submitted 5 September, 2017; originally announced September 2017.

    Comments: 42 pages, 14 figures. Submitted to Elsevier, Computer Vision and Image Understanding (Under Review)

  42. arXiv:1708.07889  [pdf, other

    cs.CV

    Batch-Based Activity Recognition from Egocentric Photo-Streams

    Authors: Alejandro Cartas, Mariella Dimiccoli, Petia Radeva

    Abstract: Activity recognition from long unstructured egocentric photo-streams has several applications in assistive technology such as health monitoring and frailty detection, just to name a few. However, one of its main technical challenges is to deal with the low frame rate of wearable photo-cameras, which causes abrupt appearance changes between consecutive frames. In consequence, important discriminato… ▽ More

    Submitted 25 August, 2017; originally announced August 2017.

    Comments: 8 pages, 7 figures, 1 table. To appear at the ICCV 2017 workshop on Egocentric Perception, Interaction and Computing

  43. arXiv:1707.08816  [pdf, other

    cs.CV

    Food Ingredients Recognition through Multi-label Learning

    Authors: Marc Bolaños, Aina Ferrà, Petia Radeva

    Abstract: Automatically constructing a food diary that tracks the ingredients consumed can help people follow a healthy diet. We tackle the problem of food ingredients recognition as a multi-label learning problem. We propose a method for adapting a highly performing state of the art CNN in order to act as a multi-label predictor for learning recipes in terms of their list of ingredients. We prove that our… ▽ More

    Submitted 27 July, 2017; originally announced July 2017.

    Comments: 8 pages

  44. arXiv:1704.04097  [pdf, other

    cs.CV

    Recognizing Activities of Daily Living from Egocentric Images

    Authors: Alejandro Cartas, Juan Marín, Petia Radeva, Mariella Dimiccoli

    Abstract: Recognizing Activities of Daily Living (ADLs) has a large number of health applications, such as characterize lifestyle for habit improvement, nursing and rehabilitation services. Wearable cameras can daily gather large amounts of image data that provide rich visual information about ADLs than using other wearable sensors. In this paper, we explore the classification of ADLs from images captured b… ▽ More

    Submitted 13 April, 2017; originally announced April 2017.

    Comments: To appear in the Proceedings of IbPRIA 2017

  45. arXiv:1704.02809  [pdf, other

    cs.CV

    R-Clustering for Egocentric Video Segmentation

    Authors: Estefania Talavera, Mariella Dimiccoli, Marc Bolaños, Maedeh Aghaei, Petia Radeva

    Abstract: In this paper, we present a new method for egocentric video temporal segmentation based on integrating a statistical mean change detector and agglomerative clustering(AC) within an energy-minimization framework. Given the tendency of most AC methods to oversegment video sequences when clustering their frames, we combine the clustering with a concept drift detection technique (ADWIN) that has rigor… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

  46. arXiv:1704.02231  [pdf, other

    cs.CV

    Clothing and People - A Social Signal Processing Perspective

    Authors: Maedeh Aghaei, Federico Parezzan, Mariella Dimiccoli, Petia Radeva, Marco Cristani

    Abstract: In our society and century, clothing is not anymore used only as a means for body protection. Our paper builds upon the evidence, studied within the social sciences, that clothing brings a clear communicative message in terms of social signals, influencing the impression and behaviour of others towards a person. In fact, clothing correlates with personality traits, both in terms of self-assessment… ▽ More

    Submitted 7 April, 2017; originally announced April 2017.

    Comments: To appear in the 12th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2017)

  47. arXiv:1704.02163  [pdf, other

    cs.CV

    Egocentric Video Description based on Temporally-Linked Sequences

    Authors: Marc Bolaños, Álvaro Peris, Francisco Casacuberta, Sergi Soler, Petia Radeva

    Abstract: Egocentric vision consists in acquiring images along the day from a first person point-of-view using wearable cameras. The automatic analysis of this information allows to discover daily patterns for improving the quality of life of the user. A natural topic that arises in egocentric vision is storytelling, that is, how to understand and tell the story relying behind the pictures. In this paper, w… ▽ More

    Submitted 9 November, 2017; v1 submitted 7 April, 2017; originally announced April 2017.

    Comments: 19 pages, 10 figures, 3 tables. Submitted to Journal of Visual Communication and Image Representation

  48. arXiv:1703.09933  [pdf, other

    cs.CV

    Sentiment Recognition in Egocentric Photostreams

    Authors: Estefania Talavera, Nicola Strisciuglio, Nicolai Petkov, Petia Radeva

    Abstract: Lifelogging is a process of collecting rich source of information about daily life of people. In this paper, we introduce the problem of sentiment analysis in egocentric events focusing on the moments that compose the images recalling positive, neutral or negative feelings to the observer. We propose a method for the classification of the sentiments in egocentric pictures based on global and seman… ▽ More

    Submitted 29 March, 2017; originally announced March 2017.

  49. arXiv:1703.01790  [pdf, other

    cs.CV

    All the people around me: face discovery in egocentric photo-streams

    Authors: Maedeh Aghaei, Mariella Dimiccoli, Petia Radeva

    Abstract: Given an unconstrained stream of images captured by a wearable photo-camera (2fpm), we propose an unsupervised bottom-up approach for automatic clustering appearing faces into the individual identities present in these data. The problem is challenging since images are acquired under real world conditions; hence the visible appearance of the people in the images undergoes intensive variations. Our… ▽ More

    Submitted 12 May, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: 5 pages, 3 figures, accepted in IEEE International Conference on Image Processing (ICIP 2017)

  50. arXiv:1612.03628  [pdf, other

    cs.CV cs.CL

    VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering

    Authors: Marc Bolaños, Álvaro Peris, Francisco Casacuberta, Petia Radeva

    Abstract: In this paper, we address the problem of visual question answering by proposing a novel model, called VIBIKNet. Our model is based on integrating Kernelized Convolutional Neural Networks and Long-Short Term Memory units to generate an answer given a question about an image. We prove that VIBIKNet is an optimal trade-off between accuracy and computational load, in terms of memory and time consumpti… ▽ More

    Submitted 12 December, 2016; originally announced December 2016.

    Comments: Submitted to IbPRIA'17, 8 pages, 3 figures, 1 table