Skip to main content

Showing 1–11 of 11 results for author: Mohedano, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.06618  [pdf, other

    cs.CV cs.AI eess.IV physics.geo-ph physics.space-ph

    QMRNet: Quality Metric Regression for EO Image Quality Assessment and Super-Resolution

    Authors: David Berga, Pau Gallés, Katalin Takáts, Eva Mohedano, Laura Riordan-Chen, Clara Garcia-Moll, David Vilaseca, Javier Marín

    Abstract: Latest advances in Super-Resolution (SR) have been tested with general purpose images such as faces, landscapes and objects, mainly unused for the task of super-resolving Earth Observation (EO) images. In this research paper, we benchmark state-of-the-art SR algorithms for distinct EO datasets using both Full-Reference and No-Reference Image Quality Assessment (IQA) metrics. We also propose a nove… ▽ More

    Submitted 14 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: 29 pages, 13 figures, 9 tables

  2. arXiv:2012.10283  [pdf, other

    cs.CV cs.LG

    Temporal Bilinear Encoding Network of Audio-Visual Features at Low Sampling Rates

    Authors: Feiyan Hu, Eva Mohedano, Noel O'Connor, Kevin McGuinness

    Abstract: Current deep learning based video classification architectures are typically trained end-to-end on large volumes of data and require extensive computational resources. This paper aims to exploit audio-visual information in video classification with a 1 frame per second sampling rate. We propose Temporal Bilinear Encoding Networks (TBEN) for encoding both audio and visual long range temporal inform… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 8 pages

  3. arXiv:1907.01869  [pdf, other

    cs.CV cs.LG

    Simple vs complex temporal recurrences for video saliency prediction

    Authors: Panagiotis Linardos, Eva Mohedano, Juan Jose Nieto, Noel E. O'Connor, Xavier Giro-i-Nieto, Kevin McGuinness

    Abstract: This paper investigates modifying an existing neural network architecture for static saliency prediction using two types of recurrences that integrate information from the temporal domain. The first modification is the addition of a ConvLSTM within the architecture, while the second is a conceptually simple exponential moving average of an internal convolutional state. We use weights pre-trained o… ▽ More

    Submitted 16 July, 2019; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Accepted at BMVC 2019

  4. arXiv:1904.08668  [pdf, other

    cs.CV

    An Efficient Approximate kNN Graph Method for Diffusion on Image Retrieval

    Authors: Federico Magliani, Kevin McGuinness, Eva Mohedano, Andrea Prati

    Abstract: The application of the diffusion in many computer vision and artificial intelligence projects has been shown to give excellent improvements in performance. One of the main bottlenecks of this technique is the quadratic growth of the kNN graph size due to the high-quantity of new connections between nodes in the graph, resulting in long computation times. Several strategies have been proposed to ad… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

  5. arXiv:1903.10195  [pdf, other

    cs.MM cs.CV

    Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks

    Authors: Amanda Duarte, Francisco Roldan, Miquel Tubau, Janna Escur, Santiago Pascual, Amaia Salvador, Eva Mohedano, Kevin McGuinness, Jordi Torres, Xavier Giro-i-Nieto

    Abstract: Speech is a rich biometric signal that contains information about the identity, gender and emotional state of the speaker. In this work, we explore its potential to generate face images of a speaker by conditioning a Generative Adversarial Network (GAN) with raw speech input. We propose a deep neural network that is trained from scratch in an end-to-end fashion, generating a face directly from the… ▽ More

    Submitted 25 March, 2019; originally announced March 2019.

    Comments: ICASSP 2019. Projevct website at https://imatge-upc.github.io/wav2pix/

  6. arXiv:1808.09559  [pdf, other

    cs.CV

    Temporal Saliency Adaptation in Egocentric Videos

    Authors: Panagiotis Linardos, Eva Mohedano, Monica Cherto, Cathal Gurrin, Xavier Giro-i-Nieto

    Abstract: This work adapts a deep neural model for image saliency prediction to the temporal domain of egocentric video. We compute the saliency map for each video frame, firstly with an off-the-shelf model trained from static images, secondly by adding a a convolutional or conv-LSTM layers trained with a dataset for video saliency prediction. We study each configuration on EgoMon, a new dataset made of sev… ▽ More

    Submitted 4 September, 2018; v1 submitted 28 August, 2018; originally announced August 2018.

    Comments: Extended abstract at the ECCV 2018 Workshop on Egocentric Perception, Interaction and Computing (EPIC)

  7. arXiv:1711.10795  [pdf, other

    cs.CV cs.AI cs.IR

    Saliency Weighted Convolutional Features for Instance Search

    Authors: Eva Mohedano, Kevin McGuinness, Xavier Giro-i-Nieto, Noel E. O'Connor

    Abstract: This work explores attention models to weight the contribution of local convolutional representations for the instance search task. We present a retrieval framework based on bags of local convolutional features (BLCF) that benefits from saliency weighting to build an efficient image representation. The use of human visual attention models (saliency) allows significant improvements in retrieval per… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

  8. arXiv:1608.08139  [pdf, other

    cs.IR cs.CV

    Where is my Phone ? Personal Object Retrieval from Egocentric Images

    Authors: Cristian Reyes, Eva Mohedano, Kevin McGuinness, Noel E. O'Connor, Xavier Giro-i-Nieto

    Abstract: This work presents a retrieval pipeline and evaluation scheme for the problem of finding the last appearance of personal objects in a large dataset of images captured from a wearable camera. Each personal object is modelled by a small set of images that define a query for a visual search engine.The retrieved results are reranked considering the temporal timestamps of the images to increase the rel… ▽ More

    Submitted 2 March, 2017; v1 submitted 29 August, 2016; originally announced August 2016.

    Comments: Lifelogging Tools and Applications Workshop (LTA'16) at ACM Multimedia 2016

    ACM Class: H.3.3; I.4.9

  9. Bags of Local Convolutional Features for Scalable Instance Search

    Authors: Eva Mohedano, Amaia Salvador, Kevin McGuinness, Ferran Marques, Noel E. O'Connor, Xavier Giro-i-Nieto

    Abstract: This work proposes a simple instance retrieval pipeline based on encoding the convolutional features of CNN using the bag of words aggregation scheme (BoW). Assigning each local array of activations in a convolutional layer to a visual word produces an \textit{assignment map}, a compact representation that relates regions of an image with a visual word. We use the assignment map for fast spatial r… ▽ More

    Submitted 15 April, 2016; originally announced April 2016.

    Comments: Preprint of a short paper accepted in the ACM International Conference on Multimedia Retrieval (ICMR) 2016 (New York City, NY, USA)

  10. arXiv:1504.02356  [pdf, other

    cs.HC cs.CV cs.IR

    Exploring EEG for Object Detection and Retrieval

    Authors: Eva Mohedano, Amaia Salvador, Sergi Porta, Xavier Giró-i-Nieto, Graham Healy, Kevin McGuinness, Noel O'Connor, Alan F. Smeaton

    Abstract: This paper explores the potential for using Brain Computer Interfaces (BCI) as a relevance feedback mechanism in content-based image retrieval. We investigate if it is possible to capture useful EEG signals to detect if relevant objects are present in a dataset of realistic and complex images. We perform several experiments using a rapid serial visual presentation (RSVP) of images at different rat… ▽ More

    Submitted 9 April, 2015; originally announced April 2015.

    Comments: This preprint is the full version of a short paper accepted in the ACM International Conference on Multimedia Retrieval (ICMR) 2015 (Shanghai, China)

    ACM Class: H.1.2; H.3.3

  11. Object Segmentation in Images using EEG Signals

    Authors: Eva Mohedano, Graham Healy, Kevin McGuinness, Xavier Giro-i-Nieto, Noel E. O'Connor, Alan F. Smeaton

    Abstract: This paper explores the potential of brain-computer interfaces in segmenting objects from images. Our approach is centered around designing an effective method for displaying the image parts to the users such that they generate measurable brain reactions. When an image region, specifically a block of pixels, is displayed we estimate the probability of the block containing the object of interest us… ▽ More

    Submitted 19 August, 2014; originally announced August 2014.

    Comments: This is a preprint version prior to submission for peer-review of the paper accepted to the 22nd ACM International Conference on Multimedia (November 3-7, 2014, Orlando, Florida, USA) for the High Risk High Reward session. 10 pages

    ACM Class: H.1.2; I.4.6; C.3