Skip to main content

Showing 1–8 of 8 results for author: Diego, F

.
  1. arXiv:2301.13618  [pdf, other

    cs.LG cs.NI eess.SY

    Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning

    Authors: Gabriele Castellano, Juan-José Nieto, Jordi Luque, Ferrán Diego, Carlos Segura, Diego Perino, Flavio Esposito, Fulvio Risso, Aravindh Raman

    Abstract: Many real-time applications (e.g., Augmented/Virtual Reality, cognitive assistance) rely on Deep Neural Networks (DNNs) to process inference tasks. Edge computing is considered a key infrastructure to deploy such applications, as moving computation close to the data sources enables us to meet stringent latency and throughput requirements. However, the constrained nature of edge networks poses seve… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  2. arXiv:2104.08086  [pdf, other

    eess.AS

    Efficient Keyword Spotting by capturing long-range interactions with Temporal Lambda Networks

    Authors: Biel Tura, Santiago Escuder, Ferran Diego, Carlos Segura, Jordi Luque

    Abstract: Models based on attention mechanisms have shown unprecedented speech recognition performance. However, they are computationally expensive and unnecessarily complex for keyword spotting, a task targeted to small-footprint devices. This work explores the application of Lambda networks, an alternative framework for capturing long-range interactions without attention, for the keyword spotting task. We… ▽ More

    Submitted 1 July, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: speech recognition, keyword spotting, lambda networks

  3. arXiv:2006.00785  [pdf, ps, other

    cs.CV cs.CL cs.IR cs.MM

    Transcription-Enriched Joint Embeddings for Spoken Descriptions of Images and Videos

    Authors: Benet Oriol, Jordi Luque, Ferran Diego, Xavier Giro-i-Nieto

    Abstract: In this work, we propose an effective approach for training unique embedding representations by combining three simultaneous modalities: image and spoken and textual narratives. The proposed methodology departs from a baseline system that spawns a embedding space trained with only spoken narratives and image cues. Our experiments on the EPIC-Kitchen and Places Audio Caption datasets show that intr… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: Accepted for presentation at EPIC@CVPR2020 workshop

  4. arXiv:1911.07808  [pdf, other

    cs.CV

    Unsupervised Representation Learning by Discovering Reliable Image Relations

    Authors: Timo Milbich, Omair Ghori, Ferran Diego, Björn Ommer

    Abstract: Learning robust representations that allow to reliably establish relations between images is of paramount importance for virtually all of computer vision. Annotating the quadratic number of pairwise relations between training images is simply not feasible, while unsupervised inference is prone to noise, thus leaving the vast majority of these relations to be unreliable. To nevertheless find those… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

    Comments: Accepted for Publication in 'Pattern Recognition Journal'

  5. arXiv:1810.09726  [pdf, other

    cs.CV

    CEREALS - Cost-Effective REgion-based Active Learning for Semantic Segmentation

    Authors: Radek Mackowiak, Philip Lenz, Omair Ghori, Ferran Diego, Oliver Lange, Carsten Rother

    Abstract: State of the art methods for semantic image segmentation are trained in a supervised fashion using a large corpus of fully labeled training images. However, gathering such a corpus is expensive, due to human annotation effort, in contrast to gathering unlabeled data. We propose an active learning-based strategy, called CEREALS, in which a human only has to hand-label a few, automatically selected,… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: Published at British Machine Vision Conference 2018 (BMVC)

  6. arXiv:1606.07029  [pdf, other

    q-bio.NC

    Sparse convolutional coding for neuronal ensemble identification

    Authors: Sven Peter, Daniel Durstewitz, Ferran Diego, Fred A. Hamprecht

    Abstract: Cell ensembles, originally proposed by Donald Hebb in 1949, are subsets of synchronously firing neurons and proposed to explain basic firing behavior in the brain. Despite having been studied for many years no conclusive evidence has been presented yet for their existence and involvement in information processing such that their identification is still a topic of modern research, especially since… ▽ More

    Submitted 22 June, 2016; originally announced June 2016.

    Comments: 12 pages, 6 figures

  7. arXiv:1412.3159  [pdf, ps, other

    cs.CV

    Road Detection via On--line Label Transfer

    Authors: José M. Álvarez, Ferran Diego, Joan Serrat, Antonio M. López

    Abstract: Vision-based road detection is an essential functionality for supporting advanced driver assistance systems (ADAS) such as road following and vehicle and pedestrian detection. The major challenges of road detection are dealing with shadows and lighting variations and the presence of other objects in the scene. Current road detection algorithms characterize road areas at pixel level and group pixel… ▽ More

    Submitted 9 December, 2014; originally announced December 2014.

  8. The hard X-ray shortages prompted by the clock bursts in GS 1826--238

    Authors: Ji Long, Zhang Shu, Chen YuPeng, Zhang Shuang-Nan, Torres F. Diego, Kretschmar Peter, Li Jian

    Abstract: We report on a study of GS 1826--238 using all available {\it RXTE} observations, concentrating on the behavior of the hard X-rays during type-I bursts. We find a hard X-ray shortage at 30--50 keV promoted by the shower of soft X-rays coming from type-I bursts. This shortage happens with a time delay after the peak of the soft flux of 3.6 $\pm$ 1.2 seconds.The behavior of hard X-rays during bursts… ▽ More

    Submitted 15 December, 2013; originally announced December 2013.

    Comments: 11 pages, 4 figures, Accepted to the ApJ