Skip to main content

Showing 1–10 of 10 results for author: Álvarez, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2401.05386  [pdf, ps, other

    eess.SP cs.HC cs.LG

    EMG subspace alignment and visualization for cross-subject hand gesture classification

    Authors: Martin Colot, Cédric Simar, Mathieu Petieau, Ana Maria Cebolla Alvarez, Guy Cheron, Gianluca Bontempi

    Abstract: Electromyograms (EMG)-based hand gesture recognition systems are a promising technology for human/machine interfaces. However, one of their main limitations is the long calibration time that is typically required to handle new users. The paper discusses and analyses the challenge of cross-subject generalization thanks to an original dataset containing the EMG signals of 14 human subjects during ha… ▽ More

    Submitted 18 December, 2023; originally announced January 2024.

    Comments: 8 pages + 1 appendix page 6 figures (one in appendix) Published in the Adapting to Change: Reliable Learning Across Domains workshop from ECML-PKDD 2023

  2. arXiv:2306.00854  [pdf, other

    eess.IV cs.CV

    Spatio-Angular Convolutions for Super-resolution in Diffusion MRI

    Authors: Matthew Lyon, Paul Armitage, Mauricio A Álvarez

    Abstract: Diffusion MRI (dMRI) is a widely used imaging modality, but requires long scanning times to acquire high resolution datasets. By leveraging the unique geometry present within this domain, we present a novel approach to dMRI angular super-resolution that extends upon the parametric continuous convolution (PCConv) framework. We introduce several additions to the operation including a Fourier feature… ▽ More

    Submitted 1 December, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  3. arXiv:2204.01399  [pdf, other

    eess.AS

    The Vicomtech Spoofing-Aware Biometric System for the SASV Challenge

    Authors: Juan M. Martín-Doñas, Iván G. Torre, Aitor Álvarez, Joaquin Arellano

    Abstract: This paper describes our proposed integration system for the spoofing-aware speaker verification challenge. It consists of a robust spoofing-aware verification system that use the speaker verification and antispoofing embeddings extracted from specialized neural networks. First, an integration network, fed with the test utterance's speaker verification and spoofing embeddings, is used to compute a… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Submitted to Interspeech 2022

  4. arXiv:2203.15598  [pdf, other

    eess.IV cs.CV

    Angular Super-Resolution in Diffusion MRI with a 3D Recurrent Convolutional Autoencoder

    Authors: Matthew Lyon, Paul Armitage, Mauricio A. Álvarez

    Abstract: High resolution diffusion MRI (dMRI) data is often constrained by limited scanning time in clinical settings, thus restricting the use of downstream analysis techniques that would otherwise be available. In this work we develop a 3D recurrent convolutional neural network (RCNN) capable of super-resolving dMRI volumes in the angular (q-space) domain. Our approach formulates the task of angular supe… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted to published in MIDL'22. Openreview link: https://openreview.net/forum?id=U6HJMtAgW-N

  5. arXiv:2203.01573  [pdf, other

    eess.AS cs.SD

    The Vicomtech Audio Deepfake Detection System based on Wav2Vec2 for the 2022 ADD Challenge

    Authors: Juan M. Martín-Doñas, Aitor Álvarez

    Abstract: This paper describes our submitted systems to the 2022 ADD challenge withing the tracks 1 and 2. Our approach is based on the combination of a pre-trained wav2vec2 feature extractor and a downstream classifier to detect spoofed audio. This method exploits the contextualized speech representations at the different transformer layers to fully capture discriminative information. Furthermore, the clas… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: Accepted by ICASSP 2022

  6. arXiv:2008.00667  [pdf, other

    eess.AS

    Learning Intonation Pattern Embeddings for Arabic Dialect Identification

    Authors: Aitor Arronte Alvarez, Elsayed Sabry Abdelaal Issa

    Abstract: This article presents a full end-to-end pipeline for Arabic Dialect Identification (ADI) using intonation patterns and acoustic representations. Recent approaches to language and dialect identification use linguistic-aware deep architectures that are able to capture phonetic differences amongst languages and dialects. Specifically, in ADI tasks, different combinations of linguistic features and ac… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Comments: Accepted for INTERSPEECH 2020

  7. arXiv:2008.00198  [pdf, other

    eess.AS cs.SD

    Singer Identification Using Convolutional Acoustic Motif Embeddings

    Authors: Aitor Arronte Alvarez, Francisco Gomez-Martin

    Abstract: Flamenco singing is characterized by pitch instability, micro-tonal ornamentations, large vibrato ranges, and a high degree of melodic variability. These musical features make the automatic identification of flamenco singers a difficult computational task. In this article we present an end-to-end pipeline for flamenco singer identification based on acoustic motif embeddings. In the approach taken,… ▽ More

    Submitted 1 August, 2020; originally announced August 2020.

    Comments: 5 pages

  8. arXiv:1912.08813  [pdf, other

    eess.IV cs.CV cs.LG

    Ambient Lighting Generation for Flash Images with Guided Conditional Adversarial Networks

    Authors: José Chávez, Rensso Mora, Edward Cayllahua-Cahuina

    Abstract: To cope with the challenges that low light conditions produce in images, photographers tend to use the light provided by the camera flash to get better illumination. Nevertheless, harsh shadows and non-uniform illumination can arise from using a camera flash, especially in low light conditions. Previous studies have focused on normalizing the lighting on flash images; however, to the best of our k… ▽ More

    Submitted 20 February, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

    Comments: VISAPP 2020

  9. arXiv:1810.12679  [pdf, other

    eess.AS cs.LG cs.SD eess.SP stat.ML

    Sparse Gaussian Process Audio Source Separation Using Spectrum Priors in the Time-Domain

    Authors: Pablo A. Alvarado, Mauricio A. Álvarez, Dan Stowell

    Abstract: Gaussian process (GP) audio source separation is a time-domain approach that circumvents the inherent phase approximation issue of spectrogram based methods. Furthermore, through its kernel, GPs elegantly incorporate prior knowledge about the sources into the separation model. Despite these compelling advantages, the computational complexity of GP inference scales cubically with the number of audi… ▽ More

    Submitted 21 November, 2018; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: Paper submitted to the 44th International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019. To be held in Brighton, United Kingdom, between May 12 and May 17, 2019

  10. arXiv:1709.05409  [pdf, other

    eess.SY math.DS stat.ME stat.ML

    Gaussian Process Latent Force Models for Learning and Stochastic Control of Physical Systems

    Authors: Simo Särkkä, Mauricio A. Álvarez, Neil D. Lawrence

    Abstract: This article is concerned with learning and stochastic control in physical systems which contain unknown input signals. These unknown signals are modeled as Gaussian processes (GP) with certain parametrized covariance structures. The resulting latent force models (LFMs) can be seen as hybrid models that contain a first-principles physical model part and a non-parametric GP model part. We briefly r… ▽ More

    Submitted 13 August, 2018; v1 submitted 15 September, 2017; originally announced September 2017.