Skip to main content

Showing 1–2 of 2 results for author: Safari, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2008.01077  [pdf, other

    eess.AS cs.LG cs.SD

    Self-attention encoding and pooling for speaker recognition

    Authors: Pooyan Safari, Miquel India, Javier Hernando

    Abstract: The computing power of mobile devices limits the end-user applications in terms of storage size, processing, memory and energy consumption. These limitations motivate researchers for the design of more efficient deep models. On the other hand, self-attention networks based on Transformer architecture have attracted remarkable interests due to their high parallelization capabilities and strong perf… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

  2. arXiv:2007.13199  [pdf, other

    eess.AS cs.SD

    Double Multi-Head Attention for Speaker Verification

    Authors: Miquel India, Pooyan Safari, Javier Hernando

    Abstract: Most state-of-the-art Deep Learning systems for speaker verification are based on speaker embedding extractors. These architectures are commonly composed of a feature extractor front-end together with a pooling layer to encode variable-length utterances into fixed-length speaker vectors. In this paper we present Double Multi-Head Attention pooling, which extends our previous approach based on Self… ▽ More

    Submitted 9 January, 2021; v1 submitted 26 July, 2020; originally announced July 2020.