Skip to main content

Showing 1–7 of 7 results for author: Bousquet, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2403.19634  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Asymmetric and trial-dependent modeling: the contribution of LIA to SdSV Challenge Task 2

    Authors: Pierre-Michel Bousquet, Mickael Rouvier

    Abstract: The SdSv challenge Task 2 provided an opportunity to assess efficiency and robustness of modern text-independent speaker verification systems. But it also made it possible to test new approaches, capable of taking into account the main issues of this challenge (duration, language, ...). This paper describes the contributions of our laboratory to the speaker recognition field. These contributions h… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: LIA system description for the Short Duration Speaker Verification (SdSv) challenge 2020 Task 2

  2. arXiv:2312.16885  [pdf, other

    cs.SD eess.AS

    Jeffreys divergence-based regularization of neural network output distribution applied to speaker recognition

    Authors: Pierre-Michel Bousquet, Mickael Rouvier

    Abstract: A new loss function for speaker recognition with deep neural network is proposed, based on Jeffreys Divergence. Adding this divergence to the cross-entropy loss function allows to maximize the target value of the output distribution while smoothing the non-target values. This objective function provides highly discriminative features. Beyond this effect, we propose a theoretical justification of i… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted in ICASSP 2023

  3. arXiv:2211.01091  [pdf, ps, other

    eess.AS cs.AI cs.SD

    I4U System Description for NIST SRE'20 CTS Challenge

    Authors: Kong Aik Lee, Tomi Kinnunen, Daniele Colibro, Claudio Vair, Andreas Nautsch, Hanwu Sun, Liang He, Tianyu Liang, Qiongqiong Wang, Mickael Rouvier, Pierre-Michel Bousquet, Rohan Kumar Das, Ignacio Viñals Bailo, Meng Liu, Héctor Deldago, Xuechen Liu, Md Sahidullah, Sandro Cumani, Boning Zhang, Koji Okabe, Hitoshi Yamamoto, Ruijie Tao, Haizhou Li, Alfonso Ortega Giménez, Longbiao Wang , et al. (1 additional authors not shown)

    Abstract: This manuscript describes the I4U submission to the 2020 NIST Speaker Recognition Evaluation (SRE'20) Conversational Telephone Speech (CTS) Challenge. The I4U's submission was resulted from active collaboration among researchers across eight research teams - I$^2$R (Singapore), UEF (Finland), VALPT (Italy, Spain), NEC (Japan), THUEE (China), LIA (France), NUS (Singapore), INRIA (France) and TJU (C… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: SRE 2021, NIST Speaker Recognition Evaluation Workshop, CTS Speaker Recognition Challenge, 14-12 December 2021

  4. arXiv:2110.05840  [pdf, other

    cs.CR eess.AS

    A bridge between features and evidence for binary attribute-driven perfect privacy

    Authors: Paul-Gauthier Noé, Andreas Nautsch, Driss Matrouf, Pierre-Michel Bousquet, Jean-François Bonastre

    Abstract: Attribute-driven privacy aims to conceal a single user's attribute, contrary to anonymisation that tries to hide the full identity of the user in some data. When the attribute to protect from malicious inferences is binary, perfect privacy requires the log-likelihood-ratio to be zero resulting in no strength-of-evidence. This work presents an approach based on normalizing flow that maps a feature… ▽ More

    Submitted 23 January, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: ICASSP 2022

  5. arXiv:2109.05977  [pdf, other

    eess.AS cs.SD

    Studying squeeze-and-excitation used in CNN for speaker verification

    Authors: Mickael Rouvier, Pierre-Michel Bousquet

    Abstract: In speaker verification, the extraction of voice representations is mainly based on the Residual Neural Network (ResNet) architecture. ResNet is built upon convolution layers which learn filters to capture local spatial patterns along all the input, then generate feature maps that jointly encode the spatial and channel information. Unfortunately, all feature maps in a convolution layer are learnt… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

  6. arXiv:2105.04310  [pdf, other

    eess.AS cs.SD

    Study on the temporal pooling used in deep neural networks for speaker verification

    Authors: Mickael Rouvier, Pierre-Michel Bousquet, Jarod Duret

    Abstract: The x-vector architecture has recently achieved state-of-the-art results on the speaker verification task. This architecture incorporates a central layer, referred to as temporal pooling, which stacks statistical parameters of the acoustic frame distribution. This work proposes to highlight the significant effect of the temporal pooling content on the training dynamics and task performance. An eva… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  7. arXiv:1904.07386  [pdf, other

    eess.AS cs.CL cs.SD

    I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences

    Authors: Kong Aik Lee, Ville Hautamaki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, **g Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Hector Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda , et al. (21 additional authors not shown)

    Abstract: The I4U consortium was established to facilitate a joint entry to NIST speaker recognition evaluations (SRE). The latest edition of such joint submission was in SRE 2018, in which the I4U submission was among the best-performing systems. SRE'18 also marks the 10-year anniversary of I4U consortium into NIST SRE series of evaluation. The primary objective of the current paper is to summarize the res… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: 5 pages