Skip to main content

Showing 1–5 of 5 results for author: Boyer, F

Searching in archive eess. Search in all archives.
.
  1. arXiv:2306.07258  [pdf, other

    cs.RO eess.SY physics.class-ph

    Input Decoupling of Lagrangian Systems via Coordinate Transformation: General Characterization and its Application to Soft Robotics

    Authors: Pietro Pustina, Cosimo Della Santina, Frédéric Boyer, Alessandro De Luca, Federico Renda

    Abstract: Suitable representations of dynamical systems can simplify their analysis and control. On this line of thought, this paper aims to answer the following question: Can a transformation of the generalized coordinates under which the actuators directly perform work on a subset of the configuration variables be found? Not only we show that the answer to this question is yes, but we also provide necessa… ▽ More

    Submitted 23 February, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  2. arXiv:2201.05420  [pdf, other

    eess.AS cs.SD

    A Study of Transducer based End-to-End ASR with ESPnet: Architecture, Auxiliary Loss and Decoding Strategies

    Authors: Florian Boyer, Yusuke Shinohara, Takaaki Ishii, Hirofumi Inaguma, Shinji Watanabe

    Abstract: In this study, we present recent developments of models trained with the RNN-T loss in ESPnet. It involves the use of various architectures such as recently proposed Conformer, multi-task learning with different auxiliary criteria and multiple decoding strategies, including our own proposition. Through experiments and benchmarks, we show that our proposed systems can be competitive against other s… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  3. arXiv:2012.13006  [pdf, other

    eess.AS cs.SD

    The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans

    Authors: Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, **g Shi, Aswin Shanmugam Subramanian, Wangyou Zhang

    Abstract: This paper describes the recent development of ESPnet (https://github.com/espnet/espnet), an end-to-end speech processing toolkit. This project was initiated in December 2017 to mainly deal with end-to-end speech recognition experiments based on sequence-to-sequence modeling. The project has grown rapidly and now covers a wide range of speech processing applications. Now ESPnet also includes text… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.

  4. arXiv:2010.13956  [pdf, other

    eess.AS cs.SD

    Recent Developments on ESPnet Toolkit Boosted by Conformer

    Authors: Pengcheng Guo, Florian Boyer, Xuankai Chang, Tomoki Hayashi, Yosuke Higuchi, Hirofumi Inaguma, Naoyuki Kamo, Chenda Li, Daniel Garcia-Romero, Jiatong Shi, **g Shi, Shinji Watanabe, Kun Wei, Wangyou Zhang, Yuekai Zhang

    Abstract: In this study, we present recent developments on ESPnet: End-to-End Speech Processing toolkit, which mainly involves a recently proposed architecture called Conformer, Convolution-augmented Transformer. This paper shows the results for a wide range of end-to-end speech processing applications, such as automatic speech recognition (ASR), speech translations (ST), speech separation (SS) and text-to-… ▽ More

    Submitted 29 October, 2020; v1 submitted 26 October, 2020; originally announced October 2020.

  5. arXiv:1910.08502  [pdf, ps, other

    cs.CL eess.AS

    End-to-End Speech Recognition: A review for the French Language

    Authors: Florian Boyer, Jean-Luc Rouas

    Abstract: Recently, end-to-end ASR based either on sequence-to-sequence networks or on the CTC objective function gained a lot of interest from the community, achieving competitive results over traditional systems using robust but complex pipelines. One of the main features of end-to-end systems, in addition to the ability to free themselves from extra linguistic resources such as dictionaries or language m… ▽ More

    Submitted 23 October, 2019; v1 submitted 18 October, 2019; originally announced October 2019.

    Comments: 10 pages, 2 column-style