Skip to main content

Showing 1–4 of 4 results for author: Birkholz, P

.
  1. arXiv:2204.09381  [pdf, other

    cs.SD cs.CL eess.AS

    Exploration strategies for articulatory synthesis of complex syllable onsets

    Authors: Daniel R. van Niekerk, Anqi Xu, Branislav Gerazov, Paul K. Krug, Peter Birkholz, Yi Xu

    Abstract: High-quality articulatory speech synthesis has many potential applications in speech science and technology. However, develo** appropriate map**s from linguistic specification to articulatory gestures is difficult and time consuming. In this paper we construct an optimisation-based framework as a first step towards learning these map**s without manual intervention. We demonstrate the product… ▽ More

    Submitted 30 June, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: Accepted at Interspeech 2022

  2. PyRCN: A Toolbox for Exploration and Application of Reservoir Computing Networks

    Authors: Peter Steiner, Azarakhsh Jalalvand, Simon Stone, Peter Birkholz

    Abstract: Reservoir Computing Networks (RCNs) belong to a group of machine learning techniques that project the input space non-linearly into a high-dimensional feature space, where the underlying task can be solved linearly. Popular variants of RCNs are capable of solving complex tasks equivalently to widely used deep neural networks, but with a substantially simpler training paradigm based on linear regre… ▽ More

    Submitted 10 May, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Preprint accepted for publication in Engineering Applications of Artificial Intelligence

    Journal ref: Engineering Applications of Artificial Intelligence 113 (2022) 104964

  3. Cluster-based Input Weight Initialization for Echo State Networks

    Authors: Peter Steiner, Azarakhsh Jalalvand, Peter Birkholz

    Abstract: Echo State Networks (ESNs) are a special type of recurrent neural networks (RNNs), in which the input and recurrent connections are traditionally generated randomly, and only the output weights are trained. Despite the recent success of ESNs in various tasks of audio, image and radar recognition, we postulate that a purely random initialization is not the ideal way of initializing ESNs. The aim of… ▽ More

    Submitted 20 January, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Accepted for publication in IEEE Transactions on Neural Network and Learning System (TNNLS), 2022

  4. arXiv:2005.09986  [pdf, other

    eess.AS cs.SD

    Evaluating Features and Metrics for High-Quality Simulation of Early Vocal Learning of Vowels

    Authors: Branislav Gerazov, Daniel van Niekerk, Anqi Xu, Paul Konstantin Krug, Peter Birkholz, Yi Xu

    Abstract: The way infants use auditory cues to learn to speak despite the acoustic mismatch of their vocal apparatus is a hot topic of scientific debate. The simulation of early vocal learning using articulatory speech synthesis offers a way towards gaining a deeper understanding of this process. One of the crucial parameters in these simulations is the choice of features and a metric to evaluate the acoust… ▽ More

    Submitted 2 April, 2021; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Submitted to INTERSPEECH 2021