Skip to main content

Showing 1–2 of 2 results for author: Imamura, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.17722  [pdf, other

    cs.SD eess.AS

    Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals

    Authors: Kentaro Seki, Shinnosuke Takamichi, Norihiro Takamune, Yuki Saito, Kanami Imamura, Hiroshi Saruwatari

    Abstract: This paper proposes a new task called spatial voice conversion, which aims to convert a target voice while preserving spatial information and non-target signals. Traditional voice conversion methods focus on single-channel waveforms, ignoring the stereo listening experience inherent in human hearing. Our baseline approach addresses this gap by integrating blind source separation (BSS), voice conve… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  2. Algorithms of Sampling-Frequency-Independent Layers for Non-integer Strides

    Authors: Kanami Imamura, Tomohiko Nakamura, Norihiro Takamune, Kohei Yatabe, Hiroshi Saruwatari

    Abstract: In this paper, we propose algorithms for handling non-integer strides in sampling-frequency-independent (SFI) convolutional and transposed convolutional layers. The SFI layers have been developed for handling various sampling frequencies (SFs) by a single neural network. They are replaceable with their non-SFI counterparts and can be introduced into various network architectures. However, they cou… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures, accepted for European Signal Processing Conference 2023 (EUSIPCO 2023)

    Journal ref: European Signal Processing Conference, Sep. 2023, pp. 326--330