Skip to main content

Showing 1–5 of 5 results for author: Ozaki, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.13460  [pdf, other

    cs.LG stat.ML

    Multi-Objective Bayesian Optimization with Active Preference Learning

    Authors: Ryota Ozaki, Kazuki Ishikawa, Youhei Kanzaki, Shinya Suzuki, Shion Takeno, Ichiro Takeuchi, Masayuki Karasuyama

    Abstract: There are a lot of real-world black-box optimization problems that need to optimize multiple criteria simultaneously. However, in a multi-objective optimization (MOO) problem, identifying the whole Pareto front requires the prohibitive search cost, while in many practical scenarios, the decision maker (DM) only needs a specific solution among the set of the Pareto optimal solutions. We propose a B… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  2. Unsupervised Multimodal Word Discovery based on Double Articulation Analysis with Co-occurrence cues

    Authors: Akira Taniguchi, Hiroaki Murakami, Ryo Ozaki, Tadahiro Taniguchi

    Abstract: Human infants acquire their verbal lexicon with minimal prior knowledge of language based on the statistical properties of phonological distributions and the co-occurrence of other sensory stimuli. This study proposes a novel fully unsupervised learning method for discovering speech units using phonological information as a distributional cue and object information as a co-occurrence cue. The prop… ▽ More

    Submitted 21 August, 2023; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: Accepted to IEEE TRANSACTIONS ON COGNITIVE DEVELOPMENTAL SYSTEMS

  3. arXiv:2104.01807  [pdf, other

    cs.SD cs.CL eess.AS

    StarGAN-based Emotional Voice Conversion for Japanese Phrases

    Authors: Asuka Moritani, Ryo Ozaki, Shoki Sakamoto, Hirokazu Kameoka, Tadahiro Taniguchi

    Abstract: This paper shows that StarGAN-VC, a spectral envelope transformation method for non-parallel many-to-many voice conversion (VC), is capable of emotional VC (EVC). Although StarGAN-VC has been shown to enable speaker identity conversion, its capability for EVC for Japanese phrases has not been clarified. In this paper, we describe the direct application of StarGAN-VC to an EVC task with minimal fun… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: Submitted to Interspeech 2021

  4. Double Articulation Analyzer with Prosody for Unsupervised Word and Phoneme Discovery

    Authors: Yasuaki Okuda, Ryo Ozaki, Tadahiro Taniguchi

    Abstract: Infants acquire words and phonemes from unsegmented speech signals using segmentation cues, such as distributional, prosodic, and co-occurrence cues. Many pre-existing computational models that represent the process tend to focus on distributional or prosodic cues. This paper proposes a nonparametric Bayesian probabilistic generative model called the prosodic hierarchical Dirichlet process-hidden… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: 11 pages, Submitted to IEEE Transactions on Cognitive and Developmental Systems

    Journal ref: IEEE Transactions on Cognitive and Developmental Systems, 2022

  5. Unsupervised Phoneme and Word Discovery from Multiple Speakers using Double Articulation Analyzer and Neural Network with Parametric Bias

    Authors: Ryo Nakashima, Ryo Ozaki, Tadahiro Taniguchi

    Abstract: This paper describes a new unsupervised machine learning method for simultaneous phoneme and word discovery from multiple speakers. Human infants can acquire knowledge of phonemes and words from interactions with his/her mother as well as with others surrounding him/her. From a computational perspective, phoneme and word discovery from multiple speakers is a more challenging problem than that from… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: 21 pages. Submitted

    Journal ref: Front. Robot. AI, 2019, 6:92