Skip to main content

Showing 1–6 of 6 results for author: Igarashi, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.07280  [pdf, ps, other

    cs.SD eess.AS

    Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment

    Authors: Takuto Igarashi, Yuki Saito, Kentaro Seki, Shinnosuke Takamichi, Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari

    Abstract: We propose noise-robust voice conversion (VC) which takes into account the recording quality and environment of noisy source speech. Conventional denoising training improves the noise robustness of a VC model by learning noisy-to-clean VC process. However, the naturalness of the converted speech is limited when the noise of the source speech is unseen during the training. To this end, our proposed… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 5 pages, accepted for INTERSPEECH 2024, audio samples: http://y-saito.sakura.ne.jp/sython/Corpus/SRC4VC/IS2024_CDT_supplementary/demo_cdt.html

  2. arXiv:2406.07254  [pdf, ps, other

    cs.SD eess.AS

    SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark

    Authors: Yuki Saito, Takuto Igarashi, Kentaro Seki, Shinnosuke Takamichi, Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari

    Abstract: We present SRC4VC, a new corpus containing 11 hours of speech recorded on smartphones by 100 Japanese speakers. Although high-quality multi-speaker corpora can advance voice conversion (VC) technologies, they are not always suitable for testing VC when low-quality speech recording is given as the input. To this end, we first asked 100 crowdworkers to record their voice samples using smartphones. T… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted for INTERSPEECH 2024, corpus project page: https://y-saito.sakura.ne.jp/sython/Corpus/SRC4VC/index.html

  3. arXiv:2104.04291  [pdf, other

    cs.CV cs.LG eess.IV

    Brain Surface Reconstruction from MRI Images Based on Segmentation Networks Applying Signed Distance Maps

    Authors: Heng Fang, Xi Yang, Taichi Kin, Takeo Igarashi

    Abstract: Whole-brain surface extraction is an essential topic in medical imaging systems as it provides neurosurgeons with a broader view of surgical planning and abnormality detection. To solve the problem confronted in current deep learning skull strip** methods lacking prior shape information, we propose a new network architecture that incorporates knowledge of signed distance fields and introduce an… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted by IEEE ISBI 2021 (International Symposium on Biomedical Imaging)

  4. arXiv:2010.03190  [pdf, other

    cs.SD cs.HC eess.AS

    Generative Melody Composition with Human-in-the-Loop Bayesian Optimization

    Authors: Yijun Zhou, Yuki Koyama, Masataka Goto, Takeo Igarashi

    Abstract: Deep generative models allow even novice composers to generate various melodies by sampling latent vectors. However, finding the desired melody is challenging since the latent space is unintuitive and high-dimensional. In this work, we present an interactive system that supports generative melody composition with human-in-the-loop Bayesian optimization (BO). This system takes a mixed-initiative ap… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: 10 pages, 2 figures, Proceedings of the 2020 Joint Conference on AI Music Creativity (CSMC-MuMe 2020)

    ACM Class: J.5; H.5.5; H.5.2

  5. arXiv:2006.16161  [pdf, other

    eess.IV cs.LG

    A Two-step Surface-based 3D Deep Learning Pipeline for Segmentation of Intracranial Aneurysms

    Authors: Xi Yang, Ding Xia, Taichi Kin, Takeo Igarashi

    Abstract: The exact shape of intracranial aneurysms is critical in medical diagnosis and surgical planning. While voxel-based deep learning frameworks have been proposed for this segmentation task, their performance remains limited. In this study, we offer a two-step surface-based deep learning pipeline that achieves significantly higher performance. Our proposed model takes a surface model of entire princi… ▽ More

    Submitted 4 July, 2021; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: It is a pre-released version

  6. arXiv:2003.02920  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    IntrA: 3D Intracranial Aneurysm Dataset for Deep Learning

    Authors: Xi Yang, Ding Xia, Taichi Kin, Takeo Igarashi

    Abstract: Medicine is an important application area for deep learning models. Research in this field is a combination of medical expertise and data science knowledge. In this paper, instead of 2D medical images, we introduce an open-access 3D intracranial aneurysm dataset, IntrA, that makes the application of points-based and mesh-based classification and segmentation models available. Our dataset can be us… ▽ More

    Submitted 6 April, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: Accepted by cvpr2020, camera-ready version will be uploaded later