Skip to main content

Showing 1–12 of 12 results for author: Nishizaki, H

.
  1. arXiv:2401.03547  [pdf, ps, other

    cs.RO cs.HC

    Overview of Dialogue Robot Competition 2023

    Authors: Takashi Minato, Ryuichiro Higashinaka, Kurima Sakai, Tomo Funayama, Hiromitsu Nishizaki, Takayuki Naga

    Abstract: We have held dialogue robot competitions in 2020 and 2022 to compare the performances of interactive robots using an android that closely resembles a human. In 2023, the third competition DRC2023 was held. The task of DRC2023 was designed to be more challenging than the previous travel agent dialogue tasks. Since anyone can now develop a dialogue system using LLMs, the participating teams are requ… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: Proceedings of Dialogue Robot Competition 2023. arXiv admin note: text overlap with arXiv:2210.12863

  2. arXiv:2312.14430   

    cs.RO

    Proceedings of the Dialogue Robot Competition 2023

    Authors: Ryuichiro Higashinaka, Takashi Minato, Hiromitsu Nishizaki, Takayuki Nagai

    Abstract: The Dialogic Robot Competition 2023 (DRC2023) is a competition for humanoid robots (android robots that closely resemble humans) to compete in interactive capabilities. This is the third year of the competition. The top four teams from the preliminary competition held in November 2023 will compete in the final competition on Saturday, December 23. The task for the interactive robots is to recommen… ▽ More

    Submitted 14 January, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: This is a proceedings of the Dialogue Robot Competition 2023

  3. arXiv:2307.01546  [pdf, other

    cs.SD eess.AS

    Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure

    Authors: Yikang Wang, Hiromitsu Nishizaki, Ming Li

    Abstract: Finding synthetic artifacts of spoofing data will help the anti-spoofing countermeasures (CMs) system discriminate between spoofed and real speech. The Conformer combines the best of convolutional neural network and the Transformer, allowing it to aggregate global and local information. This may benefit the CM system to capture the synthetic artifacts hidden both locally and globally. In this pape… ▽ More

    Submitted 30 October, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 7 pages, 2 figures

  4. arXiv:2211.06546  [pdf, other

    cs.SD eess.AS

    Low Pass Filtering and Bandwidth Extension for Robust Anti-spoofing Countermeasure Against Codec Variabilities

    Authors: Yikang Wang, Xingming Wang, Hiromitsu Nishizaki, Ming Li

    Abstract: A reliable voice anti-spoofing countermeasure system needs to robustly protect automatic speaker verification (ASV) systems in various kinds of spoofing scenarios. However, the performance of countermeasure systems could be degraded by channel effects and codecs. In this paper, we show that using the low-frequency subbands of signals as input can mitigate the negative impact introduced by codecs o… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 5 pages, 3 figures, accepted by ISCSLP 2022

  5. arXiv:2210.12863  [pdf, ps, other

    cs.RO cs.HC

    Overview of Dialogue Robot Competition 2022

    Authors: Takashi Minato, Ryuichiro Higashinaka, Kurima Sakai, Tomo Funayama, Hiromitsu Nishizaki, Takayuki Nagai

    Abstract: Although many competitions have been held on dialogue systems in the past, no competition has been organized specifically for dialogue with humanoid robots. As the first such attempt in the world, we held a dialogue robot competition in 2020 to compare the performances of interactive robots using an android that closely resembles a human. Dialogue Robot Competition 2022 (DRC2022) was the second co… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: This paper is part of the proceedings of the Dialogue Robot Competition 2022

  6. arXiv:2210.12034  other

    cs.RO

    Proceedings of the Dialogue Robot Competition 2022

    Authors: Ryuichiro Higashinaka, Takashi Minato, Hiromitsu Nishizaki, Takayuki Nagai

    Abstract: The proceedings contain papers on the dialogue systems developed by the twelve teams participating in DRC2022, as well as an overview paper summarizing the competition.

    Submitted 25 October, 2022; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: Proceedings of the Dialogue Robot Competition 2022

  7. arXiv:2203.16085  [pdf, other

    cs.SD eess.AS

    Combination of Time-domain, Frequency-domain, and Cepstral-domain Acoustic Features for Speech Commands Classification

    Authors: Yikang Wang, Hiromitsu Nishizaki

    Abstract: In speech-related classification tasks, frequency-domain acoustic features such as logarithmic Mel-filter bank coefficients (FBANK) and cepstral-domain acoustic features such as Mel-frequency cepstral coefficients (MFCC) are often used. However, time-domain features perform more effectively in some sound classification tasks which contain non-vocal or weakly speech-related sounds. We previously pr… ▽ More

    Submitted 16 June, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: 5 pages, 4 figures

  8. arXiv:2203.15473  [pdf, other

    eess.AS

    Frequency-Directional Attention Model for Multilingual Automatic Speech Recognition

    Authors: Akihiro Dobashi, Chee Siang Leow, Hiromitsu Nishizaki

    Abstract: This paper proposes a model for transforming speech features using the frequency-directional attention model for End-to-End (E2E) automatic speech recognition. The idea is based on the hypothesis that in the phoneme system of each language, the characteristics of the frequency bands of speech when uttering them are different. By transforming the input Mel filter bank features with an attention mod… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: submitted to INTERSPEECH2022

  9. arXiv:2110.03511  [pdf, other

    eess.AS cs.LG cs.SD

    Peer Collaborative Learning for Polyphonic Sound Event Detection

    Authors: Hayato Endo, Hiromitsu Nishizaki

    Abstract: This paper describes that semi-supervised learning called peer collaborative learning (PCL) can be applied to the polyphonic sound event detection (PSED) task, which is one of the tasks in the Detection and Classification of Acoustic Scenes and Events (DCASE) challenge. Many deep learning models have been studied to find out what kind of sound events occur where and for how long in a given audio c… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: Submitted to ICASSP 2022

  10. arXiv:2104.01384  [pdf, other

    eess.AS cs.CL

    ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi

    Authors: Yu Wang, Chee Siang Leow, Akio Kobayashi, Takehito Utsuro, Hiromitsu Nishizaki

    Abstract: This paper describes the ExKaldi-RT online automatic speech recognition (ASR) toolkit that is implemented based on the Kaldi ASR toolkit and Python language. ExKaldi-RT provides tools for building online recognition pipelines. While similar tools are available built on Kaldi, a key feature of ExKaldi-RT that it works on Python, which has an easy-to-use interface that allows online ASR system devel… ▽ More

    Submitted 8 August, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

    Comments: Accepted at the IEEE 10th Global Conference on Consumer Electronics

  11. arXiv:1904.04364  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Audio Classification of Bit-Representation Waveform

    Authors: Masaki Okawa, Takuya Saito, Naoki Sawada, Hiromitsu Nishizaki

    Abstract: This study investigated the waveform representation for audio signal classification. Recently, many studies on audio waveform classification such as acoustic event detection and music genre classification have been published. Most studies on audio waveform classification have proposed the use of a deep learning (neural network) framework. Generally, a frequency analysis method such as Fourier tran… ▽ More

    Submitted 18 September, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

    Comments: Accepted at INTERSPEECH2019

  12. arXiv:cond-mat/9904348  [pdf, ps, other

    cond-mat.supr-con cond-mat.str-el

    Evidence for incommensurate spin fluctuations in Sr_2RuO_4

    Authors: Y. Sidis, M. Braden, P. Bourges, B. Hennion. S. Nishizaki, Y. Maeno, Y. Mori

    Abstract: We report first inelastic neutron scattering measurements in the normal state of Sr_2RuO_4 that reveal the existence of incommensurate magnetic spin fluctuations located at ${\bf q}_0=(\pm 0.6π/a, \pm 0.6π/a, 0)$. This finding confirms recent band structure calculations that have predicted incommensurate magnetic responses related to dynamical nesting properties of its Fermi surface.

    Submitted 23 April, 1999; originally announced April 1999.

    Journal ref: Phys. Rev. Lett., 83, 3320 (1999).