Skip to main content

Showing 1–11 of 11 results for author: Kano, T

.
  1. arXiv:2406.18972  [pdf, ps, other

    eess.AS cs.CL

    Applying LLMs for Rescoring N-best ASR Hypotheses of Casual Conversations: Effects of Domain Adaptation and Context Carry-over

    Authors: Atsunori Ogawa, Naoyuki Kamo, Kohei Matsuura, Takanori Ashihara, Takafumi Moriya, Takatomo Kano, Naohiro Tawara, Marc Delcroix

    Abstract: Large language models (LLMs) have been successfully applied for rescoring automatic speech recognition (ASR) hypotheses. However, their ability to rescore ASR hypotheses of casual conversations has not been sufficiently explored. In this study, we reveal it by performing N-best ASR hypotheses rescoring using Llama2 on the CHiME-7 distant ASR (DASR) task. Llama2 is one of the most representative LL… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 5 pages

  2. arXiv:2312.14609  [pdf, ps, other

    eess.AS cs.CL

    BLSTM-Based Confidence Estimation for End-to-End Speech Recognition

    Authors: Atsunori Ogawa, Naohiro Tawara, Takatomo Kano, Marc Delcroix

    Abstract: Confidence estimation, in which we estimate the reliability of each recognized token (e.g., word, sub-word, and character) in automatic speech recognition (ASR) hypotheses and detect incorrectly recognized tokens, is an important function for develo** ASR applications. In this study, we perform confidence estimation for end-to-end (E2E) ASR hypotheses. Recent E2E ASR systems show high performanc… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2021

  3. arXiv:2306.04233  [pdf, other

    cs.CL cs.SD eess.AS

    Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization

    Authors: Kohei Matsuura, Takanori Ashihara, Takafumi Moriya, Tomohiro Tanaka, Takatomo Kano, Atsunori Ogawa, Marc Delcroix

    Abstract: End-to-end speech summarization (E2E SSum) directly summarizes input speech into easy-to-read short sentences with a single model. This approach is promising because it, in contrast to the conventional cascade approach, can utilize full acoustical information and mitigate to the propagation of transcription errors. However, due to the high cost of collecting speech-summary pairs, an E2E SSum model… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted by Interspeech 2023

  4. arXiv:2111.08201  [pdf, other

    eess.AS cs.CL

    Attention-based Multi-hypothesis Fusion for Speech Summarization

    Authors: Takatomo Kano, Atsunori Ogawa, Marc Delcroix, Shinji Watanabe

    Abstract: Speech summarization, which generates a text summary from speech, can be achieved by combining automatic speech recognition (ASR) and text summarization (TS). With this cascade approach, we can exploit state-of-the-art models and large training datasets for both subtasks, i.e., Transformer for ASR and Bidirectional Encoder Representations from Transformers (BERT) for TS. However, ASR errors direct… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  5. arXiv:2011.04845  [pdf, other

    cs.CL

    Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS

    Authors: Katsuhito Sudoh, Takatomo Kano, Sashi Novitasari, Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura

    Abstract: This paper presents a newly developed, simultaneous neural speech-to-speech translation system and its evaluation. The system consists of three fully-incremental neural processing modules for automatic speech recognition (ASR), machine translation (MT), and text-to-speech synthesis (TTS). We investigated its overall latency in the system's Ear-Voice Span and speaking latency along with module-leve… ▽ More

    Submitted 11 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: 6 pages

  6. arXiv:2007.11988  [pdf, ps, other

    physics.soc-ph math.DS

    An agent-based model for interrelation between COVID-19 outbreak and economic activities

    Authors: Takeshi Kano, Kotaro Yasui, Taishi Mikami, Munehiro Asally, Akio Ishiguro

    Abstract: As of July, 2020, acute respiratory syndrome caused by coronavirus COVID-19 is spreading over the world and causing severe economic damages. While minimizing human contact is effective in managing the outbreak, it causes severe economic losses. Strategies solving this dilemma by considering interrelation between the spread of the virus and economic activities are in urgent needs for mitigating the… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: 15 pages, 10figures

  7. arXiv:1808.03812  [pdf, ps, other

    cs.RO

    Swarm Robots Inspired by Friendship Formation Process

    Authors: Takeshi Kano, Naoki matsui, Eiichi Naito, Takenobu Aoshima, Akio Ishiguro

    Abstract: Swarm robotic systems are systems in which multiple robots having simple functionality perform tasks through their cooperation, and are advantageous in that they can exhibit non-trivial macroscopic functions such as adaptability, fault tolerance, and scalability. We previously proposed a simple model of swarm formation inspired by friendship formation process in human society, and demonstrated via… ▽ More

    Submitted 11 August, 2018; originally announced August 2018.

    Comments: 9 pages, 8 figures

  8. arXiv:1802.06003  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Structured-based Curriculum Learning for End-to-end English-Japanese Speech Translation

    Authors: Takatomo Kano, Sakriani Sakti, Satoshi Nakamura

    Abstract: Sequence-to-sequence attentional-based neural network architectures have been shown to provide a powerful model for machine translation and speech recognition. Recently, several works have attempted to extend the models for end-to-end speech translation task. However, the usefulness of these models were only investigated on language pairs with similar syntax and word order (e.g., English-French or… ▽ More

    Submitted 13 February, 2018; originally announced February 2018.

  9. arXiv:1706.00154  [pdf, other

    cond-mat.soft

    Dynamic oscillatory cluster ordering of self-propelled droplets

    Authors: Shinpei Tanaka, Takeshi Kano

    Abstract: We report here a peculiar dynamically ordered state of clustering droplets of a mixture of organic solvent. There droplets are driven by the solutal Marangoni effect on the surface of aqueous surfactant solution. They form temporal ring clusters which start collapsing immediately after its formation. This process is repeated for more than several hours with the period of 5--20 minutes. We propose… ▽ More

    Submitted 31 May, 2017; originally announced June 2017.

  10. arXiv:1310.7568  [pdf, ps, other

    q-bio.QM cs.RO eess.SY

    Interlimb neural connection is not required for gait transition in quadruped locomotion

    Authors: Atsushi Tero, Masakazu Akiyama, Dai Owaki, Takeshi Kano, Akio Ishiguro, Ryo Kobayashi

    Abstract: Quadrupeds transition spontaneously to various gait patterns (e.g., walk, trot, pace, gallop) in response to the locomotion speed. The generation of these gait patterns has been the subject of debate for a long time. We propose a coupled oscillator model that is coupled with the physical interactions of the body. The results of this study showed that the gait pattern transitions spontaneously to w… ▽ More

    Submitted 28 October, 2013; originally announced October 2013.

    Comments: 6 pages, 2figures

  11. arXiv:1008.3050  [pdf, ps, other

    physics.flu-dyn physics.comp-ph

    Energy Spectra of Quantum Turbulence: Large-scale Simulation and Modeling

    Authors: Narimasa Sasa, Takuma Kano, Masahiko Machida, Victor S. L'vov, Oleksii Rudenko, Makoto Tsubota

    Abstract: In $2048^3$ simulation of quantum turbulence within the Gross-Pitaevskii equation we demonstrate that the large scale motions have a classical Kolmogorov-1941 energy spectrum E(k) ~ k^{-5/3}, followed by an energy accumulation with E(k) ~ const at k about the reciprocal mean intervortex distance. This behavior was predicted by the L'vov-Nazarenko-Rudenko bottleneck model of gradual eddy-wave cross… ▽ More

    Submitted 13 June, 2011; v1 submitted 18 August, 2010; originally announced August 2010.

    Comments: (re)submitted to PRB: 5.5 pages, 4 figures

    Journal ref: Phys. Rev. B 84, 054525 (2011)