Skip to main content

Showing 1–6 of 6 results for author: Takashima, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.10316  [pdf, other

    cs.SD eess.AS

    Zero-shot Sound Event Classification Using a Sound Attribute Vector with Global and Local Feature Learning

    Authors: Yi-Han Lin, Xunquan Chen, Ryoichi Takashima, Tetsuya Takiguchi

    Abstract: This paper introduces a zero-shot sound event classification (ZS-SEC) method to identify sound events that have never occurred in training data. In our previous work, we proposed a ZS-SEC method using sound attribute vectors (SAVs), where a deep neural network model infers attribute information that describes the sound of an event class instead of inferring its class label directly. Our previous m… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP2023

  2. arXiv:2211.16034  [pdf, other

    cs.CV eess.IV

    Learn to See Faster: Pushing the Limits of High-Speed Camera with Deep Underexposed Image Denoising

    Authors: Weihao Zhuang, Tristan Hascoet, Ryoichi Takashima, Tetsuya Takiguchi

    Abstract: The ability to record high-fidelity videos at high acquisition rates is central to the study of fast moving phenomena. The difficulty of imaging fast moving scenes lies in a trade-off between motion blur and underexposure noise: On the one hand, recordings with long exposure times suffer from motion blur effects caused by movements in the recorded scene. On the other hand, the amount of light reac… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  3. arXiv:2206.10886  [pdf, other

    cs.CV cs.LG

    Optical Flow Regularization of Implicit Neural Representations for Video Frame Interpolation

    Authors: Weihao Zhuang, Tristan Hascoet, Ryoichi Takashima, Tetsuya Takiguchi

    Abstract: Recent works have shown the ability of Implicit Neural Representations (INR) to carry meaningful representations of signal derivatives. In this work, we leverage this property to perform Video Frame Interpolation (VFI) by explicitly constraining the derivatives of the INR to satisfy the optical flow constraint equation. We achieve state of the art VFI on limited motion ranges using only a target v… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  4. arXiv:2203.13981  [pdf, ps, other

    eess.SP cs.CV cs.LG q-bio.NC

    Current Source Localization Using Deep Prior with Depth Weighting

    Authors: Rio Yamana, Hajime Yano, Ryoichi Takashima, Tetsuya Takiguchi, Seiji Nakagawa

    Abstract: This paper proposes a novel neuronal current source localization method based on Deep Prior that represents a more complicated prior distribution of current source using convolutional networks. Deep Prior has been suggested as a means of an unsupervised learning approach that does not require learning using training data, and randomly-initialized neural networks are used to update a source locatio… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

  5. arXiv:2010.11780  [pdf, other

    cs.CV

    FasterRCNN Monitoring of Road Damages: Competition and Deployment

    Authors: Hascoet Tristan, Yihao Zhang, Persch Andreas, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki

    Abstract: Maintaining aging infrastructure is a challenge currently faced by local and national administrators all around the world. An important prerequisite for efficient infrastructure maintenance is to continuously monitor (i.e., quantify the level of safety and reliability) the state of very large structures. Meanwhile, computer vision has made impressive strides in recent years, mainly due to successf… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  6. arXiv:1906.10876  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition

    Authors: Naoyuki Kanda, Shota Horiguchi, Ryoichi Takashima, Yusuke Fujita, Kenji Nagamatsu, Shinji Watanabe

    Abstract: In this paper, we propose a novel auxiliary loss function for target-speaker automatic speech recognition (ASR). Our method automatically extracts and transcribes target speaker's utterances from a monaural mixture of multiple speakers speech given a short sample of the target speaker. The proposed auxiliary loss function attempts to additionally maximize interference speaker ASR accuracy during t… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

    Comments: Accepted to INTERSPEECH 2019