Skip to main content

Showing 1–10 of 10 results for author: Akaogi, M

.
  1. Two-stage dimensional emotion recognition by fusing predictions of acoustic and text networks using SVM

    Authors: Bagus Tris Atmaja, Masato Akagi

    Abstract: Automatic speech emotion recognition (SER) by a computer is a critical component for more natural human-machine interaction. As in human-human interaction, the capability to perceive emotion correctly is essential to take further steps in a particular situation. One issue in SER is whether it is necessary to combine acoustic features with other data such as facial expressions, text, and motion cap… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Published in Speech Communications

    Journal ref: Speech Commun., vol. 126, pp. 9-21, Feb. 2021

  2. arXiv:2206.13021  [pdf, other

    cs.SD eess.AS

    Speak Like a Professional: Increasing Speech Intelligibility by Mimicking Professional Announcer Voice with Voice Conversion

    Authors: Tuan Vu Ho, Maori Kobayashi, Masato Akagi

    Abstract: In most of practical scenarios, the announcement system must deliver speech messages in a noisy environment, in which the background noise cannot be cancelled out. The local noise reduces speech intelligibility and increases listening effort of the listener, hence hamper the effectiveness of announcement system. There has been reported that voices of professional announcers are clearer and more co… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: Accepted at INTERSPEECH 2022

  3. arXiv:2004.02355  [pdf, other

    eess.AS

    Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition

    Authors: Bagus Tris Atmaja, Masato Akagi

    Abstract: Modern deep learning architectures are ordinarily performed on high-performance computing facilities due to the large size of the input features and complexity of its model. This paper proposes traditional multilayer perceptrons (MLP) with deep layers and small input size to tackle that computation requirement limitation. The result shows that our proposed deep MLP outperformed modern deep learnin… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Comments: 2 figures, 4 tables, submitted to EUSIPCO 2020

    Journal ref: Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020

  4. On The Differences Between Song and Speech Emotion Recognition: Effect of Feature Sets, Feature Types, and Classifiers

    Authors: Bagus Tris Atmaja, Masato Akagi

    Abstract: In this paper, we evaluate the different features sets, feature types, and classifiers on both song and speech emotion recognition. Three feature sets: GeMAPS, pyAudioAnalysis, and LibROSA; two feature types: low-level descriptors and high-level statistical functions; and four classifiers: multilayer perceptron, LSTM, GRU, and convolution neural networks are examined on both song and speech data w… ▽ More

    Submitted 31 March, 2020; originally announced April 2020.

    Comments: 2 Figures, 2 Tables

    Journal ref: 2020 IEEE REGION 10 CONFERENCE (TENCON), 968-972

  5. Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recognition

    Authors: Bagus Tris Atmaja, Masato Akagi

    Abstract: The choice of a loss function is a critical part of machine learning. This paper evaluated two different loss functions commonly used in regression-task dimensional speech emotion recognition, an error-based and a correlation-based loss functions. We found that using a correlation-based loss function with a concordance correlation coefficient (CCC) loss resulted in better performance than an error… ▽ More

    Submitted 18 November, 2020; v1 submitted 24 March, 2020; originally announced March 2020.

    Comments: 3 figures, 3 tables, submitted to ANV 2020

  6. The Effect of Silence Feature in Dimensional Speech Emotion Recognition

    Authors: Bagus Tris Atmaja, Masato Akagi

    Abstract: Silence is a part of human-to-human communication, which can be a clue for human emotion perception. For automatic emotion recognition by a computer, it is not clear whether silence is useful to determine human emotion within a speech. This paper presents an investigation of the effect of using silence feature in dimensional emotion recognition. Since the silence feature is extracted per utterance… ▽ More

    Submitted 21 April, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: 6 pages, 4 figures, 2 tables, accepted at speech prosody 2020

    Journal ref: 10th International Conference on Speech Prosody 2020, 26-30

  7. Multitask Learning and Multistage Fusion for Dimensional Audiovisual Emotion Recognition

    Authors: Bagus Tris Atmaja, Masato Akagi

    Abstract: Due to its ability to accurately predict emotional state using multimodal features, audiovisual emotion recognition has recently gained more interest from researchers. This paper proposes two methods to predict emotional attributes from audio and visual data using a multitask learning and a fusion strategy. First, multitask learning is employed by adjusting three parameters for each attribute to i… ▽ More

    Submitted 9 March, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: 3 figures, 3 tables, accepted at ICASSP 2020

  8. arXiv:1509.01849  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    A ferroelectric-like structural transition in a metal

    Authors: Youguo Shi, Yanfeng Guo, Xia Wang, Andrew J. Princep, Dmitry Khalyavin, Pascal Manuel, Yuichi Michiue, Akira Sato, Kenji Tsuda, Shan Yu, Masao Arai, Yuichi Shirako, Masaki Akaogi, Nanlin Wang, Kazunari Yamaura, Andrew T. Boothroyd

    Abstract: Metals cannot exhibit ferroelectricity because static internal electric fields are screened by conduction electrons, but in 1965, Anderson and Blount predicted the possibility of a ferroelectric metal, in which a ferroelectric-like structural transition occurs in the metallic state. Up to now, no clear example of such a material has been identified. Here we report on a centrosymmetric (R-3c) to no… ▽ More

    Submitted 6 September, 2015; originally announced September 2015.

    Comments: Manuscript of published version, including Supplementary Information. See also News & Views article by V. Keppens, Nature Materials 12, 952 (2013)

    Journal ref: Nature Materials 12, 1024 (2013)

  9. arXiv:1206.0811  [pdf, ps, other

    cond-mat.supr-con

    Superconductivity suppression of Ba0.5K0.5Fe2-2xM2xAs2 single crystals by substitution of transition-metal (M = Mn, Ru, Co, Ni, Cu, and Zn)

    Authors: Jun Li, Yanfeng Guo, Shoubao Zhang, Jie Yuan, Yoshihiro Tsujimoto, Xia Wang, C. I. Sathish, Ying Sun, Shan Yu, Wei Yi, Kazunari Yamaura, Eiji Takayama-Muromachi, Yuichi Shirako, Masaki Akaogi, Hiroshi Kontani

    Abstract: We investigated the do** effects of magnetic and nonmagnetic impurities on the single-crystalline p-type Ba0.5K0.5Fe2-2xM2xAs2 (M = Mn, Ru, Co, Ni, Cu and Zn) superconductors. The superconductivity indicates robustly against impurity of Ru, while weakly against the impurities of Mn, Co, Ni, Cu, and Zn. However, the present Tc suppression rate of both magnetic and nonmagnetic impurities remains m… ▽ More

    Submitted 4 June, 2012; originally announced June 2012.

    Comments: 8 pages, 9 figures, to be published in Phys. Rev. B

  10. arXiv:1104.1461  [pdf

    cond-mat.str-el cond-mat.supr-con

    Integer spin-chain antiferromagnetism of the 4d oxide CaRuO3 with post-perovskite structure

    Authors: Y. Shirako, H. Satsukawa, X. X. Wang, J. J. Li, Y. F. Guo, M. Arai, K. Yamaura, M. Yoshida, H. Kojitani, T. Katsumata, Y. Inaguma, K. Hiraki, T. Takahashi, M. Akaogi

    Abstract: A quasi-one dimensional magnetism was discovered in the post-perovskite CaRuO3 (Ru4+: 4d4, Cmcm), which is iso-compositional with the perovskite CaRuO3 (Pbnm). An antiferromagnetic spin-chain function with -J/kB = 350 K well reproduces the experimental curve of the magnetic susceptibility vs. temperature, suggesting long-range antiferromagnetic correlations. The anisotropic magnetism is probably o… ▽ More

    Submitted 7 April, 2011; originally announced April 2011.

    Comments: Accepted for publication in Phys. Rev. B