Skip to main content

Showing 1–6 of 6 results for author: Tokuhisa, R

.
  1. arXiv:2402.13765  [pdf, other

    cs.LG stat.ML

    Accuracy-Preserving Calibration via Statistical Modeling on Probability Simplex

    Authors: Yasushi Esaki, Akihiro Nakamura, Keisuke Kawano, Ryoko Tokuhisa, Takuro Kutsuna

    Abstract: Classification models based on deep neural networks (DNNs) must be calibrated to measure the reliability of predictions. Some recent calibration methods have employed a probabilistic model on the probability simplex. However, these calibration methods cannot preserve the accuracy of pre-trained models, even those with a high classification accuracy. We propose an accuracy-preserving calibration me… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: accepted at the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

    Journal ref: Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, PMLR 238:1666-1674, 2024

  2. Chat Translation Error Detection for Assisting Cross-lingual Communications

    Authors: Yunmeng Li, Jun Suzuki, Makoto Morishita, Kaori Abe, Ryoko Tokuhisa, Ana Brassard, Kentaro Inui

    Abstract: In this paper, we describe the development of a communication support system that detects erroneous translations to facilitate crosslingual communications due to the limitations of current machine chat translation methods. We trained an error detector as the baseline of the system and constructed a new Japanese-English bilingual chat corpus, BPersona-chat, which comprises multiturn colloquial chat… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Journal ref: Proceedings of the 3rd Workshop on Evaluation and Comparison of NLP Systems, pages 88-95, November 2022, Online. Association for Computational Linguistics

  3. StyleDiff: Attribute Comparison Between Unlabeled Datasets in Latent Disentangled Space

    Authors: Keisuke Kawano, Takuro Kutsuna, Ryoko Tokuhisa, Akihiro Nakamura, Yasushi Esaki

    Abstract: One major challenge in machine learning applications is co** with mismatches between the datasets used in the development and those obtained in real-world applications. These mismatches may lead to inaccurate predictions and errors, resulting in poor product quality and unreliable systems. In this study, we propose StyleDiff to inform developers of the differences between the two datasets for th… ▽ More

    Submitted 31 August, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: 25 pages, 17 figures, Image and Vision Computing

  4. arXiv:2211.10596  [pdf, other

    cs.CL

    Bipartite-play Dialogue Collection for Practical Automatic Evaluation of Dialogue Systems

    Authors: Shiki Sato, Yosuke Kishinami, Hiroaki Sugiyama, Reina Akama, Ryoko Tokuhisa, Jun Suzuki

    Abstract: Automation of dialogue system evaluation is a driving force for the efficient development of dialogue systems. This paper introduces the bipartite-play method, a dialogue collection method for automating dialogue system evaluation. It addresses the limitations of existing dialogue collection methods: (i) inability to compare with systems that are not publicly available, and (ii) vulnerability to c… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 9 pages, Accepted to The AACL-IJCNLP 2022 Student Research Workshop (SRW)

  5. arXiv:2209.09746  [pdf, other

    cs.CL

    Target-Guided Open-Domain Conversation Planning

    Authors: Yosuke Kishinami, Reina Akama, Shiki Sato, Ryoko Tokuhisa, Jun Suzuki, Kentaro Inui

    Abstract: Prior studies addressing target-oriented conversational tasks lack a crucial notion that has been intensively studied in the context of goal-oriented artificial intelligence agents, namely, planning. In this study, we propose the task of Target-Guided Open-Domain Conversation Planning (TGCP) task to evaluate whether neural conversational agents have goal-oriented conversation planning abilities. U… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: 9 pages, Accepted to The 29th International Conference on Computational Linguistics (COLING 2022)

  6. arXiv:2208.02578  [pdf, other

    cs.CL

    N-best Response-based Analysis of Contradiction-awareness in Neural Response Generation Models

    Authors: Shiki Sato, Reina Akama, Hiroki Ouchi, Ryoko Tokuhisa, Jun Suzuki, Kentaro Inui

    Abstract: Avoiding the generation of responses that contradict the preceding context is a significant challenge in dialogue response generation. One feasible method is post-processing, such as filtering out contradicting responses from a resulting n-best response list. In this scenario, the quality of the n-best list considerably affects the occurrence of contradictions because the final response is chosen… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 8 pages, Accepted to The 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2022)