Skip to main content

Showing 1–23 of 23 results for author: Kuribayashi, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16078  [pdf, other

    cs.CL

    First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning

    Authors: Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Shusaku Sone, Masaya Taniguchi, Keisuke Sakaguchi, Kentaro Inui

    Abstract: Multi-step reasoning is widely adopted in the community to explore the better performance of language models (LMs). We report on the systematic strategy that LMs use in this process. Our controlled experiments reveal that LMs rely more heavily on heuristics, such as lexical overlap, in the earlier stages of reasoning when more steps are required to reach an answer. Conversely, as LMs progress clos… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  2. arXiv:2406.05967  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

    Authors: David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song , et al. (50 additional authors not shown)

    Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2404.11315  [pdf, other

    cs.CL

    To Drop or Not to Drop? Predicting Argument Ellipsis Judgments: A Case Study in Japanese

    Authors: Yukiko Ishizuki, Tatsuki Kuribayashi, Yuichiroh Matsubayashi, Ryohei Sasano, Kentaro Inui

    Abstract: Speakers sometimes omit certain arguments of a predicate in a sentence; such omission is especially frequent in pro-drop languages. This study addresses a question about ellipsis -- what can explain the native speakers' ellipsis decisions? -- motivated by the interest in human discourse processing and writing assistance for this choice. To this end, we first collect large-scale human annotations o… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 13 pages; accepted by LREC-COLING 2024

  4. arXiv:2402.12363  [pdf, other

    cs.CL

    Emergent Word Order Universals from Cognitively-Motivated Language Models

    Authors: Tatsuki Kuribayashi, Ryo Ueda, Ryo Yoshida, Yohei Oseki, Ted Briscoe, Timothy Baldwin

    Abstract: The world's languages exhibit certain so-called typological or implicational universals; for example, Subject-Object-Verb (SOV) languages typically use postpositions. Explaining the source of such biases is a key goal of linguistics. We study word-order universals through a computational simulation with language models (LMs). Our experiments show that typologically-typical word orders tend to have… ▽ More

    Submitted 7 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024 main conference, 22 pages

  5. arXiv:2311.07484  [pdf, other

    cs.CL cs.AI

    Psychometric Predictive Power of Large Language Models

    Authors: Tatsuki Kuribayashi, Yohei Oseki, Timothy Baldwin

    Abstract: Instruction tuning aligns the response of large language models (LLMs) with human preferences. Despite such efforts in human--LLM alignment, we find that instruction tuning does not always make LLMs human-like from a cognitive modeling perspective. More specifically, next-word probabilities estimated by instruction-tuned LLMs are often worse at simulating human reading behavior than those estimate… ▽ More

    Submitted 15 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 23 pages; Findings of NAACL 2024

  6. arXiv:2310.14868  [pdf, other

    cs.CL

    Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism

    Authors: Mengyu Ye, Tatsuki Kuribayashi, Jun Suzuki, Goro Kobayashi, Hiroaki Funayama

    Abstract: Large language models (LLMs) take advantage of step-by-step reasoning instructions, e.g., chain-of-thought (CoT) prompting. Building on this, their ability to perform CoT-style reasoning robustly is of interest from a probing perspective. In this study, we inspect the step-by-step reasoning ability of LLMs with a focus on negation, which is a core linguistic phenomenon that is difficult to process… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  7. arXiv:2306.02920  [pdf, other

    cs.CL

    Second Language Acquisition of Neural Language Models

    Authors: Miyu Oba, Tatsuki Kuribayashi, Hiroki Ouchi, Taro Watanabe

    Abstract: With the success of neural language models (LMs), their language acquisition has gained much attention. This work sheds light on the second language (L2) acquisition of LMs, while previous work has typically explored their first language (L1) acquisition. Specifically, we trained bilingual LMs with a scenario similar to human L2 acquisition and analyzed their cross-lingual transfer from linguistic… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  8. arXiv:2305.18294  [pdf, other

    cs.CL

    Transformer Language Models Handle Word Frequency in Prediction Head

    Authors: Goro Kobayashi, Tatsuki Kuribayashi, Sho Yokoi, Kentaro Inui

    Abstract: Prediction head is a crucial component of Transformer language models. Despite its direct impact on prediction, this component has often been overlooked in analyzing Transformers. In this study, we investigate the inner workings of the prediction head, specifically focusing on bias parameters. Our experiments with BERT and GPT-2 models reveal that the biases in their word prediction heads play a s… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: 11 pages, 12 figures, accepted to ACL 2023 Findings (short paper)

  9. arXiv:2302.08148  [pdf, other

    cs.AI cs.CL

    Empirical Investigation of Neural Symbolic Reasoning Strategies

    Authors: Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Ana Brassard, Masashi Yoshikawa, Keisuke Sakaguchi, Kentaro Inui

    Abstract: Neural reasoning accuracy improves when generating intermediate reasoning steps. However, the source of this improvement is yet unclear. Here, we investigate and factorize the benefit of generating intermediate steps for symbolic reasoning. Specifically, we decompose the reasoning strategy w.r.t. step granularity and chaining strategy. With a purely symbolic numerical reasoning dataset (e.g., A=1,… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: This paper is accepted as the findings at EACL 2023, and the earlier version (non-archival) of this work got the Best Paper Award in the Student Research Workshop of AACL 2022

  10. arXiv:2302.07866  [pdf, other

    cs.CL cs.AI

    Do Deep Neural Networks Capture Compositionality in Arithmetic Reasoning?

    Authors: Keito Kudo, Yoichi Aoki, Tatsuki Kuribayashi, Ana Brassard, Masashi Yoshikawa, Keisuke Sakaguchi, Kentaro Inui

    Abstract: Compositionality is a pivotal property of symbolic reasoning. However, how well recent neural models capture compositionality remains underexplored in the symbolic reasoning tasks. This study empirically addresses this question by systematically examining recently published pre-trained seq2seq models with a carefully controlled dataset of multi-hop arithmetic symbolic reasoning. We introduce a ski… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: accepted by EACL 2023

  11. arXiv:2302.00667  [pdf, other

    cs.CL

    Does Vision Accelerate Hierarchical Generalization of Neural Language Learners?

    Authors: Tatsuki Kuribayashi

    Abstract: Neural language models (LMs) are arguably less data-efficient than humans -- why does this gap occur? In this study, we hypothesize that this gap stems from the learners' accessibility to modalities other than text, specifically, vision. We conducted two complementary experiments (using noisy, realistic data and a simplified, artificial one) toward the advantage of vision in the syntactic generali… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: Work in progress

  12. arXiv:2302.00456  [pdf, other

    cs.CL

    Analyzing Feed-Forward Blocks in Transformers through the Lens of Attention Maps

    Authors: Goro Kobayashi, Tatsuki Kuribayashi, Sho Yokoi, Kentaro Inui

    Abstract: Transformers are ubiquitous in wide tasks. Interpreting their internals is a pivotal goal. Nevertheless, their particular components, feed-forward (FF) blocks, have typically been less analyzed despite their substantial parameter amounts. We analyze the input contextualization effects of FF blocks by rendering them in the attention maps as a human-friendly visualization scheme. Our experiments wit… ▽ More

    Submitted 15 April, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: ICLR 2024 Spotlight; 37 pages, 32 figures, 3 tables

  13. arXiv:2205.11463  [pdf, other

    cs.CL

    Context Limitations Make Neural Language Models More Human-Like

    Authors: Tatsuki Kuribayashi, Yohei Oseki, Ana Brassard, Kentaro Inui

    Abstract: Language models (LMs) have been used in cognitive modeling as well as engineering studies -- they compute information-theoretic complexity metrics that simulate humans' cognitive load during reading. This study highlights a limitation of modern neural LMs as the model of choice for this purpose: there is a discrepancy between their context access capacities and that of humans. Our results showed t… ▽ More

    Submitted 1 November, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted by EMNLP2022 (main long)

  14. arXiv:2109.13497  [pdf, other

    cs.CL cs.LG

    Instance-Based Neural Dependency Parsing

    Authors: Hiroki Ouchi, Jun Suzuki, Sosuke Kobayashi, Sho Yokoi, Tatsuki Kuribayashi, Masashi Yoshikawa, Kentaro Inui

    Abstract: Interpretable rationales for model predictions are crucial in practical applications. We develop neural models that possess an interpretable inference process for dependency parsing. Our models adopt instance-based inference, where dependency edges are extracted and labeled by comparing them to edges in a training set. The training edges are explicitly used for the predictions; thus, it is easy to… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: 15 pages, accepted to TACL 2021

  15. arXiv:2109.07152  [pdf, other

    cs.CL

    Incorporating Residual and Normalization Layers into Analysis of Masked Language Models

    Authors: Goro Kobayashi, Tatsuki Kuribayashi, Sho Yokoi, Kentaro Inui

    Abstract: Transformer architecture has become ubiquitous in the natural language processing field. To interpret the Transformer-based models, their attention patterns have been extensively analyzed. However, the Transformer architecture is not only composed of the multi-head attention; other components can also contribute to Transformers' progressive performance. In this study, we extended the scope of the… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: 22 pages, accepted to EMNLP 2021 main conference

  16. arXiv:2106.01229  [pdf, other

    cs.CL

    Lower Perplexity is Not Always Human-Like

    Authors: Tatsuki Kuribayashi, Yohei Oseki, Takumi Ito, Ryo Yoshida, Masayuki Asahara, Kentaro Inui

    Abstract: In computational psycholinguistics, various language models have been evaluated against human reading behavior (e.g., eye movement) to build human-like computational models. However, most previous efforts have focused almost exclusively on English, despite the recent trend towards linguistic universal within the general community. In order to fill the gap, this paper investigates whether the estab… ▽ More

    Submitted 1 November, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted by ACL 2021

  17. arXiv:2011.01785  [pdf, other

    cs.CL

    Modeling Event Salience in Narratives via Barthes' Cardinal Functions

    Authors: Takaki Otake, Sho Yokoi, Naoya Inoue, Ryo Takahashi, Tatsuki Kuribayashi, Kentaro Inui

    Abstract: Events in a narrative differ in salience: some are more important to the story than others. Estimating event salience is useful for tasks such as story generation, and as a tool for text analysis in narratology and folkloristics. To compute event salience without any annotations, we adopt Barthes' definition of event salience and propose several unsupervised methods that require only a pre-trained… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: accepted to COLING 2020

  18. arXiv:2010.04332  [pdf, other

    cs.CL

    Langsmith: An Interactive Academic Text Revision System

    Authors: Takumi Ito, Tatsuki Kuribayashi, Masatoshi Hidaka, Jun Suzuki, Kentaro Inui

    Abstract: Despite the current diversity and inclusion initiatives in the academic community, researchers with a non-native command of English still face significant obstacles when writing papers in English. This paper presents the Langsmith editor, which assists inexperienced, non-native researchers to write English papers, especially in the natural language processing (NLP) field. Our system can suggest fl… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: Accepted at EMNLP 2020 (system demonstrations)

  19. arXiv:2005.00842  [pdf, other

    cs.CL

    Language Models as an Alternative Evaluator of Word Order Hypotheses: A Case Study in Japanese

    Authors: Tatsuki Kuribayashi, Takumi Ito, Jun Suzuki, Kentaro Inui

    Abstract: We examine a methodology using neural language models (LMs) for analyzing the word order of language. This LM-based method has the potential to overcome the difficulties existing methods face, such as the propagation of preprocessor errors in count-based methods. In this study, we explore whether the LM-based method is valid for analyzing the word order. As a case study, this study focuses on Japa… ▽ More

    Submitted 2 May, 2020; originally announced May 2020.

    Comments: Accepted by ACL2020

  20. arXiv:2004.14514  [pdf, other

    cs.CL cs.LG

    Instance-Based Learning of Span Representations: A Case Study through Named Entity Recognition

    Authors: Hiroki Ouchi, Jun Suzuki, Sosuke Kobayashi, Sho Yokoi, Tatsuki Kuribayashi, Ryuto Konno, Kentaro Inui

    Abstract: Interpretable rationales for model predictions play a critical role in practical applications. In this study, we develop models possessing interpretable inference process for structured prediction. Specifically, we present a method of instance-based learning that learns similarities between spans. At inference time, each span is assigned a class label based on its similar spans in the training set… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: Accepted by ACL2020

  21. arXiv:2004.10102  [pdf, other

    cs.CL

    Attention is Not Only a Weight: Analyzing Transformers with Vector Norms

    Authors: Goro Kobayashi, Tatsuki Kuribayashi, Sho Yokoi, Kentaro Inui

    Abstract: Attention is a key component of Transformers, which have recently achieved considerable success in natural language processing. Hence, attention is being extensively studied to investigate various linguistic capabilities of Transformers, focusing on analyzing the parallels between attention weights and specific linguistic phenomena. This paper shows that attention weights alone are only one of the… ▽ More

    Submitted 6 October, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: 19 pages, accepted by EMNLP 2020

  22. arXiv:1910.09180  [pdf, other

    cs.CL

    Diamonds in the Rough: Generating Fluent Sentences from Early-Stage Drafts for Academic Writing Assistance

    Authors: Takumi Ito, Tatsuki Kuribayashi, Hayato Kobayashi, Ana Brassard, Masato Hagiwara, Jun Suzuki, Kentaro Inui

    Abstract: The writing process consists of several stages such as drafting, revising, editing, and proofreading. Studies on writing assistance, such as grammatical error correction (GEC), have mainly focused on sentence editing and proofreading, where surface-level issues such as typographical, spelling, or grammatical errors should be corrected. We broaden this focus to include the earlier revising stage, w… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: Proceedings of the 12th International Conference on Natural Language Generation (INLG 2019)

  23. arXiv:1909.02621  [pdf, other

    cs.CL

    TEASPN: Framework and Protocol for Integrated Writing Assistance Environments

    Authors: Masato Hagiwara, Takumi Ito, Tatsuki Kuribayashi, Jun Suzuki, Kentaro Inui

    Abstract: Language technologies play a key role in assisting people with their writing. Although there has been steady progress in e.g., grammatical error correction (GEC), human writers are yet to benefit from this progress due to the high development cost of integrating with writing software. We propose TEASPN, a protocol and an open-source framework for achieving integrated writing assistance environment… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

    Comments: Accepted at EMNLP 2019 (system demonstrations)