Showing 1–1 of 1 results for author: Jordehi, A Y

Search v0.5.6 released 2020-02-24

arXiv:2403.02451 [pdf, other]

cs.CL

Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground

Authors: Adil Soubki, John Murzaku, Arash Yousefi Jordehi, Peter Zeng, Magdalena Markowska, Seyed Abolghasem Mirroshandel, Owen Rambow

Abstract: Evaluating the theory of mind (ToM) capabilities of language models (LMs) has recently received a great deal of attention. However, many existing benchmarks rely on synthetic data, which risks misaligning the resulting experiments with human behavior. We introduce the first ToM dataset based on naturally occurring spoken dialogs, Common-ToM, and show that LMs struggle to demonstrate ToM. We then s… ▽ More Evaluating the theory of mind (ToM) capabilities of language models (LMs) has recently received a great deal of attention. However, many existing benchmarks rely on synthetic data, which risks misaligning the resulting experiments with human behavior. We introduce the first ToM dataset based on naturally occurring spoken dialogs, Common-ToM, and show that LMs struggle to demonstrate ToM. We then show that integrating a simple, explicit representation of beliefs improves LM performance on Common-ToM. △ Less

Submitted 5 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

Journal ref: ACL 2024 Findings

Search v0.5.6 released 2020-02-24