Skip to main content

Showing 1–6 of 6 results for author: Tsvilodub, P

.
  1. arXiv:2407.03805  [pdf, other

    cs.CL

    Cognitive Modeling with Scaffolded LLMs: A Case Study of Referential Expression Generation

    Authors: Polina Tsvilodub, Michael Franke, Fausto Carcassi

    Abstract: To what extent can LLMs be used as part of a cognitive model of language generation? In this paper, we approach this question by exploring a neuro-symbolic implementation of an algorithmic cognitive model of referential expression generation by Dale & Reiter (1995). The symbolic task analysis implements the generation as an iterative procedure that scaffolds symbolic and gpt-3.5-turbo-based module… ▽ More

    Submitted 8 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: 11 pages, 3 figures, 2 algorithms, to appear at the ICML 2024 workshop on Large Language Models and Cognition

  2. arXiv:2406.09012  [pdf, other

    cs.CL

    Bayesian Statistical Modeling with Predictors from LLMs

    Authors: Michael Franke, Polina Tsvilodub, Fausto Carcassi

    Abstract: State of the art large language models (LLMs) have shown impressive performance on a variety of benchmark tasks and are increasingly used as components in larger applications, where LLM-based predictions serve as proxies for human judgements or decision. This raises questions about the human-likeness of LLM-derived information, alignment with human intuition, and whether LLMs could possibly be con… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 20 pages, 10 figures, parallel submission to a journal

  3. arXiv:2405.05776  [pdf, other

    cs.CL

    Experimental Pragmatics with Machines: Testing LLM Predictions for the Inferences of Plain and Embedded Disjunctions

    Authors: Polina Tsvilodub, Paul Marty, Sonia Ramotowska, Jacopo Romoli, Michael Franke

    Abstract: Human communication is based on a variety of inferences that we draw from sentences, often going beyond what is literally said. While there is wide agreement on the basic distinction between entailment, implicature, and presupposition, the status of many inferences remains controversial. In this paper, we focus on three inferences of plain and embedded disjunctions, and compare them with regular s… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 8 pages, 3 figures, to appear in the Proceedings of the 46th Annual Conference of the Cognitive Science Society (2024)

  4. arXiv:2403.00998  [pdf, other

    cs.CL

    Predictions from language models for multiple-choice tasks are not robust under variation of scoring methods

    Authors: Polina Tsvilodub, Hening Wang, Sharon Grosch, Michael Franke

    Abstract: This paper systematically compares different methods of deriving item-level predictions of language models for multiple-choice tasks. It compares scoring methods for answer options based on free generation of responses, various probability-based scores, a Likert-scale style rating method, and embedding similarity. In a case study on pragmatic language interpretation, we find that LLM predictions a… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

  5. arXiv:2305.12777  [pdf, other

    cs.CL

    Evaluating Pragmatic Abilities of Image Captioners on A3DS

    Authors: Polina Tsvilodub, Michael Franke

    Abstract: Evaluating grounded neural language model performance with respect to pragmatic qualities like the trade off between truthfulness, contrastivity and overinformativity of generated utterances remains a challenge in absence of data collected from humans. To enable such evaluation, we present a novel open source image-text dataset "Annotated 3D Shapes" (A3DS) comprising over nine million exhaustive n… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 5 pages, 2 figures, to appear in the 61st Proceedings of the Association for Computational Linguistics (ACL 2023)

  6. arXiv:2305.07151  [pdf, other

    cs.CL

    Overinformative Question Answering by Humans and Machines

    Authors: Polina Tsvilodub, Michael Franke, Robert D. Hawkins, Noah D. Goodman

    Abstract: When faced with a polar question, speakers often provide overinformative answers going beyond a simple "yes" or "no". But what principles guide the selection of additional information? In this paper, we provide experimental evidence from two studies suggesting that overinformativeness in human answering is driven by considerations of relevance to the questioner's goals which they flexibly adjust g… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 7 pages, 2 figures, to appear in the Proceedings of the 45th Annual Conference of the Cognitive Science Society (2023)