Skip to main content

Showing 1–40 of 40 results for author: Schlangen, D

.
  1. arXiv:2406.18403  [pdf, other

    cs.CL

    LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

    Authors: Anna Bavaresco, Raffaella Bernardi, Leonardo Bertolazzi, Desmond Elliott, Raquel Fernández, Albert Gatt, Esam Ghaleb, Mario Giulianelli, Michael Hanna, Alexander Koller, André F. T. Martins, Philipp Mondorf, Vera Neplenbroek, Sandro Pezzelle, Barbara Plank, David Schlangen, Alessandro Suglia, Aditya K Surikuchi, Ece Takmaz, Alberto Testoni

    Abstract: There is an increasing trend towards evaluating NLP models with LLM-generated judgments instead of human judgments. In the absence of a comparison against human data, this raises concerns about the validity of these evaluations; in case they are conducted with proprietary models, this also raises concerns over reproducibility. We provide JUDGE-BENCH, a collection of 20 NLP datasets with human anno… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.17553  [pdf, other

    cs.CL

    Retrieval-Augmented Code Generation for Situated Action Generation: A Case Study on Minecraft

    Authors: Chalamalasetti Kranti, Sherzod Hakimov, David Schlangen

    Abstract: In the Minecraft Collaborative Building Task, two players collaborate: an Architect (A) provides instructions to a Builder (B) to assemble a specified structure using 3D blocks. In this work, we investigate the use of large language models (LLMs) to predict the sequence of actions taken by the Builder. Leveraging LLMs' in-context learning abilities, we use few-shot prompting techniques, that signi… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: under review

  3. arXiv:2406.14051  [pdf, other

    cs.CL cs.AI

    How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics

    Authors: Nidhir Bhavsar, Jonathan Jordan, Sherzod Hakimov, David Schlangen

    Abstract: What makes a good Large Language Model (LLM)? That it performs well on the relevant benchmarks -- which hopefully measure, with some validity, the presence of capabilities that are also challenged in real application. But what makes the model perform well? What gives a model its abilities? We take a recently introduced type of benchmark that is meant to challenge capabilities in a goal-directed, a… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: under review

  4. arXiv:2406.14035  [pdf, other

    cs.CL cs.AI

    Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models

    Authors: Sherzod Hakimov, Yerkezhan Abdullayeva, Kushal Koshti, Antonia Schmidt, Yan Weiser, Anne Beyer, David Schlangen

    Abstract: While the situation has improved for text-only models, it again seems to be the case currently that multimodal (text and image) models develop faster than ways to evaluate them. In this paper, we bring a recently developed evaluation paradigm from text models to multimodal models, namely evaluation through the goal-oriented game (self) play, complementing reference-based and preference-based evalu… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: under review

  5. arXiv:2406.08202  [pdf, other

    cs.CL

    A Dialogue Game for Eliciting Balanced Collaboration

    Authors: Isidora Jeknić, David Schlangen, Alexander Koller

    Abstract: Collaboration is an integral part of human dialogue. Typical task-oriented dialogue games assign asymmetric roles to the participants, which limits their ability to elicit naturalistic role-taking in collaboration and its negotiation. We present a novel and simple online setup that favors balanced collaboration: a two-player 2D object placement game in which the players must negotiate the goal sta… ▽ More

    Submitted 11 July, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2405.20859  [pdf, other

    cs.CL cs.AI

    clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark and Underlying Flexible Framework for LLMs as Multi-Action Agents

    Authors: Anne Beyer, Kranti Chalamalasetti, Sherzod Hakimov, Brielen Madureira, Philipp Sadler, David Schlangen

    Abstract: It has been established in recent work that Large Language Models (LLMs) can be prompted to "self-play" conversational games that probe certain capabilities (general instruction following, strategic goal orientation, language understanding abilities), where the resulting interactive game play can be automatically scored. In this paper, we take one of the proposed frameworks for setting up such gam… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: under review

  7. arXiv:2405.01139  [pdf, other

    cs.CL

    It Couldn't Help But Overhear: On the Limits of Modelling Meta-Communicative Grounding Acts with Supervised Learning

    Authors: Brielen Madureira, David Schlangen

    Abstract: Active participation in a conversation is key to building common ground, since understanding is jointly tailored by producers and recipients. Overhearers are deprived of the privilege of performing grounding acts and can only conjecture about intended meanings. Still, data generation and annotation, modelling, training and evaluation of NLP dialogue models place reliance on the overhearing paradig… ▽ More

    Submitted 13 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: work in progress

  8. arXiv:2403.17497  [pdf, other

    cs.CL cs.CV

    Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies

    Authors: Philipp Sadler, Sherzod Hakimov, David Schlangen

    Abstract: In collaborative goal-oriented settings, the participants are not only interested in achieving a successful outcome, but do also implicitly negotiate the effort they put into the interaction (by adapting to each other). In this work, we propose a challenging interactive reference game that requires two players to coordinate on vision and language observations. The learning signal in this game is a… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 9 pages, Accepted at LREC-COLING 2024

  9. arXiv:2402.13113  [pdf, other

    cs.CL

    When Only Time Will Tell: Interpreting How Transformers Process Local Ambiguities Through the Lens of Restart-Incrementality

    Authors: Brielen Madureira, Patrick Kahardipraja, David Schlangen

    Abstract: Incremental models that process sentences one token at a time will sometimes encounter points where more than one interpretation is possible. Causal models are forced to output one interpretation and continue, whereas models that can revise may edit their previous output as the ambiguity is resolved. In this work, we look at how restart-incremental Transformers build and update internal states, in… ▽ More

    Submitted 2 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL 2024

  10. arXiv:2402.04824  [pdf, other

    cs.CL

    Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game

    Authors: Philipp Sadler, Sherzod Hakimov, David Schlangen

    Abstract: Albrecht and Stone (2018) state that modeling of changing behaviors remains an open problem "due to the essentially unconstrained nature of what other agents may do". In this work we evaluate the adaptability of neural artificial agents towards assumed partner behaviors in a collaborative reference game. In this game success is achieved when a knowledgeable Guide can verbally lead a Follower to th… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Work presented at the "Cooperative Multi-Agent Systems Decision-making and Learning" workshop (AAAI'24)

  11. arXiv:2401.17039  [pdf, other

    cs.CL

    Taking Action Towards Graceful Interaction: The Effects of Performing Actions on Modelling Policies for Instruction Clarification Requests

    Authors: Brielen Madureira, David Schlangen

    Abstract: Clarification requests are a mechanism to help solve communication problems, e.g. due to ambiguity or underspecification, in instruction-following interactions. Despite their importance, even skilful models struggle with producing or interpreting such repair acts. In this work, we test three hypotheses concerning the effects of action taking as an auxiliary task in modelling iCR policies. Contrary… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to UnImplicit workshop at EACL 2024

  12. arXiv:2310.18229  [pdf, other

    cs.CL

    Revising with a Backward Glance: Regressions and Skips during Reading as Cognitive Signals for Revision Policies in Incremental Processing

    Authors: Brielen Madureira, Pelin Çelikkol, David Schlangen

    Abstract: In NLP, incremental processors produce output in instalments, based on incoming prefixes of the linguistic input. Some tokens trigger revisions, causing edits to the output hypothesis, but little is known about why models revise when they revise. A policy that detects the time steps where revisions should happen can improve efficiency. Still, retrieving a suitable signal to train a revision policy… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted to CoNLL 2023

  13. arXiv:2310.18038  [pdf, other

    cs.CL cs.CY

    On General Language Understanding

    Authors: David Schlangen

    Abstract: Natural Language Processing prides itself to be an empirically-minded, if not outright empiricist field, and yet lately it seems to get itself into essentialist debates on issues of meaning and measurement ("Do Large Language Models Understand Language, And If So, How Much?"). This is not by accident: Here, as everywhere, the evidence underspecifies the understanding. As a remedy, this paper sketc… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023

  14. arXiv:2308.06095  [pdf, other

    cs.CL cs.AI cs.LG

    Neural Conversation Models and How to Rein Them in: A Survey of Failures and Fixes

    Authors: Fabian Galetzka, Anne Beyer, David Schlangen

    Abstract: Recent conditional language models are able to continue any kind of text source in an often seemingly fluent way. This fact encouraged research in the area of open-domain conversational systems that are based on powerful language models and aim to imitate an interlocutor by generating appropriate contributions to a written dialogue. From a linguistic perspective, however, the complexity of contrib… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Represents the state of the field in 2022; partially based on the first authors 2022 PhD thesis

  15. arXiv:2307.15508  [pdf, other

    cs.CL

    The Road to Quality is Paved with Good Revisions: A Detailed Evaluation Methodology for Revision Policies in Incremental Sequence Labelling

    Authors: Brielen Madureira, Patrick Kahardipraja, David Schlangen

    Abstract: Incremental dialogue model components produce a sequence of output prefixes based on incoming input. Mistakes can occur due to local ambiguities or to wrong hypotheses, making the ability to revise past outputs a desirable property that can be governed by a policy. In this work, we formalise and characterise edits and revisions in incremental sequence labelling and propose metrics to evaluate revi… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted at SIGdial 2023

  16. arXiv:2306.02377  [pdf, other

    cs.CL

    "Are you telling me to put glasses on the dog?'' Content-Grounded Annotation of Instruction Clarification Requests in the CoDraw Dataset

    Authors: Brielen Madureira, David Schlangen

    Abstract: Instruction Clarification Requests are a mechanism to solve communication problems, which is very functional in instruction-following interactions. Recent work has argued that the CoDraw dataset is a valuable source of naturally occurring iCRs. Beyond identifying when iCRs should be made, dialogue models should also be able to generate them with suitable form and content. In this work, we introduc… ▽ More

    Submitted 26 July, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: A 2-page version will appear at SemDial 2023 as a poster

  17. arXiv:2305.15087  [pdf, other

    cs.CL cs.CV

    Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples

    Authors: Philipp Sadler, David Schlangen

    Abstract: NLP tasks are typically defined extensionally through datasets containing example instantiations (e.g., pairs of image i and text t), but motivated intensionally through capabilities invoked in verbal descriptions of the task (e.g., "t is a description of i, for which the content of i needs to be recognised and understood"). We present Pento-DIARef, a diagnostic dataset in a visual domain of puzzl… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 9 pages, Accepted to EACL 2023

  18. arXiv:2305.13782  [pdf, other

    cs.CL

    Images in Language Space: Exploring the Suitability of Large Language Models for Vision & Language Tasks

    Authors: Sherzod Hakimov, David Schlangen

    Abstract: Large language models have demonstrated robust performance on various language tasks using zero-shot or few-shot learning paradigms. While being actively researched, multimodal models that can additionally handle images as input have yet to catch up in size and generality with language-only models. In this work, we ask whether language-only models can be utilised for tasks that require visual inpu… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 Findings

  19. arXiv:2305.13455  [pdf, other

    cs.CL

    Clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents

    Authors: Kranti Chalamalasetti, Jana Götze, Sherzod Hakimov, Brielen Madureira, Philipp Sadler, David Schlangen

    Abstract: Recent work has proposed a methodology for the systematic evaluation of "Situated Language Understanding Agents"-agents that operate in rich linguistic and non-linguistic contexts-through testing them in carefully constructed interactive settings. Other recent work has argued that Large Language Models (LLMs), if suitably set up, can be understood as (simulators of) such agents. A connection sugge… ▽ More

    Submitted 23 November, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  20. arXiv:2305.12880  [pdf, other

    cs.CV cs.CL

    Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers

    Authors: Philipp Sadler, Sherzod Hakimov, David Schlangen

    Abstract: The ability to pick up on language signals in an ongoing interaction is crucial for future machine learning models to collaborate and interact with humans naturally. In this paper, we present an initial study that evaluates intra-episodic feedback given in a collaborative setting. We use a referential language game as a controllable example of a task-oriented collaborative joint activity. A teache… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 5 pages, Accepted at Findings of ACL 2023

  21. arXiv:2305.10845  [pdf, other

    cs.CL

    TAPIR: Learning Adaptive Revision for Incremental Natural Language Understanding with a Two-Pass Model

    Authors: Patrick Kahardipraja, Brielen Madureira, David Schlangen

    Abstract: Language is by its very nature incremental in how it is produced and processed. This property can be exploited by NLP systems to produce fast responses, which has been shown to be beneficial for real-time interactive applications. Recent neural network-based approaches for incremental processing mainly use RNNs or Transformers. RNNs are fast but monotonic (cannot correct earlier output, which can… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 (Findings)

  22. arXiv:2304.07007  [pdf, other

    cs.CL

    Dialogue Games for Benchmarking Language Understanding: Motivation, Taxonomy, Strategy

    Authors: David Schlangen

    Abstract: How does one measure "ability to understand language"? If it is a person's ability that is being measured, this is a question that almost never poses itself in an unqualified manner: Whatever formal test is applied, it takes place on the background of the person's language use in daily social practice, and what is measured is a specialised variety of language understanding (e.g., of a second langu… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  23. arXiv:2302.14406  [pdf, other

    cs.CL

    Instruction Clarification Requests in Multimodal Collaborative Dialogue Games: Tasks, and an Analysis of the CoDraw Dataset

    Authors: Brielen Madureira, David Schlangen

    Abstract: In visual instruction-following dialogue games, players can engage in repair mechanisms in face of an ambiguous or underspecified instruction that cannot be fully mapped to actions in the world. In this work, we annotate Instruction Clarification Requests (iCRs) in CoDraw, an existing dataset of interactions in a multimodal collaborative dialogue game. We show that it contains lexically and semant… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted to EACL 2023

  24. arXiv:2302.08590  [pdf, other

    cs.CL

    What A Situated Language-Using Agent Must be Able to Do: A Top-Down Analysis

    Authors: David Schlangen

    Abstract: Even in our increasingly text-intensive times, the primary site of language use is situated, co-present interaction. It is primary ontogenetically and phylogenetically, and it is arguably also still primary in negotiating everyday social situations. Situated interaction is also the final frontier of Natural Language Processing, where, compared to the area of text processing, very little progress h… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  25. arXiv:2206.02885  [pdf, other

    cs.CL cs.CY

    Norm Participation Grounds Language

    Authors: David Schlangen

    Abstract: The striking recent advances in eliciting seemingly meaningful language behaviour from language-only machine learning models have only made more apparent, through the surfacing of clear limitations, the need to go beyond the language-only mode and to ground these models "in the world". Proposals for doing so vary in the details, but what unites them is that the solution is sought in the addition o… ▽ More

    Submitted 23 August, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Published at "(Dis)embodiment 2022: A CLASP Conference", Gothenburg, Sweden, September 2022

  26. arXiv:2204.06970  [pdf

    cs.CL

    Can Visual Dialogue Models Do Scorekee**? Exploring How Dialogue Representations Incrementally Encode Shared Knowledge

    Authors: Brielen Madureira, David Schlangen

    Abstract: Cognitively plausible visual dialogue models should keep a mental scoreboard of shared established facts in the dialogue context. We propose a theory-based evaluation method for investigating to what degree models pretrained on the VisDial dataset incrementally build representations that appropriately do scorekee**. Our conclusion is that the ability to make the distinction between shared and pr… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted at ACL 2022, short paper

  27. arXiv:2202.01155  [pdf, other

    cs.CL

    The slurk Interaction Server Framework: Better Data for Better Dialog Models

    Authors: Jana Götze, Maike Paetzel-Prüsmann, Wencke Liermann, Tim Diekmann, David Schlangen

    Abstract: This paper presents the slurk software, a lightweight interaction server for setting up dialog data collections and running experiments. Slurk enables a multitude of settings including text-based, speech and video interaction between two or more humans or humans and bots, and a multimodal display area for presenting shared or private interactive context. The software is implemented in Python with… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

    Comments: submitted to LREC 2022

  28. arXiv:2109.07364  [pdf, other

    cs.CL

    Towards Incremental Transformers: An Empirical Analysis of Transformer Models for Incremental NLU

    Authors: Patrick Kahardipraja, Brielen Madureira, David Schlangen

    Abstract: Incremental processing allows interactive systems to respond based on partial inputs, which is a desirable property e.g. in dialogue agents. The currently popular Transformer architecture inherently processes sequences as a whole, abstracting away the notion of time. Recent work attempts to apply Transformers incrementally via restart-incrementality by repeatedly feeding, to an unchanged model, in… ▽ More

    Submitted 2 May, 2024; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP 2021 (contains corrigendum)

  29. arXiv:2105.03495  [pdf, other

    cs.CL

    Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models

    Authors: Anne Beyer, Sharid Loáiciga, David Schlangen

    Abstract: Coherent discourse is distinguished from a mere collection of utterances by the satisfaction of a diverse set of constraints, for example choice of expression, logical relation between denoted events, and implicit compatibility with world-knowledge. Do neural language models encode such constraints? We design an extendable set of test suites addressing different aspects of discourse and dialogue c… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted as long paper at NAACL 2021

  30. arXiv:2010.05330  [pdf, other

    cs.CL

    Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU

    Authors: Brielen Madureira, David Schlangen

    Abstract: While humans process language incrementally, the best language encoders currently used in NLP do not. Both bidirectional LSTMs and Transformers assume that the sequence that is to be encoded is available in full, to be processed either forwards and backwards (BiLSTMs) or as a whole (Transformers). We investigate how they behave under incremental interfaces, when partial output must be provided bas… ▽ More

    Submitted 28 March, 2024; v1 submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted to the EMNLP 2020 conference (long paper). V2 has minor updates, see note in last page

  31. arXiv:2007.09774  [pdf, other

    cs.CL

    An Overview of Natural Language State Representation for Reinforcement Learning

    Authors: Brielen Madureira, David Schlangen

    Abstract: A suitable state representation is a fundamental part of the learning process in Reinforcement Learning. In various tasks, the state can either be described by natural language or be natural language itself. This survey outlines the strategies used in the literature to build natural language state representations. We appeal for more linguistically interpretable and grounded representations, carefu… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: Accepted to the ICML 2020 Workshop on Language in Reinforcement Learning (LaReL). 4 pages

  32. arXiv:2007.04792  [pdf, other

    cs.CL

    Targeting the Benchmark: On Methodology in Current Natural Language Processing Research

    Authors: David Schlangen

    Abstract: It has become a common pattern in our field: One group introduces a language task, exemplified by a dataset, which they argue is challenging enough to serve as a benchmark. They also provide a baseline model for it, which then soon is improved upon by other groups. Often, research efforts then move on, and the pattern repeats itself. What is typically left implicit is the argumentation for why thi… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: text overlap with arXiv:1908.10747

  33. arXiv:2003.13342  [pdf, other

    cs.CL

    A Corpus of Controlled Opinionated and Knowledgeable Movie Discussions for Training Neural Conversation Models

    Authors: Fabian Galetzka, Chukwuemeka U. Eneh, David Schlangen

    Abstract: Fully data driven Chatbots for non-goal oriented dialogues are known to suffer from inconsistent behaviour across their turns, stemming from a general difficulty in controlling parameters like their assumed background personality and knowledge of facts. One reason for this is the relative lack of labeled data from which personality consistency and fact usage could be learned together with dialogue… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: 8 Pages, 8 Figures, 5 Tables. Accepted paper for LREC 2020 conference

  34. arXiv:1911.03936  [pdf, other

    cs.CL cs.CV cs.LG

    Can Neural Image Captioning be Controlled via Forced Attention?

    Authors: Philipp Sadler, Tatjana Scheffler, David Schlangen

    Abstract: Learned dynamic weighting of the conditioning signal (attention) has been shown to improve neural language generation in a variety of settings. The weights applied when generating a particular output sequence have also been viewed as providing a potentially explanatory insight into the internal workings of the generator. In this paper, we reverse the direction of this connection and ask whether th… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

    Comments: Accepted shortpaper for the 12th International Conference on Natural Language Generation

  35. arXiv:1908.11279  [pdf, other

    cs.CL

    Grounded Agreement Games: Emphasizing Conversational Grounding in Visual Dialogue Settings

    Authors: David Schlangen

    Abstract: Where early work on dialogue in Computational Linguistics put much emphasis on dialogue structure and its relation to the mental states of the dialogue participants (e.g., Allen 1979, Grosz & Sidner 1986), current work mostly reduces dialogue to the task of producing at any one time a next utterance; e.g. in neural chatbot or Visual Dialogue settings. As a methodological decision, this is sound: E… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

  36. arXiv:1908.10747  [pdf, other

    cs.CL cs.AI

    Language Tasks and Language Games: On Methodology in Current Natural Language Processing Research

    Authors: David Schlangen

    Abstract: "This paper introduces a new task and a new dataset", "we improve the state of the art in X by Y" -- it is rare to find a current natural language processing paper (or AI paper more generally) that does not contain such statements. What is mostly left implicit, however, is the assumption that this necessarily constitutes progress, and what it constitutes progress towards. Here, we make more precis… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

  37. arXiv:1907.05084  [pdf, other

    cs.CL cs.CV

    MeetUp! A Corpus of Joint Activity Dialogues in a Visual Environment

    Authors: Nikolai Ilinykh, Sina Zarrieß, David Schlangen

    Abstract: Building computer systems that can converse about their visual environment is one of the oldest concerns of research in Artificial Intelligence and Computational Linguistics (see, for example, Winograd's 1972 SHRDLU system). Only recently, however, have methods from computer vision and natural language processing become powerful enough to make this vision seem more attainable. Pushed especially by… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: In Proceedings of the 23rd Workshop on the Semantics and Pragmatics of Dialogue (semdial / LondonLogue), London, September 2019

  38. arXiv:1906.05518  [pdf, other

    cs.CL

    Know What You Don't Know: Modeling a Pragmatic Speaker that Refers to Objects of Unknown Categories

    Authors: Sina Zarrieß, David Schlangen

    Abstract: Zero-shot learning in Language & Vision is the task of correctly labelling (or naming) objects of novel categories. Another strand of work in L&V aims at pragmatically informative rather than ``correct'' object descriptions, e.g. in reference games. We combine these lines of research and model zero-shot reference games, where a speaker needs to successfully refer to a novel object in an image. Ins… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: Accepted at ACL 2019

  39. arXiv:1904.07318  [pdf, other

    cs.CL cs.CV

    Natural Language Semantics With Pictures: Some Language & Vision Datasets and Potential Uses for Computational Semantics

    Authors: David Schlangen

    Abstract: Propelling, and propelled by, the "deep learning revolution", recent years have seen the introduction of ever larger corpora of images annotated with natural language expressions. We survey some of these corpora, taking a perspective that reverses the usual directionality, as it were, by viewing the images as semantic annotation of the natural language expressions. We discuss datasets that can be… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: Presented at the 13th International Conference on Computational Semantics (IWCS 2019), Gothenburg

  40. arXiv:1510.02125  [pdf, other

    cs.CL

    Resolving References to Objects in Photographs using the Words-As-Classifiers Model

    Authors: David Schlangen, Sina Zarriess, Casey Kennington

    Abstract: A common use of language is to refer to visually present objects. Modelling it in computers requires modelling the link between language and perception. The "words as classifiers" model of grounded semantics views words as classifiers of perceptual contexts, and composes the meaning of a phrase through composition of the denotations of its component words. It was recently shown to perform well in… ▽ More

    Submitted 3 June, 2016; v1 submitted 7 October, 2015; originally announced October 2015.

    Comments: 11 pages; as in Proceedings of ACL 2016, Berlin, 2016