Skip to main content

Showing 1–37 of 37 results for author: Konstas, I

.
  1. arXiv:2407.03967  [pdf, other

    cs.CL cs.AI cs.RO

    Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks

    Authors: Amit Parekh, Nikolas Vitsakis, Alessandro Suglia, Ioannis Konstas

    Abstract: Evaluating the generalisation capabilities of multimodal models based solely on their performance on out-of-distribution data fails to capture their true robustness. This work introduces a comprehensive evaluation framework that systematically examines the role of instructions and inputs in the generalisation abilities of such models, considering architectural design, input perturbations across la… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2406.19297  [pdf, other

    cs.CV

    Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation

    Authors: Malvina Nikandrou, Georgios Pantazopoulos, Ioannis Konstas, Alessandro Suglia

    Abstract: Continual learning focuses on incrementally training a model on a sequence of tasks with the aim of learning new tasks while minimizing performance drop on previous tasks. Existing approaches at the intersection of Continual Learning and Visual Question Answering (VQA) do not study how the multimodal nature of the input affects the learning dynamics of a model. In this paper, we demonstrate that e… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2406.13807  [pdf, other

    cs.CV cs.AI cs.CL

    AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding

    Authors: Alessandro Suglia, Claudio Greco, Katie Baker, Jose L. Part, Ioannis Papaioannou, Arash Eshghi, Ioannis Konstas, Oliver Lemon

    Abstract: AI personal assistants deployed via robots or wearables require embodied understanding to collaborate with humans effectively. However, current Vision-Language Models (VLMs) primarily focus on third-person view videos, neglecting the richness of egocentric perceptual experience. To address this gap, we propose three key contributions. First, we introduce the Egocentric Video Understanding Dataset… ▽ More

    Submitted 21 June, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Code available https://github.com/alanaai/EVUD

  4. arXiv:2312.02431  [pdf, other

    cs.CL cs.AI

    Visually Grounded Language Learning: a review of language games, datasets, tasks, and models

    Authors: Alessandro Suglia, Ioannis Konstas, Oliver Lemon

    Abstract: In recent years, several machine learning models have been proposed. They are trained with a language modelling objective on large-scale text-only data. With such pretraining, they can achieve impressive results on many Natural Language Understanding and Generation tasks. However, many facets of meaning cannot be learned by ``listening to the radio" only. In the literature, many Vision+Language (V… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Preprint for JAIR before copyediting

  5. arXiv:2311.04067  [pdf, other

    cs.LG cs.AI cs.CV

    Multitask Multimodal Prompted Training for Interactive Embodied Task Completion

    Authors: Georgios Pantazopoulos, Malvina Nikandrou, Amit Parekh, Bhathiya Hemanthage, Arash Eshghi, Ioannis Konstas, Verena Rieser, Oliver Lemon, Alessandro Suglia

    Abstract: Interactive and embodied tasks pose at least two fundamental challenges to existing Vision & Language (VL) models, including 1) grounding language in trajectories of actions and observations, and 2) referential disambiguation. To tackle these challenges, we propose an Embodied MultiModal Agent (EMMA): a unified encoder-decoder model that reasons over images and trajectories, and casts action predi… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  6. arXiv:2307.16689  [pdf, other

    cs.CL

    No that's not what I meant: Handling Third Position Repair in Conversational Question Answering

    Authors: Vevake Balaraman, Arash Eshghi, Ioannis Konstas, Ioannis Papaioannou

    Abstract: The ability to handle miscommunication is crucial to robust and faithful conversational AI. People usually deal with miscommunication immediately as they detect it, using highly systematic interactional mechanisms called repair. One important type of repair is Third Position Repair (TPR) whereby a speaker is initially misunderstood but then corrects the misunderstanding as it becomes apparent afte… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: Accepted at SIGDIAL'23

  7. arXiv:2305.19911  [pdf, other

    cs.LG cs.CL

    Neuron to Graph: Interpreting Language Model Neurons at Scale

    Authors: Alex Foote, Neel Nanda, Esben Kran, Ioannis Konstas, Shay Cohen, Fazl Barez

    Abstract: Advances in Large Language Models (LLMs) have led to remarkable capabilities, yet their inner mechanisms remain largely unknown. To understand these models, we need to unravel the functions of individual neurons and their contribution to the network. This paper introduces a novel automated approach designed to scale interpretability techniques across a vast array of neurons within LLMs, to make th… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  8. arXiv:2305.17553  [pdf, other

    cs.CL cs.AI cs.LG

    Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark

    Authors: Jason Hoelscher-Obermaier, Julia Persson, Esben Kran, Ioannis Konstas, Fazl Barez

    Abstract: Recent model editing techniques promise to mitigate the problem of memorizing false or outdated associations during LLM training. However, we show that these techniques can introduce large unwanted side effects which are not detected by existing specificity benchmarks. We extend the existing CounterFact benchmark to include a dynamic component and dub our benchmark CounterFact+. Additionally, we e… ▽ More

    Submitted 3 June, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: To be published in ACL Findings 2023; for code see https://github.com/apartresearch/specificityplus; for a homepage see https://specificityplus.apartresearch.com/; updated Figures to uniform style

    ACM Class: I.2.7

  9. arXiv:2305.16519  [pdf, other

    cs.CL

    The Dangers of trusting Stochastic Parrots: Faithfulness and Trust in Open-domain Conversational Question Answering

    Authors: Sabrina Chiesurin, Dimitris Dimakopoulos, Marco Antonio Sobrevilla Cabezudo, Arash Eshghi, Ioannis Papaioannou, Verena Rieser, Ioannis Konstas

    Abstract: Large language models are known to produce output which sounds fluent and convincing, but is also often wrong, e.g. "unfaithful" with respect to a rationale as retrieved from a knowledge base. In this paper, we show that task-based systems which exhibit certain advanced linguistic dialog behaviors, such as lexical alignment (repeating what the user said), are in fact preferred and trusted more, wh… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 5 pages, ACL Findings 2023

  10. arXiv:2305.15507  [pdf, other

    cs.CL cs.AI

    The Larger They Are, the Harder They Fail: Language Models do not Recognize Identifier Swaps in Python

    Authors: Antonio Valerio Miceli-Barone, Fazl Barez, Ioannis Konstas, Shay B. Cohen

    Abstract: Large Language Models (LLMs) have successfully been applied to code generation tasks, raising the question of how well these models understand programming. Typical programming languages have invariances and equivariances in their semantics that human programmers intuitively understand and exploit, such as the (near) invariance to the renaming of identifiers. We show that LLMs not only fail to prop… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 17 pages, 5 figure, ACL 2023

  11. arXiv:2305.06074  [pdf, other

    cs.CL cs.LG

    iLab at SemEval-2023 Task 11 Le-Wi-Di: Modelling Disagreement or Modelling Perspectives?

    Authors: Nikolas Vitsakis, Amit Parekh, Tanvi Dinkar, Gavin Abercrombie, Ioannis Konstas, Verena Rieser

    Abstract: There are two competing approaches for modelling annotator disagreement: distributional soft-labelling approaches (which aim to capture the level of disagreement) or modelling perspectives of individual annotators or groups thereof. We adapt a multi-task architecture -- which has previously shown success in modelling perspectives -- to evaluate its performance on the SEMEVAL Task 11. We do so by c… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: To appear in the Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023). Association for Computational Linguistics, 2023

  12. arXiv:2304.12918  [pdf, other

    cs.LG

    N2G: A Scalable Approach for Quantifying Interpretable Neuron Representations in Large Language Models

    Authors: Alex Foote, Neel Nanda, Esben Kran, Ionnis Konstas, Fazl Barez

    Abstract: Understanding the function of individual neurons within language models is essential for mechanistic interpretability research. We propose $\textbf{Neuron to Graph (N2G)}$, a tool which takes a neuron and its dataset examples, and automatically distills the neuron's behaviour on those examples to an interpretable graph. This presents a less labour intensive approach to interpreting neurons than cu… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: To be published at ICLR 2023 Workshop on Trustworthy and Reliable Large-Scale Machine Learning Models

  13. arXiv:2211.04534  [pdf, other

    cs.CV cs.CL

    Going for GOAL: A Resource for Grounded Football Commentaries

    Authors: Alessandro Suglia, José Lopes, Emanuele Bastianelli, Andrea Vanzo, Shubham Agarwal, Malvina Nikandrou, Lu Yu, Ioannis Konstas, Verena Rieser

    Abstract: Recent video+language datasets cover domains where the interaction is highly structured, such as instructional videos, or where the interaction is scripted, such as TV shows. Both of these properties can lead to spurious cues to be exploited by models rather than learning to ground language. In this paper, we present GrOunded footbAlL commentaries (GOAL), a novel dataset of football (or `soccer')… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: Preprint formatted using the ACM Multimedia template (8 pages + appendix)

  14. Mind the Labels: Describing Relations in Knowledge Graphs With Pretrained Models

    Authors: Zdeněk Kasner, Ioannis Konstas, Ondřej Dušek

    Abstract: Pretrained language models (PLMs) for data-to-text (D2T) generation can use human-readable data labels such as column headings, keys, or relation names to generalize to out-of-domain examples. However, the models are well-known in producing semantically inaccurate outputs if these labels are ambiguous or incomplete, which is often the case in D2T datasets. In this paper, we expose this issue on th… ▽ More

    Submitted 16 October, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Long paper at EACL '23. Code and data: https://github.com/kasnerz/rel2text

    ACM Class: I.2.7

  15. arXiv:2210.00044  [pdf, other

    cs.LG

    Task Formulation Matters When Learning Continually: A Case Study in Visual Question Answering

    Authors: Mavina Nikandrou, Lu Yu, Alessandro Suglia, Ioannis Konstas, Verena Rieser

    Abstract: Continual learning aims to train a model incrementally on a sequence of tasks without forgetting previous knowledge. Although continual learning has been widely studied in computer vision, its application to Vision+Language tasks is not that straightforward, as settings can be parameterized in multiple ways according to their input modalities. In this paper, we present a detailed study of how diff… ▽ More

    Submitted 20 January, 2024; v1 submitted 30 September, 2022; originally announced October 2022.

  16. arXiv:2110.01295  [pdf, other

    cs.CL

    SPaR.txt, a cheap Shallow Parsing approach for Regulatory texts

    Authors: Ruben Kruiper, Ioannis Konstas, Alasdair Gray, Farhad Sadeghineko, Richard Watson, Bimal Kumar

    Abstract: Automated Compliance Checking (ACC) systems aim to semantically parse building regulations to a set of rules. However, semantic parsing is known to be hard and requires large amounts of training data. The complexity of creating such training data has led to research that focuses on small sub-tasks, such as shallow parsing or the extraction of a limited subset of rules. This study introduces a shal… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: To be published in the NLLP workshop at EMNLP 2021, 9 pages (15 including reference and appendices). For the ScotReg corpus, SPaR.txt dataset and code see: http://github.com/rubenkruiper/SPaR.txt

  17. arXiv:2109.10650  [pdf, other

    cs.CL

    MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization

    Authors: Xinnuo Xu, Ondřej Dušek, Shashi Narayan, Verena Rieser, Ioannis Konstas

    Abstract: One of the most challenging aspects of current single-document news summarization is that the summary often contains 'extrinsic hallucinations', i.e., facts that are not present in the source document, which are often derived via world knowledge. This causes summarization systems to act more like open-ended language models tending to hallucinate facts that are erroneous. In this paper, we mitigate… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Journal ref: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing Findings (EMNLP2021 Findings)

  18. arXiv:2106.05580  [pdf, other

    cs.CL

    AGGGEN: Ordering and Aggregating while Generating

    Authors: Xinnuo Xu, Ondřej Dušek, Verena Rieser, Ioannis Konstas

    Abstract: We present AGGGEN (pronounced 'again'), a data-to-text model which re-introduces two explicit sentence planning stages into neural data-to-text systems: input ordering and input aggregation. In contrast to previous work using sentence planning, our model is still end-to-end: AGGGEN performs sentence planning at the same time as generating text by learning latent alignments (via semantic facts) bet… ▽ More

    Submitted 17 June, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: Correct the first citation in the Zero-shot Few-shot scenarios paragraph in Section 7

    Journal ref: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL2021)

  19. arXiv:2105.13710  [pdf, other

    cs.CL

    OTTers: One-turn Topic Transitions for Open-Domain Dialogue

    Authors: Karin Sevegnani, David M. Howcroft, Ioannis Konstas, Verena Rieser

    Abstract: Mixed initiative in open-domain dialogue requires a system to pro-actively introduce new topics. The one-turn topic transition task explores how a system connects two topics in a cooperative and coherent manner. The goal of the task is to generate a "bridging" utterance connecting the new topic to the topic of the previous conversation turn. We are especially interested in commonsense explanations… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Journal ref: ACL2021

  20. arXiv:2102.00424  [pdf, other

    cs.CL cs.CV cs.LG

    An Empirical Study on the Generalization Power of Neural Representations Learned via Visual Guessing Games

    Authors: Alessandro Suglia, Yonatan Bisk, Ioannis Konstas, Antonio Vergari, Emanuele Bastianelli, Andrea Vanzo, Oliver Lemon

    Abstract: Guessing games are a prototypical instance of the "learning by interacting" paradigm. This work investigates how well an artificial agent can benefit from playing guessing games when later asked to perform on novel NLP downstream tasks such as Visual Question Answering (VQA). We propose two ways to exploit playing guessing games: 1) a supervised learning scenario in which the agent learns to mimic… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

    Comments: Accepted paper for the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021)

  21. arXiv:2011.02917  [pdf, other

    cs.CL cs.CV cs.LG

    Imagining Grounded Conceptual Representations from Perceptual Information in Situated Guessing Games

    Authors: Alessandro Suglia, Antonio Vergari, Ioannis Konstas, Yonatan Bisk, Emanuele Bastianelli, Andrea Vanzo, Oliver Lemon

    Abstract: In visual guessing games, a Guesser has to identify a target object in a scene by asking questions to an Oracle. An effective strategy for the players is to learn conceptual representations of objects that are both discriminative and expressive enough to ask questions and guess correctly. However, as shown by Suglia et al. (2020), existing models fail to learn truly multi-modal representations, re… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: Accepted to the International Conference on Computational Linguistics (COLING) 2020

  22. arXiv:2006.02174  [pdf, other

    cs.CL cs.AI cs.LG

    CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning

    Authors: Alessandro Suglia, Ioannis Konstas, Andrea Vanzo, Emanuele Bastianelli, Desmond Elliott, Stella Frank, Oliver Lemon

    Abstract: Approaches to Grounded Language Learning typically focus on a single task-based final performance measure that may not depend on desirable properties of the learned hidden representations, such as their ability to predict salient attributes or to generalise to unseen situations. To remedy this, we present GROLLA, an evaluation framework for Grounded Language Learning with Attributes with three sub… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Comments: Accepted to the Annual Conference of the Association for Computational Linguistics (ACL) 2020

  23. arXiv:2005.07753  [pdf, other

    cs.CL

    A Scientific Information Extraction Dataset for Nature Inspired Engineering

    Authors: Ruben Kruiper, Julian F. V. Vincent, Jessica Chen-Burger, Marc P. Y. Desmulliez, Ioannis Konstas

    Abstract: Nature has inspired various ground-breaking technological developments in applications ranging from robotics to aerospace engineering and the manufacturing of medical devices. However, accessing the information captured in scientific biology texts is a time-consuming and hard task that requires domain-specific knowledge. Improving access for outsiders can help interdisciplinary research like Natur… ▽ More

    Submitted 26 May, 2020; v1 submitted 15 May, 2020; originally announced May 2020.

    Comments: Published in Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020). Updated dataset statistics, results unchanged

  24. arXiv:2005.07751  [pdf, other

    cs.CL

    In Layman's Terms: Semi-Open Relation Extraction from Scientific Texts

    Authors: Ruben Kruiper, Julian F. V. Vincent, Jessica Chen-Burger, Marc P. Y. Desmulliez, Ioannis Konstas

    Abstract: Information Extraction (IE) from scientific texts can be used to guide readers to the central information in scientific documents. But narrow IE systems extract only a fraction of the information captured, and Open IE systems do not perform well on the long and complex sentences encountered in scientific texts. In this work we combine the output of both types of systems to achieve Semi-Open Relati… ▽ More

    Submitted 26 May, 2020; v1 submitted 15 May, 2020; originally announced May 2020.

    Comments: To be published in ACL 2020 conference proceedings. Updated dataset statistics, results unchanged

  25. arXiv:2005.07493  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    History for Visual Dialog: Do we really need it?

    Authors: Shubham Agarwal, Trung Bui, Joon-Young Lee, Ioannis Konstas, Verena Rieser

    Abstract: Visual Dialog involves "understanding" the dialog history (what has been discussed previously) and the current question (what is asked), in addition to grounding information in the image, to generate the correct response. In this paper, we show that co-attention models which explicitly encode dialog history outperform models that don't, achieving state-of-the-art performance (72 % NDCG on val set)… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: ACL'20

  26. arXiv:1910.13299  [pdf, other

    cs.CL

    Findings of the Third Workshop on Neural Generation and Translation

    Authors: Hiroaki Hayashi, Yusuke Oda, Alexandra Birch, Ioannis Konstas, Andrew Finch, Minh-Thang Luong, Graham Neubig, Katsuhito Sudoh

    Abstract: This document describes the findings of the Third Workshop on Neural Generation and Translation, held in concert with the annual conference of the Empirical Methods in Natural Language Processing (EMNLP 2019). First, we summarize the research trends of papers presented in the proceedings. Second, we describe the results of the two shared tasks 1) efficient neural machine translation (NMT) where pa… ▽ More

    Submitted 29 October, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Fixed the metadata (author list)

  27. arXiv:1910.04731  [pdf, other

    cs.CL

    Automatic Quality Estimation for Natural Language Generation: Ranting (Jointly Rating and Ranking)

    Authors: Ondřej Dušek, Karin Sevegnani, Ioannis Konstas, Verena Rieser

    Abstract: We present a recurrent neural network based system for automatic quality estimation of natural language generation (NLG) outputs, which jointly learns to assign numerical ratings to individual outputs and to provide pairwise rankings of two different outputs. The latter is trained using pairwise hinge loss over scores from two copies of the rating network. We use learning to rank and synthetic d… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: Accepted as a short paper at INLG 2019

    ACM Class: I.2.7

  28. arXiv:1909.06644  [pdf, ps, other

    cs.CL

    Current Challenges in Spoken Dialogue Systems and Why They Are Critical for Those Living with Dementia

    Authors: Angus Addlesee, Arash Eshghi, Ioannis Konstas

    Abstract: Dialogue technologies such as Amazon's Alexa have the potential to transform the healthcare industry. However, current systems are not yet naturally interactive: they are often turn-based, have naive end-of-turn detection and completely ignore many types of verbal and visual feedback - such as backchannels, hesitation markers, filled pauses, gaze, brow furrows and disfluencies - that are crucial i… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

    Comments: Published at Dialog for Good 2019 - Workshop on Speech and Language Technology Serving Society

    Journal ref: Dialog for Good (2019)

  29. arXiv:1904.03651  [pdf, other

    cs.CL

    SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression

    Authors: Christos Baziotis, Ion Androutsopoulos, Ioannis Konstas, Alexandros Potamianos

    Abstract: Neural sequence-to-sequence models are currently the dominant approach in several natural language processing tasks, but require large parallel corpora. We present a sequence-to-sequence-to-sequence autoencoder (SEQ^3), consisting of two chained encoder-decoder pairs, with words used as a sequence of discrete latent variables. We apply the proposed model to unsupervised abstractive sentence compre… ▽ More

    Submitted 9 June, 2019; v1 submitted 7 April, 2019; originally announced April 2019.

    Comments: Accepted to NAACL 2019

  30. arXiv:1810.11955  [pdf, other

    cs.CL

    Improving Context Modelling in Multimodal Dialogue Generation

    Authors: Shubham Agarwal, Ondrej Dusek, Ioannis Konstas, Verena Rieser

    Abstract: In this work, we investigate the task of textual response generation in a multimodal task-oriented dialogue system. Our work is based on the recently released Multimodal Dialogue (MMD) dataset (Saha et al., 2017) in the fashion domain. We introduce a multimodal extension to the Hierarchical Recurrent Encoder-Decoder (HRED) model and show that this extension outperforms strong baselines in terms of… ▽ More

    Submitted 20 October, 2018; originally announced October 2018.

    Journal ref: Proceedings of the 11th International Conference on Natural Language Generation, pages 129-134, Tilburg, The Netherlands, 2018

  31. arXiv:1810.11954  [pdf, other

    cs.CL cs.AI

    A Knowledge-Grounded Multimodal Search-Based Conversational Agent

    Authors: Shubham Agarwal, Ondrej Dusek, Ioannis Konstas, Verena Rieser

    Abstract: Multimodal search-based dialogue is a challenging new task: It extends visually grounded question answering systems into multi-turn conversations with access to an external database. We address this new challenge by learning a neural response generation system from the recently released Multimodal Dialogue (MMD) dataset (Saha et al., 2017). We introduce a knowledge-grounded multimodal conversation… ▽ More

    Submitted 20 October, 2018; originally announced October 2018.

    Journal ref: Proceedings of the 2018 EMNLP Workshop SCAI: The 2nd International Workshop on Search-Oriented Conversational AI, pages 59-66, Brussels, Belgium, October 2018

  32. arXiv:1809.06873  [pdf, other

    cs.CL

    Better Conversations by Modeling,Filtering,and Optimizing for Coherence and Diversity

    Authors: Xinnuo Xu, Ondřej Dušek, Ioannis Konstas, Verena Rieser

    Abstract: We present three enhancements to existing encoder-decoder models for open-domain conversational agents, aimed at effectively modeling coherence and promoting output diversity: (1) We introduce a measure of coherence as the GloVe embedding similarity between the dialogue context and the generated response, (2) we filter our training corpora based on the measure of coherence to obtain topically cohe… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

    Journal ref: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3981-3991, Brussels, Belgium, November 2018

  33. arXiv:1808.09588  [pdf, ps, other

    cs.CL

    Map** Language to Code in Programmatic Context

    Authors: Srinivasan Iyer, Ioannis Konstas, Alvin Cheung, Luke Zettlemoyer

    Abstract: Source code is rarely written in isolation. It depends significantly on the programmatic context, such as the class that the code would reside in. To study this phenomenon, we introduce the task of generating class member functions given English documentation and the programmatic context provided by the rest of the class. This task is challenging because the desired code can vary greatly depending… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: Accepted at EMNLP 2018

  34. arXiv:1704.08760  [pdf, other

    cs.CL

    Learning a Neural Semantic Parser from User Feedback

    Authors: Srinivasan Iyer, Ioannis Konstas, Alvin Cheung, Jayant Krishnamurthy, Luke Zettlemoyer

    Abstract: We present an approach to rapidly and easily build natural language interfaces to databases for new domains, whose performance improves over time based on user feedback, and requires minimal intervention. To achieve this, we adapt neural sequence models to map utterances directly to SQL with its full expressivity, bypassing any intermediate meaning representations. These models are immediately dep… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

    Comments: Accepted at ACL 2017

  35. arXiv:1704.08381  [pdf, other

    cs.CL

    Neural AMR: Sequence-to-Sequence Models for Parsing and Generation

    Authors: Ioannis Konstas, Srinivasan Iyer, Mark Yatskar, Ye** Choi, Luke Zettlemoyer

    Abstract: Sequence-to-sequence models have shown strong performance across a broad range of applications. However, their application to parsing and generating text usingAbstract Meaning Representation (AMR)has been limited, due to the relatively limited amount of labeled data and the non-sequential nature of the AMR graphs. We present a novel training procedure that can lift this limitation using millions o… ▽ More

    Submitted 18 August, 2017; v1 submitted 26 April, 2017; originally announced April 2017.

    Comments: Accepted in ACL 2017

  36. arXiv:1702.01841  [pdf, ps, other

    cs.CL

    The Effect of Different Writing Tasks on Linguistic Style: A Case Study of the ROC Story Cloze Task

    Authors: Roy Schwartz, Maarten Sap, Ioannis Konstas, Li Zilles, Ye** Choi, Noah A. Smith

    Abstract: A writer's style depends not just on personal traits but also on her intent and mental state. In this paper, we show how variants of the same writing task can lead to measurable differences in writing style. We present a case study based on the story cloze task (Mostafazadeh et al., 2016a), where annotators were assigned similar writing tasks with different constraints: (1) writing an entire story… ▽ More

    Submitted 13 July, 2017; v1 submitted 6 February, 2017; originally announced February 2017.

    Comments: 11 pages, CoNLL 2017

  37. arXiv:1610.06210  [pdf, other

    cs.CL

    A Theme-Rewriting Approach for Generating Algebra Word Problems

    Authors: Rik Koncel-Kedziorski, Ioannis Konstas, Luke Zettlemoyer, Hannaneh Hajishirzi

    Abstract: Texts present coherent stories that have a particular theme or overall setting, for example science fiction or western. In this paper, we present a text generation method called {\it rewriting} that edits existing human-authored narratives to change their theme without changing the underlying story. We apply the approach to math word problems, where it might help students stay more engaged by quic… ▽ More

    Submitted 19 October, 2016; originally announced October 2016.

    Comments: To appear EMNLP 2016