Skip to main content

Showing 1–12 of 12 results for author: Arkoudas, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.03762  [pdf, ps, other

    cs.CL

    GPT-4 Can't Reason

    Authors: Konstantine Arkoudas

    Abstract: GPT-4 was released in March 2023 to wide acclaim, marking a very substantial improvement across the board over GPT-3.5 (OpenAI's previously best model, which had powered the initial release of ChatGPT). However, despite the genuinely impressive improvement, there are good reasons to be highly skeptical of GPT-4's ability to reason. This position paper discusses the nature of reasoning; criticizes… ▽ More

    Submitted 10 August, 2023; v1 submitted 21 July, 2023; originally announced August 2023.

  2. arXiv:2301.09809  [pdf, other

    cs.CL

    Low-Resource Compositional Semantic Parsing with Concept Pretraining

    Authors: Subendhu Rongali, Mukund Sridhar, Haidar Khan, Konstantine Arkoudas, Wael Hamza, Andrew McCallum

    Abstract: Semantic parsing plays a key role in digital voice assistants such as Alexa, Siri, and Google Assistant by map** natural language to structured meaning representations. When we want to improve the capabilities of a voice assistant by adding a new domain, the underlying semantic parsing model needs to be retrained using thousands of annotated examples from the new domain, which is time-consuming… ▽ More

    Submitted 30 January, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: EACL 2023

  3. arXiv:2212.00265  [pdf, other

    cs.CL cs.LG

    PIZZA: A new benchmark for complex end-to-end task-oriented parsing

    Authors: Konstantine Arkoudas, Nicolas Guenon des Mesnards, Melanie Rubino, Sandesh Swamy, Saarthak Khanna, Weiqi Sun, Khan Haidar

    Abstract: Much recent work in task-oriented parsing has focused on finding a middle ground between flat slots and intents, which are inexpressive but easy to annotate, and powerful representations such as the lambda calculus, which are expressive but costly to annotate. This paper continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, whose semant… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: Accepted for publication at AMLC 2022

  4. arXiv:2206.07808  [pdf, other

    cs.CL cs.AI cs.LG

    Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems

    Authors: Jack FitzGerald, Shankar Ananthakrishnan, Konstantine Arkoudas, Davide Bernardi, Abhishek Bhagia, Claudio Delli Bovi, ** Cao, Rakesh Chada, Amit Chauhan, Luoxin Chen, Anurag Dwarakanath, Satyam Dwivedi, Turan Gojayev, Karthik Gopalakrishnan, Thomas Gueudre, Dilek Hakkani-Tur, Wael Hamza, Jonathan Hueser, Kevin Martin Jose, Haidar Khan, Beiye Liu, Jianhua Lu, Alessandro Manzotti, Pradeep Natarajan, Karolina Owczarzak , et al. (16 additional authors not shown)

    Abstract: We present results from a large-scale experiment on pretraining encoders with non-embedding parameter counts ranging from 700M to 9.3B, their subsequent distillation into smaller models ranging from 17M-170M parameters, and their application to the Natural Language Understanding (NLU) component of a virtual assistant system. Though we train using 70% spoken-form data, our teacher models perform co… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: KDD 2022

    ACM Class: I.2.7

    Journal ref: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA

  5. arXiv:2206.05352  [pdf, other

    cs.CL cs.LG

    Cross-TOP: Zero-Shot Cross-Schema Task-Oriented Parsing

    Authors: Melanie Rubino, Nicolas Guenon des Mesnards, Uday Shah, Nanjiang Jiang, Weiqi Sun, Konstantine Arkoudas

    Abstract: Deep learning methods have enabled task-oriented semantic parsing of increasingly complex utterances. However, a single model is still typically trained and deployed for each task separately, requiring labeled training data for each, which makes it challenging to support new tasks, even within a single business vertical (e.g., food-ordering or travel booking). In this paper we describe Cross-TOP (… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: Accepted for publication at NAACL 2022 workshop DeepLo, "Deep Learning for Low-Resource NLP"

  6. arXiv:2205.02068  [pdf, other

    cs.CL

    Compositional Task-Oriented Parsing as Abstractive Question Answering

    Authors: Wenting Zhao, Konstantine Arkoudas, Weiqi Sun, Claire Cardie

    Abstract: Task-oriented parsing (TOP) aims to convert natural language into machine-readable representations of specific tasks, such as setting an alarm. A popular approach to TOP is to apply seq2seq models to generate linearized parse trees. A more recent line of work argues that pretrained seq2seq models are better at generating outputs that are themselves natural language, so they replace linearized pars… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: accepted at NAACL'22

  7. arXiv:2204.14243  [pdf, other

    cs.CL

    Training Naturalized Semantic Parsers with Very Little Data

    Authors: Subendhu Rongali, Konstantine Arkoudas, Melanie Rubino, Wael Hamza

    Abstract: Semantic parsing is an important NLP problem, particularly for voice assistants such as Alexa and Google Assistant. State-of-the-art (SOTA) semantic parsers are seq2seq architectures based on large language models that have been pretrained on vast amounts of text. To better leverage that pretraining, recent work has explored a reformulation of semantic parsing whereby the output sequences are them… ▽ More

    Submitted 4 May, 2022; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: IJCAI 2022

  8. Unfreeze with Care: Space-Efficient Fine-Tuning of Semantic Parsing Models

    Authors: Weiqi Sun, Haidar Khan, Nicolas Guenon des Mesnards, Melanie Rubino, Konstantine Arkoudas

    Abstract: Semantic parsing is a key NLP task that maps natural language to structured meaning representations. As in many other NLP tasks, SOTA performance in semantic parsing is now attained by fine-tuning a large pretrained language model (PLM). While effective, this approach is inefficient in the presence of multiple downstream tasks, as a new set of values for all parameters of the PLM needs to be store… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 9 pages, 4 figures, submitted to the ACM Web Conference 2022 (WWW '22) and accepted as a full-length research track paper. To be published in the proceedings and ACM Digital Library

  9. arXiv:2012.08549  [pdf, other

    cs.CL

    Exploring Transfer Learning For End-to-End Spoken Language Understanding

    Authors: Subendhu Rongali, Beiye Liu, Liwei Cai, Konstantine Arkoudas, Chengwei Su, Wael Hamza

    Abstract: Voice Assistants such as Alexa, Siri, and Google Assistant typically use a two-stage Spoken Language Understanding pipeline; first, an Automatic Speech Recognition (ASR) component to process customer speech and generate text transcriptions, followed by a Natural Language Understanding (NLU) component to map transcriptions to an actionable hypothesis. An end-to-end (E2E) system that goes directly f… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: AAAI 2021

  10. arXiv:2012.02763  [pdf, other

    cs.CL cs.AI cs.LG

    Delexicalized Paraphrase Generation

    Authors: Boya Yu, Konstantine Arkoudas, Wael Hamza

    Abstract: We present a neural model for paraphrasing and train it to generate delexicalized sentences. We achieve this by creating training data in which each input is paired with a number of reference paraphrases. These sets of reference paraphrases represent a weak type of semantic equivalence based on annotated slots and intents. To understand semantics from different types of slots, other than anonymizi… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  11. arXiv:1907.11049  [pdf, other

    cs.CL

    Grammatical Sequence Prediction for Real-Time Neural Semantic Parsing

    Authors: Chunyang Xiao, Christoph Teichmann, Konstantine Arkoudas

    Abstract: While sequence-to-sequence (seq2seq) models achieve state-of-the-art performance in many natural language processing tasks, they can be too slow for real-time applications. One performance bottleneck is predicting the most likely next token over a large vocabulary; methods to circumvent this bottleneck are a current research topic. We focus specifically on using seq2seq models for semantic parsing… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

  12. arXiv:1906.09450  [pdf, ps, other

    cs.CL cs.IR

    Semantically Driven Auto-completion

    Authors: Konstantine Arkoudas, Mohamed Yahya

    Abstract: The Bloomberg Terminal has been a leading source of financial data and analytics for over 30 years. Through its thousands of functions, the Terminal allows its users to query and run analytics over a large array of data sources, including structured, semi-structured, and unstructured data; as well as plot charts, set up event-driven alerts and triggers, create interactive maps, exchange informatio… ▽ More

    Submitted 22 June, 2019; originally announced June 2019.

    Comments: 12 pages, under submission