Skip to main content

Showing 1–15 of 15 results for author: Briscoe, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.12363  [pdf, other

    cs.CL

    Emergent Word Order Universals from Cognitively-Motivated Language Models

    Authors: Tatsuki Kuribayashi, Ryo Ueda, Ryo Yoshida, Yohei Oseki, Ted Briscoe, Timothy Baldwin

    Abstract: The world's languages exhibit certain so-called typological or implicational universals; for example, Subject-Object-Verb (SOV) languages typically use postpositions. Explaining the source of such biases is a key goal of linguistics. We study word-order universals through a computational simulation with language models (LMs). Our experiments show that typologically-typical word orders tend to have… ▽ More

    Submitted 7 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024 main conference, 22 pages

  2. Grammatical Error Correction: A Survey of the State of the Art

    Authors: Christopher Bryant, Zheng Yuan, Muhammad Reza Qorib, Hannan Cao, Hwee Tou Ng, Ted Briscoe

    Abstract: Grammatical Error Correction (GEC) is the task of automatically detecting and correcting errors in text. The task not only includes the correction of grammatical errors, such as missing prepositions and mismatched subject-verb agreement, but also orthographic and semantic errors, such as misspellings and word choice errors respectively. The field has seen significant progress in the last decade, m… ▽ More

    Submitted 29 April, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Journal ref: Computational Linguistics (2023) 49 (3): 643-701

  3. arXiv:2011.06306  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Analyzing Neural Discourse Coherence Models

    Authors: Youmna Farag, Josef Valvoda, Helen Yannakoudakis, Ted Briscoe

    Abstract: In this work, we systematically investigate how well current models of coherence can capture aspects of text implicated in discourse organisation. We devise two datasets of various linguistic alterations that undermine coherence and test model sensitivity to changes in syntax and semantics. We furthermore probe discourse embedding space and examine the knowledge that is encoded in representations… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Journal ref: CODI workshop in EMNLP2020

  4. Text Readability Assessment for Second Language Learners

    Authors: Menglin Xia, Ekaterina Kochmar, Ted Briscoe

    Abstract: This paper addresses the task of readability assessment for the texts aimed at second language (L2) learners. One of the major challenges in this task is the lack of significantly sized level-annotated data. For the present work, we collected a dataset of CEFR-graded texts tailored for learners of English as an L2 and investigated text readability assessment for both native and L2 learners. We app… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications

  5. arXiv:1906.07555  [pdf, other

    cs.CL

    Automatic learner summary assessment for reading comprehension

    Authors: Menglin Xia, Ekaterina Kochmar, Ted Briscoe

    Abstract: Automating the assessment of learner summaries provides a useful tool for assessing learner reading comprehension. We present a summarization task for evaluating non-native reading comprehension and propose three novel approaches to automatically assess the learner summaries. We evaluate our models on two datasets we created and show that our models outperform traditional approaches that rely on e… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

    Comments: NAACL2019

  6. arXiv:1804.06898  [pdf, other

    cs.CL cs.AI

    Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted Input

    Authors: Youmna Farag, Helen Yannakoudakis, Ted Briscoe

    Abstract: We demonstrate that current state-of-the-art approaches to Automated Essay Scoring (AES) are not well-suited to capturing adversarially crafted input of grammatical but incoherent sequences of sentences. We develop a neural model of local coherence that can effectively learn connectedness features between sentences, and propose a framework for integrating and jointly training the local coherence m… ▽ More

    Submitted 30 April, 2020; v1 submitted 18 April, 2018; originally announced April 2018.

    Comments: 9, NAACL 2018

    Journal ref: The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2018)

  7. arXiv:1711.10837  [pdf, other

    cs.CL

    Curriculum Q-Learning for Visual Vocabulary Acquisition

    Authors: Ahmed H. Zaidi, Russell Moore, Ted Briscoe

    Abstract: The structure of curriculum plays a vital role in our learning process, both as children and adults. Presenting material in ascending order of difficulty that also exploits prior knowledge can have a significant impact on the rate of learning. However, the notion of difficulty and prior knowledge differs from person to person. Motivated by the need for a personalised curriculum, we present a novel… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

    Comments: Accepted at Visually Grounded Interaction and Language Workshop (NIPS 2017)

  8. arXiv:1707.06841  [pdf, other

    cs.CL cs.LG cs.NE

    An Error-Oriented Approach to Word Embedding Pre-Training

    Authors: Youmna Farag, Marek Rei, Ted Briscoe

    Abstract: We propose a novel word embedding pre-training approach that exploits writing errors in learners' scripts. We compare our method to previous models that tune the embeddings based on script scores and the discrimination between correct and corrupt word contexts in addition to the generic commonly-used embeddings pre-trained on large corpora. The comparison is achieved by using the aforementioned mo… ▽ More

    Submitted 21 July, 2017; originally announced July 2017.

    Comments: 10 pages, 2 figures, 4 tables, BEA 2017

    Journal ref: The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2017)

  9. arXiv:1707.05236  [pdf, other

    cs.CL cs.LG

    Artificial Error Generation with Machine Translation and Syntactic Patterns

    Authors: Marek Rei, Mariano Felice, Zheng Yuan, Ted Briscoe

    Abstract: Shortage of available training data is holding back progress in the area of automated error detection. This paper investigates two alternative methods for artificially generating writing errors, in order to create additional resources. We propose treating error generation as a machine translation task, where grammatically correct text is translated to contain errors. In addition, we explore a syst… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

    Comments: The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2017)

    ACM Class: I.2.7; I.2.6; I.5.1

  10. arXiv:cs/9907013  [pdf, ps, other

    cs.CL

    Corpus Annotation for Parser Evaluation

    Authors: John Carroll, Guido Minnen, Ted Briscoe

    Abstract: We describe a recently developed corpus annotation scheme for evaluating parsers that avoids shortcomings of current methods. The scheme encodes grammatical relations between heads and dependents, and has been used to mark up a new public-domain corpus of naturally occurring English text. We show how the corpus can be used to evaluate the accuracy of a robust parser, and relate the corpus to ext… ▽ More

    Submitted 8 July, 1999; originally announced July 1999.

    Comments: 7 pages, LaTeX (uses eaclap.sty)

    ACM Class: I.2.7

    Journal ref: Proceedings of the EACL99 workshop on Linguistically Interpreted Corpora (LINC), Bergen, Norway, June 12

  11. Can Subcategorisation Probabilities Help a Statistical Parser?

    Authors: John Carroll, Guido Minnen, Ted Briscoe

    Abstract: Research into the automatic acquisition of lexical information from corpora is starting to produce large-scale computational lexicons containing data on the relative frequencies of subcategorisation alternatives for individual verbal predicates. However, the empirical question of whether this type of frequency information can in practice improve the accuracy of a statistical parser has not yet b… ▽ More

    Submitted 21 June, 1998; originally announced June 1998.

    Comments: 9 pages, uses colacl.sty

    Journal ref: 6th Workshop on Very Large Corpora, Montreal, Canada, 1998

  12. Co-evolution of Language and of the Language Acquisition Device

    Authors: Ted Briscoe

    Abstract: A new account of parameter setting during grammatical acquisition is presented in terms of Generalized Categorial Grammar embedded in a default inheritance hierarchy, providing a natural partial ordering on the setting of parameters. Experiments show that several experimentally effective learners can be defined in this framework. Evolutionary simulations suggest that a learner with default initi… ▽ More

    Submitted 1 May, 1997; originally announced May 1997.

    Comments: 10 pages, latex, 2 postscript figures, uses aclap.sty and graphics.sty, to appear ACL-EACL97

  13. Automatic Extraction of Subcategorization from Corpora

    Authors: Ted Briscoe, John Carroll

    Abstract: We describe a novel technique and implemented system for constructing a subcategorization dictionary from textual corpora. Each dictionary entry encodes the relative frequency of occurrence of a comprehensive set of subcategorization classes for English. An initial experiment, on a sample of 14 verbs which exhibit multiple complementation patterns, demonstrates that the technique achieves accura… ▽ More

    Submitted 4 February, 1997; originally announced February 1997.

    Comments: 8 pages; requires aclap.sty. To appear in ANLP-97

  14. Apportioning Development Effort in a Probabilistic LR Parsing System through Evaluation

    Authors: John Carroll, Ted Briscoe

    Abstract: We describe an implemented system for robust domain-independent syntactic parsing of English, using a unification-based grammar of part-of-speech and punctuation labels coupled with a probabilistic LR parser. We present evaluations of the system's performance along several different dimensions; these enable us to assess the contribution that each individual part is making to the success of the s… ▽ More

    Submitted 12 April, 1996; originally announced April 1996.

    Comments: 10 pages, 1 Postscript figure. To Appear in Proceedings of the Conference on Empirical Methods in Natural Language Processing, University of Pennsylvania, May 1996

    Journal ref: Conference on Empirical Methods in Natural Language Processing (EMNLP-96), 92-100

  15. Develo** and Evaluating a Probabilistic LR Parser of Part-of-Speech and Punctuation Labels

    Authors: Ted Briscoe, John Carroll

    Abstract: We describe an approach to robust domain-independent syntactic parsing of unrestricted naturally-occurring (English) input. The technique involves parsing sequences of part-of-speech and punctuation labels using a unification-based grammar coupled with a probabilistic LR parser. We describe the coverage of several corpora using this grammar and report the results of a parsing experiment using pr… ▽ More

    Submitted 9 October, 1995; originally announced October 1995.

    Comments: 11 pages, standard LaTeX

    Journal ref: 4th International Workshop on Parsing Technologies (IWPT-95), 48-58