Search | arXiv e-print repository

Emergent Word Order Universals from Cognitively-Motivated Language Models

Authors: Tatsuki Kuribayashi, Ryo Ueda, Ryo Yoshida, Yohei Oseki, Ted Briscoe, Timothy Baldwin

Abstract: The world's languages exhibit certain so-called typological or implicational universals; for example, Subject-Object-Verb (SOV) languages typically use postpositions. Explaining the source of such biases is a key goal of linguistics. We study word-order universals through a computational simulation with language models (LMs). Our experiments show that typologically-typical word orders tend to have… ▽ More The world's languages exhibit certain so-called typological or implicational universals; for example, Subject-Object-Verb (SOV) languages typically use postpositions. Explaining the source of such biases is a key goal of linguistics. We study word-order universals through a computational simulation with language models (LMs). Our experiments show that typologically-typical word orders tend to have lower perplexity estimated by LMs with cognitively plausible biases: syntactic biases, specific parsing strategies, and memory limitations. This suggests that the interplay of cognitive biases and predictability (perplexity) can explain many aspects of word-order universals. It also showcases the advantage of cognitively-motivated LMs, typically employed in cognitive modeling, in the simulation of language universals. △ Less

Submitted 7 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

Comments: Accepted by ACL 2024 main conference, 22 pages

arXiv:2211.05166 [pdf, ps, other]

doi 10.1162/coli_a_00478

Grammatical Error Correction: A Survey of the State of the Art

Authors: Christopher Bryant, Zheng Yuan, Muhammad Reza Qorib, Hannan Cao, Hwee Tou Ng, Ted Briscoe

Abstract: Grammatical Error Correction (GEC) is the task of automatically detecting and correcting errors in text. The task not only includes the correction of grammatical errors, such as missing prepositions and mismatched subject-verb agreement, but also orthographic and semantic errors, such as misspellings and word choice errors respectively. The field has seen significant progress in the last decade, m… ▽ More Grammatical Error Correction (GEC) is the task of automatically detecting and correcting errors in text. The task not only includes the correction of grammatical errors, such as missing prepositions and mismatched subject-verb agreement, but also orthographic and semantic errors, such as misspellings and word choice errors respectively. The field has seen significant progress in the last decade, motivated in part by a series of five shared tasks, which drove the development of rule-based methods, statistical classifiers, statistical machine translation, and finally neural machine translation systems which represent the current dominant state of the art. In this survey paper, we condense the field into a single article and first outline some of the linguistic challenges of the task, introduce the most popular datasets that are available to researchers (for both English and other languages), and summarise the various methods and techniques that have been developed with a particular focus on artificial error generation. We next describe the many different approaches to evaluation as well as concerns surrounding metric reliability, especially in relation to subjective human judgements, before concluding with an overview of recent progress and suggestions for future work and remaining challenges. We hope that this survey will serve as comprehensive resource for researchers who are new to the field or who want to be kept apprised of recent developments. △ Less

Submitted 29 April, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

Journal ref: Computational Linguistics (2023) 49 (3): 643-701

arXiv:2011.06306 [pdf, ps, other]

Analyzing Neural Discourse Coherence Models

Authors: Youmna Farag, Josef Valvoda, Helen Yannakoudakis, Ted Briscoe

Abstract: In this work, we systematically investigate how well current models of coherence can capture aspects of text implicated in discourse organisation. We devise two datasets of various linguistic alterations that undermine coherence and test model sensitivity to changes in syntax and semantics. We furthermore probe discourse embedding space and examine the knowledge that is encoded in representations… ▽ More In this work, we systematically investigate how well current models of coherence can capture aspects of text implicated in discourse organisation. We devise two datasets of various linguistic alterations that undermine coherence and test model sensitivity to changes in syntax and semantics. We furthermore probe discourse embedding space and examine the knowledge that is encoded in representations of coherence. We hope this study shall provide further insight into how to frame the task and improve models of coherence assessment further. Finally, we make our datasets publicly available as a resource for researchers to use to test discourse coherence models. △ Less

Submitted 12 November, 2020; originally announced November 2020.

Journal ref: CODI workshop in EMNLP2020

arXiv:1906.07580 [pdf, ps, other]

doi 10.18653/v1/W16-0502

Text Readability Assessment for Second Language Learners

Authors: Menglin Xia, Ekaterina Kochmar, Ted Briscoe

Abstract: This paper addresses the task of readability assessment for the texts aimed at second language (L2) learners. One of the major challenges in this task is the lack of significantly sized level-annotated data. For the present work, we collected a dataset of CEFR-graded texts tailored for learners of English as an L2 and investigated text readability assessment for both native and L2 learners. We app… ▽ More This paper addresses the task of readability assessment for the texts aimed at second language (L2) learners. One of the major challenges in this task is the lack of significantly sized level-annotated data. For the present work, we collected a dataset of CEFR-graded texts tailored for learners of English as an L2 and investigated text readability assessment for both native and L2 learners. We applied a generalization method to adapt models trained on larger native corpora to estimate text readability for learners, and explored domain adaptation and self-learning techniques to make use of the native data to improve system performance on the limited L2 data. In our experiments, the best performing model for readability on learner texts achieves an accuracy of 0.797 and PCC of $0.938$. △ Less

Submitted 18 June, 2019; originally announced June 2019.

Comments: Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications

arXiv:1906.07555 [pdf, other]

Automatic learner summary assessment for reading comprehension

Authors: Menglin Xia, Ekaterina Kochmar, Ted Briscoe

Abstract: Automating the assessment of learner summaries provides a useful tool for assessing learner reading comprehension. We present a summarization task for evaluating non-native reading comprehension and propose three novel approaches to automatically assess the learner summaries. We evaluate our models on two datasets we created and show that our models outperform traditional approaches that rely on e… ▽ More Automating the assessment of learner summaries provides a useful tool for assessing learner reading comprehension. We present a summarization task for evaluating non-native reading comprehension and propose three novel approaches to automatically assess the learner summaries. We evaluate our models on two datasets we created and show that our models outperform traditional approaches that rely on exact word match on this task. Our best model produces quality assessments close to professional examiners. △ Less

Submitted 18 June, 2019; originally announced June 2019.

Comments: NAACL2019

arXiv:1804.06898 [pdf, other]

Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted Input

Authors: Youmna Farag, Helen Yannakoudakis, Ted Briscoe

Abstract: We demonstrate that current state-of-the-art approaches to Automated Essay Scoring (AES) are not well-suited to capturing adversarially crafted input of grammatical but incoherent sequences of sentences. We develop a neural model of local coherence that can effectively learn connectedness features between sentences, and propose a framework for integrating and jointly training the local coherence m… ▽ More We demonstrate that current state-of-the-art approaches to Automated Essay Scoring (AES) are not well-suited to capturing adversarially crafted input of grammatical but incoherent sequences of sentences. We develop a neural model of local coherence that can effectively learn connectedness features between sentences, and propose a framework for integrating and jointly training the local coherence model with a state-of-the-art AES model. We evaluate our approach against a number of baselines and experimentally demonstrate its effectiveness on both the AES task and the task of flagging adversarial input, further contributing to the development of an approach that strengthens the validity of neural essay scoring models. △ Less

Submitted 30 April, 2020; v1 submitted 18 April, 2018; originally announced April 2018.

Comments: 9, NAACL 2018

Journal ref: The 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2018)

arXiv:1711.10837 [pdf, other]

Curriculum Q-Learning for Visual Vocabulary Acquisition

Authors: Ahmed H. Zaidi, Russell Moore, Ted Briscoe

Abstract: The structure of curriculum plays a vital role in our learning process, both as children and adults. Presenting material in ascending order of difficulty that also exploits prior knowledge can have a significant impact on the rate of learning. However, the notion of difficulty and prior knowledge differs from person to person. Motivated by the need for a personalised curriculum, we present a novel… ▽ More The structure of curriculum plays a vital role in our learning process, both as children and adults. Presenting material in ascending order of difficulty that also exploits prior knowledge can have a significant impact on the rate of learning. However, the notion of difficulty and prior knowledge differs from person to person. Motivated by the need for a personalised curriculum, we present a novel method of curriculum learning for vocabulary words in the form of visual prompts. We employ a reinforcement learning model grounded in pedagogical theories that emulates the actions of a tutor. We simulate three students with different levels of vocabulary knowledge in order to evaluate the how well our model adapts to the environment. The results of the simulation reveal that through interaction, the model is able to identify the areas of weakness, as well as push students to the edge of their ZPD. We hypothesise that these methods can also be effective in training agents to learn language representations in a simulated environment where it has previously been shown that order of words and prior knowledge play an important role in the efficacy of language learning. △ Less

Submitted 29 November, 2017; originally announced November 2017.

Comments: Accepted at Visually Grounded Interaction and Language Workshop (NIPS 2017)

arXiv:1707.06841 [pdf, other]

An Error-Oriented Approach to Word Embedding Pre-Training

Authors: Youmna Farag, Marek Rei, Ted Briscoe

Abstract: We propose a novel word embedding pre-training approach that exploits writing errors in learners' scripts. We compare our method to previous models that tune the embeddings based on script scores and the discrimination between correct and corrupt word contexts in addition to the generic commonly-used embeddings pre-trained on large corpora. The comparison is achieved by using the aforementioned mo… ▽ More We propose a novel word embedding pre-training approach that exploits writing errors in learners' scripts. We compare our method to previous models that tune the embeddings based on script scores and the discrimination between correct and corrupt word contexts in addition to the generic commonly-used embeddings pre-trained on large corpora. The comparison is achieved by using the aforementioned models to bootstrap a neural network that learns to predict a holistic score for scripts. Furthermore, we investigate augmenting our model with error corrections and monitor the impact on performance. Our results show that our error-oriented approach outperforms other comparable ones which is further demonstrated when training on more data. Additionally, extending the model with corrections provides further performance gains when data sparsity is an issue. △ Less

Submitted 21 July, 2017; originally announced July 2017.

Comments: 10 pages, 2 figures, 4 tables, BEA 2017

Journal ref: The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2017)

arXiv:1707.05236 [pdf, other]

Artificial Error Generation with Machine Translation and Syntactic Patterns

Authors: Marek Rei, Mariano Felice, Zheng Yuan, Ted Briscoe

Abstract: Shortage of available training data is holding back progress in the area of automated error detection. This paper investigates two alternative methods for artificially generating writing errors, in order to create additional resources. We propose treating error generation as a machine translation task, where grammatically correct text is translated to contain errors. In addition, we explore a syst… ▽ More Shortage of available training data is holding back progress in the area of automated error detection. This paper investigates two alternative methods for artificially generating writing errors, in order to create additional resources. We propose treating error generation as a machine translation task, where grammatically correct text is translated to contain errors. In addition, we explore a system for extracting textual patterns from an annotated corpus, which can then be used to insert errors into grammatically correct sentences. Our experiments show that the inclusion of artificially generated errors significantly improves error detection accuracy on both FCE and CoNLL 2014 datasets. △ Less

Submitted 17 July, 2017; originally announced July 2017.

Comments: The 12th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2017)

ACM Class: I.2.7; I.2.6; I.5.1

arXiv:cs/9907013 [pdf, ps, other]

Corpus Annotation for Parser Evaluation

Authors: John Carroll, Guido Minnen, Ted Briscoe

Abstract: We describe a recently developed corpus annotation scheme for evaluating parsers that avoids shortcomings of current methods. The scheme encodes grammatical relations between heads and dependents, and has been used to mark up a new public-domain corpus of naturally occurring English text. We show how the corpus can be used to evaluate the accuracy of a robust parser, and relate the corpus to ext… ▽ More We describe a recently developed corpus annotation scheme for evaluating parsers that avoids shortcomings of current methods. The scheme encodes grammatical relations between heads and dependents, and has been used to mark up a new public-domain corpus of naturally occurring English text. We show how the corpus can be used to evaluate the accuracy of a robust parser, and relate the corpus to extant resources. △ Less

Submitted 8 July, 1999; originally announced July 1999.

Comments: 7 pages, LaTeX (uses eaclap.sty)

ACM Class: I.2.7

Journal ref: Proceedings of the EACL99 workshop on Linguistically Interpreted Corpora (LINC), Bergen, Norway, June 12

arXiv:cmp-lg/9806013 [pdf, ps, other]

Can Subcategorisation Probabilities Help a Statistical Parser?

Authors: John Carroll, Guido Minnen, Ted Briscoe

Abstract: Research into the automatic acquisition of lexical information from corpora is starting to produce large-scale computational lexicons containing data on the relative frequencies of subcategorisation alternatives for individual verbal predicates. However, the empirical question of whether this type of frequency information can in practice improve the accuracy of a statistical parser has not yet b… ▽ More Research into the automatic acquisition of lexical information from corpora is starting to produce large-scale computational lexicons containing data on the relative frequencies of subcategorisation alternatives for individual verbal predicates. However, the empirical question of whether this type of frequency information can in practice improve the accuracy of a statistical parser has not yet been answered. In this paper we describe an experiment with a wide-coverage statistical grammar and parser for English and subcategorisation frequencies acquired from ten million words of text which shows that this information can significantly improve parse accuracy. △ Less

Submitted 21 June, 1998; originally announced June 1998.

Comments: 9 pages, uses colacl.sty

Journal ref: 6th Workshop on Very Large Corpora, Montreal, Canada, 1998

arXiv:cmp-lg/9705001 [pdf, ps, other]

Co-evolution of Language and of the Language Acquisition Device

Authors: Ted Briscoe

Abstract: A new account of parameter setting during grammatical acquisition is presented in terms of Generalized Categorial Grammar embedded in a default inheritance hierarchy, providing a natural partial ordering on the setting of parameters. Experiments show that several experimentally effective learners can be defined in this framework. Evolutionary simulations suggest that a learner with default initi… ▽ More A new account of parameter setting during grammatical acquisition is presented in terms of Generalized Categorial Grammar embedded in a default inheritance hierarchy, providing a natural partial ordering on the setting of parameters. Experiments show that several experimentally effective learners can be defined in this framework. Evolutionary simulations suggest that a learner with default initial settings for parameters will emerge, provided that learning is memory limited and the environment of linguistic adaptation contains an appropriate language. △ Less

Submitted 1 May, 1997; originally announced May 1997.

Comments: 10 pages, latex, 2 postscript figures, uses aclap.sty and graphics.sty, to appear ACL-EACL97

arXiv:cmp-lg/9702002 [pdf, ps, other]

Automatic Extraction of Subcategorization from Corpora

Authors: Ted Briscoe, John Carroll

Abstract: We describe a novel technique and implemented system for constructing a subcategorization dictionary from textual corpora. Each dictionary entry encodes the relative frequency of occurrence of a comprehensive set of subcategorization classes for English. An initial experiment, on a sample of 14 verbs which exhibit multiple complementation patterns, demonstrates that the technique achieves accura… ▽ More We describe a novel technique and implemented system for constructing a subcategorization dictionary from textual corpora. Each dictionary entry encodes the relative frequency of occurrence of a comprehensive set of subcategorization classes for English. An initial experiment, on a sample of 14 verbs which exhibit multiple complementation patterns, demonstrates that the technique achieves accuracy comparable to previous approaches, which are all limited to a highly restricted set of subcategorization classes. We also demonstrate that a subcategorization dictionary built with the system improves the accuracy of a parser by an appreciable amount. △ Less

Submitted 4 February, 1997; originally announced February 1997.

Comments: 8 pages; requires aclap.sty. To appear in ANLP-97

arXiv:cmp-lg/9604004 [pdf, ps, other]

Apportioning Development Effort in a Probabilistic LR Parsing System through Evaluation

Authors: John Carroll, Ted Briscoe

Abstract: We describe an implemented system for robust domain-independent syntactic parsing of English, using a unification-based grammar of part-of-speech and punctuation labels coupled with a probabilistic LR parser. We present evaluations of the system's performance along several different dimensions; these enable us to assess the contribution that each individual part is making to the success of the s… ▽ More We describe an implemented system for robust domain-independent syntactic parsing of English, using a unification-based grammar of part-of-speech and punctuation labels coupled with a probabilistic LR parser. We present evaluations of the system's performance along several different dimensions; these enable us to assess the contribution that each individual part is making to the success of the system as a whole, and thus prioritise the effort to be devoted to its further enhancement. Currently, the system is able to parse around 80% of sentences in a substantial corpus of general text containing a number of distinct genres. On a random sample of 250 such sentences the system has a mean crossing bracket rate of 0.71 and recall and precision of 83% and 84% respectively when evaluated against manually-disambiguated analyses. △ Less

Submitted 12 April, 1996; originally announced April 1996.

Comments: 10 pages, 1 Postscript figure. To Appear in Proceedings of the Conference on Empirical Methods in Natural Language Processing, University of Pennsylvania, May 1996

Journal ref: Conference on Empirical Methods in Natural Language Processing (EMNLP-96), 92-100

arXiv:cmp-lg/9510005 [pdf, ps, other]

Develo** and Evaluating a Probabilistic LR Parser of Part-of-Speech and Punctuation Labels

Authors: Ted Briscoe, John Carroll

Abstract: We describe an approach to robust domain-independent syntactic parsing of unrestricted naturally-occurring (English) input. The technique involves parsing sequences of part-of-speech and punctuation labels using a unification-based grammar coupled with a probabilistic LR parser. We describe the coverage of several corpora using this grammar and report the results of a parsing experiment using pr… ▽ More We describe an approach to robust domain-independent syntactic parsing of unrestricted naturally-occurring (English) input. The technique involves parsing sequences of part-of-speech and punctuation labels using a unification-based grammar coupled with a probabilistic LR parser. We describe the coverage of several corpora using this grammar and report the results of a parsing experiment using probabilities derived from bracketed training data. We report the first substantial experiments to assess the contribution of punctuation to deriving an accurate syntactic analysis, by parsing identical texts both with and without naturally-occurring punctuation marks. △ Less

Submitted 9 October, 1995; originally announced October 1995.

Comments: 11 pages, standard LaTeX

Journal ref: 4th International Workshop on Parsing Technologies (IWPT-95), 48-58

Showing 1–15 of 15 results for author: Briscoe, T