Investigating representations of verb bias in neural language models

Hawkins, Robert D.; Yamakoshi, Takateru; Griffiths, Thomas L.; Goldberg, Adele E.

Computer Science > Computation and Language

arXiv:2010.02375 (cs)

[Submitted on 5 Oct 2020 (v1), last revised 15 Oct 2020 (this version, v2)]

Title:Investigating representations of verb bias in neural language models

Authors:Robert D. Hawkins, Takateru Yamakoshi, Thomas L. Griffiths, Adele E. Goldberg

View PDF

Abstract:Languages typically provide more than one grammatical construction to express certain types of messages. A speaker's choice of construction is known to depend on multiple factors, including the choice of main verb -- a phenomenon known as \emph{verb bias}. Here we introduce DAIS, a large benchmark dataset containing 50K human judgments for 5K distinct sentence pairs in the English dative alternation. This dataset includes 200 unique verbs and systematically varies the definiteness and length of arguments. We use this dataset, as well as an existing corpus of naturally occurring data, to evaluate how well recent neural language models capture human preferences. Results show that larger models perform better than smaller models, and transformer architectures (e.g. GPT-2) tend to out-perform recurrent architectures (e.g. LSTMs) even under comparable parameter and training settings. Additional analyses of internal feature representations suggest that transformers may better integrate specific lexical information with grammatical constructions.

Comments:	Accepted to EMNLP
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2010.02375 [cs.CL]
	(or arXiv:2010.02375v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2010.02375

Submission history

From: Robert Hawkins [view email]
[v1] Mon, 5 Oct 2020 22:39:08 UTC (1,996 KB)
[v2] Thu, 15 Oct 2020 19:37:48 UTC (1,999 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-10

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Robert X. D. Hawkins
Thomas L. Griffiths

export BibTeX citation

Computer Science > Computation and Language

Title:Investigating representations of verb bias in neural language models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Investigating representations of verb bias in neural language models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators