Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach

Du, Wenyu; Lin, Zhouhan; Shen, Yikang; O'Donnell, Timothy J.; Bengio, Yoshua; Zhang, Yue

Computer Science > Computation and Language

arXiv:2005.05864 (cs)

[Submitted on 12 May 2020]

Title:Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach

Authors:Wenyu Du, Zhouhan Lin, Yikang Shen, Timothy J. O'Donnell, Yoshua Bengio, Yue Zhang

View PDF

Abstract:It is commonly believed that knowledge of syntactic structure should improve language modeling. However, effectively and computationally efficiently incorporating syntactic structure into neural language models has been a challenging topic. In this paper, we make use of a multi-task objective, i.e., the models simultaneously predict words as well as ground truth parse trees in a form called "syntactic distances", where information between these two separate objectives shares the same intermediate representation. Experimental results on the Penn Treebank and Chinese Treebank datasets show that when ground truth parse trees are provided as additional training signals, the model is able to achieve lower perplexity and induce trees with better quality.

Comments:	ACL20
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2005.05864 [cs.CL]
	(or arXiv:2005.05864v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2005.05864

Submission history

From: Wenyu Du [view email]
[v1] Tue, 12 May 2020 15:35:00 UTC (490 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Wenyu Du
Zhouhan Lin
Yikang Shen
Timothy J. O'Donnell
Yoshua Bengio

…

export BibTeX citation

Computer Science > Computation and Language

Title:Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators