Investigating Pretrained Language Models for Graph-to-Text Generation

Ribeiro, Leonardo F. R.; Schmitt, Martin; Schütze, Hinrich; Gurevych, Iryna

Computer Science > Computation and Language

arXiv:2007.08426v1 (cs)

[Submitted on 16 Jul 2020 (this version), latest version 27 Sep 2021 (v3)]

Title:Investigating Pretrained Language Models for Graph-to-Text Generation

Authors:Leonardo F. R. Ribeiro, Martin Schmitt, Hinrich Schütze, Iryna Gurevych

View PDF

Abstract:Graph-to-text generation, a subtask of data-to-text generation, aims to generate fluent texts from graph-based data. Many graph-to-text models have shown strong performance in this task employing specialized graph encoders. However, recent approaches employ large pretrained language models (PLMs) achieving state-of-the-art results in data-to-text generation. In this paper, we aim to investigate the impact of large PLMs in graph-to-text generation. We present a study across three graph domains: meaning representations, Wikipedia knowledge graphs (KGs) and scientific KGs. Our analysis shows that PLMs such as BART and T5 achieve state-of-the-art results in graph-to-text benchmarks without explicitly encoding the graph structure. We also demonstrate that task-adaptive pretraining strategies are beneficial to the target task, improving even further the state of the art in two benchmarks for graph-to-text generation. In a final analysis, we investigate possible reasons for the PLMs' success on graph-to-text tasks. We find evidence that their knowledge about the world gives them a big advantage, especially when generating texts from KGs.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2007.08426 [cs.CL]
	(or arXiv:2007.08426v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2007.08426

Submission history

From: Leonardo F. R. Ribeiro [view email]
[v1] Thu, 16 Jul 2020 16:05:34 UTC (128 KB)
[v2] Wed, 23 Dec 2020 16:37:44 UTC (7,301 KB)
[v3] Mon, 27 Sep 2021 13:50:11 UTC (5,539 KB)

Computer Science > Computation and Language

Title:Investigating Pretrained Language Models for Graph-to-Text Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Investigating Pretrained Language Models for Graph-to-Text Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators