Transductive Learning for Abstractive News Summarization

Bražinskas, Arthur; Liu, Mengwen; Nallapati, Ramesh; Ravi, Sujith; Dreyer, Markus

Computer Science > Computation and Language

arXiv:2104.09500 (cs)

[Submitted on 17 Apr 2021 (v1), last revised 16 Apr 2022 (this version, v2)]

Title:Transductive Learning for Abstractive News Summarization

Authors:Arthur Bražinskas, Mengwen Liu, Ramesh Nallapati, Sujith Ravi, Markus Dreyer

View PDF

Abstract:Pre-trained and fine-tuned news summarizers are expected to generalize to news articles unseen in the fine-tuning (training) phase. However, these articles often contain specifics, such as new events and people, a summarizer could not learn about in training. This applies to scenarios such as a news publisher training a summarizer on dated news and summarizing incoming recent news. In this work, we explore the first application of transductive learning to summarization where we further fine-tune models on test set inputs. Specifically, we construct pseudo summaries from salient article sentences and input randomly masked articles. Moreover, this approach is also beneficial in the fine-tuning phase, where we jointly predict extractive pseudo references and abstractive gold summaries in the training set. We show that our approach yields state-of-the-art results on CNN/DM and NYT datasets, improving ROUGE-L by 1.05 and 0.74, respectively. Importantly, our approach does not require any changes of the original architecture. Moreover, we show the benefits of transduction from dated to more recent CNN news. Finally, through human and automatic evaluation, we demonstrate improvements in summary abstractiveness and coherence.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2104.09500 [cs.CL]
	(or arXiv:2104.09500v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2104.09500

Submission history

From: Arthur Bražinskas [view email]
[v1] Sat, 17 Apr 2021 17:33:12 UTC (356 KB)
[v2] Sat, 16 Apr 2022 20:23:12 UTC (1,331 KB)

Computer Science > Computation and Language

Title:Transductive Learning for Abstractive News Summarization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Transductive Learning for Abstractive News Summarization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators