USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation

Mehri, Shikib; Eskenazi, Maxine

Computer Science > Computation and Language

arXiv:2005.00456 (cs)

[Submitted on 1 May 2020]

Title:USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation

Authors:Shikib Mehri, Maxine Eskenazi

View PDF

Abstract:The lack of meaningful automatic evaluation metrics for dialog has impeded open-domain dialog research. Standard language generation metrics have been shown to be ineffective for evaluating dialog models. To this end, this paper presents USR, an UnSupervised and Reference-free evaluation metric for dialog. USR is a reference-free metric that trains unsupervised models to measure several desirable qualities of dialog. USR is shown to strongly correlate with human judgment on both Topical-Chat (turn-level: 0.42, system-level: 1.0) and PersonaChat (turn-level: 0.48 and system-level: 1.0). USR additionally produces interpretable measures for several desirable properties of dialog.

Comments:	Accepted to ACL 2020 as long paper
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2005.00456 [cs.CL]
	(or arXiv:2005.00456v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2005.00456

Submission history

From: Shikib Mehri [view email]
[v1] Fri, 1 May 2020 15:50:50 UTC (251 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-05

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shikib Mehri
Maxine Eskénazi

export BibTeX citation

Computer Science > Computation and Language

Title:USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:USR: An Unsupervised and Reference Free Evaluation Metric for Dialog Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators