AVA: an Automatic eValuation Approach to Question Answering Systems

Vu, Thuy; Moschitti, Alessandro

Computer Science > Computation and Language

arXiv:2005.00705 (cs)

[Submitted on 2 May 2020]

Title:AVA: an Automatic eValuation Approach to Question Answering Systems

Authors:Thuy Vu, Alessandro Moschitti

View PDF

Abstract:We introduce AVA, an automatic evaluation approach for Question Answering, which given a set of questions associated with Gold Standard answers, can estimate system Accuracy. AVA uses Transformer-based language models to encode question, answer, and reference text. This allows for effectively measuring the similarity between the reference and an automatic answer, biased towards the question semantics. To design, train and test AVA, we built multiple large training, development, and test sets on both public and industrial benchmarks. Our innovative solutions achieve up to 74.7% in F1 score in predicting human judgement for single answers. Additionally, AVA can be used to evaluate the overall system Accuracy with an RMSE, ranging from 0.02 to 0.09, depending on the availability of multiple references.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2005.00705 [cs.CL]
	(or arXiv:2005.00705v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2005.00705
Journal reference:	NAACL 2021

Submission history

From: Thuy Vu [view email]
[v1] Sat, 2 May 2020 05:00:16 UTC (363 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-05

Change to browse by:

cs
cs.AI
cs.IR
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Thuy Vu
Alessandro Moschitti

export BibTeX citation

Computer Science > Computation and Language

Title:AVA: an Automatic eValuation Approach to Question Answering Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:AVA: an Automatic eValuation Approach to Question Answering Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators