WYWEB: A NLP Evaluation Benchmark For Classical Chinese

Zhou, Bo; Chen, Qianglong; Wang, Tianyu; Zhong, Xiaomi; Zhang, Yin

Computer Science > Computation and Language

arXiv:2305.14150 (cs)

[Submitted on 23 May 2023]

Title:WYWEB: A NLP Evaluation Benchmark For Classical Chinese

Authors:Bo Zhou, Qianglong Chen, Tianyu Wang, Xiaomi Zhong, Yin Zhang

View PDF

Abstract:To fully evaluate the overall performance of different NLP models in a given domain, many evaluation benchmarks are proposed, such as GLUE, SuperGLUE and CLUE. The fi eld of natural language understanding has traditionally focused on benchmarks for various tasks in languages such as Chinese, English, and multilingua, however, there has been a lack of attention given to the area of classical Chinese, also known as "wen yan wen", which has a rich history spanning thousands of years and holds signifi cant cultural and academic value. For the prosperity of the NLP community, in this paper, we introduce the WYWEB evaluation benchmark, which consists of nine NLP tasks in classical Chinese, implementing sentence classifi cation, sequence labeling, reading comprehension, and machine translation. We evaluate the existing pre-trained language models, which are all struggling with this benchmark. We also introduce a number of supplementary datasets and additional tools to help facilitate further progress on classical Chinese NLU. The github repository is this https URL.

Comments:	Accepted by ACL 2023
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Report number:	2023.findings-acl.204
Cite as:	arXiv:2305.14150 [cs.CL]
	(or arXiv:2305.14150v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.14150
Journal reference:	https://aclanthology.org/2023.findings-acl.204

Submission history

From: Bo Zhou [view email]
[v1] Tue, 23 May 2023 15:15:11 UTC (7,408 KB)

Computer Science > Computation and Language

Title:WYWEB: A NLP Evaluation Benchmark For Classical Chinese

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:WYWEB: A NLP Evaluation Benchmark For Classical Chinese

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators