-
Structure-Tags Improve Text Classification for Scholarly Document Quality Prediction
Abstract: Training recurrent neural networks on long texts, in particular scholarly documents, causes problems for learning. While hierarchical attention networks (HANs) are effective in solving these problems, they still lose important information about the structure of the text. To tackle these problems, we propose the use of HANs combined with structure-tags which mark the role of sentences in the docume… ▽ More
Submitted 17 December, 2020; v1 submitted 30 April, 2020; originally announced May 2020.
Comments: This new version of the paper brings the paper up-to-date with the improved paper, published at the First Workshop on Scholarly Document Processing, at EMNLP 2020. .Additionally, minor corrections were made including addition of color to Figures 1,2. The changes in comparison to the first arXiv version are substantial, including various additional results, and substantial improvements to the text
ACM Class: I.2.7
Journal ref: Proceedings of the First Workshop on Scholarly Document Processing. Association for Computational Linguistics. (2020) 158-167. EMNLP|SDP 2020 https://www.aclweb.org/anthology/2020.sdp-1.18