Will-They-Won't-They: A Very Large Dataset for Stance Detection on Twitter
Authors:
Costanza Conforti,
Jakob Berndt,
Mohammad Taher Pilehvar,
Chryssi Giannitsarou,
Flavio Toxvaerd,
Nigel Collier
Abstract:
We present a new challenging stance detection dataset, called Will-They-Won't-They (WT-WT), which contains 51,284 tweets in English, making it by far the largest available dataset of the type. All the annotations are carried out by experts; therefore, the dataset constitutes a high-quality and reliable benchmark for future research in stance detection. Our experiments with a wide range of recent s…
▽ More
We present a new challenging stance detection dataset, called Will-They-Won't-They (WT-WT), which contains 51,284 tweets in English, making it by far the largest available dataset of the type. All the annotations are carried out by experts; therefore, the dataset constitutes a high-quality and reliable benchmark for future research in stance detection. Our experiments with a wide range of recent state-of-the-art stance detection systems show that the dataset poses a strong challenge to existing models in this domain.
△ Less
Submitted 1 May, 2020;
originally announced May 2020.