Skip to main content

Showing 1–2 of 2 results for author: Best, M L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.02876  [pdf, other

    cs.CL cs.CY

    Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation

    Authors: Aman Khullar, Daniel Nkemelu, Cuong V. Nguyen, Michael L. Best

    Abstract: A growing body of work has focused on text classification methods for detecting the increasing amount of hate speech posted online. This progress has been limited to only a select number of highly-resourced languages causing detection systems to either under-perform or not exist in limited data contexts. This is majorly caused by a lack of training data which is expensive to collect and curate in… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted at ACM Journal on Computing and Sustainable Societies

  2. arXiv:2303.16828  [pdf, other

    cs.CY

    Tackling Hate Speech in Low-resource Languages with Context Experts

    Authors: Daniel Nkemelu, Harshil Shah, Irfan Essa, Michael L. Best

    Abstract: Given Myanmars historical and socio-political context, hate speech spread on social media has escalated into offline unrest and violence. This paper presents findings from our remote study on the automatic detection of hate speech online in Myanmar. We argue that effectively addressing this problem will require community-based approaches that combine the knowledge of context experts with machine l… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: ICTD 2022 Conference paper