Skip to main content

Showing 1–11 of 11 results for author: Hee, M S

.
  1. arXiv:2405.01842  [pdf, ps, other

    cs.CL

    SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore

    Authors: Ri Chi Ng, Nirmalendu Prakash, Ming Shan Hee, Kenny Tsu Wei Choo, Roy Ka-Wei Lee

    Abstract: To address the limitations of current hate speech detection models, we introduce \textsf{SGHateCheck}, a novel framework designed for the linguistic and cultural context of Singapore and Southeast Asia. It extends the functional testing approach of HateCheck and MHC, employing large language models for translation and paraphrasing into Singapore's main languages, and refining these with native ann… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  2. arXiv:2401.16727  [pdf, other

    cs.CL

    Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models

    Authors: Ming Shan Hee, Shivam Sharma, Rui Cao, Palash Nandi, Tanmoy Chakraborty, Roy Ka-Wei Lee

    Abstract: In the evolving landscape of online communication, moderating hate speech (HS) presents an intricate challenge, compounded by the multimodal nature of digital content. This comprehensive survey delves into the recent strides in HS moderation, spotlighting the burgeoning role of large language models (LLMs) and large multimodal models (LMMs). Our exploration begins with a thorough analysis of curre… ▽ More

    Submitted 1 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Preprint; Under-Review

  3. arXiv:2312.09693  [pdf, other

    cs.AI

    Prompting Large Language Models for Topic Modeling

    Authors: Han Wang, Nirmalendu Prakash, Nguyen Khoi Hoang, Ming Shan Hee, Usman Naseem, Roy Ka-Wei Lee

    Abstract: Topic modeling is a widely used technique for revealing underlying thematic structures within textual data. However, existing models have certain limitations, particularly when dealing with short text datasets that lack co-occurring words. Moreover, these models often neglect sentence-level semantics, focusing primarily on token-level semantics. In this paper, we propose PromptTopic, a novel topic… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 6 pages, 3 figures, IEEE International Conference on Big Data

    ACM Class: I.2.7

  4. arXiv:2312.06094  [pdf, other

    cs.CL cs.CV cs.MM

    MATK: The Meme Analytical Tool Kit

    Authors: Ming Shan Hee, Aditi Kumaresan, Nguyen Khoi Hoang, Nirmalendu Prakash, Rui Cao, Roy Ka-Wei Lee

    Abstract: The rise of social media platforms has brought about a new digital culture called memes. Memes, which combine visuals and text, can strongly influence public opinions on social and cultural issues. As a result, people have become interested in categorizing memes, leading to the development of various datasets and multimodal models that show promising results in this field. However, there is curren… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: Accepted at ACM Multimedia'23 Open-Source Software Competition Track

    ACM Class: I.1.4

  5. arXiv:2312.06093  [pdf, other

    cs.CL cs.CV cs.MM

    PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models

    Authors: Nirmalendu Prakash, Han Wang, Nguyen Khoi Hoang, Ming Shan Hee, Roy Ka-Wei Lee

    Abstract: The proliferation of social media has given rise to a new form of communication: memes. Memes are multimodal and often contain a combination of text and visual elements that convey meaning, humor, and cultural significance. While meme analysis has been an active area of research, little work has been done on unsupervised multimodal topic modeling of memes, which is important for content moderation… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: Accepted at ACM Multimedia'23 Research Track

    ACM Class: I.1.4; I.1.7

  6. arXiv:2308.08088  [pdf, other

    cs.CV cs.IR cs.MM

    Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection

    Authors: Rui Cao, Ming Shan Hee, Adriel Kuek, Wen-Haw Chong, Roy Ka-Wei Lee, **g Jiang

    Abstract: Hateful meme detection is a challenging multimodal task that requires comprehension of both vision and language, as well as cross-modal interactions. Recent studies have tried to fine-tune pre-trained vision-language models (PVLMs) for this task. However, with increasing model sizes, it becomes important to leverage powerful PVLMs more efficiently, rather than simply fine-tuning them. Recently, re… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: Camera-ready for 23, ACM MM

  7. arXiv:2305.17911  [pdf, other

    cs.SI cs.AI cs.CL cs.CV

    TotalDefMeme: A Multi-Attribute Meme dataset on Total Defence in Singapore

    Authors: Nirmalendu Prakash, Ming Shan Hee, Roy Ka-Wei Lee

    Abstract: Total Defence is a defence policy combining and extending the concept of military defence and civil defence. While several countries have adopted total defence as their defence policy, very few studies have investigated its effectiveness. With the rapid proliferation of social media and digitalisation, many social studies have been focused on investigating policy effectiveness through specially cu… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: 6 pages. Accepted at ACM MMSys 2023

    ACM Class: I.2.7

  8. arXiv:2305.17680  [pdf, other

    cs.CL cs.AI

    Evaluating GPT-3 Generated Explanations for Hateful Content Moderation

    Authors: Han Wang, Ming Shan Hee, Md Rabiul Awal, Kenny Tsu Wei Choo, Roy Ka-Wei Lee

    Abstract: Recent research has focused on using large language models (LLMs) to generate explanations for hate speech through fine-tuning or prompting. Despite the growing interest in this area, these generated explanations' effectiveness and potential limitations remain poorly understood. A key concern is that these explanations, generated by LLMs, may lead to erroneous judgments about the nature of flagged… ▽ More

    Submitted 30 August, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: 9 pages, 2 figures, Accepted by International Joint Conference on Artificial Intelligence(IJCAI)

    ACM Class: I.2.7

  9. arXiv:2305.17678  [pdf, other

    cs.CL cs.AI cs.CV

    Decoding the Underlying Meaning of Multimodal Hateful Memes

    Authors: Ming Shan Hee, Wen-Haw Chong, Roy Ka-Wei Lee

    Abstract: Recent studies have proposed models that yielded promising performance for the hateful meme classification task. Nevertheless, these proposed models do not generate interpretable explanations that uncover the underlying meaning and support the classification output. A major reason for the lack of explainable hateful meme methods is the absence of a hateful meme dataset that contains ground truth e… ▽ More

    Submitted 19 June, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: 9 pages. Accepted by IJCAI 2023

    ACM Class: I.2.7; I.2.10

  10. arXiv:2204.01734  [pdf, other

    cs.CV cs.SI

    On Explaining Multimodal Hateful Meme Detection Models

    Authors: Ming Shan Hee, Roy Ka-Wei Lee, Wen-Haw Chong

    Abstract: Hateful meme detection is a new multimodal task that has gained significant traction in academic and industry research communities. Recently, researchers have applied pre-trained visual-linguistic models to perform the multimodal classification task, and some of these solutions have yielded promising results. However, what these visual-linguistic models learn for the hateful meme classification ta… ▽ More

    Submitted 6 April, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

  11. arXiv:1902.08737  [pdf, other

    cs.SI

    Linky: Visualizing User Identity Linkage Results For Multiple Online Social Networks

    Authors: Roy Ka-Wei Lee, Ming Shan Hee, Philips Kokoh Prasetyo, Ee-Peng Lim

    Abstract: User identity linkage across online social networks is an emerging research topic that has attracted attention in recent years. Many user identity linkage methods have been proposed so far and most of them utilize user profile, content and network information to determine if two social media accounts belong to the same person. In most cases, user identity linkage methods are evaluated by performin… ▽ More

    Submitted 23 February, 2019; originally announced February 2019.

    Comments: 2018 IEEE International Conference on Data Mining Workshops (ICDMW)