Skip to main content

Showing 1–3 of 3 results for author: Almohaimeed, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18725  [pdf, other

    cs.LG cs.CL

    Jailbreaking LLMs with Arabic Transliteration and Arabizi

    Authors: Mansour Al Ghanim, Saleh Almohaimeed, Mengxin Zheng, Yan Solihin, Qian Lou

    Abstract: This study identifies the potential vulnerabilities of Large Language Models (LLMs) to 'jailbreak' attacks, specifically focusing on the Arabic language and its various forms. While most research has concentrated on English-based prompt manipulation, our investigation broadens the scope to investigate the Arabic language. We initially tested the AdvBench benchmark in Standardized Arabic, finding t… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 14 pages, 4 figures

  2. Ar-Spider: Text-to-SQL in Arabic

    Authors: Saleh Almohaimeed, Saad Almohaimeed, Mansour Al Ghanim, Liqiang Wang

    Abstract: In Natural Language Processing (NLP), one of the most important tasks is text-to-SQL semantic parsing, which focuses on enabling users to interact with the database in a more natural manner. In recent years, text-to-SQL has made significant progress, but most were English-centric. In this paper, we introduce Ar-Spider 1, the first Arabic cross-domain text-to-SQL dataset. Due to the unique nature o… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: ACM SAC Conference (SAC 24)

  3. arXiv:2311.06446  [pdf, ps, other

    cs.CL cs.AI

    THOS: A Benchmark Dataset for Targeted Hate and Offensive Speech

    Authors: Saad Almohaimeed, Saleh Almohaimeed, Ashfaq Ali Shafin, Bogdan Carbunar, Ladislau Bölöni

    Abstract: Detecting harmful content on social media, such as Twitter, is made difficult by the fact that the seemingly simple yes/no classification conceals a significant amount of complexity. Unfortunately, while several datasets have been collected for training classifiers in hate and offensive speech, there is a scarcity of datasets labeled with a finer granularity of target classes and specific targets.… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.