Skip to main content

Showing 1–2 of 2 results for author: Haron, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.00565  [pdf, other

    cs.CL

    Leveraging Corpus Metadata to Detect Template-based Translation: An Exploratory Case Study of the Egyptian Arabic Wikipedia Edition

    Authors: Saied Alshahrani, Hesham Haroon, Ali Elfilali, Mariama Njie, Jeanna Matthews

    Abstract: Wikipedia articles (content pages) are commonly used corpora in Natural Language Processing (NLP) research, especially in low-resource languages other than English. Yet, a few research studies have studied the three Arabic Wikipedia editions, Arabic Wikipedia (AR), Egyptian Arabic Wikipedia (ARZ), and Moroccan Arabic Wikipedia (ARY), and documented issues in the Egyptian Arabic Wikipedia edition r… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: This paper has been accepted at LREC-COLING 2024: The 6th Workshop on Open-Source Arabic Corpora and Processing Tools (OSACT6)

  2. arXiv:1402.6764  [pdf

    cs.SE cs.CL

    A method to identify potential ambiguous Malay words through Ambiguity Attributes map**: An exploratory Study

    Authors: Hazlina Haron, Abdul Azim Abd. Ghani

    Abstract: We describe here a methodology to identify a list of ambiguous Malay words that are commonly being used in Malay documentations such as Requirement Specification. We compiled several relevant and appropriate requirement quality attributes and sentence rules from previous literatures and adopt it to come out with a set of ambiguity attributes that most suit Malay words. The extracted Malay ambiguou… ▽ More

    Submitted 26 February, 2014; originally announced February 2014.

    Comments: Paper was presented at The Fourth International Conference of Computer Science and Information Technology (CCSIT2014)in Sydney, Australia on Feb 22, 2014