Skip to main content

Showing 1–1 of 1 results for author: Sanian, M V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.06644  [pdf, other

    cs.CL cs.AI

    Khayyam Challenge (PersianMMLU): Is Your LLM Truly Wise to The Persian Language?

    Authors: Omid Ghahroodi, Marzia Nouri, Mohammad Vali Sanian, Alireza Sahebi, Doratossadat Dastgheib, Ehsaneddin Asgari, Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban

    Abstract: Evaluating Large Language Models (LLMs) is challenging due to their generative nature, necessitating precise evaluation methodologies. Additionally, non-English LLM evaluation lags behind English, resulting in the absence or weakness of LLMs for many languages. In response to this necessity, we introduce Khayyam Challenge (also known as PersianMMLU), a meticulously curated collection comprising 20… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.