Skip to main content

Showing 1–2 of 2 results for author: Myrzakhan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07545  [pdf, other

    cs.CL cs.AI

    Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena

    Authors: Aidar Myrzakhan, Sondos Mahmoud Bsharat, Zhiqiang Shen

    Abstract: Multiple-choice questions (MCQ) are frequently used to assess large language models (LLMs). Typically, an LLM is given a question and selects the answer deemed most probable after adjustments for factors like length. Unfortunately, LLMs may inherently favor certain answer choice IDs, such as A/B/C/D, due to inherent biases of priori unbalanced probabilities, influencing the prediction of answers b… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Code and dataset are available at https://github.com/VILA-Lab/Open-LLM-Leaderboard

  2. arXiv:2312.16171  [pdf, other

    cs.CL cs.AI

    Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4

    Authors: Sondos Mahmoud Bsharat, Aidar Myrzakhan, Zhiqiang Shen

    Abstract: This paper introduces 26 guiding principles designed to streamline the process of querying and prompting large language models. Our goal is to simplify the underlying concepts of formulating questions for various scales of large language models, examining their abilities, and enhancing user comprehension on the behaviors of different scales of large language models when feeding into different prom… ▽ More

    Submitted 18 January, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: Github at: https://github.com/VILA-Lab/ATLAS