Skip to main content

Showing 1–6 of 6 results for author: Alzahrani, N

.
  1. arXiv:2402.01781  [pdf, other

    cs.CL cs.AI cs.LG

    When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

    Authors: Norah Alzahrani, Hisham Abdullah Alyahya, Yazeed Alnumay, Sultan Alrashed, Shaykhah Alsubaie, Yusef Almushaykeh, Faisal Mirza, Nouf Alotaibi, Nora Altwairesh, Areeb Alowisheq, M Saiful Bari, Haidar Khan

    Abstract: Large Language Model (LLM) leaderboards based on benchmark rankings are regularly used to guide practitioners in model selection. Often, the published leaderboard rankings are taken at face value - we show this is a (potentially costly) mistake. Under existing leaderboards, the relative performance of LLMs is highly sensitive to (often minute) details. We show that for popular multiple-choice ques… ▽ More

    Submitted 3 July, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: updated with ACL 2024 camera ready version

  2. arXiv:2308.09843  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Large thermo-spin effects in Heusler alloy based spin-gapless semiconductor thin films

    Authors: Amit Chanda, Deepika Rani, Derick DeTellem, Noha Alzahrani, Dario A. Arena, Sarath Witanachchi, Ratnamala Chatterjee, Manh-Huong Phan, Hariharan Srikanth

    Abstract: Recently, Heusler alloys-based spin gapless semiconductors (SGSs) with high Curie temperature (TC) and sizeable spin polarization have emerged as potential candidates for tunable spintronic applications. We report comprehensive investigation of the temperature dependent ANE and intrinsic longitudinal spin Seebeck effect (LSSE) in CoFeCrGa thin films grown on MgO substrates. Our findings show the a… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  3. arXiv:2211.12003  [pdf, other

    cs.SE cs.FL

    Application of property-based testing tools\\ for metamorphic testing

    Authors: Nasser Alzahrani, Maria Spichkova, James Harland

    Abstract: Metamorphic testing (MT) is a general approach for the testing of a specific kind of software systems -- so-called ``non-testable'', where the ``classical'' testing approaches are difficult to apply. MT is an effective approach for addressing the test oracle problem and test case generation problem. The test oracle problem is when it is difficult to determine the correct expected output of a parti… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Preprint. Accepted to the 17th International Conference on Evaluation of Novel Approaches to Software Engineering (ENASE 2022). Final version published by SCITEPRESS, http://www.scitepress.org

  4. arXiv:1705.10032  [pdf, other

    cs.SE

    From Temporal Models to Property-Based Testing

    Authors: Nasser Alzahrani, Maria Spichkova, Jan Olaf Blech

    Abstract: This paper presents a framework to apply property-based testing (PBT) on top of temporal formal models. The aim of this work is to help software engineers to understand temporal models that are presented formally and to make use of the advantages of formal methods: the core time-based constructs of a formal method are schematically translated to the BeSpaceD extension of the Scala programming lang… ▽ More

    Submitted 28 May, 2017; originally announced May 2017.

    Comments: Preprint. Accepted to the 12th International Conference on Evaluation of Novel Approaches to Software Engineering (ENASE 2017). Final version published by SCITEPRESS, http://www.scitepress.org

  5. arXiv:1612.01686  [pdf, other

    cs.SE

    Spatio-temporal Models for Formal Analysis and Property-based Testing

    Authors: Nasser Alzahrani, Maria Spichkova, Jan Olaf Blech

    Abstract: This paper presents our ongoing work on spatio-temporal models for formal analysis and property-based testing. Our proposed framework aims at reducing the impedance mismatch between formal methods and practitioners. We introduce a set of formal methods and explain their interplay and benefits in terms of usability.

    Submitted 9 December, 2016; v1 submitted 6 December, 2016; originally announced December 2016.

    Comments: Preprint. Accepted to the Software Technologies: Applications and Foundations (STAF 2016). Final version published by Springer International Publishing AG

  6. arXiv:1512.04743  [pdf, ps, other

    stat.CO

    Model comparison with missing data using MCMC and importance sampling

    Authors: Panayiota Touloupou, Naif Alzahrani, Peter Neal, Simon E. F. Spencer, Trevelyan J. McKinley

    Abstract: Selecting between competing statistical models is a challenging problem especially when the competing models are non-nested. In this paper we offer a simple solution by devising an algorithm which combines MCMC and importance sampling to obtain computationally efficient estimates of the marginal likelihood which can then be used to compare the models. The algorithm is successfully applied to longi… ▽ More

    Submitted 15 December, 2015; originally announced December 2015.

    Comments: 34 pages