Skip to main content

Showing 1–2 of 2 results for author: Malwat, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.06233  [pdf, other

    cs.CL

    LEGOBench: Scientific Leaderboard Generation Benchmark

    Authors: Shruti Singh, Shoaib Alam, Husain Malwat, Mayank Singh

    Abstract: The ever-increasing volume of paper submissions makes it difficult to stay informed about the latest state-of-the-art research. To address this challenge, we introduce LEGOBench, a benchmark for evaluating systems that generate scientific leaderboards. LEGOBench is curated from 22 years of preprint submission data on arXiv and more than 11k machine learning leaderboards on the PapersWithCode porta… ▽ More

    Submitted 21 February, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  2. arXiv:2309.12616  [pdf, other

    cs.CL

    Unlocking Model Insights: A Dataset for Automated Model Card Generation

    Authors: Shruti Singh, Hitesh Lodwal, Husain Malwat, Rakesh Thakur, Mayank Singh

    Abstract: Language models (LMs) are no longer restricted to ML community, and instruction-tuned LMs have led to a rise in autonomous AI agents. As the accessibility of LMs grows, it is imperative that an understanding of their capabilities, intended usage, and development cycle also improves. Model cards are a popular practice for documenting detailed information about an ML model. To automate model card ge… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.