Skip to main content

Showing 1–3 of 3 results for author: Alshikh, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.02048  [pdf, ps, other

    cs.IR cs.AI

    Comparative Analysis of Retrieval Systems in the Real World

    Authors: Dmytro Mozolevskyi, Waseem AlShikh

    Abstract: This research paper presents a comprehensive analysis of integrating advanced language models with search and retrieval systems in the fields of information retrieval and natural language processing. The objective is to evaluate and compare various state-of-the-art methods based on their performance in terms of accuracy and efficiency. The analysis explores different combinations of technologies,… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  2. arXiv:2402.17553  [pdf, other

    cs.AI cs.CL cs.CV cs.HC

    OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

    Authors: Raghav Kapoor, Yash Parag Butala, Melisa Russak, **g Yu Koh, Kiran Kamble, Waseem Alshikh, Ruslan Salakhutdinov

    Abstract: For decades, human-computer interaction has fundamentally been manual. Even today, almost all productive work done on the computer necessitates human input at every step. Autonomous virtual agents represent an exciting step in automating many of these menial tasks. Virtual agents would empower users with limited technical proficiency to harness the full possibilities of computer systems. They coul… ▽ More

    Submitted 28 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  3. arXiv:2307.03692  [pdf, other

    cs.CL cs.AI

    Becoming self-instruct: introducing early stop** criteria for minimal instruct tuning

    Authors: Waseem AlShikh, Manhal Daaboul, Kirk Goddard, Brock Imel, Kiran Kamble, Parikshith Kulkarni, Melisa Russak

    Abstract: In this paper, we introduce the Instruction Following Score (IFS), a metric that detects language models' ability to follow instructions. The metric has a dual purpose. First, IFS can be used to distinguish between base and instruct models. We benchmark publicly available base and instruct models, and show that the ratio of well formatted responses to partial and full sentences can be an effective… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.