Skip to main content

Showing 1–2 of 2 results for author: Butala, Y P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.17553  [pdf, other

    cs.AI cs.CL cs.CV cs.HC

    OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

    Authors: Raghav Kapoor, Yash Parag Butala, Melisa Russak, **g Yu Koh, Kiran Kamble, Waseem Alshikh, Ruslan Salakhutdinov

    Abstract: For decades, human-computer interaction has fundamentally been manual. Even today, almost all productive work done on the computer necessitates human input at every step. Autonomous virtual agents represent an exciting step in automating many of these menial tasks. Virtual agents would empower users with limited technical proficiency to harness the full possibilities of computer systems. They coul… ▽ More

    Submitted 28 February, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  2. arXiv:2306.06190  [pdf, other

    cs.CL cs.LG

    $FastDoc$: Domain-Specific Fast Pre-training Technique using Document-Level Metadata and Taxonomy

    Authors: Abhilash Nandy, Manav Nitin Kapadnis, Sohan Patnaik, Yash Parag Butala, Pawan Goyal, Niloy Ganguly

    Abstract: As the demand for sophisticated Natural Language Processing (NLP) models continues to grow, so does the need for efficient pre-training techniques. Current NLP models undergo resource-intensive pre-training. In response, we introduce $FastDoc$ (Fast Pre-training Technique using Document-Level Metadata and Taxonomy), a novel approach designed to significantly reduce computational demands.… ▽ More

    Submitted 14 November, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 38 pages, 7 figures

    MSC Class: 68T50 ACM Class: I.2.7