Skip to main content

Showing 1–3 of 3 results for author: Tejaswi, A

.
  1. arXiv:2406.14670  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring Design Choices for Building Language-Specific LLMs

    Authors: Atula Tejaswi, Nilesh Gupta, Eunsol Choi

    Abstract: Despite rapid progress in large language models (LLMs), their performance on a vast majority of languages remain unsatisfactory. In this paper, we study building language-specific LLMs by adapting monolingual and multilingual LLMs. We conduct systematic experiments on how design choices (base model selection, vocabulary extension, and continued fine-tuning) impact the adapted LLM, both in terms of… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 15 pages, 6 figures, 11 tables

  2. arXiv:2405.19597  [pdf, other

    cs.LG cs.AI cs.CL

    SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors

    Authors: Vijay Lingam, Atula Tejaswi, Aditya Vavre, Aneesh Shetty, Gautham Krishna Gudur, Joydeep Ghosh, Alex Dimakis, Eunsol Choi, Aleksandar Bojchevski, Sujay Sanghavi

    Abstract: Popular parameter-efficient fine-tuning (PEFT) methods, such as LoRA and its variants, freeze pre-trained model weights \(W\) and inject learnable matrices \(ΔW\). These \(ΔW\) matrices are structured for efficient parameterization, often using techniques like low-rank approximations or scaling vectors. However, these methods typically show a performance gap compared to full fine-tuning. Although… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 17 pages, 5 figures, 14 tables

  3. arXiv:1803.04710  [pdf, ps, other

    nucl-ex nucl-th

    Effect of direct reaction channels on deep sub-barrier fusion in asymmetric systems

    Authors: Md. Moin Shaikh, S. Nath, J. Gehlot, Tathagata Banerjee, Ish Mukul, R. Dubey, A. Shamlath, P. V. Laveen, M. Shareef, A. Jhingan, N. Madhavan, Tapan Rajbongshi, P. Jisha, G. Naga Jyothi, A. Tejaswi, Rudra N. Sahoo, Anjali Rani

    Abstract: A steeper fall of fusion excitation function, compared to the predictions of coupled-channels models, at energies below the lowest barrier between the reaction partners, is termed as deep sub-barrier fusion hindrance. This phenomenon has been observed in many symmetric and nearly-symmetric systems. Different physical origins of the hindrance have been proposed. This work aims to study the probable… ▽ More

    Submitted 25 May, 2018; v1 submitted 13 March, 2018; originally announced March 2018.

    Comments: 6 Pages 3 figures