Skip to main content

Showing 1–4 of 4 results for author: Madhusudhan, S T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17415  [pdf, other

    cs.CL cs.AI cs.LG

    Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-Levels

    Authors: Razvan-Gabriel Dumitru, Vikas Yadav, Rishabh Maheshwary, Paul-Ioan Clotan, Sathwik Tejaswi Madhusudhan, Mihai Surdeanu

    Abstract: We present a simple variable quantization approach that quantizes different layers of a large language model (LLM) at different bit levels. Specifically, we quantize the most important layers to higher bit precision and less important layers to lower bits to achieve floating point quantization levels. We propose two effective strategies to measure the importance of layers within LLMs: the first me… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: submitted to EMNLP, 15 pages, 10 figures, 4 tables

    ACM Class: I.2.7; I.2.0

  2. arXiv:2406.16783  [pdf, other

    cs.CL cs.AI cs.LG

    M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models

    Authors: Rishabh Maheshwary, Vikas Yadav, Hoang Nguyen, Khyati Mahajan, Sathwik Tejaswi Madhusudhan

    Abstract: Instruction finetuning (IFT) is critical for aligning Large Language Models (LLMs) to follow instructions. While many effective IFT datasets have been introduced recently, they predominantly focus on high-resource languages like English. To better align LLMs across a broad spectrum of languages and tasks, we propose a fully synthetic, novel taxonomy (Evol) guided Multilingual, Multi-turn instructi… ▽ More

    Submitted 28 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 39 pages

  3. arXiv:2403.07230  [pdf, other

    cs.CL cs.AI cs.LG

    Curry-DPO: Enhancing Alignment using Curriculum Learning & Ranked Preferences

    Authors: Pulkit Pattnaik, Rishabh Maheshwary, Kelechi Ogueji, Vikas Yadav, Sathwik Tejaswi Madhusudhan

    Abstract: Direct Preference Optimization (DPO) is an effective technique that leverages pairwise preference data (usually one chosen and rejected response pair per user prompt) to align LLMs to human preferences. In practice, multiple responses can exist for a given prompt with varying quality relative to each other. With availability of such quality ratings for multiple responses, we propose utilizing thes… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Work in progress

  4. arXiv:2402.10168  [pdf, other

    cs.SD cs.AI cs.IR cs.LG eess.AS

    DeepSRGM -- Sequence Classification and Ranking in Indian Classical Music with Deep Learning

    Authors: Sathwik Tejaswi Madhusudhan, Girish Chowdhary

    Abstract: A vital aspect of Indian Classical Music (ICM) is Raga, which serves as a melodic framework for compositions and improvisations alike. Raga Recognition is an important music information retrieval task in ICM as it can aid numerous downstream applications ranging from music recommendations to organizing huge music collections. In this work, we propose a deep learning based approach to Raga recognit… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.