Skip to main content

Showing 1–3 of 3 results for author: Karkhanis, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.13228  [pdf, other

    cs.CL cs.AI cs.LG

    Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive

    Authors: Arka Pal, Deep Karkhanis, Samuel Dooley, Manley Roberts, Siddartha Naidu, Colin White

    Abstract: Direct Preference Optimisation (DPO) is effective at significantly improving the performance of large language models (LLMs) on downstream tasks such as reasoning, summarisation, and alignment. Using pairs of preferred and dispreferred data, DPO models the relative probability of picking one response over another. In this work, first we show theoretically that the standard DPO loss can lead to a r… ▽ More

    Submitted 3 July, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  2. arXiv:2308.10882  [pdf, other

    cs.AI cs.CL

    Giraffe: Adventures in Expanding Context Lengths in LLMs

    Authors: Arka Pal, Deep Karkhanis, Manley Roberts, Samuel Dooley, Arvind Sundararajan, Siddartha Naidu

    Abstract: Modern large language models (LLMs) that rely on attention mechanisms are typically trained with fixed context lengths which enforce upper limits on the length of input sequences that they can handle at evaluation time. To use these models on sequences longer than the train-time context length, one might employ techniques from the growing family of context length extrapolation methods -- most of w… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  3. arXiv:2201.07999  [pdf, other

    cs.LG cs.CL

    Sentiment Analysis: Predicting Yelp Scores

    Authors: Bhanu Prakash Reddy Guda, Mashrin Srivastava, Deep Karkhanis

    Abstract: In this work, we predict the sentiment of restaurant reviews based on a subset of the Yelp Open Dataset. We utilize the meta features and text available in the dataset and evaluate several machine learning and state-of-the-art deep learning approaches for the prediction task. Through several qualitative experiments, we show the success of the deep models with attention mechanism in learning a bala… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.