Skip to main content

Showing 1–10 of 10 results for author: Kasa, S R

.
  1. arXiv:2407.06443  [pdf, other

    cs.AI

    Exposing Privacy Gaps: Membership Inference Attack on Preference Data for LLM Alignment

    Authors: Qizhang Feng, Siva Rajesh Kasa, Hyokun Yun, Choon Hui Teo, Sravan Babu Bodapati

    Abstract: Large Language Models (LLMs) have seen widespread adoption due to their remarkable natural language capabilities. However, when deploying them in real-world settings, it is important to align LLMs to generate texts according to acceptable human standards. Methods such as Proximal Policy Optimization (PPO) and Direct Preference Optimization (DPO) have made significant progress in refining LLMs usin… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2405.11775  [pdf, other

    cs.CL cs.LG

    Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques

    Authors: Siva Rajesh Kasa, Aniket Goel, Karan Gupta, Sumegh Roychowdhury, Anish Bhanushali, Nikhil Pattisapu, Prasanna Srinivasa Murthy

    Abstract: Ordinal Classification (OC) is a widely encountered challenge in Natural Language Processing (NLP), with applications in various domains such as sentiment analysis, rating prediction, and more. Previous approaches to tackle OC have primarily focused on modifying existing or creating novel loss functions that \textbf{explicitly} account for the ordinal nature of labels. However, with the advent of… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Findings of ACL 2024

  3. arXiv:2402.10229  [pdf, other

    stat.CO cs.LG

    Mixture-Models: a one-stop Python Library for Model-based Clustering using various Mixture Models

    Authors: Siva Rajesh Kasa, Hu Yijie, Santhosh Kumar Kasa, Vaibhav Rajan

    Abstract: \texttt{Mixture-Models} is an open-source Python library for fitting Gaussian Mixture Models (GMM) and their variants, such as Parsimonious GMMs, Mixture of Factor Analyzers, MClust models, Mixture of Student's t distributions, etc. It streamlines the implementation and analysis of these models using various first/second order optimization routines such as Gradient Descent and Newton-CG through au… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  4. arXiv:2312.16549  [pdf, other

    cs.LG cs.AI cs.CL

    How Robust are LLMs to In-Context Majority Label Bias?

    Authors: Karan Gupta, Sumegh Roychowdhury, Siva Rajesh Kasa, Santhosh Kumar Kasa, Anish Bhanushali, Nikhil Pattisapu, Prasanna Srinivasa Murthy

    Abstract: In the In-Context Learning (ICL) setup, various forms of label biases can manifest. One such manifestation is majority label bias, which arises when the distribution of labeled examples in the in-context samples is skewed towards one or more specific classes making Large Language Models (LLMs) more prone to predict those labels. Such discrepancies can arise from various factors, including logistic… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: 6 pages, 3 figures, 2 table. Accepted at Workshop on Responsible Language Modeling, AAAI 2024, (www.aaai.org)

  5. arXiv:2311.03320  [pdf, other

    cs.CL

    Tackling Concept Shift in Text Classification using Entailment-style Modeling

    Authors: Sumegh Roychowdhury, Karan Gupta, Siva Rajesh Kasa, Prasanna Srinivasa Murthy, Alok Chandra

    Abstract: Pre-trained language models (PLMs) have seen tremendous success in text classification (TC) problems in the context of Natural Language Processing (NLP). In many real-world text classification tasks, the class definitions being learned do not remain constant but rather change with time - this is known as Concept Shift. Most techniques for handling concept shift rely on retraining the old classifie… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Journal ref: NeurIPS 2023 - Workshop on Distribution Shifts

  6. Improved Inference of Gaussian Mixture Copula Model for Clustering and Reproducibility Analysis using Automatic Differentiation

    Authors: Siva Rajesh Kasa, Vaibhav Rajan

    Abstract: Copulas provide a modular parameterization of multivariate distributions that decouples the modeling of marginals from the dependencies between them. Gaussian Mixture Copula Model (GMCM) is a highly flexible copula that can model many kinds of multi-modal dependencies, as well as asymmetric and tail dependencies. They have been effectively used in clustering non-Gaussian data and in Reproducibilit… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

  7. arXiv:2009.05805  [pdf, other

    cs.LG stat.ML

    Multi-way Spectral Clustering of Augmented Multi-view Data through Deep Collective Matrix Tri-factorization

    Authors: Ragunathan Mariappan, Siva Rajesh Kasa, Vaibhav Rajan

    Abstract: We present the first deep learning based architecture for collective matrix tri-factorization (DCMTF) of arbitrary collections of matrices, also known as augmented multi-view data. DCMTF can be used for multi-way spectral clustering of heterogeneous collections of relational data matrices to discover latent clusters in each input matrix, across both dimensions, as well as the strengths of associat… ▽ More

    Submitted 24 January, 2022; v1 submitted 12 September, 2020; originally announced September 2020.

  8. arXiv:2007.12786  [pdf, other

    stat.ML cs.LG stat.CO

    Model-based Clustering using Automatic Differentiation: Confronting Misspecification and High-Dimensional Data

    Authors: Siva Rajesh Kasa, Vaibhav Rajan

    Abstract: We study two practically important cases of model based clustering using Gaussian Mixture Models: (1) when there is misspecification and (2) on high dimensional data, in the light of recent advances in Gradient Descent (GD) based optimization using Automatic Differentiation (AD). Our simulation studies show that EM has better clustering performance, measured by Adjusted Rand Index, compared to GD… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  9. arXiv:1812.05928  [pdf, other

    stat.CO stat.ME

    Automatic Differentiation in Mixture Models

    Authors: Siva Rajesh Kasa, Vaibhav Rajan

    Abstract: In this article, we discuss two specific classes of models - Gaussian Mixture Copula models and Mixture of Factor Analyzers - and the advantages of doing inference with gradient descent using automatic differentiation. Gaussian mixture models are a popular class of clustering methods, that offers a principled statistical approach to clustering. However, the underlying assumption, that every mixing… ▽ More

    Submitted 13 December, 2018; originally announced December 2018.

    Comments: 19 pages, 4 figures. arXiv admin note: text overlap with arXiv:1301.1505, arXiv:1502.05767, arXiv:1503.06302 by other authors

  10. arXiv:1809.07114  [pdf, other

    stat.AP

    A Test for detecting Structural Breakdowns in Markets using Eigenvalue Decompositions

    Authors: Malay Bhattacharyya, Siva Rajesh Kasa

    Abstract: Correlations among stock returns during volatile markets differ substantially compared to those from quieter markets. During times of financial crisis, it has been observed that traditional dependency in global markets breaks down. However, such an upheaval in dependency structure happens over a span of several months, with the breakdown coinciding with a major bankruptcy or sovereign default. Eve… ▽ More

    Submitted 12 September, 2019; v1 submitted 19 September, 2018; originally announced September 2018.

    Comments: 15 pages, 9th International Conference of the Financial Engineering and Banking Society (FEBS)