Search | arXiv e-print repository

Exploring Safety-Utility Trade-Offs in Personalized Language Models

Authors: Anvesh Rao Vij**i, Somnath Basu Roy Chowdhury, Snigdha Chaturvedi

Abstract: As large language models (LLMs) become increasingly integrated into daily applications, it is essential to ensure they operate fairly across diverse user demographics. In this work, we show that LLMs suffer from personalization bias, where their performance is impacted when they are personalized to a user's identity. We quantify personalization bias by evaluating the performance of LLMs along two… ▽ More As large language models (LLMs) become increasingly integrated into daily applications, it is essential to ensure they operate fairly across diverse user demographics. In this work, we show that LLMs suffer from personalization bias, where their performance is impacted when they are personalized to a user's identity. We quantify personalization bias by evaluating the performance of LLMs along two axes - safety and utility. We measure safety by examining how benign LLM responses are to unsafe prompts with and without personalization. We measure utility by evaluating the LLM's performance on various tasks, including general knowledge, mathematical abilities, programming, and reasoning skills. We find that various LLMs, ranging from open-source models like Llama (Touvron et al., 2023) and Mistral (Jiang et al., 2023) to API-based ones like GPT-3.5 and GPT-4o (Ouyang et al., 2022), exhibit significant variance in performance in terms of safety-utility trade-offs depending on the user's identity. Finally, we discuss several strategies to mitigate personalization bias using preference tuning and prompt-based defenses. △ Less

Submitted 16 June, 2024; originally announced June 2024.

Comments: Work in Progress

arXiv:2211.00676 [pdf, other]

Towards Inter-character Relationship-driven Story Generation

Authors: Anvesh Rao Vij**i, Faeze Brahman, Snigdha Chaturvedi

Abstract: In this paper, we introduce the task of modeling interpersonal relationships for story generation. For addressing this task, we propose Relationships as Latent Variables for Story Generation, (ReLiSt). ReLiSt generates stories sentence by sentence and has two major components - a relationship selector and a story continuer. The relationship selector specifies a latent variable to pick the relation… ▽ More In this paper, we introduce the task of modeling interpersonal relationships for story generation. For addressing this task, we propose Relationships as Latent Variables for Story Generation, (ReLiSt). ReLiSt generates stories sentence by sentence and has two major components - a relationship selector and a story continuer. The relationship selector specifies a latent variable to pick the relationship to exhibit in the next sentence and the story continuer generates the next sentence while expressing the selected relationship in a coherent way. Our automatic and human evaluations demonstrate that ReLiSt is able to generate stories with relationships that are more faithful to desired relationships while maintaining the content quality. The relationship assignments to sentences during inference bring interpretability to ReLiSt. △ Less

Submitted 1 November, 2022; originally announced November 2022.

Comments: EMNLP 2022

arXiv:2208.09912 [pdf, other]

A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework

Authors: Avinash Madasu, Anvesh Rao Vij**i

Abstract: A well formed query is defined as a query which is formulated in the manner of an inquiry, and with correct interrogatives, spelling and grammar. While identifying well formed queries is an important task, few works have attempted to address it. In this paper we propose transformer based language model - Bidirectional Encoder Representations from Transformers (BERT) to this task. We further imbibe… ▽ More A well formed query is defined as a query which is formulated in the manner of an inquiry, and with correct interrogatives, spelling and grammar. While identifying well formed queries is an important task, few works have attempted to address it. In this paper we propose transformer based language model - Bidirectional Encoder Representations from Transformers (BERT) to this task. We further imbibe BERT with parts-of-speech information inspired from earlier works. Furthermore, we also train the model in multiple curriculum settings for improvement in performance. Curriculum Learning over the task is experimented with Baby Steps and One Pass techniques. Proposed architecture performs exceedingly well on the task. The best approach achieves accuracy of 83.93%, outperforming previous state-of-the-art at 75.0% and reaching close to the approximate human upper bound of 88.4%. △ Less

Submitted 21 August, 2022; originally announced August 2022.

Comments: ICPR 2022

arXiv:2102.09990 [pdf, other]

Analyzing Curriculum Learning for Sentiment Analysis along Task Difficulty, Pacing and Visualization Axes

Authors: Anvesh Rao Vij**i, Kaveri Anuranjana, Radhika Mamidi

Abstract: While Curriculum Learning (CL) has recently gained traction in Natural language Processing Tasks, it is still not adequately analyzed. Previous works only show their effectiveness but fail short to explain and interpret the internal workings fully. In this paper, we analyze curriculum learning in sentiment analysis along multiple axes. Some of these axes have been proposed by earlier works that ne… ▽ More While Curriculum Learning (CL) has recently gained traction in Natural language Processing Tasks, it is still not adequately analyzed. Previous works only show their effectiveness but fail short to explain and interpret the internal workings fully. In this paper, we analyze curriculum learning in sentiment analysis along multiple axes. Some of these axes have been proposed by earlier works that need more in-depth study. Such analysis requires understanding where curriculum learning works and where it does not. Our axes of analysis include Task difficulty on CL, comparing CL pacing techniques, and qualitative analysis by visualizing the movement of attention scores in the model as curriculum phases progress. We find that curriculum learning works best for difficult tasks and may even lead to a decrement in performance for tasks with higher performance without curriculum learning. We see that One-Pass curriculum strategies suffer from catastrophic forgetting and attention movement visualization within curriculum pacing. This shows that curriculum learning breaks down the challenging main task into easier sub-tasks solved sequentially. △ Less

Submitted 2 March, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

Comments: Accepted for presentation at WASSA 2021 at EACL

arXiv:2101.05478 [pdf, other]

WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm

Authors: Akshay Krishna Sheshadri, Anvesh Rao Vij**i, Sukhdeep Kharbanda

Abstract: Automatic Speech Recognition (ASR) systems are evaluated using Word Error Rate (WER), which is calculated by comparing the number of errors between the ground truth and the transcription of the ASR system. This calculation, however, requires manual transcription of the speech signal to obtain the ground truth. Since transcribing audio signals is a costly process, Automatic WER Evaluation (e-WER) m… ▽ More Automatic Speech Recognition (ASR) systems are evaluated using Word Error Rate (WER), which is calculated by comparing the number of errors between the ground truth and the transcription of the ASR system. This calculation, however, requires manual transcription of the speech signal to obtain the ground truth. Since transcribing audio signals is a costly process, Automatic WER Evaluation (e-WER) methods have been developed to automatically predict the WER of a speech system by only relying on the transcription and the speech signal features. While WER is a continuous variable, previous works have shown that positing e-WER as a classification problem is more effective than regression. However, while converting to a classification setting, these approaches suffer from heavy class imbalance. In this paper, we propose a new balanced paradigm for e-WER in a classification setting. Within this paradigm, we also propose WER-BERT, a BERT based architecture with speech features for e-WER. Furthermore, we introduce a distance loss function to tackle the ordinal nature of e-WER classification. The proposed approach and paradigm are evaluated on the Librispeech dataset and a commercial (black box) ASR system, Google Cloud's Speech-to-Text API. The results and experiments demonstrate that WER-BERT establishes a new state-of-the-art in automatic WER estimation. △ Less

Submitted 13 February, 2021; v1 submitted 14 January, 2021; originally announced January 2021.

Comments: Accepted Long Paper at EACL 2021

arXiv:2101.05274 [pdf, ps, other]

A note on balancing sequences and application to cryptography

Authors: K. Anitha, I. Mumtaj Fathima, A R Vijayalakshmi

Abstract: In this paper, we prove the lower bound for the number of balancing non-Wieferich primes in arithmetic progressions. More precisely, for any given integer $r\geq2$ there are $\gg\log x$ balancing non-Wieferich primes $p\leq x$ such that $p\equiv\pm1 \pmod{r}$, under the assumption of the $abc$ conjecture for the number field $\mathbb{Q}(\sqrt{2})$. Further, we discuss some applications of balancin… ▽ More In this paper, we prove the lower bound for the number of balancing non-Wieferich primes in arithmetic progressions. More precisely, for any given integer $r\geq2$ there are $\gg\log x$ balancing non-Wieferich primes $p\leq x$ such that $p\equiv\pm1 \pmod{r}$, under the assumption of the $abc$ conjecture for the number field $\mathbb{Q}(\sqrt{2})$. Further, we discuss some applications of balancing sequences in cryptography. △ Less

Submitted 14 September, 2022; v1 submitted 13 January, 2021; originally announced January 2021.

Comments: 18 pages, some applications of balancing sequences are given

MSC Class: 11B25; 11B39; 11A41; 11T71; 14G50; 94A60

arXiv:2101.04906 [pdf, ps, other]

Some new results on negative polynomial Pell's equation

Authors: K. Anitha, I. Mumtaj Fathima, A R Vijayalakshmi

Abstract: We consider the negative polynomial Pell's equation $P^2(X)-D(X)Q^2(X)=-1$, where $D(X)\in \mathbb{Z}[X]$ be some fixed, monic, square-free, even degree polynomials. In this paper, we investigate the existence of polynomial solutions $P(X), \, Q(X)$ with integer coefficients. We consider the negative polynomial Pell's equation $P^2(X)-D(X)Q^2(X)=-1$, where $D(X)\in \mathbb{Z}[X]$ be some fixed, monic, square-free, even degree polynomials. In this paper, we investigate the existence of polynomial solutions $P(X), \, Q(X)$ with integer coefficients. △ Less

Submitted 9 June, 2022; v1 submitted 13 January, 2021; originally announced January 2021.

Comments: 10 pages, some more equations added

MSC Class: 11A99; 11C08; 11D99

arXiv:2101.04901 [pdf, ps, other]

Lucas non-Wieferich primes in arithmetic progressions and the $abc$ conjecture

Authors: K. Anitha, I. Mumtaj Fathima, A R Vijayalakshmi

Abstract: We prove the lower bound for the number of Lucas non-Wieferich primes in arithmetic progressions. More precisely, for any given integer $k\geq 2$ there are $\gg \log x$ Lucas non-Wieferich primes $p\leq x$ such that $p\equiv\pm1\pmod{k}$, assuming the $abc$ conjecture for number fields. Further, we discuss some applications of Lucas sequences in Cryptography. We prove the lower bound for the number of Lucas non-Wieferich primes in arithmetic progressions. More precisely, for any given integer $k\geq 2$ there are $\gg \log x$ Lucas non-Wieferich primes $p\leq x$ such that $p\equiv\pm1\pmod{k}$, assuming the $abc$ conjecture for number fields. Further, we discuss some applications of Lucas sequences in Cryptography. △ Less

Submitted 12 July, 2022; v1 submitted 13 January, 2021; originally announced January 2021.

Comments: 12 pages, added some applications in cryptography

MSC Class: 11B39; 11A41; 11B25 (Primary)

Showing 1–8 of 8 results for author: Vij**i, A R