Skip to main content

Showing 1–4 of 4 results for author: Bhotia, T S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2109.13767  [pdf, other

    cs.CL cs.AI

    Identifying and Mitigating Gender Bias in Hyperbolic Word Embeddings

    Authors: Vaibhav Kumar, Tenzin Singhay Bhotia, Vaibhav Kumar, Tanmoy Chakraborty

    Abstract: Euclidean word embedding models such as GloVe and Word2Vec have been shown to reflect human-like gender biases. In this paper, we extend the study of gender bias to the recently popularized hyperbolic word embeddings. We propose gyrocosine bias, a novel measure for quantifying gender bias in hyperbolic word representations and observe a significant presence of gender bias. To address this problem,… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: 8 pages

  2. arXiv:2109.13711  [pdf, other

    cs.CL

    One to rule them all: Towards Joint Indic Language Hate Speech Detection

    Authors: Mehar Bhatia, Tenzin Singhay Bhotia, Akshat Agarwal, Prakash Ramesh, Shubham Gupta, Kumar Shridhar, Felix Laumann, Ayushman Dash

    Abstract: This paper is a contribution to the Hate Speech and Offensive Content Identification in Indo-European Languages (HASOC) 2021 shared task. Social media today is a hotbed of toxic and hateful conversations, in various languages. Recent news reports have shown that current models struggle to automatically identify hate posted in minority languages. Therefore, efficiently curbing hate speech is a crit… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: submitted to FIRE 2021 in the HASOC-FIRE shared task on hate speech and offensive language detection

  3. arXiv:2010.13168  [pdf, other

    cs.CL cs.CY

    Fair Embedding Engine: A Library for Analyzing and Mitigating Gender Bias in Word Embeddings

    Authors: Vaibhav Kumar, Tenzin Singhay Bhotia, Vaibhav Kumar

    Abstract: Non-contextual word embedding models have been shown to inherit human-like stereotypical biases of gender, race and religion from the training corpora. To counter this issue, a large body of research has emerged which aims to mitigate these biases while kee** the syntactic and semantic utility of embeddings intact. This paper describes Fair Embedding Engine (FEE), a library for analysing and mit… ▽ More

    Submitted 25 October, 2020; originally announced October 2020.

    Comments: 6 pages, 3 figures

  4. arXiv:2006.01938  [pdf, other

    cs.CL cs.LG

    Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased Proximities in Word Embeddings

    Authors: Vaibhav Kumar, Tenzin Singhay Bhotia, Vaibhav Kumar, Tanmoy Chakraborty

    Abstract: Word embeddings are the standard model for semantic and syntactic representations of words. Unfortunately, these models have been shown to exhibit undesirable word associations resulting from gender, racial, and religious biases. Existing post-processing methods for debiasing word embeddings are unable to mitigate gender bias hidden in the spatial arrangement of word vectors. In this paper, we pro… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: TACL 2020