Skip to main content

Showing 1–4 of 4 results for author: Abhishek, T

.
  1. arXiv:2312.15181  [pdf, other

    cs.CL

    Multilingual Bias Detection and Mitigation for Indian Languages

    Authors: Ankita Maity, Anubhav Sharma, Rudra Dhar, Tushar Abhishek, Manish Gupta, Vasudeva Varma

    Abstract: Lack of diverse perspectives causes neutrality bias in Wikipedia content leading to millions of worldwide readers getting exposed by potentially inaccurate information. Hence, neutrality bias detection and mitigation is a critical problem. Although previous studies have proposed effective solutions for English, no work exists for Indian languages. First, we contribute two large datasets, mWikiBias… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  2. arXiv:2209.11252  [pdf, other

    cs.CL

    XF2T: Cross-lingual Fact-to-Text Generation for Low-Resource Languages

    Authors: Shivprasad Sagare, Tushar Abhishek, Bhavyajeet Singh, Anubhav Sharma, Manish Gupta, Vasudeva Varma

    Abstract: Multiple business scenarios require an automated generation of descriptive human-readable text from structured input data. Hence, fact-to-text generation systems have been developed for various downstream tasks like generating soccer reports, weather and financial reports, medical reports, person biographies, etc. Unfortunately, previous work on fact-to-text (F2T) generation has focused primarily… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  3. arXiv:2202.00291  [pdf, other

    cs.CL

    XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages

    Authors: Tushar Abhishek, Shivprasad Sagare, Bhavyajeet Singh, Anubhav Sharma, Manish Gupta, Vasudeva Varma

    Abstract: Multiple critical scenarios (like Wikipedia text generation given English Infoboxes) need automated generation of descriptive text in low resource (LR) languages from English fact triples. Previous work has focused on English fact-to-text (F2T) generation. To the best of our knowledge, there has been no previous attempt on cross-lingual alignment or generation for LR languages. Building an effecti… ▽ More

    Submitted 24 April, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: Update the code repository and acknowledgement

  4. arXiv:2109.02176  [pdf, other

    cs.CL

    Transformer Models for Text Coherence Assessment

    Authors: Tushar Abhishek, Daksh Rawat, Manish Gupta, Vasudeva Varma

    Abstract: Coherence is an important aspect of text quality and is crucial for ensuring its readability. It is essential desirable for outputs from text generation systems like summarization, question answering, machine translation, question generation, table-to-text, etc. An automated coherence scoring model is also helpful in essay scoring or providing writing feedback. A large body of previous work has le… ▽ More

    Submitted 23 February, 2022; v1 submitted 5 September, 2021; originally announced September 2021.

    Comments: added link to the codebase