Skip to main content

Showing 1–4 of 4 results for author: Kanumolu, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.11349  [pdf, other

    cs.CL

    TeClass: A Human-Annotated Relevance-based Headline Classification and Generation Dataset for Telugu

    Authors: Gopichand Kanumolu, Lokesh Madasu, Nirmal Surange, Manish Shrivastava

    Abstract: News headline generation is a crucial task in increasing productivity for both the readers and producers of news. This task can easily be aided by automated News headline-generation models. However, the presence of irrelevant headlines in scraped news articles results in sub-optimal performance of generation models. We propose that relevance-based headline classification can greatly aid the task o… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted at LREC-COLING 2024

  2. arXiv:2402.08638  [pdf, other

    cs.CL

    SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages

    Authors: Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Abinew Ali Ayele, Pavan Baswani, Meriem Beloucif, Chris Biemann, Sofia Bourhim, Christine De Kock, Genet Shanko Dekebo, Oumaima Hourrane, Gopichand Kanumolu, Lokesh Madasu, Samuel Rutunda, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Hailegnaw Getaneh Tilaye, Krishnapriya Vishnubhotla, Genta Winata , et al. (2 additional authors not shown)

    Abstract: Exploring and quantifying semantic relatedness is central to representing language and holds significant implications across various NLP tasks. While earlier NLP research primarily focused on semantic similarity, often within the English language context, we instead investigate the broader phenomenon of semantic relatedness. In this paper, we present \textit{SemRel}, a new semantic relatedness dat… ▽ More

    Submitted 31 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted to the Findings of ACL 2024

  3. arXiv:2312.01500  [pdf, other

    cs.CL

    Unsupervised Approach to Evaluate Sentence-Level Fluency: Do We Really Need Reference?

    Authors: Gopichand Kanumolu, Lokesh Madasu, Pavan Baswani, Ananya Mukherjee, Manish Shrivastava

    Abstract: Fluency is a crucial goal of all Natural Language Generation (NLG) systems. Widely used automatic evaluation metrics fall short in capturing the fluency of machine-generated text. Assessing the fluency of NLG systems poses a challenge since these models are not limited to simply reusing words from the input but may also generate abstractions. Existing reference-based fluency evaluations, such as w… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted at IJCNLP-AACL SEALP Workshop

  4. arXiv:2311.17743  [pdf, other

    cs.CL cs.AI

    Mukhyansh: A Headline Generation Dataset for Indic Languages

    Authors: Lokesh Madasu, Gopichand Kanumolu, Nirmal Surange, Manish Shrivastava

    Abstract: The task of headline generation within the realm of Natural Language Processing (NLP) holds immense significance, as it strives to distill the true essence of textual content into concise and attention-grabbing summaries. While noteworthy progress has been made in headline generation for widely spoken languages like English, there persist numerous challenges when it comes to generating headlines i… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted at PACLIC 2023