Skip to main content

Showing 1–3 of 3 results for author: Sagare, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.12308  [pdf, other

    cs.CL

    XWikiGen: Cross-lingual Summarization for Encyclopedic Text Generation in Low Resource Languages

    Authors: Dhaval Taunk, Shivprasad Sagare, Anupam Patil, Shivansh Subramanian, Manish Gupta, Vasudeva Varma

    Abstract: Lack of encyclopedic text contributors, especially on Wikipedia, makes automated text generation for low resource (LR) languages a critical problem. Existing work on Wikipedia text generation has focused on English only where English reference articles are summarized to generate English Wikipedia pages. But, for low-resource languages, the scarcity of reference articles makes monolingual summariza… ▽ More

    Submitted 18 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  2. arXiv:2209.11252  [pdf, other

    cs.CL

    XF2T: Cross-lingual Fact-to-Text Generation for Low-Resource Languages

    Authors: Shivprasad Sagare, Tushar Abhishek, Bhavyajeet Singh, Anubhav Sharma, Manish Gupta, Vasudeva Varma

    Abstract: Multiple business scenarios require an automated generation of descriptive human-readable text from structured input data. Hence, fact-to-text generation systems have been developed for various downstream tasks like generating soccer reports, weather and financial reports, medical reports, person biographies, etc. Unfortunately, previous work on fact-to-text (F2T) generation has focused primarily… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  3. arXiv:2202.00291  [pdf, other

    cs.CL

    XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages

    Authors: Tushar Abhishek, Shivprasad Sagare, Bhavyajeet Singh, Anubhav Sharma, Manish Gupta, Vasudeva Varma

    Abstract: Multiple critical scenarios (like Wikipedia text generation given English Infoboxes) need automated generation of descriptive text in low resource (LR) languages from English fact triples. Previous work has focused on English fact-to-text (F2T) generation. To the best of our knowledge, there has been no previous attempt on cross-lingual alignment or generation for LR languages. Building an effecti… ▽ More

    Submitted 24 April, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: Update the code repository and acknowledgement