Skip to main content

Showing 1–1 of 1 results for author: Pandya, M R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2005.00042  [pdf

    cs.IR cs.CL

    Method for Customizable Automated Tagging: Addressing the Problem of Over-tagging and Under-tagging Text Documents

    Authors: Maharshi R. Pandya, Jessica Reyes, Bob Vanderheyden

    Abstract: Using author provided tags to predict tags for a new document often results in the overgeneration of tags. In the case where the author doesn't provide any tags, our documents face the severe under-tagging issue. In this paper, we present a method to generate a universal set of tags that can be applied widely to a large document corpus. Using IBM Watson's NLU service, first, we collect keywords/ph… ▽ More

    Submitted 30 April, 2020; originally announced May 2020.

    Comments: Work done by Maharshi R. Pandya and Jessica Reyes as IBM interns under leadership of Bob Vanderheyden. Article to be published

    ACM Class: I.2.7