Skip to main content

Showing 1–1 of 1 results for author: Gamidi, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:1609.06423  [pdf, other

    cs.DL cs.IR

    OCR++: A Robust Framework For Information Extraction from Scholarly Articles

    Authors: Mayank Singh, Barnopriyo Barua, Priyank Palod, Manvi Garg, Sidhartha Satapathy, Samuel Bushi, Kumar Ayush, Krishna Sai Rohith, Tulasi Gamidi, Pawan Goyal, Animesh Mukherjee

    Abstract: This paper proposes OCR++, an open-source framework designed for a variety of information extraction tasks from scholarly articles including metadata (title, author names, affiliation and e-mail), structure (section headings and body text, table and figure headings, URLs and footnotes) and bibliography (citation instances and references). We analyze a diverse set of scientific articles written in… ▽ More

    Submitted 23 September, 2016; v1 submitted 21 September, 2016; originally announced September 2016.