Skip to main content

Showing 1–1 of 1 results for author: Jagdale, A

Searching in archive cs. Search in all archives.
.
  1. A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents

    Authors: Norman Meuschke, Apurva Jagdale, Timo Spinde, Jelena Mitrović, Bela Gipp

    Abstract: Extracting information from academic PDF documents is crucial for numerous indexing, retrieval, and analysis use cases. Choosing the best tool to extract specific content elements is difficult because many, technically diverse tools are available, but recent performance benchmarks are rare. Moreover, such benchmarks typically cover only a few content elements like header metadata or bibliographic… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: iConference 2023