Skip to main content

Showing 1–2 of 2 results for author: Sheikh, T U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06236  [pdf, other

    cs.CV

    UnSupDLA: Towards Unsupervised Document Layout Analysis

    Authors: Talha Uddin Sheikh, Tahira Shehzadi, Khurram Azeem Hashmi, Didier Stricker, Muhammad Zeshan Afzal

    Abstract: Document layout analysis is a key area in document research, involving techniques like text mining and visual analysis. Despite various methods developed to tackle layout analysis, a critical but frequently overlooked problem is the scarcity of labeled data needed for analyses. With the rise of internet use, an overwhelming number of documents are now available online, making the process of accura… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: ICDAR 2024 - Workshop

  2. arXiv:2405.03660  [pdf, other

    cs.CV

    CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification

    Authors: Sankalp Sinha, Muhammad Saif Ullah Khan, Talha Uddin Sheikh, Didier Stricker, Muhammad Zeshan Afzal

    Abstract: Zero-shot learning has been extensively investigated in the broader field of visual recognition, attracting significant interest recently. However, the current work on zero-shot learning in document image classification remains scarce. The existing studies either focus exclusively on zero-shot inference, or their evaluation does not align with the established criteria of zero-shot evaluation in th… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 18 Pages, 4 Figures and Accepted in ICDAR 2024