Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks

Rajesh, Bulla; Agrawal, Manav Kamlesh; Bhuva, Milan; Kishore, Kisalaya; Javed, Mohammed

Computer Science > Computer Vision and Pattern Recognition

arXiv:2209.05921 (cs)

[Submitted on 13 Sep 2022]

Title:Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks

Authors:Bulla Rajesh, Manav Kamlesh Agrawal, Milan Bhuva, Kisalaya Kishore, Mohammed Javed

View PDF

Abstract:Image binarization techniques are being popularly used in enhancement of noisy and/or degraded images catering different Document Image Anlaysis (DIA) applications like word spotting, document retrieval, and OCR. Most of the existing techniques focus on feeding pixel images into the Convolution Neural Networks to accomplish document binarization, which may not produce effective results when working with compressed images that need to be processed without full decompression. Therefore in this research paper, the idea of document image binarization directly using JPEG compressed stream of document images is proposed by employing Dual Discriminator Generative Adversarial Networks (DD-GANs). Here the two discriminator networks - Global and Local work on different image ratios and use focal loss as generator loss. The proposed model has been thoroughly tested with different versions of DIBCO dataset having challenges like holes, erased or smudged ink, dust, and misplaced fibres. The model proved to be highly robust, efficient both in terms of time and space complexities, and also resulted in state-of-the-art performance in JPEG compressed domain.

Comments:	Accepted in IAPR endorsed first International Conference on Computer Vision and Machine Intelligence (CVMI2022), held at IIIT Allahabad
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2209.05921 [cs.CV]
	(or arXiv:2209.05921v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2209.05921

Submission history

From: Dr. Mohammed Javed [view email]
[v1] Tue, 13 Sep 2022 12:07:32 UTC (36,515 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators