Skip to main content

Showing 1–8 of 8 results for author: Harit, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.02142  [pdf, other

    cs.CV

    TransDocAnalyser: A Framework for Offline Semi-structured Handwritten Document Analysis in the Legal Domain

    Authors: Sagar Chakraborty, Gaurav Harit, Saptarshi Ghosh

    Abstract: State-of-the-art offline Optical Character Recognition (OCR) frameworks perform poorly on semi-structured handwritten domain-specific documents due to their inability to localize and label form fields with domain-specific semantics. Existing techniques for semi-structured document analysis have primarily used datasets comprising invoices, purchase orders, receipts, and identity-card documents for… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted in 17th International Conference on Document Analysis and Recognition(ICDAR) as an Oral presentation

    ACM Class: I.2.1

  2. EKTVQA: Generalized use of External Knowledge to empower Scene Text in Text-VQA

    Authors: Arka Ujjal Dey, Ernest Valveny, Gaurav Harit

    Abstract: The open-ended question answering task of Text-VQA often requires reading and reasoning about rarely seen or completely unseen scene-text content of an image. We address this zero-shot nature of the problem by proposing the generalized use of external knowledge to augment our understanding of the scene text. We design a framework to extract, validate, and reason with knowledge using a standard mul… ▽ More

    Submitted 15 July, 2022; v1 submitted 22 August, 2021; originally announced August 2021.

    Comments: Accepted at IEEE Access

    Journal ref: IEEE.ACCESS 10 (2022) 72092-72106

  3. arXiv:2002.12096  [pdf, other

    cs.CV

    Action Quality Assessment using Siamese Network-Based Deep Metric Learning

    Authors: Hiteshi Jain, Gaurav Harit, Avinash Sharma

    Abstract: Automated vision-based score estimation models can be used as an alternate opinion to avoid judgment bias. In the past works the score estimation models were learned by regressing the video representations to the ground truth score provided by the judges. However such regression-based solutions lack interpretability in terms of giving reasons for the awarded score. One solution to make the scores… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: 12 pages, 5 Figures, 8 tables

  4. Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding

    Authors: Arka Ujjal Dey, Suman Kumar Ghosh, Ernest Valveny, Gaurav Harit

    Abstract: Images with visual and scene text content are ubiquitous in everyday life. However, current image interpretation systems are mostly limited to using only the visual features, neglecting to leverage the scene text content. In this paper, we propose to jointly use scene text and visual channels for robust semantic interpretation of images. We do not only extract and encode visual and scene text cues… ▽ More

    Submitted 4 December, 2019; v1 submitted 25 May, 2019; originally announced May 2019.

    Comments: The paper is under consideration at Pattern Recognition Letters

  5. arXiv:1609.02687  [pdf, other

    cs.IR

    Extraction of Layout Entities and Sub-layout Query-based Retrieval of Document Images

    Authors: Anukriti Bansal, Sumantra Dutta Roy, Gaurav Harit

    Abstract: Layouts and sub-layouts constitute an important clue while searching a document on the basis of its structure, or when textual content is unknown/irrelevant. A sub-layout specifies the arrangement of document entities within a smaller portion of the document. We propose an efficient graph-based matching algorithm, integrated with hash-based indexing, to prune a possibly large search space. A user… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

  6. An Interactive Medical Image Segmentation Framework Using Iterative Refinement

    Authors: Pratik Kalshetti, Manas Bundele, Parag Rahangdale, Dinesh Jangra, Chiranjoy Chattopadhyay, Gaurav Harit, Abhay Elhence

    Abstract: Image segmentation is often performed on medical images for identifying diseases in clinical evaluation. Hence it has become one of the major research areas. Conventional image segmentation techniques are unable to provide satisfactory segmentation results for medical images as they contain irregularities. They need to be pre-processed before segmentation. In order to obtain the most suitable meth… ▽ More

    Submitted 4 June, 2016; originally announced June 2016.

    Comments: 19 pages, 19 figures, Submitted for review in Computers in Biology and Medicine

  7. Topographic Feature Extraction for Bengali and Hindi Character Images

    Authors: Soumen Bag, Gaurav Harit

    Abstract: Feature selection and extraction plays an important role in different classification based problems such as face recognition, signature verification, optical character recognition (OCR) etc. The performance of OCR highly depends on the proper selection and extraction of feature set. In this paper, we present novel features based on the topography of a character as visible from different viewing di… ▽ More

    Submitted 14 July, 2011; originally announced July 2011.

    Journal ref: Signal & Image Processing : An International Journal (SIPIJ), vol.2, no.2, pp. 181-196, June 2011

  8. arXiv:1103.0738  [pdf, ps, other

    cs.CV cs.DL

    A Medial Axis Based Thinning Strategy for Character Images

    Authors: Soumen Bag, Gaurav Harit

    Abstract: Thinning of character images is a big challenge. Removal of strokes or deformities in thinning is a difficult problem. In this paper, we have proposed a medial axis based thinning strategy used for performing skeletonization of printed and handwritten character images. In this method, we have used shape characteristics of text to get skeleton of nearly same as the true character shape. This approa… ▽ More

    Submitted 3 March, 2011; originally announced March 2011.

    Comments: 6 pages, 5 figures. In proceedings of the second National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), pp. 67-72, Jaipur, India, 2010