Skip to main content

Showing 1–3 of 3 results for author: Kitagishi, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.00528  [pdf

    cs.CL cs.CV

    Large Language Models for Simultaneous Named Entity Extraction and Spelling Correction

    Authors: Edward Whittaker, Ikuo Kitagishi

    Abstract: Language Models (LMs) such as BERT, have been shown to perform well on the task of identifying Named Entities (NE) in text. A BERT LM is typically used as a classifier to classify individual tokens in the input text, or to classify spans of tokens, as belonging to one of a set of possible NE categories. In this paper, we hypothesise that decoder-only Large Language Models (LLMs) can also be used… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 9 pages, 1 figure

    ACM Class: H.3.3; H.3.4; I.2.7; I.7.1; I.7.5

  2. arXiv:2303.05763  [pdf, other

    cs.CV cs.AI cs.HC

    Automatic Detection and Rectification of Paper Receipts on Smartphones

    Authors: Edward Whittaker, Masashi Tanaka, Ikuo Kitagishi

    Abstract: We describe the development of a real-time smartphone app that allows the user to digitize paper receipts in a novel way by "waving" their phone over the receipts and letting the app automatically detect and rectify the receipts for subsequent text recognition. We show that traditional computer vision algorithms for edge and corner detection do not robustly detect the non-linear and discontinuou… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  3. arXiv:2212.05525  [pdf, other

    cs.CL cs.CV

    Extending TrOCR for Text Localization-Free OCR of Full-Page Scanned Receipt Images

    Authors: Hongkuan Zhang, Edward Whittaker, Ikuo Kitagishi

    Abstract: Digitization of scanned receipts aims to extract text from receipt images and save it into structured documents. This is usually split into two sub-tasks: text localization and optical character recognition (OCR). Most existing OCR models only focus on the cropped text instance images, which require the bounding box information provided by a text region detection model. Introducing an additional d… ▽ More

    Submitted 16 October, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

    Comments: ICCV2023 RCV Workshop