Skip to main content

Showing 1–1 of 1 results for author: Driesen, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:1708.04671  [pdf, other

    cs.CV

    Sequence-to-Label Script Identification for Multilingual OCR

    Authors: Yasuhisa Fujii, Karel Driesen, Jonathan Baccash, Ash Hurst, Ashok C. Popat

    Abstract: We describe a novel line-level script identification method. Previous work repurposed an OCR model generating per-character script codes, counted to obtain line-level script identification. This has two shortcomings. First, as a sequence-to-sequence model it is more complex than necessary for the sequence-to-label problem of line script identification. This makes it harder to train and inefficient… ▽ More

    Submitted 17 August, 2017; v1 submitted 15 August, 2017; originally announced August 2017.

    Comments: ICDAR2017, The 14th IAPR International Conference on Document Analysis and Recognition, Kyoto, Japan

    MSC Class: 68T45 ACM Class: I.7.5