Skip to main content

Showing 1–21 of 21 results for author: Paquet, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12105  [pdf, other

    cs.CV

    Sheet Music Transformer ++: End-to-End Full-Page Optical Music Recognition for Pianoform Sheet Music

    Authors: Antonio Ríos-Vila, Jorge Calvo-Zaragoza, David Rizo, Thierry Paquet

    Abstract: Optical Music Recognition is a field that has progressed significantly, bringing accurate systems that transcribe effectively music scores into digital formats. Despite this, there are still several limitations that hinder OMR from achieving its full potential. Specifically, state of the art OMR still depends on multi-stage pipelines for performing full-page transcription, as well as it has only b… ▽ More

    Submitted 21 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  2. arXiv:2404.19329  [pdf, other

    cs.CV

    End-to-end information extraction in handwritten documents: Understanding Paris marriage records from 1880 to 1940

    Authors: Thomas Constum, Lucas Preel, Théo Larcher, Pierrick Tranouez, Thierry Paquet, Sandra Brée

    Abstract: The EXO-POPP project aims to establish a comprehensive database comprising 300,000 marriage records from Paris and its suburbs, spanning the years 1880 to 1940, which are preserved in over 130,000 scans of double pages. Each marriage record may encompass up to 118 distinct types of information that require extraction from plain text. In this paper, we introduce the M-POPP dataset, a subset of the… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: To be published in: International Conference on Document Analysis and Recognition - ICDAR 2024

  3. arXiv:2402.07596  [pdf, other

    cs.CV cs.SD eess.AS

    Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription

    Authors: Antonio Ríos-Vila, Jorge Calvo-Zaragoza, Thierry Paquet

    Abstract: State-of-the-art end-to-end Optical Music Recognition (OMR) has, to date, primarily been carried out using monophonic transcription techniques to handle complex score layouts, such as polyphony, often by resorting to simplifications or specific adaptations. Despite their efficacy, these approaches imply challenges related to scalability and limitations. This paper presents the Sheet Music Transfor… ▽ More

    Submitted 29 April, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Submitted to the International Conference on Document Analysis and Recognition 2024

  4. arXiv:2311.12814  [pdf, other

    q-bio.BM cs.AI cs.LG

    HydraScreen: A Generalizable Structure-Based Deep Learning Approach to Drug Discovery

    Authors: Alvaro Prat, Hisham Abdel Aty, Gintautas Kamuntavičius, Tanya Paquet, Povilas Norvaišas, Piero Gasparotto, Roy Tal

    Abstract: We propose HydraScreen, a deep-learning approach that aims to provide a framework for more robust machine-learning-accelerated drug discovery. HydraScreen utilizes a state-of-the-art 3D convolutional neural network, designed for the effective representation of molecular structures and interactions in protein-ligand binding. We design an end-to-end pipeline for high-throughput screening and lead op… ▽ More

    Submitted 22 September, 2023; originally announced November 2023.

  5. Faster DAN: Multi-target Queries with Document Positional Encoding for End-to-end Handwritten Document Recognition

    Authors: Denis Coquenet, Clément Chatelain, Thierry Paquet

    Abstract: Recent advances in handwritten text recognition enabled to recognize whole documents in an end-to-end way: the Document Attention Network (DAN) recognizes the characters one after the other through an attention-based prediction process until reaching the end of the document. However, this autoregressive process leads to inference that cannot benefit from any parallelization optimization. In this p… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Journal ref: International Conference on Document Analysis and Recognition - ICDAR 2023

  6. arXiv:2209.03771  [pdf, ps, other

    cs.LG

    Stochastic gradient descent with gradient estimator for categorical features

    Authors: Paul Peseux, Maxime Berar, Thierry Paquet, Victor Nicollet

    Abstract: Categorical data are present in key areas such as health or supply chain, and this data require specific treatment. In order to apply recent machine learning models on such data, encoding is needed. In order to build interpretable models, one-hot encoding is still a very good solution, but such encoding creates sparse data. Gradient estimators are not suited for sparse data: the gradient is mainly… ▽ More

    Submitted 18 April, 2023; v1 submitted 8 September, 2022; originally announced September 2022.

  7. arXiv:2208.13391  [pdf, other

    cs.CV

    Confidence Estimation for Object Detection in Document Images

    Authors: Mélodie Boillet, Christopher Kermorvant, Thierry Paquet

    Abstract: Deep neural networks are becoming increasingly powerful and large and always require more labelled data to be trained. However, since annotating data is time-consuming, it is now necessary to develop systems that show good performance while learning on a limited amount of data. These data must be correctly chosen to obtain models that are still efficient. For this, the systems must be able to dete… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

  8. Robust Text Line Detection in Historical Documents: Learning and Evaluation Methods

    Authors: Mélodie Boillet, Christopher Kermorvant, Thierry Paquet

    Abstract: Text line segmentation is one of the key steps in historical document understanding. It is challenging due to the variety of fonts, contents, writing styles and the quality of documents that have degraded through the years. In this paper, we address the limitations that currently prevent people from building line segmentation models with a high generalization capacity. We present a study conduct… ▽ More

    Submitted 21 October, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Journal ref: International Journal on Document Analysis and Recognition (IJDAR) (2022)

  9. DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition

    Authors: Denis Coquenet, Clément Chatelain, Thierry Paquet

    Abstract: Unconstrained handwritten text recognition is a challenging computer vision task. It is traditionally handled by a two-step approach, combining line segmentation followed by text line recognition. For the first time, we propose an end-to-end segmentation-free architecture for the task of handwritten document recognition: the Document Attention Network. In addition to text recognition, the model is… ▽ More

    Submitted 13 December, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 2023

  10. Including Keyword Position in Image-based Models for Act Segmentation of Historical Registers

    Authors: Mélodie Boillet, Martin Maarand, Thierry Paquet, Christopher Kermorvant

    Abstract: The segmentation of complex images into semantic regions has seen a growing interest these last years with the advent of Deep Learning. Until recently, most existing methods for Historical Document Analysis focused on the visual appearance of documents, ignoring the rich information that textual content can offer. However, the segmentation of complex documents into semantic regions is sometimes im… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Journal ref: The 6th International Workshop on Historical Document Imaging and Processing (2021)

  11. SPAN: a Simple Predict & Align Network for Handwritten Paragraph Recognition

    Authors: Denis Coquenet, Clément Chatelain, Thierry Paquet

    Abstract: Unconstrained handwriting recognition is an essential task in document analysis. It is usually carried out in two steps. First, the document is segmented into text lines. Second, an Optical Character Recognition model is applied on these line images. We propose the Simple Predict & Align Network: an end-to-end recurrence-free Fully Convolutional Network performing OCR at paragraph level without an… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Journal ref: Document Analysis and Recognition - ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science, vol 12823

  12. Multiple Document Datasets Pre-training Improves Text Line Detection With Deep Neural Networks

    Authors: Mélodie Boillet, Christopher Kermorvant, Thierry Paquet

    Abstract: In this paper, we introduce a fully convolutional network for the document layout analysis task. While state-of-the-art methods are using models pre-trained on natural scene images, our method Doc-UFCN relies on a U-shaped model trained from scratch for detecting objects from historical documents. We consider the line segmentation task and more generally the layout analysis problem as a pixel-wise… ▽ More

    Submitted 29 March, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Journal ref: 2020 25th International Conference on Pattern Recognition (ICPR)

  13. Recurrence-free unconstrained handwritten text recognition using gated fully convolutional network

    Authors: Denis Coquenet, Clément Chatelain, Thierry Paquet

    Abstract: Unconstrained handwritten text recognition is a major step in most document analysis tasks. This is generally processed by deep recurrent neural networks and more specifically with the use of Long Short-Term Memory cells. The main drawbacks of these components are the large number of parameters involved and their sequential execution during training and prediction. One alternative solution to usin… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Journal ref: 2020 17th International Conference on Frontiers in Handwriting Recognition (ICFHR)

  14. Have convolutions already made recurrence obsolete for unconstrained handwritten text recognition ?

    Authors: Denis Coquenet, Yann Soullard, Clément Chatelain, Thierry Paquet

    Abstract: Unconstrained handwritten text recognition remains an important challenge for deep neural networks. These last years, recurrent networks and more specifically Long Short-Term Memory networks have achieved state-of-the-art performance in this field. Nevertheless, they are made of a large number of trainable parameters and training recurrent neural networks does not support parallelism. This has a d… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Journal ref: 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW)

  15. End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network

    Authors: Denis Coquenet, Clément Chatelain, Thierry Paquet

    Abstract: Unconstrained handwritten text recognition remains challenging for computer vision systems. Paragraph text recognition is traditionally achieved by two models: the first one for line segmentation and the second one for text line recognition. We propose a unified end-to-end model using hybrid attention to tackle this task. This model is designed to iteratively process a paragraph image line by line… ▽ More

    Submitted 3 December, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 2022

  16. arXiv:1901.07957  [pdf, other

    cs.LG stat.ML

    CTCModel: a Keras Model for Connectionist Temporal Classification

    Authors: Yann Soullard, Cyprien Ruffino, Thierry Paquet

    Abstract: We report an extension of a Keras Model, called CTCModel, to perform the Connectionist Temporal Classification (CTC) in a transparent way. Combined with Recurrent Neural Networks, the Connectionist Temporal Classification is the reference method for dealing with unsegmented input sequences, i.e. with data that are a couple of observation and label sequences where each label is related to a subset… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

  17. A Unified Multilingual Handwriting Recognition System using multigrams sub-lexical units

    Authors: Wassim Swaileh, Yann Soullard, Thierry Paquet

    Abstract: We address the design of a unified multilingual system for handwriting recognition. Most of multi- lingual systems rests on specialized models that are trained on a single language and one of them is selected at test time. While some recognition systems are based on a unified optical model, dealing with a unified language model remains a major issue, as traditional language models are generally tr… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: preprint

    Journal ref: Pattern Recognition Letter 2018

  18. arXiv:1808.07277  [pdf

    cs.CV

    A syllable based model for handwriting recognition

    Authors: Wassim Swaileh, Thierry Paquet

    Abstract: In this paper, we introduce a new modeling approach of texts for handwriting recognition based on syllables. We propose a supervised syllabification approach for the French and English languages for building a vocabulary of syllables. Statistical n-gram language models of syllables are trained on French and English Wikipedia corpora. The handwriting recognition system, based on optical HMM context… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

  19. arXiv:1707.07432  [pdf, other

    cs.CV

    LV-ROVER: Lexicon Verified Recognizer Output Voting Error Reduction

    Authors: Bruno Stuner, Clément Chatelain, Thierry Paquet

    Abstract: Offline handwritten text line recognition is a hard task that requires both an efficient optical character recognizer and language model. Handwriting recognition state of the art methods are based on Long Short Term Memory (LSTM) recurrent neural networks (RNN) coupled with the use of linguistic knowledge. Most of the proposed approaches in the literature focus on improving one of the two componen… ▽ More

    Submitted 24 July, 2017; originally announced July 2017.

    Comments: Submitted to Pattern Recognition Letters

  20. arXiv:1612.07528  [pdf, other

    cs.CV

    Handwriting recognition using Cohort of LSTM and lexicon verification with extremely large lexicon

    Authors: Bruno Stuner, Clément Chatelain, Thierry Paquet

    Abstract: State-of-the-art methods for handwriting recognition are based on Long Short Term Memory (LSTM) recurrent neural networks (RNN), which now provides very impressive character recognition performance. The character recognition is generally coupled with a lexicon driven decoding process which integrates dictionaries. Unfortunately these dictionaries are limited to hundred of thousands words for the b… ▽ More

    Submitted 25 September, 2017; v1 submitted 22 December, 2016; originally announced December 2016.

    Comments: 31 pages, paper submitted to Pattern Recognition

  21. arXiv:1210.0999  [pdf

    cs.IR cs.CV cs.DL

    Logical segmentation for article extraction in digitized old newspapers

    Authors: Thomas Palfray, David Hébert, Stéphane Nicolas, Pierrick Tranouez, Thierry Paquet

    Abstract: Newspapers are documents made of news item and informative articles. They are not meant to be red iteratively: the reader can pick his items in any order he fancies. Ignoring this structural property, most digitized newspaper archives only offer access by issue or at best by page to their content. We have built a digitization workflow that automatically extracts newspaper articles from images, whi… ▽ More

    Submitted 3 October, 2012; originally announced October 2012.

    Comments: ACM Document Engineering, France (2012)