-
QPLEX: Realizing the Integration of Quantum Computing into Combinatorial Optimization Software
Authors:
Juan Giraldo,
José Ossorio,
Norha M. Villegas,
Gabriel Tamura,
Ulrike Stege
Abstract:
Quantum computing has the potential to surpass the capabilities of current classical computers when solving complex problems. Combinatorial optimization has emerged as one of the key target areas for quantum computers as problems found in this field play a critical role in many different industrial application sectors (e.g., enhancing manufacturing operations or improving decision processes). Curr…
▽ More
Quantum computing has the potential to surpass the capabilities of current classical computers when solving complex problems. Combinatorial optimization has emerged as one of the key target areas for quantum computers as problems found in this field play a critical role in many different industrial application sectors (e.g., enhancing manufacturing operations or improving decision processes). Currently, there are different types of high-performance optimization software (e.g., ILOG CPLEX and Gurobi) that support engineers and scientists in solving optimization problems using classical computers. In order to utilize quantum resources, users require domain-specific knowledge of quantum algorithms, SDKs and libraries, which can be a limiting factor for any practitioner who wants to integrate this technology into their workflows. Our goal is to add software infrastructure to a classical optimization package so that application developers can interface with quantum platforms readily when setting up their workflows. This paper presents a tool for the seamless utilization of quantum resources through a classical interface. Our approach consists of a Python library extension that provides a backend to facilitate access to multiple quantum providers. Our pipeline enables optimization software developers to experiment with quantum resources selectively and assess performance improvements of hybrid quantum-classical optimization solutions.
△ Less
Submitted 26 July, 2023;
originally announced July 2023.
-
Replicating and extending chain-ladder via an age-period-cohort structure on the claim development in a run-off triangle
Authors:
Gabriele Pittarello,
Munir Hiabu,
Andrés M. Villegas
Abstract:
This paper introduces yet another stochastic model replicating chain-ladder estimates and furthermore considers extensions that add flexibility to the modeling. In its simplest form, the proposed model replicates the chain-ladder's development factors using a GLM model with averaged hazard rates running in reversed development time as response. This is in contrast to the existing reserving literat…
▽ More
This paper introduces yet another stochastic model replicating chain-ladder estimates and furthermore considers extensions that add flexibility to the modeling. In its simplest form, the proposed model replicates the chain-ladder's development factors using a GLM model with averaged hazard rates running in reversed development time as response. This is in contrast to the existing reserving literature within the GLM framework where claim amounts are modeled as response. Modeling the averaged hazard rate corresponds to modeling the claim development and is arguably closer to the actual chain-ladder algorithm. Furthermore, since exposure does not need to be modeled, the model only has half the number of parameters compared to when modeling the claim amounts. This lesser complexity can be used to easily introduce model extensions that may better fit the data. We provide a new R-package, $\texttt{clmplus}$, where the models are implemented and can be fed with run-off triangles. We conduct an empirical study on 30 publicly available run-off triangles making a case for the benefit of having $\texttt{clmplus}$ in the actuary's toolbox.
△ Less
Submitted 10 November, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
The Athena X-ray Integral Field Unit: a consolidated design for the system requirement review of the preliminary definition phase
Authors:
Didier Barret,
Vincent Albouys,
Jan-Willem den Herder,
Luigi Piro,
Massimo Cappi,
Juhani Huovelin,
Richard Kelley,
J. Miguel Mas-Hesse,
Stéphane Paltani,
Gregor Rauw,
Agata Rozanska,
Jiri Svoboda,
Joern Wilms,
Noriko Yamasaki,
Marc Audard,
Simon Bandler,
Marco Barbera,
Xavier Barcons,
Enrico Bozzo,
Maria Teresa Ceballos,
Ivan Charles,
Elisa Costantini,
Thomas Dauser,
Anne Decourchelle,
Lionel Duband
, et al. (274 additional authors not shown)
Abstract:
The Athena X-ray Integral Unit (X-IFU) is the high resolution X-ray spectrometer, studied since 2015 for flying in the mid-30s on the Athena space X-ray Observatory, a versatile observatory designed to address the Hot and Energetic Universe science theme, selected in November 2013 by the Survey Science Committee. Based on a large format array of Transition Edge Sensors (TES), it aims to provide sp…
▽ More
The Athena X-ray Integral Unit (X-IFU) is the high resolution X-ray spectrometer, studied since 2015 for flying in the mid-30s on the Athena space X-ray Observatory, a versatile observatory designed to address the Hot and Energetic Universe science theme, selected in November 2013 by the Survey Science Committee. Based on a large format array of Transition Edge Sensors (TES), it aims to provide spatially resolved X-ray spectroscopy, with a spectral resolution of 2.5 eV (up to 7 keV) over an hexagonal field of view of 5 arc minutes (equivalent diameter). The X-IFU entered its System Requirement Review (SRR) in June 2022, at about the same time when ESA called for an overall X-IFU redesign (including the X-IFU cryostat and the cooling chain), due to an unanticipated cost overrun of Athena. In this paper, after illustrating the breakthrough capabilities of the X-IFU, we describe the instrument as presented at its SRR, browsing through all the subsystems and associated requirements. We then show the instrument budgets, with a particular emphasis on the anticipated budgets of some of its key performance parameters. Finally we briefly discuss on the ongoing key technology demonstration activities, the calibration and the activities foreseen in the X-IFU Instrument Science Center, and touch on communication and outreach activities, the consortium organisation, and finally on the life cycle assessment of X-IFU aiming at minimising the environmental footprint, associated with the development of the instrument. Thanks to the studies conducted so far on X-IFU, it is expected that along the design-to-cost exercise requested by ESA, the X-IFU will maintain flagship capabilities in spatially resolved high resolution X-ray spectroscopy, enabling most of the original X-IFU related scientific objectives of the Athena mission to be retained. (abridged).
△ Less
Submitted 28 November, 2022; v1 submitted 30 August, 2022;
originally announced August 2022.
-
Content and Style Aware Generation of Text-line Images for Handwriting Recognition
Authors:
Lei Kang,
Pau Riba,
Marçal Rusiñol,
Alicia Fornés,
Mauricio Villegas
Abstract:
Handwritten Text Recognition has achieved an impressive performance in public benchmarks. However, due to the high inter- and intra-class variability between handwriting styles, such recognizers need to be trained using huge volumes of manually labeled training data. To alleviate this labor-consuming problem, synthetic data produced with TrueType fonts has been often used in the training loop to g…
▽ More
Handwritten Text Recognition has achieved an impressive performance in public benchmarks. However, due to the high inter- and intra-class variability between handwriting styles, such recognizers need to be trained using huge volumes of manually labeled training data. To alleviate this labor-consuming problem, synthetic data produced with TrueType fonts has been often used in the training loop to gain volume and augment the handwriting style variability. However, there is a significant style bias between synthetic and real data which hinders the improvement of recognition performance. To deal with such limitations, we propose a generative method for handwritten text-line images, which is conditioned on both visual appearance and textual content. Our method is able to produce long text-line samples with diverse handwriting styles. Once properly trained, our method can also be adapted to new target data by only accessing unlabeled text-line images to mimic handwritten styles and produce images with any textual content. Extensive experiments have been done on making use of the generated samples to boost Handwritten Text Recognition performance. Both qualitative and quantitative results demonstrate that the proposed approach outperforms the current state of the art.
△ Less
Submitted 12 April, 2022;
originally announced April 2022.
-
Framework para Caracterizar Fake News en Terminos de Emociones
Authors:
Luis Rojas Rubio,
Claudio Meneses Villegas
Abstract:
Social networks have become one of the main information channels for human beings due to the immediate and social interactivity they offer, allowing in some cases to publish what each user considers relevant. This has brought with it the generation of false news or Fake News, publications that only seek to generate uncertainty, misinformation or skew the opinion of readers. It has been shown that…
▽ More
Social networks have become one of the main information channels for human beings due to the immediate and social interactivity they offer, allowing in some cases to publish what each user considers relevant. This has brought with it the generation of false news or Fake News, publications that only seek to generate uncertainty, misinformation or skew the opinion of readers. It has been shown that the human being is not capable of fully identifying whether an article is really a fact or a Fake News, due to this it is that models arise that seek to characterize and identify articles based on data mining and machine learning. This article proposes a three-layer framework, the main objective of which is to characterize the emotions present in Fake News and to be a tool for future work that identifies the emotional state and intentional state of the public.
△ Less
Submitted 13 December, 2021;
originally announced December 2021.
-
The Catalan Language CLUB
Authors:
Carlos Rodriguez-Penagos,
Carme Armentano-Oller,
Marta Villegas,
Maite Melero,
Aitor Gonzalez,
Ona de Gibert Bonet,
Casimiro Carrino Pio
Abstract:
The Catalan Language Understanding Benchmark (CLUB) encompasses various datasets representative of different NLU tasks that enable accurate evaluations of language models, following the General Language Understanding Evaluation (GLUE) example. It is part of AINA and PlanTL, two public funding initiatives to empower the Catalan language in the Artificial Intelligence era.
The Catalan Language Understanding Benchmark (CLUB) encompasses various datasets representative of different NLU tasks that enable accurate evaluations of language models, following the General Language Understanding Evaluation (GLUE) example. It is part of AINA and PlanTL, two public funding initiatives to empower the Catalan language in the Artificial Intelligence era.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
Spanish Legalese Language Model and Corpora
Authors:
Asier Gutiérrez-Fandiño,
Jordi Armengol-Estapé,
Aitor Gonzalez-Agirre,
Marta Villegas
Abstract:
There are many Language Models for the English language according to its worldwide relevance. However, for the Spanish language, even if it is a widely spoken language, there are very few Spanish Language Models which result to be small and too general. Legal slang could be think of a Spanish variant on its own as it is very complicated in vocabulary, semantics and phrase understanding. For this w…
▽ More
There are many Language Models for the English language according to its worldwide relevance. However, for the Spanish language, even if it is a widely spoken language, there are very few Spanish Language Models which result to be small and too general. Legal slang could be think of a Spanish variant on its own as it is very complicated in vocabulary, semantics and phrase understanding. For this work we gathered legal-domain corpora from different sources, generated a model and evaluated against Spanish general domain tasks. The model provides reasonable results in those tasks.
△ Less
Submitted 23 October, 2021;
originally announced October 2021.
-
Spanish Biomedical Crawled Corpus: A Large, Diverse Dataset for Spanish Biomedical Language Models
Authors:
Casimiro Pio Carrino,
Jordi Armengol-Estapé,
Ona de Gibert Bonet,
Asier Gutiérrez-Fandiño,
Aitor Gonzalez-Agirre,
Martin Krallinger,
Marta Villegas
Abstract:
We introduce CoWeSe (the Corpus Web Salud Español), the largest Spanish biomedical corpus to date, consisting of 4.5GB (about 750M tokens) of clean plain text. CoWeSe is the result of a massive crawler on 3000 Spanish domains executed in 2020. The corpus is openly available and already preprocessed. CoWeSe is an important resource for biomedical and health NLP in Spanish and has already been emplo…
▽ More
We introduce CoWeSe (the Corpus Web Salud Español), the largest Spanish biomedical corpus to date, consisting of 4.5GB (about 750M tokens) of clean plain text. CoWeSe is the result of a massive crawler on 3000 Spanish domains executed in 2020. The corpus is openly available and already preprocessed. CoWeSe is an important resource for biomedical and health NLP in Spanish and has already been employed to train domain-specific language models and to produce word embbedings. We released the CoWeSe corpus under a Creative Commons Attribution 4.0 International license, both in Zenodo (\url{https://zenodo.org/record/4561971\#.YTI5SnVKiEA}).
△ Less
Submitted 16 September, 2021;
originally announced September 2021.
-
Biomedical and Clinical Language Models for Spanish: On the Benefits of Domain-Specific Pretraining in a Mid-Resource Scenario
Authors:
Casimiro Pio Carrino,
Jordi Armengol-Estapé,
Asier Gutiérrez-Fandiño,
Joan Llop-Palao,
Marc Pàmies,
Aitor Gonzalez-Agirre,
Marta Villegas
Abstract:
This work presents biomedical and clinical language models for Spanish by experimenting with different pretraining choices, such as masking at word and subword level, varying the vocabulary size and testing with domain data, looking for better language representations. Interestingly, in the absence of enough clinical data to train a model from scratch, we applied mixed-domain pretraining and cross…
▽ More
This work presents biomedical and clinical language models for Spanish by experimenting with different pretraining choices, such as masking at word and subword level, varying the vocabulary size and testing with domain data, looking for better language representations. Interestingly, in the absence of enough clinical data to train a model from scratch, we applied mixed-domain pretraining and cross-domain transfer approaches to generate a performant bio-clinical model suitable for real-world clinical data. We evaluated our models on Named Entity Recognition (NER) tasks for biomedical documents and challenging hospital discharge reports. When compared against the competitive mBERT and BETO models, we outperform them in all NER tasks by a significant margin. Finally, we studied the impact of the model's vocabulary on the NER performances by offering an interesting vocabulary-centric analysis. The results confirm that domain-specific pretraining is fundamental to achieving higher performances in downstream NER tasks, even within a mid-resource scenario. To the best of our knowledge, we provide the first biomedical and clinical transformer-based pretrained language models for Spanish, intending to boost native Spanish NLP applications in biomedicine. Our best models are freely available in the HuggingFace hub: https://huggingface.co/BSC-TeMU.
△ Less
Submitted 17 September, 2021; v1 submitted 8 September, 2021;
originally announced September 2021.
-
Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? A Comprehensive Assessment for Catalan
Authors:
Jordi Armengol-Estapé,
Casimiro Pio Carrino,
Carlos Rodriguez-Penagos,
Ona de Gibert Bonet,
Carme Armentano-Oller,
Aitor Gonzalez-Agirre,
Maite Melero,
Marta Villegas
Abstract:
Multilingual language models have been a crucial breakthrough as they considerably reduce the need of data for under-resourced languages. Nevertheless, the superiority of language-specific models has already been proven for languages having access to large amounts of data. In this work, we focus on Catalan with the aim to explore to what extent a medium-sized monolingual language model is competit…
▽ More
Multilingual language models have been a crucial breakthrough as they considerably reduce the need of data for under-resourced languages. Nevertheless, the superiority of language-specific models has already been proven for languages having access to large amounts of data. In this work, we focus on Catalan with the aim to explore to what extent a medium-sized monolingual language model is competitive with state-of-the-art large multilingual models. For this, we: (1) build a clean, high-quality textual Catalan corpus (CaText), the largest to date (but only a fraction of the usual size of the previous work in monolingual language models), (2) train a Transformer-based language model for Catalan (BERTa), and (3) devise a thorough evaluation in a diversity of settings, comprising a complete array of downstream tasks, namely, Part of Speech Tagging, Named Entity Recognition and Classification, Text Classification, Question Answering, and Semantic Textual Similarity, with most of the corresponding datasets being created ex novo. The result is a new benchmark, the Catalan Language Understanding Benchmark (CLUB), which we publish as an open resource, together with the clean textual corpus, the language model, and the cleaning pipeline. Using state-of-the-art multilingual models and a monolingual model trained only on Wikipedia as baselines, we consistently observe the superiority of our model across tasks and settings.
△ Less
Submitted 16 July, 2021;
originally announced July 2021.
-
MarIA: Spanish Language Models
Authors:
Asier Gutiérrez-Fandiño,
Jordi Armengol-Estapé,
Marc Pàmies,
Joan Llop-Palao,
Joaquín Silveira-Ocampo,
Casimiro Pio Carrino,
Aitor Gonzalez-Agirre,
Carme Armentano-Oller,
Carlos Rodriguez-Penagos,
Marta Villegas
Abstract:
This work presents MarIA, a family of Spanish language models and associated resources made available to the industry and the research community. Currently, MarIA includes RoBERTa-base, RoBERTa-large, GPT2 and GPT2-large Spanish language models, which can arguably be presented as the largest and most proficient language models in Spanish. The models were pretrained using a massive corpus of 570GB…
▽ More
This work presents MarIA, a family of Spanish language models and associated resources made available to the industry and the research community. Currently, MarIA includes RoBERTa-base, RoBERTa-large, GPT2 and GPT2-large Spanish language models, which can arguably be presented as the largest and most proficient language models in Spanish. The models were pretrained using a massive corpus of 570GB of clean and deduplicated texts with 135 billion words extracted from the Spanish Web Archive crawled by the National Library of Spain between 2009 and 2019. We assessed the performance of the models with nine existing evaluation datasets and with a novel extractive Question Answering dataset created ex novo. Overall, MarIA models outperform the existing Spanish models across a variety of NLU tasks and training settings.
△ Less
Submitted 5 April, 2022; v1 submitted 15 July, 2021;
originally announced July 2021.
-
Overview of BioASQ 2020: The eighth BioASQ challenge on Large-Scale Biomedical Semantic Indexing and Question Answering
Authors:
Anastasios Nentidis,
Anastasia Krithara,
Konstantinos Bougiatiotis,
Martin Krallinger,
Carlos Rodriguez-Penagos,
Marta Villegas,
Georgios Paliouras
Abstract:
In this paper, we present an overview of the eighth edition of the BioASQ challenge, which ran as a lab in the Conference and Labs of the Evaluation Forum (CLEF) 2020. BioASQ is a series of challenges aiming at the promotion of systems and methodologies for large-scale biomedical semantic indexing and question answering. To this end, shared tasks are organized yearly since 2012, where different te…
▽ More
In this paper, we present an overview of the eighth edition of the BioASQ challenge, which ran as a lab in the Conference and Labs of the Evaluation Forum (CLEF) 2020. BioASQ is a series of challenges aiming at the promotion of systems and methodologies for large-scale biomedical semantic indexing and question answering. To this end, shared tasks are organized yearly since 2012, where different teams develop systems that compete on the same demanding benchmark datasets that represent the real information needs of experts in the biomedical domain. This year, the challenge has been extended with the introduction of a new task on medical semantic indexing in Spanish. In total, 34 teams with more than 100 systems participated in the three tasks of the challenge. As in previous years, the results of the evaluation reveal that the top-performing systems managed to outperform the strong baselines, which suggests that state-of-the-art systems keep pushing the frontier of research through continuous improvements.
△ Less
Submitted 28 June, 2021;
originally announced June 2021.
-
Persistent Homology Captures the Generalization of Neural Networks Without A Validation Set
Authors:
Asier Gutiérrez-Fandiño,
David Pérez-Fernández,
Jordi Armengol-Estapé,
Marta Villegas
Abstract:
The training of neural networks is usually monitored with a validation (holdout) set to estimate the generalization of the model. This is done instead of measuring intrinsic properties of the model to determine whether it is learning appropriately. In this work, we suggest studying the training of neural networks with Algebraic Topology, specifically Persistent Homology (PH). Using simplicial comp…
▽ More
The training of neural networks is usually monitored with a validation (holdout) set to estimate the generalization of the model. This is done instead of measuring intrinsic properties of the model to determine whether it is learning appropriately. In this work, we suggest studying the training of neural networks with Algebraic Topology, specifically Persistent Homology (PH). Using simplicial complex representations of neural networks, we study the PH diagram distance evolution on the neural network learning process with different architectures and several datasets. Results show that the PH diagram distance between consecutive neural network states correlates with the validation accuracy, implying that the generalization error of a neural network could be intrinsically estimated without any holdout set.
△ Less
Submitted 31 May, 2021;
originally announced June 2021.
-
Collagenase Nanocapsules: An Approach to Fibrosis Treatment
Authors:
MR Villegas,
A Baeza,
A Usategui,
PL Ortiz-Romero,
JL Pablos,
M Vallet-Regi
Abstract:
Fibrosis is a common lesion in different pathologic diseases and is defined by the excessive accumulation of collagen. Different approaches have been used to treat different conditions characterized by fibrosis. FDA and EMA approved collagenase to treat palmar fibromatosis, Dupuyten disease. EMA approved additionally its use in severe Peyronie disease, but it has been used off label in other condi…
▽ More
Fibrosis is a common lesion in different pathologic diseases and is defined by the excessive accumulation of collagen. Different approaches have been used to treat different conditions characterized by fibrosis. FDA and EMA approved collagenase to treat palmar fibromatosis, Dupuyten disease. EMA approved additionally its use in severe Peyronie disease, but it has been used off label in other conditions.1, 2. Approved treatment includes up to 3, in palmar fibromatosis or up to 8, in penile fibromatosis, injections followed by finger extension or penile modelling procedures, typically causing severe pain. Frequently single injections are enough to treat palmar fibromatosis. 3, The need to inject repeatedly doses of this enzyme can be originated by by the labile nature of collagenase which exhibits a complete activity loss after short periods of time. Herein, a novel strategy to manage this enzyme based on the synthesis of polymeric nanocapsules which contains collagenase housed within their matrix is presented. These nanocapsules have been engineered for achieving a gradual release of the encapsulated enzyme for longer times which can be up to ten days. The efficacy of these nanocapsules have been tested in murine model of local dermal fibrosis yielding higher fibrosis reduction in comparison with the injection of free enzyme which represent a significant improvement over conventional therapy.
△ Less
Submitted 18 March, 2021;
originally announced March 2021.
-
Spanish Biomedical and Clinical Language Embeddings
Authors:
Asier Gutiérrez-Fandiño,
Jordi Armengol-Estapé,
Casimiro Pio Carrino,
Ona De Gibert,
Aitor Gonzalez-Agirre,
Marta Villegas
Abstract:
We computed both Word and Sub-word Embeddings using FastText. For Sub-word embeddings we selected Byte Pair Encoding (BPE) algorithm to represent the sub-words. We evaluated the Biomedical Word Embeddings obtaining better results than previous versions showing the implication that with more data, we obtain better representations.
We computed both Word and Sub-word Embeddings using FastText. For Sub-word embeddings we selected Byte Pair Encoding (BPE) algorithm to represent the sub-words. We evaluated the Biomedical Word Embeddings obtaining better results than previous versions showing the implication that with more data, we obtain better representations.
△ Less
Submitted 25 February, 2021;
originally announced February 2021.
-
Characterizing and Measuring the Similarity of Neural Networks with Persistent Homology
Authors:
David Pérez-Fernández,
Asier Gutiérrez-Fandiño,
Jordi Armengol-Estapé,
Marta Villegas
Abstract:
Characterizing the structural properties of neural networks is crucial yet poorly understood, and there are no well-established similarity measures between networks. In this work, we observe that neural networks can be represented as abstract simplicial complex and analyzed using their topological 'fingerprints' via Persistent Homology (PH). We then describe a PH-based representation proposed for…
▽ More
Characterizing the structural properties of neural networks is crucial yet poorly understood, and there are no well-established similarity measures between networks. In this work, we observe that neural networks can be represented as abstract simplicial complex and analyzed using their topological 'fingerprints' via Persistent Homology (PH). We then describe a PH-based representation proposed for characterizing and measuring similarity of neural networks. We empirically show the effectiveness of this representation as a descriptor of different architectures in several datasets. This approach based on Topological Data Analysis is a step towards better understanding neural networks and serves as a useful similarity measure.
△ Less
Submitted 31 May, 2021; v1 submitted 19 January, 2021;
originally announced January 2021.
-
A Vulnerability Study on Academic Collaboration Networks Based on Network Dynamics
Authors:
Asier Gutiérrez-Fandiño,
Jordi Armengol-Estapé,
Marta Villegas
Abstract:
Researchers that work for the same institution use their email as the main communication tool. Email can be one of the most fruitful attack vectors of research institutions as they also contain access to all accounts and thus to all private information. We propose an approach for analyzing in terms of security research institutions' communication networks. We first obtained institutions' communica…
▽ More
Researchers that work for the same institution use their email as the main communication tool. Email can be one of the most fruitful attack vectors of research institutions as they also contain access to all accounts and thus to all private information. We propose an approach for analyzing in terms of security research institutions' communication networks. We first obtained institutions' communication networks as well as a method to analyze possible breaches of collected emails. We downloaded the network of 4 different research centers, three from Spain and one from Portugal. We then ran simulations of Susceptible-Exposed-Infected-Recovered (SEIR) complex network dynamics model for analyzing the vulnerability of the network. More than half of the nodes have more than one security breach, and our simulation results show that more than 90\% of the networks' nodes are vulnerable. This method can be employed for enhancing security of research centers and can make email accounts' use security-aware. It may additionally open new research lines in communication security. Finally, we manifest that, due to confidentiality reasons, the sources we utilized for obtaining communication networks should not be providing the information that we were able to gather.
△ Less
Submitted 31 March, 2021; v1 submitted 21 December, 2020;
originally announced December 2020.
-
Pay Attention to What You Read: Non-recurrent Handwritten Text-Line Recognition
Authors:
Lei Kang,
Pau Riba,
Marçal Rusiñol,
Alicia Fornés,
Mauricio Villegas
Abstract:
The advent of recurrent neural networks for handwriting recognition marked an important milestone reaching impressive recognition accuracies despite the great variability that we observe across different writing styles. Sequential architectures are a perfect fit to model text lines, not only because of the inherent temporal aspect of text, but also to learn probability distributions over sequences…
▽ More
The advent of recurrent neural networks for handwriting recognition marked an important milestone reaching impressive recognition accuracies despite the great variability that we observe across different writing styles. Sequential architectures are a perfect fit to model text lines, not only because of the inherent temporal aspect of text, but also to learn probability distributions over sequences of characters and words. However, using such recurrent paradigms comes at a cost at training stage, since their sequential pipelines prevent parallelization. In this work, we introduce a non-recurrent approach to recognize handwritten text by the use of transformer models. We propose a novel method that bypasses any recurrence. By using multi-head self-attention layers both at the visual and textual stages, we are able to tackle character recognition as well as to learn language-related dependencies of the character sequences to be decoded. Our model is unconstrained to any predefined vocabulary, being able to recognize out-of-vocabulary words, i.e. words that do not appear in the training vocabulary. We significantly advance over prior art and demonstrate that satisfactory recognition accuracies are yielded even in few-shot learning scenarios.
△ Less
Submitted 26 May, 2020;
originally announced May 2020.
-
GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images
Authors:
Lei Kang,
Pau Riba,
Yaxing Wang,
Marçal Rusiñol,
Alicia Fornés,
Mauricio Villegas
Abstract:
Although current image generation methods have reached impressive quality levels, they are still unable to produce plausible yet diverse images of handwritten words. On the contrary, when writing by hand, a great variability is observed across different writers, and even when analyzing words scribbled by the same individual, involuntary variations are conspicuous. In this work, we take a step clos…
▽ More
Although current image generation methods have reached impressive quality levels, they are still unable to produce plausible yet diverse images of handwritten words. On the contrary, when writing by hand, a great variability is observed across different writers, and even when analyzing words scribbled by the same individual, involuntary variations are conspicuous. In this work, we take a step closer to producing realistic and varied artificially rendered handwritten words. We propose a novel method that is able to produce credible handwritten word images by conditioning the generative process with both calligraphic style features and textual content. Our generator is guided by three complementary learning objectives: to produce realistic images, to imitate a certain handwriting style and to convey a specific textual content. Our model is unconstrained to any predefined vocabulary, being able to render whatever input word. Given a sample writer, it is also able to mimic its calligraphic features in a few-shot setup. We significantly advance over prior art and demonstrate with qualitative, quantitative and human-based evaluations the realistic aspect of our synthetically produced images.
△ Less
Submitted 21 July, 2020; v1 submitted 5 March, 2020;
originally announced March 2020.
-
Candidate Fusion: Integrating Language Modelling into a Sequence-to-Sequence Handwritten Word Recognition Architecture
Authors:
Lei Kang,
Pau Riba,
Mauricio Villegas,
Alicia Fornés,
Marçal Rusiñol
Abstract:
Sequence-to-sequence models have recently become very popular for tackling handwritten word recognition problems. However, how to effectively integrate an external language model into such recognizer is still a challenging problem. The main challenge faced when training a language model is to deal with the language model corpus which is usually different to the one used for training the handwritte…
▽ More
Sequence-to-sequence models have recently become very popular for tackling handwritten word recognition problems. However, how to effectively integrate an external language model into such recognizer is still a challenging problem. The main challenge faced when training a language model is to deal with the language model corpus which is usually different to the one used for training the handwritten word recognition system. Thus, the bias between both word corpora leads to incorrectness on the transcriptions, providing similar or even worse performances on the recognition task. In this work, we introduce Candidate Fusion, a novel way to integrate an external language model to a sequence-to-sequence architecture. Moreover, it provides suggestions from an external language knowledge, as a new input to the sequence-to-sequence recognizer. Hence, Candidate Fusion provides two improvements. On the one hand, the sequence-to-sequence recognizer has the flexibility not only to combine the information from itself and the language model, but also to choose the importance of the information provided by the language model. On the other hand, the external language model has the ability to adapt itself to the training corpus and even learn the most commonly errors produced from the recognizer. Finally, by conducting comprehensive experiments, the Candidate Fusion proves to outperform the state-of-the-art language models for handwritten word recognition tasks.
△ Less
Submitted 21 December, 2019;
originally announced December 2019.
-
A Neural Model for Text Localization, Transcription and Named Entity Recognition in Full Pages
Authors:
Manuel Carbonell,
Alicia Fornés,
Mauricio Villegas,
Josep Lladós
Abstract:
In the last years, the consolidation of deep neural network architectures for information extraction in document images has brought big improvements in the performance of each of the tasks involved in this process, consisting of text localization, transcription, and named entity recognition. However, this process is traditionally performed with separate methods for each task. In this work we propo…
▽ More
In the last years, the consolidation of deep neural network architectures for information extraction in document images has brought big improvements in the performance of each of the tasks involved in this process, consisting of text localization, transcription, and named entity recognition. However, this process is traditionally performed with separate methods for each task. In this work we propose an end-to-end model that combines a one stage object detection network with branches for the recognition of text and named entities respectively in a way that shared features can be learned simultaneously from the training error of each of the tasks. By doing so the model jointly performs handwritten text detection, transcription, and named entity recognition at page level with a single feed forward step. We exhaustively evaluate our approach on different datasets, discussing its advantages and limitations compared to sequential approaches. The results show that the model is capable of benefiting from shared features for simultaneously solving interdependent tasks.
△ Less
Submitted 4 May, 2020; v1 submitted 20 December, 2019;
originally announced December 2019.
-
Optically Cooling Cesium Lead Tribromide Nanoparticles
Authors:
Benjamin J. Roman,
Noel Mireles Villegas,
Kylie Lytle,
Matthew T. Sheldon
Abstract:
One photon up-conversion photoluminescence is an optical phenomenon whereby the thermal energy of a fluorescent material increases the energy of an emitted photon compared with the energy of the photon that was absorbed. When this occurs with near unity efficiency, the emitting material undergoes a net decrease in temperature--so called optical cooling. Because the up-conversion mechanism is therm…
▽ More
One photon up-conversion photoluminescence is an optical phenomenon whereby the thermal energy of a fluorescent material increases the energy of an emitted photon compared with the energy of the photon that was absorbed. When this occurs with near unity efficiency, the emitting material undergoes a net decrease in temperature--so called optical cooling. Because the up-conversion mechanism is thermally activated, the yield of up-converted photoluminescence is also a reporter of the temperature of the emitter. Taking advantage of this optical signature, cesium lead trihalide nanocrystals are shown to cool during the up-conversion of 532 nm CW laser excitation. Raman thermometric analysis of a substrate the nanocrystals were deposited on further verifies the decrease in the local environmental temperature by as much as 25 degrees during optical pum**. This is the first demonstration of optical cooling driven by colloidal semiconductor nanocrystal up-conversion.
△ Less
Submitted 14 September, 2020; v1 submitted 10 December, 2019;
originally announced December 2019.
-
Unsupervised Adaptation for Synthetic-to-Real Handwritten Word Recognition
Authors:
Lei Kang,
Marçal Rusiñol,
Alicia Fornés,
Pau Riba,
Mauricio Villegas
Abstract:
Handwritten Text Recognition (HTR) is still a challenging problem because it must deal with two important difficulties: the variability among writing styles, and the scarcity of labelled data. To alleviate such problems, synthetic data generation and data augmentation are typically used to train HTR systems. However, training with such data produces encouraging but still inaccurate transcriptions…
▽ More
Handwritten Text Recognition (HTR) is still a challenging problem because it must deal with two important difficulties: the variability among writing styles, and the scarcity of labelled data. To alleviate such problems, synthetic data generation and data augmentation are typically used to train HTR systems. However, training with such data produces encouraging but still inaccurate transcriptions in real words. In this paper, we propose an unsupervised writer adaptation approach that is able to automatically adjust a generic handwritten word recognizer, fully trained with synthetic fonts, towards a new incoming writer. We have experimentally validated our proposal using five different datasets, covering several challenges (i) the document source: modern and historic samples, which may involve paper degradation problems; (ii) different handwriting styles: single and multiple writer collections; and (iii) language, which involves different character combinations. Across these challenging collections, we show that our system is able to maintain its performance, thus, it provides a practical and generic approach to deal with new document collections without requiring any expensive and tedious manual annotation step.
△ Less
Submitted 26 May, 2020; v1 submitted 18 September, 2019;
originally announced September 2019.
-
Joint Recognition of Handwritten Text and Named Entities with a Neural End-to-end Model
Authors:
Manuel Carbonell,
Mauricio Villegas,
Alicia Fornés,
Josep Lladós
Abstract:
When extracting information from handwritten documents, text transcription and named entity recognition are usually faced as separate subsequent tasks. This has the disadvantage that errors in the first module affect heavily the performance of the second module. In this work we propose to do both tasks jointly, using a single neural network with a common architecture used for plain text recognitio…
▽ More
When extracting information from handwritten documents, text transcription and named entity recognition are usually faced as separate subsequent tasks. This has the disadvantage that errors in the first module affect heavily the performance of the second module. In this work we propose to do both tasks jointly, using a single neural network with a common architecture used for plain text recognition. Experimentally, the work has been tested on a collection of historical marriage records. Results of experiments are presented to show the effect on the performance for different configurations: different ways of encoding the information, doing or not transfer learning and processing at text line or multi-line region level. The results are comparable to state of the art reported in the ICDAR 2017 Information Extraction competition, even though the proposed technique does not use any dictionaries, language modeling or post processing.
△ Less
Submitted 22 March, 2018; v1 submitted 16 March, 2018;
originally announced March 2018.
-
Search for neutral Higgs bosons decaying into four taus at LEP2
Authors:
ALEPH Collaboration,
S. Schael,
R. Barate,
R. Brunelière,
I. De Bonis,
D. Decamp,
C. Goy,
S. Jézéquel,
J. -P. Lees,
F. Martin,
E. Merle,
M. -N. Minard,
B. Pietrzyk,
B. Trocmé S. Bravo,
M. P. Casado,
M. Chmeissani,
J. M. Crespo,
E. Fernandez,
M. Fernandez-Bosman,
Ll. Garrido,
M. Martinez,
A. Pacheco,
H. Ruiz,
A. Colaleo,
D. Creanza
, et al. (236 additional authors not shown)
Abstract:
A search for the production and non-standard decay of a Higgs boson, h, into four taus through intermediate pseudoscalars, a, is conducted on 683 pb-1 of data collected by the ALEPH experiment at centre-of-mass energies from 183 to 209 GeV. No excess of events above background is observed, and exclusion limits are placed on the combined production cross section times branching ratio, ξ^2 = σ(e+e…
▽ More
A search for the production and non-standard decay of a Higgs boson, h, into four taus through intermediate pseudoscalars, a, is conducted on 683 pb-1 of data collected by the ALEPH experiment at centre-of-mass energies from 183 to 209 GeV. No excess of events above background is observed, and exclusion limits are placed on the combined production cross section times branching ratio, ξ^2 = σ(e+e- --> Zh)/σ_{SM}(e+e- --> Zh) x B(h --> aa)x B(a --> τ^+τ^-)^2. For mh < 107 GeV/c2 and 4 < ma < 10 GeV/c2, ξ^2 > 1 is excluded at the 95% confidence level.
△ Less
Submitted 19 April, 2010; v1 submitted 2 March, 2010;
originally announced March 2010.
-
On Discrete Quasiprobability Distributions
Authors:
C. A. Munoz Villegas,
A. Chavez Chavez,
S. Chumakov,
Yu. Fofanov,
A. B. Klimov
Abstract:
We analyze quasi probability distributions in discrete phase space related to the discrete Heisenberg-Weyl group. In particular, we discuss the relation between the Discrete Wigner and Q- functions.
We analyze quasi probability distributions in discrete phase space related to the discrete Heisenberg-Weyl group. In particular, we discuss the relation between the Discrete Wigner and Q- functions.
△ Less
Submitted 7 July, 2003;
originally announced July 2003.