Search | arXiv e-print repository

X-ray Coulomb explosion imaging reveals role of molecular structure in internal conversion

Authors: Till Jahnke, Sebastian Mai, Surjendu Bhattacharyya, Keyu Chen, Rebecca Boll, Maria Elena Castellani, Simon Dold, Avijit Duley, Ulrike Frühling, Alice E. Green, Markus Ilchen, Rebecca Ingle, Gregor Kastirke, Huynh Van Sa Lam, Fabiano Lever, Dennis Mayer, Tommaso Mazza, Terence Mullins, Yevheniy Ovcharenko, Björn Senfftleben, Florian Trinter, Atia Tul Noor, Sergey Usenko, Anbu Selvam Venkatachalam, Artem Rudenko , et al. (4 additional authors not shown)

Abstract: Molecular photoabsorption results in an electronic excitation/ionization which couples to the rearrangement of the nuclei. The resulting intertwined change of nuclear and electronic degrees of freedom determines the conversion of photoenergy into other molecular energy forms. Nucleobases are excellent candidates for studying such dynamics, and great effort has been taken in the past to observe the… ▽ More Molecular photoabsorption results in an electronic excitation/ionization which couples to the rearrangement of the nuclei. The resulting intertwined change of nuclear and electronic degrees of freedom determines the conversion of photoenergy into other molecular energy forms. Nucleobases are excellent candidates for studying such dynamics, and great effort has been taken in the past to observe the electronic changes induced by the initial excitation in a time-resolved manner using ultrafast electron spectroscopy. The linked geometrical changes during nucleobase photorelaxation have so far not been observed directly in time-resolved experiments. Here, we present a study on a thionucleobase, where we extract comprehensive information on the molecular rearrangement using Coulomb explosion imaging. Our measurement links the extracted deplanarization of the molecular geometry to the previously studied temporal evolution of the electronic properties of the system. In particular, the protons of the exploded molecule are well-suited messengers carrying rich information on the molecule's geometry at distinct times after the initial electronic excitation. The combination of ultrashort laser pulses to trigger molecular dynamics, intense X-ray free-electron laser pulses for the explosion of the molecule, and multi-particle coincidence detection opens new avenues for time-resolved studies of complex molecules in the gas phase. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: 19 pages, 8 figures

arXiv:2311.12482 [pdf]

Monitoring the evolution of relative product populations at early times during a photochemical reaction

Authors: Joao Pedro Figueira Nunes, Lea Maria Ibele, Shashank Pathak, Andrew R. Attar, Surjendu Bhattacharyya, Rebecca Boll, Kurtis Borne, Martin Centurion, Benjamin Erk, Ming-Fu Lin, Ruaridh J. G. Forbes, Nate Goff, Christopher S. Hansen, Matthias Hoffmann, David M. P. Holland, Rebecca A. Ingle, Duan Luo, Sri Bhavya Muvva, Alex Reid, Arnaud Rouzée, Artem Rudenko, Sajib Kumar Saha, Xiaozhe Shen, Anbu Selvam Venkatachalam, Xijie Wang , et al. (9 additional authors not shown)

Abstract: Identifying multiple rival reaction products and transient species formed during ultrafast photochemical reactions and determining their time-evolving relative populations are key steps towards understanding and predicting photochemical outcomes. Yet, most contemporary ultrafast studies struggle with clearly identifying and quantifying competing molecular structures/species amongst the emerging re… ▽ More Identifying multiple rival reaction products and transient species formed during ultrafast photochemical reactions and determining their time-evolving relative populations are key steps towards understanding and predicting photochemical outcomes. Yet, most contemporary ultrafast studies struggle with clearly identifying and quantifying competing molecular structures/species amongst the emerging reaction products. Here, we show that mega-electronvolt ultrafast electron diffraction in combination with ab initio molecular dynamics calculations offer a powerful route to determining time-resolved populations of the various isomeric products formed after UV (266 nm) excitation of the five-membered heterocyclic molecule 2(5H)-thiophenone. This strategy provides experimental validation of the predicted high (~50%) yield of an episulfide isomer containing a strained 3-membered ring within ~1 ps of photoexcitation and highlights the rapidity of interconversion between the rival highly vibrationally excited photoproducts in their ground electronic state. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2308.15037 [pdf, other]

Is it an i or an l: Test-time Adaptation of Text Line Recognition Models

Authors: Debapriya Tula, Sujoy Paul, Gagan Madan, Peter Garst, Reeve Ingle, Gaurav Aggarwal

Abstract: Recognizing text lines from images is a challenging problem, especially for handwritten documents due to large variations in writing styles. While text line recognition models are generally trained on large corpora of real and synthetic data, such models can still make frequent mistakes if the handwriting is inscrutable or the image acquisition process adds corruptions, such as noise, blur, compre… ▽ More Recognizing text lines from images is a challenging problem, especially for handwritten documents due to large variations in writing styles. While text line recognition models are generally trained on large corpora of real and synthetic data, such models can still make frequent mistakes if the handwriting is inscrutable or the image acquisition process adds corruptions, such as noise, blur, compression, etc. Writing style is generally quite consistent for an individual, which can be leveraged to correct mistakes made by such models. Motivated by this, we introduce the problem of adapting text line recognition models during test time. We focus on a challenging and realistic setting where, given only a single test image consisting of multiple text lines, the task is to adapt the model such that it performs better on the image, without any labels. We propose an iterative self-training approach that uses feedback from the language model to update the optical model, with confident self-labels in each iteration. The confidence measure is based on an augmentation mechanism that evaluates the divergence of the prediction of the model in a local region. We perform rigorous evaluation of our method on several benchmark datasets as well as their corrupted versions. Experimental results on multiple datasets spanning multiple scripts show that the proposed adaptation method offers an absolute improvement of up to 8% in character error rate with just a few iterations of self-training at test time. △ Less

Submitted 29 August, 2023; originally announced August 2023.

arXiv:2308.09671 [pdf, other]

OCR Language Models with Custom Vocabularies

Authors: Peter Garst, Reeve Ingle, Yasuhisa Fujii

Abstract: Language models are useful adjuncts to optical models for producing accurate optical character recognition (OCR) results. One factor which limits the power of language models in this context is the existence of many specialized domains with language statistics very different from those implied by a general language model - think of checks, medical prescriptions, and many other specialized document… ▽ More Language models are useful adjuncts to optical models for producing accurate optical character recognition (OCR) results. One factor which limits the power of language models in this context is the existence of many specialized domains with language statistics very different from those implied by a general language model - think of checks, medical prescriptions, and many other specialized document classes. This paper introduces an algorithm for efficiently generating and attaching a domain specific word based language model at run time to a general language model in an OCR system. In order to best use this model the paper also introduces a modified CTC beam search decoder which effectively allows hypotheses to remain in contention based on possible future completion of vocabulary words. The result is a substantial reduction in word error rate in recognizing material from specialized domains. △ Less

Submitted 18 August, 2023; originally announced August 2023.

arXiv:2305.11938 [pdf, other]

doi 10.18653/v1/2023.findings-emnlp.125

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

Authors: Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel A. Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana L. Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David I. Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson , et al. (2 additional authors not shown)

Abstract: Data scarcity is a crucial issue for the development of highly multilingual NLP systems. Yet for many under-represented languages (ULs) -- languages for which NLP re-search is particularly far behind in meeting user needs -- it is feasible to annotate small amounts of data. Motivated by this, we propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot;… ▽ More Data scarcity is a crucial issue for the development of highly multilingual NLP systems. Yet for many under-represented languages (ULs) -- languages for which NLP re-search is particularly far behind in meeting user needs -- it is feasible to annotate small amounts of data. Motivated by this, we propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot; its focus on user-centric tasks -- tasks with broad adoption by speakers of high-resource languages; and its focus on under-represented languages where this scarce-data scenario tends to be most realistic. XTREME-UP evaluates the capabilities of language models across 88 under-represented languages over 9 key user-centric technologies including ASR, OCR, MT, and information access tasks that are of general utility. We create new datasets for OCR, autocomplete, semantic parsing, and transliteration, and build on and refine existing datasets for other tasks. XTREME-UP provides methodology for evaluating many modeling scenarios including text-only, multi-modal (vision, audio, and text),supervised parameter tuning, and in-context learning. We evaluate commonly used models on the benchmark. We release all code and scripts to train and evaluate models △ Less

Submitted 24 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

arXiv:2110.05270 [pdf]

Investigating Transfer Learning Capabilities of Vision Transformers and CNNs by Fine-Tuning a Single Trainable Block

Authors: Durvesh Malpure, Onkar Litake, Rajesh Ingle

Abstract: In recent developments in the field of Computer Vision, a rise is seen in the use of transformer-based architectures. They are surpassing the state-of-the-art set by CNN architectures in accuracy but on the other hand, they are computationally very expensive to train from scratch. As these models are quite recent in the Computer Vision field, there is a need to study it's transfer learning capabil… ▽ More In recent developments in the field of Computer Vision, a rise is seen in the use of transformer-based architectures. They are surpassing the state-of-the-art set by CNN architectures in accuracy but on the other hand, they are computationally very expensive to train from scratch. As these models are quite recent in the Computer Vision field, there is a need to study it's transfer learning capabilities and compare it with CNNs so that we can understand which architecture is better when applied to real world problems with small data. In this work, we follow a simple yet restrictive method for fine-tuning both CNN and Transformer models pretrained on ImageNet1K on CIFAR-10 and compare them with each other. We only unfreeze the last transformer/encoder or last convolutional block of a model and freeze all the layers before it while adding a simple MLP at the end for classification. This simple modification lets us use the raw learned weights of both these neural networks. From our experiments, we find out that transformers-based architectures not only achieve higher accuracy than CNNs but some transformers even achieve this feat with around 4 times lesser number of parameters. △ Less

Submitted 11 October, 2021; originally announced October 2021.

Comments: 8 pages, 4 figures

arXiv:2104.07787 [pdf, other]

Rethinking Text Line Recognition Models

Authors: Daniel Hernandez Diaz, Siyang Qin, Reeve Ingle, Yasuhisa Fujii, Alessandro Bissacco

Abstract: In this paper, we study the problem of text line recognition. Unlike most approaches targeting specific domains such as scene-text or handwritten documents, we investigate the general problem of develo** a universal architecture that can extract text from any image, regardless of source or input modality. We consider two decoder families (Connectionist Temporal Classification and Transformer) an… ▽ More In this paper, we study the problem of text line recognition. Unlike most approaches targeting specific domains such as scene-text or handwritten documents, we investigate the general problem of develo** a universal architecture that can extract text from any image, regardless of source or input modality. We consider two decoder families (Connectionist Temporal Classification and Transformer) and three encoder modules (Bidirectional LSTMs, Self-Attention, and GRCLs), and conduct extensive experiments to compare their accuracy and performance on widely used public datasets of scene and handwritten text. We find that a combination that so far has received little attention in the literature, namely a Self-Attention encoder coupled with the CTC decoder, when compounded with an external language model and trained on both public and internal data, outperforms all the others in accuracy and computational complexity. Unlike the more common Transformer-based models, this architecture can handle inputs of arbitrary length, a requirement for universal line recognition. Using an internal dataset collected from multiple sources, we also expose the limitations of current public datasets in evaluating the accuracy of line recognizers, as the relatively narrow image width and sequence length distributions do not allow to observe the quality degradation of the Transformer approach when applied to the transcription of long lines. △ Less

Submitted 21 April, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

Comments: 11 pages, 6 figures

arXiv:2006.01234 [pdf]

Doming and spin cascade in Ferric Haems: Femtosecond X-ray Absorption and X-ray Emission Studies

Authors: Camila Bacellar, Dominik Kinschel, Giulia F. Mancini, Rebecca A. Ingle, Jérémy Rouxel, Oliviero Cannelli, Claudio Cirelli, Gregor Knopp, Jakub Szlachetko, Frederico A. Lima, Samuel Menzi, Georgios Pamfilidis, Katharina Kubicek, Dmitry Khakhulin, Wojciech Gawelda, Angel Rodriguez-Fernandez, Mykola Biednov, Christian Bressler, Christopher A. Arrell, Philip J. M. Johnson, Christopher Milne, Majed Chergui

Abstract: The structure-function relationship is at the heart of biology and major protein deformations are correlated to specific functions. In the case of heme proteins, doming is associated with the respiratory function in hemoglobin and myoglobin, while ruffling has been correlated with electron transfer processes, such as in the case of Cytochrome c (Cyt c). The latter has indeed evolved to become an i… ▽ More The structure-function relationship is at the heart of biology and major protein deformations are correlated to specific functions. In the case of heme proteins, doming is associated with the respiratory function in hemoglobin and myoglobin, while ruffling has been correlated with electron transfer processes, such as in the case of Cytochrome c (Cyt c). The latter has indeed evolved to become an important electron transfer protein in humans. In its ferrous form, it undergoes ligand release and doming upon photoexcitation, but its ferric form does not release the distal ligand, while the return to the ground state has been attributed to thermal relaxation. Here, by combining femtosecond Fe K-edge X-ray absorption near-edge structure (XANES) studies and femtosecond Fe Kalpha and Kbeta X-ray emission spectroscopy (XES), we demonstrate that the photocycle of ferric Cyt c is entirely due to a cascade among excited spin states of the Iron ion, causing the ferric heme to undergo doming, which we identify for the first time. We also argue that this pattern is common to all ferric haems, raising the question of the biological relevance of doming in such proteins. △ Less

Submitted 1 June, 2020; originally announced June 2020.

arXiv:1912.00531 [pdf]

Tracking the Ultraviolet Photochemistry of Thiophenone During and Beyond the Initial Ultrafast Ring Opening

Authors: Shashank Pathak, Lea M. Ibele, Rebecca Boll, Carlo Callegari, Alexander Demidovich, Benjamin Erk, Raimund Feifel, Ruaridh Forbes, Michele Di Fraia, Luca Giannessi, Christopher S. Hansen, David M. P. Holland, Rebecca A. Ingle, Robert Mason, Oksana Plekan, Kevin C. Prince, Arnaud Rouzée, Richard J. Squibb, Jan Tross, Michael N. R. Ashfold, Basile F. E. Curchod, Daniel Rolles

Abstract: Photoinduced isomerization reactions, including ring-opening reactions, lie at the heart of many processes in nature. The mechanisms of such reactions are determined by a delicate interplay of coupled electronic and nuclear dynamics unfolding on the femtosecond scale, followed by the slower redistribution of energy into different vibrational degrees of freedom. Here we apply time-resolved photoele… ▽ More Photoinduced isomerization reactions, including ring-opening reactions, lie at the heart of many processes in nature. The mechanisms of such reactions are determined by a delicate interplay of coupled electronic and nuclear dynamics unfolding on the femtosecond scale, followed by the slower redistribution of energy into different vibrational degrees of freedom. Here we apply time-resolved photoelectron spectroscopy with a seeded extreme ultraviolet free electron laser to trace the ultrafast ring opening of gas phase thiophenone molecules following photoexcitation at 265 nm. When combined with cutting edge ab initio electronic structure and molecular dynamics calculations of both the excited and ground state molecules, the results provide unprecedented insights into both electronic and nuclear dynamics of this fundamental class of reactions. The initial ring opening and non-adiabatic coupling to the electronic ground state is shown to be driven by ballistic SC bond extension and to be complete within 350 femtoseconds. Theory and experiment also allow clear visualization of the rich ground-state dynamics involving formation of, and interconversion between, several ring opened isomers and the reformed cyclic structure, and fragmentation (CO loss) over much longer timescales. △ Less

Submitted 14 March, 2020; v1 submitted 1 December, 2019; originally announced December 2019.

Comments: 40 pages, 21 figures Changes from the previous version: 1) Added theoretical calculations for explaining long timescale changes in shown in Figure 4(a). 2) Reworked on fitting (modelling) of the experimental data in Figure 2(b). Added another panel i.e. Figure 2(c). 3) Other minor changes and rewording in response to questions and suggestions by the referees

arXiv:1904.09150 [pdf, other]

A Scalable Handwritten Text Recognition System

Authors: R. Reeve Ingle, Yasuhisa Fujii, Thomas Deselaers, Jonathan Baccash, Ashok C. Popat

Abstract: Many studies on (Offline) Handwritten Text Recognition (HTR) systems have focused on building state-of-the-art models for line recognition on small corpora. However, adding HTR capability to a large scale multilingual OCR system poses new challenges. This paper addresses three problems in building such systems: data, efficiency, and integration. Firstly, one of the biggest challenges is obtaining… ▽ More Many studies on (Offline) Handwritten Text Recognition (HTR) systems have focused on building state-of-the-art models for line recognition on small corpora. However, adding HTR capability to a large scale multilingual OCR system poses new challenges. This paper addresses three problems in building such systems: data, efficiency, and integration. Firstly, one of the biggest challenges is obtaining sufficient amounts of high quality training data. We address the problem by using online handwriting data collected for a large scale production online handwriting recognition system. We describe our image data generation pipeline and study how online data can be used to build HTR models. We show that the data improve the models significantly under the condition where only a small number of real images is available, which is usually the case for HTR models. It enables us to support a new script at substantially lower cost. Secondly, we propose a line recognition model based on neural networks without recurrent connections. The model achieves a comparable accuracy with LSTM-based models while allowing for better parallelism in training and inference. Finally, we present a simple way to integrate HTR models into an OCR system. These constitute a solution to bring HTR capability into a large scale OCR system. △ Less

Submitted 14 June, 2019; v1 submitted 19 April, 2019; originally announced April 2019.

Comments: ICDAR 2019

Showing 1–10 of 10 results for author: Ingle, R