Skip to main content

Showing 1–50 of 158 results for author: Titov, I

.
  1. arXiv:2407.04543  [pdf, other

    cs.CL

    Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations

    Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

    Abstract: Models need appropriate inductive biases to effectively learn from small amounts of data and generalize systematically outside of the training distribution. While Transformers are highly versatile and powerful, they can still benefit from enhanced structural inductive biases for seq2seq tasks, especially those involving syntactic transformations, such as converting active to passive voice or seman… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2405.14324  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Magnetic microstructure of nanocrystalline Fe-Nb-B alloys as seen by small-angle neutron and X-ray scattering

    Authors: Venus Rai, Ivan Titov, Michael P. Adams, Kiyonori Suzuki, Joachim Kohlbrecher, Andreas Michels

    Abstract: We have investigated the magnetic microstructure of two-phase Fe-Nb-B~based Nanoperm alloys using unpolarized small-angle neutron scattering (SANS) and small-angle X-ray scattering (SAXS). Our SANS analysis reveals a significantly large magnetic scattering contribution due to spin misalignment, primarily originating from the substantial jump in the longitudinal magnetization at the interfaces betw… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2405.02134  [pdf, other

    cs.CL

    Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection

    Authors: Guillem Ramírez, Alexandra Birch, Ivan Titov

    Abstract: Researchers and practitioners operating on a limited budget face the cost-performance trade-off dilemma. The challenging decision often centers on whether to use a large LLM with better performance or a smaller one with reduced costs. This has motivated recent research in the optimisation of LLM calls. Either a cascading strategy is used, where a smaller LLM or both are called sequentially, or a r… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  4. arXiv:2401.15241  [pdf, other

    cs.CL cs.AI

    Unlearning Traces the Influential Training Data of Language Models

    Authors: Masaru Isonuma, Ivan Titov

    Abstract: Identifying the training datasets that influence a language model's outputs is essential for minimizing the generation of harmful content and enhancing its performance. Ideally, we can measure the influence of each dataset by removing it from training; however, it is prohibitively expensive to retrain a model multiple times. This paper presents UnTrac: unlearning traces the influence of a training… ▽ More

    Submitted 13 June, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: 14 pages, to appear in ACL2024 main conference (long paper)

  5. arXiv:2401.09367  [pdf, other

    cond-mat.soft physics.optics

    Optical and thermal effects in the neighborhood of the spherical layered nanoparticle of the "metallic core -- J-aggregate shell'' structure

    Authors: A. V. Korotun, N. A. Smirnova, V. I. Reva, I. M. Titov, G. M. Shilo

    Abstract: The relations for the polarizability of the metallic nanoparticles, coated with the shell of cyanine dyes, are obtained in the article. The frequency dependencies for light absorption and scattering efficiencies, the heating of the composite nanoparticle and the electric field amplification in its neighborhood are studied. It is established that all the dependencies have three maxima which corresp… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 18 pages, 13 figures, 3 tables

    Journal ref: Condensed Matter Physics, 2023, vol. 26, No. 4, 43704

  6. arXiv:2312.02748  [pdf, other

    cs.CL cs.LG

    Compositional Generalization for Data-to-Text Generation

    Authors: Xinnuo Xu, Ivan Titov, Mirella Lapata

    Abstract: Data-to-text generation involves transforming structured data, often represented as predicate-argument tuples, into coherent textual descriptions. Despite recent advances, systems still struggle when confronted with unseen combinations of predicates, producing unfaithful descriptions (e.g. hallucinations or omissions). We refer to this issue as compositional generalisation, and it encouraged us to… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Journal ref: Findings of EMNLP 2023

  7. arXiv:2311.10236  [pdf, other

    cs.CL

    Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study

    Authors: Maike Züfle, Verna Dankers, Ivan Titov

    Abstract: With the ever-growing presence of social media platforms comes the increased spread of harmful content and the need for robust hate speech detection systems. Such systems easily overfit to specific targets and keywords, and evaluating them without considering distribution shifts that might occur between train and test data overestimates their benefit. We challenge hate speech models via new train-… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted at the GenBench workshop at EMNLP 2023; 9 pages in the main paper, 5 pages with references and 4 pages with appendices

  8. arXiv:2311.05379  [pdf, other

    cs.CL

    Memorisation Cartography: Map** out the Memorisation-Generalisation Continuum in Neural Machine Translation

    Authors: Verna Dankers, Ivan Titov, Dieuwke Hupkes

    Abstract: When training a neural network, it will quickly memorise some source-target map**s from your dataset but never learn some others. Yet, memorisation is not easily expressed as a binary feature that is good or bad: individual datapoints lie on a memorisation-generalisation continuum. What determines a datapoint's position on that spectrum, and how does that spectrum influence neural models' perfor… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Published in EMNLP 2023; 21 pages total (9 in the main paper, 3 pages with limitations, acknowledgments and references, 9 pages with appendices)

  9. arXiv:2310.16484  [pdf, other

    cs.CL

    Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training

    Authors: Max Müller-Eberstein, Rob van der Goot, Barbara Plank, Ivan Titov

    Abstract: Representational spaces learned via language modeling are fundamental to Natural Language Processing (NLP), however there has been limited understanding regarding how and when during training various types of linguistic information emerge and interact. Leveraging a novel information theoretic probing suite, which enables direct comparisons of not just task performance, but their representational s… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 (Findings)

  10. Cross-Modal Conceptualization in Bottleneck Models

    Authors: Danis Alukaev, Semen Kiselev, Ilya Pershin, Bulat Ibragimov, Vladimir Ivanov, Alexey Kornaev, Ivan Titov

    Abstract: Concept Bottleneck Models (CBMs) assume that training examples (e.g., x-ray images) are annotated with high-level concepts (e.g., types of abnormalities), and perform classification by first predicting the concepts, followed by predicting the label relying on these concepts. The main difficulty in using CBMs comes from having to choose concepts that are predictive of the label and then having to l… ▽ More

    Submitted 17 December, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023; camera-ready version

  11. arXiv:2310.14107  [pdf, other

    cs.CL cs.AI

    On the Transferability of Visually Grounded PCFGs

    Authors: Yanpeng Zhao, Ivan Titov

    Abstract: There has been a significant surge of interest in visually grounded grammar induction in recent times. While a variety of models have been developed for the task and have demonstrated impressive performance, they have not been evaluated on text domains that are different from the training domain, so it is unclear if the improvements brought by visual groundings are transferable. Our study aims to… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP Findings 2023. Our code is available at https://github.com/zhaoyanpeng/cpcfg

  12. arXiv:2310.13561  [pdf, other

    cs.CL cs.LG

    Cache & Distil: Optimising API Calls to Large Language Models

    Authors: Guillem Ramírez, Matthias Lindemann, Alexandra Birch, Ivan Titov

    Abstract: Large-scale deployment of generative AI tools often depends on costly API calls to a Large Language Model (LLM) to fulfil user queries. To curtail the frequency of these calls, one can employ a smaller language model -- a student -- which is continuously trained on the responses of the LLM. This student gradually gains proficiency in independently handling an increasing number of user requests, a… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  13. arXiv:2310.00796  [pdf, other

    cs.CL

    Injecting a Structural Inductive Bias into a Seq2Seq Model by Simulation

    Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

    Abstract: Strong inductive biases enable learning from little data and help generalization outside of the training distribution. Popular neural architectures such as Transformers lack strong structural inductive biases for seq2seq NLP tasks on their own. Consequently, they struggle with systematic generalization beyond the training distribution, e.g. with extrapolating to longer inputs, even when pre-traine… ▽ More

    Submitted 16 February, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

  14. arXiv:2307.00620  [pdf, other

    hep-ph

    Polarization of recoil photon in non-linear Compton process

    Authors: A. I. Titov

    Abstract: The polarization of recoil photon ($γ'$) in the non-linear Compton process $e + \vec L \to \vec γ' +e'$ in the interaction of a relativistic electron with a linearly polarized laser beam ($\vec L$) is studied within the Furry picture in the lowest-order, tree-level S matrix element. In particular, we consider the asymmetry of differential cross sections ${\cal A}$ for two independent axes describi… ▽ More

    Submitted 26 March, 2024; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: 8 pages, 7 figures

  15. arXiv:2305.18485  [pdf, other

    cs.LG cs.AI

    Autoencoding Conditional Neural Processes for Representation Learning

    Authors: Victor Prokhorov, Ivan Titov, N. Siddharth

    Abstract: Conditional neural processes (CNPs) are a flexible and efficient family of models that learn to learn a stochastic process from data. They have seen particular application in contextual image completion - observing pixel values at some locations to predict a distribution over values at other unobserved locations. However, the choice of pixels in learning CNPs is typically either random or derived… ▽ More

    Submitted 17 February, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

  16. arXiv:2305.16971  [pdf, other

    cs.LG

    Theoretical and Practical Perspectives on what Influence Functions Do

    Authors: Andrea Schioppa, Katja Filippova, Ivan Titov, Polina Zablotskaia

    Abstract: Influence functions (IF) have been seen as a technique for explaining model predictions through the lens of the training data. Their utility is assumed to be in identifying training examples "responsible" for a prediction so that, for example, correcting a prediction is possible by intervening on those examples (removing or editing them) and retraining the model. However, recent empirical studies… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  17. arXiv:2305.16954  [pdf, other

    cs.CL

    Compositional Generalization without Trees using Multiset Tagging and Latent Permutations

    Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

    Abstract: Seq2seq models have been shown to struggle with compositional generalization in semantic parsing, i.e. generalizing to unseen compositions of phenomena that the model handles correctly in isolation. We phrase semantic parsing as a two-step process: we first tag each input token with a multiset of output tokens. Then we arrange the tokens into an output sequence using a new way of parameterizing… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  18. Fingerprint of vortex-like flux closure in isotropic Nd-Fe-B bulk magnet

    Authors: Mathias Bersweiler, Yojiro Oba, Evelyn Pratami Sinaga, Inma Peral, Ivan Titov, Michael P. Adams, Konstantin L. Metlov, Andreas Michels

    Abstract: Taking advantage of recent progress in neutron instrumentation and in the understanding of magnetic-field-dependent small-angle neutron scattering, here, we study the three-dimensional magnetization distribution within an isotropic Nd-Fe-B bulk magnet. The magnetic neutron scattering cross section of this system features the so-called spike anisotropy, which points towards the presence of a strong… ▽ More

    Submitted 17 October, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 16 pages, 5 figures

    Journal ref: Physical Review B 108, 094434 (2023)

  19. arXiv:2301.13714  [pdf, other

    cs.CL

    Recursive Neural Networks with Bottlenecks Diagnose (Non-)Compositionality

    Authors: Verna Dankers, Ivan Titov

    Abstract: A recent line of work in NLP focuses on the (dis)ability of models to generalise compositionally for artificial languages. However, when considering natural language tasks, the data involved is not strictly, or locally, compositional. Quantifying the compositionality of data is a challenging task, which has been investigated primarily for short utterances. We use recursive neural models (Tree-LSTM… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: Published in EMNLP 2023 findings; 18 pages total (9 in the main paper, 3 pages of limitations and references and 6 pages with appendices)

  20. arXiv:2211.07906  [pdf, other

    cs.CL

    Hierarchical Phrase-based Sequence-to-Sequence Learning

    Authors: Bailin Wang, Ivan Titov, Jacob Andreas, Yoon Kim

    Abstract: We describe a neural transducer that maintains the flexibility of standard sequence-to-sequence (seq2seq) models while incorporating hierarchical phrases as a source of inductive bias during training and as explicit constraints during inference. Our approach trains two models: a discriminative parser based on a bracketing transduction grammar whose derivation tree hierarchically aligns source and… ▽ More

    Submitted 15 November, 2022; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: EMNLP 2022

  21. arXiv:2210.03183  [pdf, other

    cs.CL

    Compositional Generalisation with Structured Reordering and Fertility Layers

    Authors: Matthias Lindemann, Alexander Koller, Ivan Titov

    Abstract: Seq2seq models have been shown to struggle with compositional generalisation, i.e. generalising to new and potentially more complex structures than seen during training. Taking inspiration from grammar-based models that excel at compositional generalisation, we present a flexible end-to-end differentiable neural model that composes two structural operations: a fertility step, which we introduce in… ▽ More

    Submitted 15 February, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: EACL 2023 camera-ready

    ACM Class: I.2.7

  22. arXiv:2205.15301  [pdf, other

    cs.CL

    Can Transformer be Too Compositional? Analysing Idiom Processing in Neural Machine Translation

    Authors: Verna Dankers, Christopher G. Lucas, Ivan Titov

    Abstract: Unlike literal expressions, idioms' meanings do not directly follow from their parts, posing a challenge for neural machine translation (NMT). NMT models are often unable to translate idioms accurately and over-generate compositional, literal translations. In this work, we investigate whether the non-compositionality of idioms is reflected in the mechanics of the dominant NMT model, Transformer, b… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: Published at ACL 2022

  23. arXiv:2201.06802  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Uniaxial polarization analysis of bulk ferromagnets: Theory and first experimental Results

    Authors: A. Malyeyev, I. Titov, C. D. Dewhurst, K. Suzuki, D. Honecker, A. Michels

    Abstract: Based on Brown's static equations of micromagnetics, we compute the uniaxial polarization of the scattered neutron beam of a bulk magnetic material. The theoretical expressions are compared to experimental data on a soft magnetic nanocrystalline alloy. The micromagnetic SANS theory provides a general framework for polarized real-space neutron methods, and it opens up a new avenue for magnetic neut… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

  24. arXiv:2112.06837  [pdf, other

    cs.CL cs.LG

    Sparse Interventions in Language Models with Differentiable Masking

    Authors: Nicola De Cao, Leon Schmid, Dieuwke Hupkes, Ivan Titov

    Abstract: There has been a lot of interest in understanding what information is captured by hidden representations of language models (LMs). Typically, interpretation methods i) do not guarantee that the model actually uses the encoded information, and ii) do not discover small subsets of neurons responsible for a considered phenomenon. Inspired by causal mediation analysis, we propose a method that discove… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 12 pages, 4 figures, 6 tables

  25. arXiv:2109.04325  [pdf, other

    cs.CL cs.AI cs.LG

    Learning Opinion Summarizers by Selecting Informative Reviews

    Authors: Arthur Bražinskas, Mirella Lapata, Ivan Titov

    Abstract: Opinion summarization has been traditionally approached with unsupervised, weakly-supervised and few-shot learning techniques. In this work, we collect a large dataset of summaries paired with user reviews for over 31,000 products, enabling supervised training. However, the number of reviews per product is large (320 on average), making summarization - and especially training a summarizer - imprac… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  26. arXiv:2109.03792  [pdf, other

    cs.CL cs.AI stat.ML

    Highly Parallel Autoregressive Entity Linking with Discriminative Correction

    Authors: Nicola De Cao, Wilker Aziz, Ivan Titov

    Abstract: Generative approaches have been recently shown to be effective for both Entity Disambiguation and Entity Linking (i.e., joint mention detection and disambiguation). However, the previously proposed autoregressive formulation for EL suffers from i) high computational cost due to a complex (deep) decoder, ii) non-parallelizable decoding that scales with the source sequence length, and iii) the need… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP2021 Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. Code at https://github.com/nicola-decao/efficient-autoregressive-EL . 8 pages, 1 figure, 3 tables

  27. arXiv:2109.01396  [pdf, other

    cs.CL

    Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT

    Authors: Elena Voita, Rico Sennrich, Ivan Titov

    Abstract: Differently from the traditional statistical MT that decomposes the translation task into distinct separately learned components, neural machine translation uses a single neural network to model the entire translation process. Despite neural machine translation being de-facto standard, it is still not clear how NMT models acquire different competences over the course of training, and how this mirr… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  28. Positron energy distribution in factorized trident process

    Authors: A. I. Titov, U. Hernandez Acosta, B. Kampfer

    Abstract: We estimate the energy distribution of positrons produced in the interaction of ultra-relativistic electrons with a high-intensity laser beam. The underlying trident process is factorized on the probabilistic level. That is, we deploy a two-step mechanism for the formation of electron-positron pairs. In the first step, a high-energy photon is produced as a result of nonlinear Compton scattering. I… ▽ More

    Submitted 29 December, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: 8 pages, 10 figures

  29. arXiv:2106.05634  [pdf, other

    cs.CL

    Exploring Unsupervised Pretraining Objectives for Machine Translation

    Authors: Christos Baziotis, Ivan Titov, Alexandra Birch, Barry Haddow

    Abstract: Unsupervised cross-lingual pretraining has achieved strong results in neural machine translation (NMT), by drastically reducing the need for large parallel data. Most approaches adapt masked-language modeling (MLM) to sequence-to-sequence architectures, by masking parts of the input and reconstructing them in the decoder. In this work, we systematically compare masking with alternative objectives… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: Findings of ACL 2021

  30. arXiv:2106.04252  [pdf, other

    cs.CL

    Meta-Learning to Compositionally Generalize

    Authors: Henry Conklin, Bailin Wang, Kenny Smith, Ivan Titov

    Abstract: Natural language is compositional; the meaning of a sentence is a function of the meaning of its parts. This property allows humans to create and interpret novel sentences, generalizing robustly outside their prior experience. Neural networks have been shown to struggle with this kind of generalization, in particular performing poorly on tasks designed to assess compositional generalization (i.e.… ▽ More

    Submitted 29 June, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: ACL2021 Camera Ready; fix a small typo

  31. arXiv:2106.03257  [pdf, other

    cs.CL cs.LG

    Structured Reordering for Modeling Latent Alignments in Sequence Transduction

    Authors: Bailin Wang, Mirella Lapata, Ivan Titov

    Abstract: Despite success in many domains, neural models struggle in settings where train and test examples are drawn from different distributions. In particular, in contrast to humans, conventional sequence-to-sequence (seq2seq) models fail to generalize systematically, i.e., interpret sentences representing novel combinations of concepts (e.g., text segments) seen in training. Traditional grammar formalis… ▽ More

    Submitted 26 October, 2021; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  32. arXiv:2105.11758  [pdf, other

    hep-ph

    Rise and fall of laser-intensity effects in spectrally resolved Compton process

    Authors: Uwe Hernandez Acosta, Alexander I. Titov, Burkhard Kämpfer

    Abstract: The spectrally resolved differential cross section of Compton scattering, $d σ/ d ω' \vert_{ω' = const}$, rises from small towards larger laser intensity parameter $ξ$, reaches a maximum, and falls towards the asymptotic strong-field region. Expressed by invariant quantities: $d σ/du \vert_{u = const}$ rises from small towards larger values of $ξ$, reaches a maximum at… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: 12 pages, 8 figures

  33. Role of higher-order effects in spin-misalignment small-angle neutron scattering of high-pressure torsion nickel

    Authors: Yojiro Oba, Mathias Bersweiler, Ivan Titov, Nozomu Adachi, Yoshikazu Todaka, Elliot Paul Gilbert, Nina-Juliane Steinke, Konstantin L. Metlov, Andreas Michels

    Abstract: Magnetic-field-dependent unpolarized small-angle neutron scattering (SANS) experiments demonstrate that high-pressure torsion (HPT) straining induces spin misalignments in pure Ni, which persist in magnetic fields up to 4 T. The spin-misalignment scattering patterns are elongated perpendicular to the applied magnetic field due to an unusual predominant longitudinal $sin^2(θ)$-type angular anisotro… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: 9 figures

    Journal ref: Phys. Rev. Materials 5, 084410 (2021)

  34. arXiv:2104.08164  [pdf, other

    cs.CL cs.AI cs.LG

    Editing Factual Knowledge in Language Models

    Authors: Nicola De Cao, Wilker Aziz, Ivan Titov

    Abstract: The factual knowledge acquired during pre-training and stored in the parameters of Language Models (LMs) can be useful in downstream tasks (e.g., question answering or textual inference). However, some facts can be incorrectly induced or become obsolete over time. We present KnowledgeEditor, a method which can be used to edit this knowledge and, thus, fix 'bugs' or unexpected predictions without t… ▽ More

    Submitted 8 September, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at EMNLP2021 Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. Code at https://github.com/nicola-decao/KnowledgeEditor . 16 pages, 6 figures, 2 tables

  35. arXiv:2104.07012  [pdf, other

    cs.CL cs.LG

    Sparse Attention with Linear Units

    Authors: Biao Zhang, Ivan Titov, Rico Sennrich

    Abstract: Recently, it has been argued that encoder-decoder models can be made more interpretable by replacing the softmax function in the attention with its sparse variants. In this work, we introduce a novel, simple method for achieving sparsity in attention: we replace the softmax activation with a ReLU, and show that sparsity naturally emerges from such a formulation. Training stability is achieved with… ▽ More

    Submitted 6 October, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: EMNLP2021, code is available at https://github.com/bzhangGo/zero

  36. arXiv:2104.05819  [pdf, other

    cs.CL

    Learning from Executions for Semantic Parsing

    Authors: Bailin Wang, Mirella Lapata, Ivan Titov

    Abstract: Semantic parsing aims at translating natural language (NL) utterances onto machine-interpretable programs, which can be executed against a real-world environment. The expensive annotation of utterance-program pairs has long been acknowledged as a major bottleneck for the deployment of contemporary neural models to real-life applications. In this work, we focus on the task of semi-supervised learni… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: NAACL 2021 Camera Ready

  37. arXiv:2103.02298  [pdf, other

    cs.CL

    An Empirical Study of Compound PCFGs

    Authors: Yanpeng Zhao, Ivan Titov

    Abstract: Compound probabilistic context-free grammars (C-PCFGs) have recently established a new state of the art for unsupervised phrase-structure grammar induction. However, due to the high space and time complexities of chart-based representation and inference, it is difficult to investigate C-PCFGs comprehensively. In this work, we rely on a fast implementation of C-PCFGs to conduct an evaluation comple… ▽ More

    Submitted 21 October, 2023; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Accepted to Adapt-NLP at EACL 2021 (Added results on Brown of Penn Treebank and English Web Treebank). Our code is available at https://github.com/zhaoyanpeng/cpcfg

  38. Impact of laser polarization on q-exponential photon tails in non-linear Compton scattering

    Authors: B. Kampfer, A. I. Titov

    Abstract: Non-linear Compton scattering of ultra-relativistic electrons traversing high-intensity laser pulses generates also hard photons. These photon high-energy tails are considered for parameters in reach at the forthcoming experiments LUXE and E-320. We consider the invariant differential cross sections $d σ/ du$ between the IR and UV regions and analyze the impact of the laser polarization and find q… ▽ More

    Submitted 18 February, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

    Journal ref: Phys. Rev. A 103, 033101 (2021)

  39. Neutron study of magnetic correlations in rare-earth-free Mn-Bi magnets

    Authors: Artem Malyeyev, Ivan Titov, Philipp Bender, Mathias Bersweiler, Vitaliy Pipich, Sebastian Mühlbauer, Semih Ener, Oliver Gutfleisch, Andreas Michels

    Abstract: We report the results of an unpolarized small-angle neutron scattering (SANS) study on Mn-Bi-based rare-earth-free permanent magnets. The magnetic SANS cross section is dominated by long-wavelength transversal magnetization fluctuations and has been analyzed in terms of the Guinier-Porod model and the distance distribution function. This provides the radius of gyration which, in the remanent state… ▽ More

    Submitted 26 February, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Journal ref: Phys. Rev. Materials 5, 034407 (2021)

  40. arXiv:2011.01846  [pdf, other

    cs.CL

    Detecting Word Sense Disambiguation Biases in Machine Translation for Model-Agnostic Adversarial Attacks

    Authors: Denis Emelin, Ivan Titov, Rico Sennrich

    Abstract: Word sense disambiguation is a well-known source of translation errors in NMT. We posit that some of the incorrect disambiguation choices are due to models' over-reliance on dataset artifacts found in training data, specifically superficial word co-occurrences, rather than a deeper understanding of the source text. We introduce a method for the prediction of disambiguation errors based on statisti… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: Accepted to EMNLP 2020

  41. arXiv:2010.14481  [pdf, other

    cs.CL cs.LG

    Fast Interleaved Bidirectional Sequence Generation

    Authors: Biao Zhang, Ivan Titov, Rico Sennrich

    Abstract: Independence assumptions during sequence generation can speed up inference, but parallel generation of highly inter-dependent tokens comes at a cost in quality. Instead of assuming independence between neighbouring tokens (semi-autoregressive decoding, SA), we take inspiration from bidirectional sequence generation and introduce a decoder that generates target words from the left-to-right and righ… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

    Comments: WMT2020, source code is at https://github.com/bzhangGo/zero/tree/master/docs/interleaved_bidirectional_transformer

  42. arXiv:2010.12676  [pdf, other

    cs.CL cs.LG

    A Differentiable Relaxation of Graph Segmentation and Alignment for AMR Parsing

    Authors: Chunchuan Lyu, Shay B. Cohen, Ivan Titov

    Abstract: Abstract Meaning Representations (AMR) are a broad-coverage semantic formalism which represents sentence meaning as a directed acyclic graph. To train most AMR parsers, one needs to segment the graph into subgraphs and align each such subgraph to a word in a sentence; this is normally done at preprocessing, relying on hand-crafted rules. In contrast, we treat both alignment and segmentation as lat… ▽ More

    Submitted 24 October, 2022; v1 submitted 23 October, 2020; originally announced October 2020.

  43. arXiv:2010.11988  [pdf, other

    cs.CL

    Meta-Learning for Domain Generalization in Semantic Parsing

    Authors: Bailin Wang, Mirella Lapata, Ivan Titov

    Abstract: The importance of building semantic parsers which can be applied to new domains and generate programs unseen at training has long been acknowledged, and datasets testing out-of-domain performance are becoming increasingly available. However, little or no attention has been devoted to learning algorithms or objectives which promote domain generalization, with virtually all existing approaches relyi… ▽ More

    Submitted 12 April, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: NAACL2021 Camera Ready

  44. arXiv:2010.10907  [pdf, other

    cs.CL

    Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation

    Authors: Elena Voita, Rico Sennrich, Ivan Titov

    Abstract: In Neural Machine Translation (and, more generally, conditional language modeling), the generation of a target token is influenced by two types of context: the source and the prefix of the target sequence. While many attempts to understand the internal workings of NMT models have been made, none of them explicitly evaluates relative source and target contributions to a generation decision. We argu… ▽ More

    Submitted 25 June, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: ACL 2021 (more accurate results with the improved LRP code)

  45. arXiv:2010.08518  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Adaptive Feature Selection for End-to-End Speech Translation

    Authors: Biao Zhang, Ivan Titov, Barry Haddow, Rico Sennrich

    Abstract: Information in speech signals is not evenly distributed, making it an additional challenge for end-to-end (E2E) speech translation (ST) to learn to focus on informative features. In this paper, we propose adaptive feature selection (AFS) for encoder-decoder based E2E ST. We first pre-train an ASR encoder and apply AFS to dynamically estimate the importance of each encoded speech feature to SR. A S… ▽ More

    Submitted 20 October, 2020; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: EMNLP2020 Findings; source code is at https://github.com/bzhangGo/zero

  46. arXiv:2010.00577  [pdf, other

    cs.CL cs.LG stat.ML

    Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking

    Authors: Michael Sejr Schlichtkrull, Nicola De Cao, Ivan Titov

    Abstract: Graph neural networks (GNNs) have become a popular approach to integrating structural inductive biases into NLP models. However, there has been little work on interpreting them, and specifically on understanding which parts of the graphs (e.g. syntactic trees or co-reference structures) contribute to a prediction. In this work, we introduce a post-hoc method for interpreting the predictions of GNN… ▽ More

    Submitted 3 October, 2022; v1 submitted 1 October, 2020; originally announced October 2020.

  47. arXiv:2009.12404  [pdf, other

    cs.CL cs.CV

    Visually Grounded Compound PCFGs

    Authors: Yanpeng Zhao, Ivan Titov

    Abstract: Exploiting visual groundings for language understanding has recently been drawing much attention. In this work, we study visually grounded grammar induction and learn a constituency parser from both unlabeled text and its visual groundings. Existing work on this task (Shi et al., 2019) optimizes a parser via Reinforce and derives the learning signal only from the alignment of images and sentences.… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

    Comments: Accepted to EMNLP 2020. Our code is available at https://github.com/zhaoyanpeng/vpcfg

  48. Non-linear Breit-Wheeler process with linearly polarized beams

    Authors: Alexander I. Titov, Burkhard Kampfer

    Abstract: We study the non-linear Breit-Wheeler process $\vec γ' + \vec L \to e^+ + e^-$ in the interaction of linearly polarized probe photons ($\vec γ'$) with a linearly polarized laser beam ($\vec L$). In particular, we consider the asymmetry of the total cross section and the azimuthal electron distributions when the polarizations of the photon and laser beams in the initial state are mutually perpendic… ▽ More

    Submitted 18 November, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: 9 pages, 8 figures

  49. Anisometric mesoscale nuclear and magnetic texture in sintered Nd-Fe-B magnets

    Authors: I. Titov, D. Honecker, D. Mettus, A. Feoktystov, J. Kohlbrecher, P. Strunz, A. Michels

    Abstract: By means of temperature and wavelength-dependent small-angle neutron scattering (SANS) experiments on sintered isotropic and textured Nd-Fe-B magnets we provide evidence for the existence of an anisometric structure in the microstructure of the textured magnets. This conclusion is reached by observing a characteristic cross-shaped angular anisotropy in the total unpolarized SANS cross section at t… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Journal ref: Phys. Rev. Materials 4, 054419 (2020)

  50. arXiv:2005.00278  [pdf, other

    cs.CL cs.LG

    Unsupervised Transfer of Semantic Role Models from Verbal to Nominal Domain

    Authors: Yanpeng Zhao, Ivan Titov

    Abstract: Semantic role labeling (SRL) is an NLP task involving the assignment of predicate arguments to types, called semantic roles. Though research on SRL has primarily focused on verbal predicates and many resources available for SRL provide annotations only for verbs, semantic relations are often triggered by other linguistic constructions, e.g., nominalizations. In this work, we investigate a transfer… ▽ More

    Submitted 26 September, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: Our code is available at https://github.com/zhaoyanpeng/srltransfer