Skip to main content

Showing 1–15 of 15 results for author: Vazquez, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.17918  [pdf, other

    cs.CL

    I Have an Attention Bridge to Sell You: Generalization Capabilities of Modular Translation Architectures

    Authors: Timothee Mickus, Raúl Vázquez, Joseph Attieh

    Abstract: Modularity is a paradigm of machine translation with the potential of bringing forth models that are large at training time and small during inference. Within this field of study, modular approaches, and in particular attention bridges, have been argued to improve the generalization capabilities of models by fostering language-independent representations. In the present paper, we study whether mod… ▽ More

    Submitted 30 April, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

  2. arXiv:2403.07726  [pdf, other

    cs.CL

    SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes

    Authors: Timothee Mickus, Elaine Zosa, Raúl Vázquez, Teemu Vahtola, Jörg Tiedemann, Vincent Segonne, Alessandro Raganato, Marianna Apidianaki

    Abstract: This paper presents the results of the SHROOM, a shared task focused on detecting hallucinations: outputs from natural language generation (NLG) systems that are fluent, yet inaccurate. Such cases of overgeneration put in jeopardy many NLG applications, where correctness is often mission-critical. The shared task was conducted with a newly constructed dataset of 4000 model outputs labeled by 5 ann… ▽ More

    Submitted 29 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: SemEval 2024 shared task. Pre-review version

  3. arXiv:2403.07544  [pdf, other

    cs.CL

    MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki

    Authors: Timothee Mickus, Stig-Arne Grönroos, Joseph Attieh, Michele Boggia, Ona De Gibert, Shaoxiong Ji, Niki Andreas Lopi, Alessandro Raganato, Raúl Vázquez, Jörg Tiedemann

    Abstract: NLP in the age of monolithic large language models is approaching its limits in terms of size and information that can be handled. The trend goes to modularization, a necessary step into the direction of designing smaller sub-networks and components with specialized functionality. In this paper, we present the MAMMOTH toolkit: a framework designed for training massively multilingual modular machin… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Presented as a demo at EACL 2024

  4. arXiv:2401.02511  [pdf, other

    eess.SY cs.AI cs.LG math.DS math.OC

    Gain Scheduling with a Neural Operator for a Transport PDE with Nonlinear Recirculation

    Authors: Maxence Lamarque, Luke Bhan, Rafael Vazquez, Miroslav Krstic

    Abstract: To stabilize PDE models, control laws require space-dependent functional gains mapped by nonlinear operators from the PDE functional coefficients. When a PDE is nonlinear and its "pseudo-coefficient" functions are state-dependent, a gain-scheduling (GS) nonlinear design is the simplest approach to the design of nonlinear feedback. The GS version of PDE backstep** employs gains obtained by solvin… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 16 pages, 5 figures

  5. arXiv:2310.14313  [pdf, other

    math.NA cs.CE

    Arbitrary order spline representation of cohomology generators for isogeometric analysis of eddy current problems

    Authors: Bernard Kapidani, Melina Merkel, Sebastian Schöps, Rafael Vázquez

    Abstract: The eddy current problem has many relevant practical applications in science, ranging from non-destructive testing to magnetic confinement of plasma in fusion reactors. It arises when electrical conductors are immersed in an external time-varying magnetic field operating at frequencies for which electromagnetic wave propagation effects can be neglected. Popular formulations of the eddy current p… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  6. arXiv:2310.06977  [pdf, other

    cs.CL

    Why bother with geometry? On the relevance of linear decompositions of Transformer embeddings

    Authors: Timothee Mickus, Raúl Vázquez

    Abstract: A recent body of work has demonstrated that Transformer embeddings can be linearly decomposed into well-defined sums of factors, that can in turn be related to specific network inputs or components. There is however still a dearth of work studying whether these mathematical reformulations are empirically meaningful. In the present work, we study representations from machine-translation decoders us… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to BlackBoxNLP 2023

  7. arXiv:2212.01936  [pdf, other

    cs.CL

    Democratizing Neural Machine Translation with OPUS-MT

    Authors: Jörg Tiedemann, Mikko Aulamo, Daria Bakshandaeva, Michele Boggia, Stig-Arne Grönroos, Tommi Nieminen, Alessandro Raganato, Yves Scherrer, Raul Vazquez, Sami Virpioja

    Abstract: This paper presents the OPUS ecosystem with a focus on the development of open machine translation models and tools, and their integration into end-user applications, development platforms and professional workflows. We discuss our on-going mission of increasing language coverage and translation quality, and also describe on-going work on the development of modular translation models and speed-opt… ▽ More

    Submitted 4 July, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

  8. Torque Computation with the Isogeometric Mortar Method for the Simulation of Electric Machines

    Authors: Melina Merkel, Bernard Kapidani, Sebastian Schöps, Rafael Vázquez

    Abstract: In this work isogeometric mortaring is used for the simulation of a six pole permanent magnet synchronous machine. Isogeometric mortaring is especially well suited for the efficient computation of rotating electric machines as it allows for an exact geometry representation for arbitrary rotation angles without the need of remeshing. The appropriate B-spline spaces needed for the solution of Maxwel… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Journal ref: IEEE Transactions on Magnetics, vol. 58, no. 9, Sept. 2022, Art no. 8107604

  9. Tree-Cotree Decomposition of Isogeometric Mortared Spaces in H(curl) on Multi-Patch Domains

    Authors: Bernard Kapidani, Melina Merkel, Sebastian Schöps, Rafael Vázquez

    Abstract: When applying isogeometric analysis to engineering problems, one often deals with multi-patch spline spaces that have incompatible discretisations, e.g. in the case of moving objects. In such cases mortaring has been shown to be advantageous. This contribution discusses the appropriate B-spline spaces needed for the solution of Maxwell's equations in the functions space H(curl) and the correspondi… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Journal ref: Computer Methods in Applied Mechanics and Engineering, Vol. 395, pp. 114949, 2022

  10. arXiv:1906.04040  [pdf, other

    cs.CL

    The University of Helsinki submissions to the WMT19 news translation task

    Authors: Aarne Talman, Umut Sulubacak, Raúl Vázquez, Yves Scherrer, Sami Virpioja, Alessandro Raganato, Arvi Hurskainen, Jörg Tiedemann

    Abstract: In this paper, we present the University of Helsinki submissions to the WMT 2019 shared task on news translation in three language pairs: English-German, English-Finnish and Finnish-English. This year, we focused first on cleaning and filtering the training data using multiple data-filtering approaches, resulting in much smaller and cleaner training sets. For English-German, we trained both senten… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: To appear in WMT19

  11. arXiv:1901.00759  [pdf, other

    cs.CE math.NA physics.comp-ph

    Isogeometric Mortar Coupling for Electromagnetic Problems

    Authors: Annalisa Buffa, Jacopo Corno, Carlo de Falco, Sebastian Schöps, Rafael Vázquez

    Abstract: This paper discusses and analyses two domain decomposition approaches for electromagnetic problems that allow the combination of domains discretised by either Nédélec-type polynomial finite elements or spline-based isogeometric analysis. The first approach is a new isogeometric mortar method and the second one is based on a modal basis for the Lagrange multiplier space, called state-space concaten… ▽ More

    Submitted 27 December, 2018; originally announced January 2019.

    MSC Class: 35Q60; 49M27; 65D07; 68Q25; 68R10; 68U05; 78M10

  12. Multilingual NMT with a language-independent attention bridge

    Authors: Raúl Vázquez, Alessandro Raganato, Jörg Tiedemann, Mathias Creutz

    Abstract: In this paper, we propose a multilingual encoder-decoder architecture capable of obtaining multilingual sentence representations by means of incorporating an intermediate {\em attention bridge} that is shared across all languages. That is, we train the model with language-specific encoders and decoders that are connected via self-attention with a shared layer that we call attention bridge. This la… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Journal ref: Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019) Pages 33-39

  13. arXiv:1811.00111  [pdf, other

    eess.SY cs.MA math.OC

    On finite-time and fixed-time consensus algorithms for dynamic networks switching among disconnected digraphs

    Authors: David Gómez-Gutiérrez, Carlos Renato Vázquez, Sergej Čelikovský, Juan Diego Sánchez-Torres, Javier Ruiz León

    Abstract: The aim of this paper is to analyze a class of consensus algorithms with finite-time or fixed-time convergence for dynamic networks formed by agents with first-order dynamics. In particular, in the analyzed class a single evaluation of a nonlinear function of the consensus error is performed per each node. The classical assumption of switching among connected graphs is dropped here, allowing to re… ▽ More

    Submitted 25 June, 2021; v1 submitted 31 October, 2018; originally announced November 2018.

    Comments: Please cite the publisher's version}. For the publisher's version and full citation details see: https://doi.org/10.1080/00207179.2018.1543896 The following links provide access, for a limited time, to a free copy of the publisher's version: https://www.tandfonline.com/eprint/FSW8JJRVPHMXJ3XUUXZH/full?target=10.1080/00207179.2018.1543896

    Journal ref: International Journal of Control, 93(9), 2120-2134, 2020

  14. arXiv:1808.10802  [pdf, other

    cs.CL

    The MeMAD Submission to the WMT18 Multimodal Translation Task

    Authors: Stig-Arne Grönroos, Benoit Huet, Mikko Kurimo, Jorma Laaksonen, Bernard Merialdo, Phu Pham, Mats Sjöberg, Umut Sulubacak, Jörg Tiedemann, Raphael Troncy, Raúl Vázquez

    Abstract: This paper describes the MeMAD project entry to the WMT Multimodal Machine Translation Shared Task. We propose adapting the Transformer neural machine translation (NMT) architecture to a multi-modal setting. In this paper, we also describe the preliminary experiments with text-only translation systems leading us up to this choice. We have the top scoring system for both English-to-German and E… ▽ More

    Submitted 3 September, 2018; v1 submitted 31 August, 2018; originally announced August 2018.

    Comments: To appear in WMT18

  15. arXiv:1709.06004  [pdf, other

    cs.CE math.NA

    Recent Advances of Isogeometric Analysis in Computational Electromagnetics

    Authors: Zeger Bontinck, Jacopo Corno, Herbert De Gersem, Stefan Kurz, Andreas Pels, Sebastian Schöps, Felix Wolf, Carlo de Falco, Jürgen Dölz, Rafael Vázquez, Ulrich Römer

    Abstract: In this communication the advantages and drawbacks of the isogeometric analysis (IGA) are reviewed in the context of electromagnetic simulations. IGA extends the set of polynomial basis functions, commonly employed by the classical Finite Element Method (FEM). While identical to FEM with Nédélec's basis functions in the lowest order case, it is based on B-spline and Non-Uniform Rational B-spline b… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

    Comments: submitted to the ICS Newsletter

    MSC Class: 78A30; 78A40; 74F15; 65N30; 65N25 ACM Class: G.1.8; F.2.1; J.2