Skip to main content

Showing 1–50 of 346 results for author: Martins, F

.
  1. arXiv:2407.03137  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.IM

    X-Shooting ULLYSES: Massive Stars at low metallicity -- IV. Spectral analysis methods and exemplary results for O stars

    Authors: A. A. C. Sander, J. -C. Bouret, M. Bernini-Peron, J. Puls, F. Backs, S. R. Berlanas, J. M. Bestenlehner, S. A. Brands, A. Herrero, F. Martins, O. Maryeva, D. Pauli, V. Ramachandran, P. A. Crowther, V. M. A. Gómez-González, A. C. Gormaz-Matamala, W. -R. Hamann, D. J. Hillier, R. Kuiper, C. J. K. Larkin, R. R. Lefever, A. Mehner, F. Najarro, L. M. Oskinova, E. C. Schösser , et al. (4 additional authors not shown)

    Abstract: CONTEXT: The spectral analysis of hot, massive stars is a fundamental astrophysical method to obtain their intrinsic properties and their feedback. Quantitative spectroscopy for hot, massive stars requires detailed numerical modeling of the atmosphere and an iterative treatment to obtain the best solution within a given framework. AIMS: We present an overview of different techniques for the quanti… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 18+15 pages, 21+4 figures, under review at A&A, condensed abstract

  2. arXiv:2407.00436  [pdf, other

    cs.CL

    A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models

    Authors: Peiqin Lin, André F. T. Martins, Hinrich Schütze

    Abstract: Recent studies have highlighted the potential of exploiting parallel corpora to enhance multilingual large language models, improving performance in both bilingual tasks, e.g., machine translation, and general-purpose tasks, e.g., text classification. Building upon these findings, our comprehensive study aims to identify the most effective strategies for leveraging parallel corpora. We investigate… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  3. arXiv:2406.19482  [pdf, other

    cs.CL

    xTower: A Multilingual LLM for Explaining and Correcting Translation Errors

    Authors: Marcos Treviso, Nuno M. Guerreiro, Sweta Agrawal, Ricardo Rei, José Pombal, Tania Vaz, Helena Wu, Beatriz Silva, Daan van Stigt, André F. T. Martins

    Abstract: While machine translation (MT) systems are achieving increasingly strong performance on benchmarks, they often produce translations with errors and anomalies. Understanding these errors can potentially help improve the translation quality and user experience. This paper introduces xTower, an open large language model (LLM) built on top of TowerBase designed to provide free-text explanations for tr… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.18403  [pdf, other

    cs.CL

    LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

    Authors: Anna Bavaresco, Raffaella Bernardi, Leonardo Bertolazzi, Desmond Elliott, Raquel Fernández, Albert Gatt, Esam Ghaleb, Mario Giulianelli, Michael Hanna, Alexander Koller, André F. T. Martins, Philipp Mondorf, Vera Neplenbroek, Sandro Pezzelle, Barbara Plank, David Schlangen, Alessandro Suglia, Aditya K Surikuchi, Ece Takmaz, Alberto Testoni

    Abstract: There is an increasing trend towards evaluating NLP models with LLM-generated judgments instead of human judgments. In the absence of a comparison against human data, this raises concerns about the validity of these evaluations; in case they are conducted with proprietary models, this also raises concerns over reproducibility. We provide JUDGE-BENCH, a collection of 20 NLP datasets with human anno… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  5. arXiv:2406.10913  [pdf, other

    quant-ph

    Minimal evolution times for fast, pulse-based state preparation in silicon spin qubits

    Authors: Christopher K. Long, Nicholas J. Mayhall, Sophia E. Economou, Edwin Barnes, Crispin H. W. Barnes, Frederico Martins, David R. M. Arvidsson-Shukur, Normann Mertig

    Abstract: Standing as one of the most significant barriers to reaching quantum advantage, state-preparation fidelities on noisy intermediate-scale quantum processors suffer from quantum-gate errors, which accumulate over time. A potential remedy is pulse-based state preparation. We numerically investigate the minimal evolution times (METs) attainable by optimizing (microwave and exchange) pulses on silicon… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 9 + (7) pages, 6 figs, comments are welcomed

  6. arXiv:2406.09689  [pdf, other

    cond-mat.dis-nn cond-mat.soft cond-mat.stat-mech

    Physical networks become what they learn

    Authors: Menachem Stern, Marcelo Guzman, Felipe Martins, Andrea J Liu, Vijay Balasubramanian

    Abstract: Physical networks can develop diverse responses, or functions, by design, evolution or learning. We focus on electrical networks of nodes connected by resistive edges. Such networks can learn by adapting edge conductances to lower a cost function that penalizes deviations from a desired response. The network must also satisfy Kirchhoff's law, balancing currents at nodes, or, equivalently, minimizi… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 6 pages, 2 figures

  7. arXiv:2406.00049  [pdf, other

    cs.CL cs.LG

    QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation

    Authors: Gonçalo R. A. Faria, Sweta Agrawal, António Farinhas, Ricardo Rei, José G. C. de Souza, André F. T. Martins

    Abstract: An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware an… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

  8. arXiv:2405.18348  [pdf, other

    cs.CL

    Can Automatic Metrics Assess High-Quality Translations?

    Authors: Sweta Agrawal, António Farinhas, Ricardo Rei, André F. T. Martins

    Abstract: Automatic metrics for evaluating translation quality are typically validated by measuring how well they correlate with human assessments. However, correlation methods tend to capture only the ability of metrics to differentiate between good and bad source-translation pairs, overlooking their reliability in distinguishing alternative translations for the same source. In this paper, we confirm that… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: work in progress

  9. arXiv:2405.05116  [pdf, other

    cs.CL

    XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples

    Authors: Peiqin Lin, André F. T. Martins, Hinrich Schütze

    Abstract: Recent studies indicate that leveraging off-the-shelf or fine-tuned retrievers, capable of retrieving relevant in-context examples tailored to the input query, enhances few-shot in-context learning of English. However, adapting these methods to other languages, especially low-resource ones, poses challenges due to the scarcity of cross-lingual retrievers and annotated data. Thus, we introduce XAMP… ▽ More

    Submitted 29 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  10. arXiv:2405.01976  [pdf, other

    cs.CL cs.LG

    Conformal Prediction for Natural Language Processing: A Survey

    Authors: Margarida M. Campos, António Farinhas, Chrysoula Zerva, Mário A. T. Figueiredo, André F. T. Martins

    Abstract: The rapid proliferation of large language models and natural language processing (NLP) applications creates a crucial need for uncertainty quantification to mitigate risks such as hallucinations and to enhance decision-making reliability in critical applications. Conformal prediction is emerging as a theoretically sound and practically useful framework, combining flexibility with strong statistica… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  11. arXiv:2405.01267  [pdf, ps, other

    astro-ph.SR astro-ph.GA

    X-Shooting ULLYSES: Massive stars at low metallicity -- V. Effect of metallicity on surface abundances of O stars

    Authors: F. Martins, J. -C. Bouret, D. J. Hillier, S. A. Brands, P. A. Crowther, A. Herrero, F. Najarro, D. Pauli, J. Puls, V. Ramachandran, A. A. C. Sander, J. S. Vink, the XshootU collaboration

    Abstract: Massive stars rotate faster, on average, than lower mass stars. Stellar rotation triggers hydrodynamical instabilities which transport angular momentum and chemical species from the core to the surface. Models of high-mass stars that include these processes predict that chemical mixing is stronger at lower metallicity. We aim to test this prediction by comparing the surface abundances of massive s… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 15 pages + appendix. Accepted in Astronomy & Astrophysics

  12. arXiv:2405.00085  [pdf

    astro-ph.SR astro-ph.CO astro-ph.GA astro-ph.HE astro-ph.IM

    X-Shooting ULLYSES: Massive Stars at Low Metallicity

    Authors: Jorick S. Vink, Paul Crowther, Alex Fullerton, Miriam Garcia, Fabrice Martins, Nidia Morrell, Lida Oskinova, Nicole St. Louis, Asif ud-Doula, Andreas Sander, Hugues Sana, Jean-Claude Bouret, Brankica Kubatova, Pablo Marchant, Lucimara P. Martins, Aida Wofford, Jacco van Loon, O. Grace Telford, Ylva Götberg, Dominic Bowman, Christi Erba, Venu Kalari, The XShootU Collaboration

    Abstract: The Hubble Space Telescope has devoted 500 orbits to observing 250 massive stars with low metallicity in the ultraviolet (UV) range within the framework of the ULLYSES program. The X-Shooting ULLYSES (XShootU) project enhances the legacy value of this UV dataset by providing high-quality optical and near-infrared spectra, which are acquired using the wide-wavelength-coverage X-shooter spectrograph… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 6 pages, 6 figures. ESO Large Programme Overview

    Journal ref: ESO Messenger, 2024

  13. arXiv:2403.12888  [pdf, other

    cond-mat.mes-hall quant-ph

    Electrical readout of spins in the absence of spin blockade

    Authors: Felix-Ekkehard von Horstig, Lorenzo Peri, Sylvain Barraud, Jason A. W. Robinson, Monica Benito, Frederico Martins, M. Fernando Gonzalez-Zalba

    Abstract: In semiconductor nanostructures, spin blockade (SB) is the most scalable mechanism for electrical spin readout requiring only two bound spins for its implementation which, in conjunction with charge sensing techniques, has led to high-fidelity readout of spins in semiconductor-based quantum processors. However, various mechanisms may lift SB, such as strong spin-orbit coupling (SOC) or low-lying e… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 13 pages, 10 figures

  14. arXiv:2403.08314  [pdf, other

    cs.CL

    Is Context Helpful for Chat Translation Evaluation?

    Authors: Sweta Agrawal, Amin Farajian, Patrick Fernandes, Ricardo Rei, André F. T. Martins

    Abstract: Despite the recent success of automatic metrics for assessing translation quality, their application in evaluating the quality of machine-translated chats has been limited. Unlike more structured texts like news, chat conversations are often unstructured, short, and heavily reliant on contextual information. This poses questions about the reliability of existing sentence-level metrics in this doma… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  15. arXiv:2403.03923  [pdf, other

    cs.CL

    Did Translation Models Get More Robust Without Anyone Even Noticing?

    Authors: Ben Peters, André F. T. Martins

    Abstract: Neural machine translation (MT) models achieve strong results across a variety of settings, but it is widely believed that they are highly sensitive to "noisy" inputs, such as spelling errors, abbreviations, and other formatting issues. In this paper, we revisit this insight in light of recent multilingual MT models and large language models (LLMs) applied to machine translation. Somewhat surprisi… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  16. arXiv:2403.03883  [pdf, other

    cs.CL

    SaulLM-7B: A pioneering Large Language Model for Law

    Authors: Pierre Colombo, Telmo Pessoa Pires, Malik Boudiaf, Dominic Culver, Rui Melo, Caio Corro, Andre F. T. Martins, Fabrizio Esposito, Vera Lúcia Raposo, Sofia Morgado, Michael Desa

    Abstract: In this paper, we introduce SaulLM-7B, a large language model (LLM) tailored for the legal domain. With 7 billion parameters, SaulLM-7B is the first LLM designed explicitly for legal text comprehension and generation. Leveraging the Mistral 7B architecture as its foundation, SaulLM-7B is trained on an English legal corpus of over 30 billion tokens. SaulLM-7B exhibits state-of-the-art proficiency i… ▽ More

    Submitted 7 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  17. arXiv:2402.17733  [pdf, other

    cs.CL

    Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

    Authors: Duarte M. Alves, José Pombal, Nuno M. Guerreiro, Pedro H. Martins, João Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Agrawal, Pierre Colombo, José G. C. de Souza, André F. T. Martins

    Abstract: While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  18. arXiv:2402.13725  [pdf, other

    cs.LG

    Sparse and Structured Hopfield Networks

    Authors: Saul Santos, Vlad Niculae, Daniel McNamee, Andre F. T. Martins

    Abstract: Modern Hopfield networks have enjoyed recent interest due to their connection to attention in transformers. Our paper provides a unified framework for sparse Hopfield networks by establishing a link with Fenchel-Young losses. The result is a new family of Hopfield-Fenchel-Young energies whose update rules are end-to-end differentiable sparse transformations. We reveal a connection between loss mar… ▽ More

    Submitted 4 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 20 pages, 4 figures

  19. arXiv:2402.00786  [pdf, other

    cs.CL cs.LG

    CroissantLLM: A Truly Bilingual French-English Language Model

    Authors: Manuel Faysse, Patrick Fernandes, Nuno M. Guerreiro, António Loison, Duarte M. Alves, Caio Corro, Nicolas Boizard, João Alves, Ricardo Rei, Pedro H. Martins, Antoni Bigata Casademunt, François Yvon, André F. T. Martins, Gautier Viaud, Céline Hudelot, Pierre Colombo

    Abstract: We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a cust… ▽ More

    Submitted 29 March, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  20. arXiv:2402.00707  [pdf, other

    cs.CL cs.AI cs.LG

    Non-Exchangeable Conformal Language Generation with Nearest Neighbors

    Authors: Dennis Ulmer, Chrysoula Zerva, André F. T. Martins

    Abstract: Quantifying uncertainty in automatically generated text is important for letting humans check potential hallucinations and making systems more reliable. Conformal prediction is an attractive framework to provide predictions imbued with statistical guarantees, however, its application to text generation is challenging since any i.i.d. assumptions are not realistic. In this paper, we bridge this gap… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  21. Evidence for Very Massive Stars in extremely UV-bright star-forming galaxies at $z \sim 2.2-3.6$

    Authors: A. Upadhyaya, R. Marques-Chaves, D. Schaerer, F. Martins, I. Pérez-Fournon, A. Palacios, E. R. Stanway

    Abstract: We present a comprehensive analysis of the presence of very massive stars (VMS > $100 M_{\odot}$) in the integrated spectra of 13 UV-bright star-forming galaxies at $2.2 \lesssim z \lesssim 3.6$ taken with the Gran Telescopio Canarias (GTC). These galaxies have very high UV absolute magnitudes ($M_{\rm UV} \simeq -24$), intense star formation (SFR $ \simeq 100-1000$ $M_{\odot}$ yr$^{-1}$), and met… ▽ More

    Submitted 3 April, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 20 pages, 11 Figures, Accepted for Publication in Astronomy & Astrophysics

    Journal ref: A&A 686, A185 (2024)

  22. arXiv:2401.13303  [pdf, other

    cs.CL

    MaLA-500: Massive Language Adaptation of Large Language Models

    Authors: Peiqin Lin, Shaoxiong Ji, Jörg Tiedemann, André F. T. Martins, Hinrich Schütze

    Abstract: Large language models (LLMs) have advanced the state of the art in natural language processing. However, their predominant design for English or a limited set of languages creates a substantial gap in their effectiveness for low-resource languages. To bridge this gap, we introduce MaLA-500, a novel large language model designed to cover an extensive range of 534 languages. To train MaLA-500, we em… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  23. arXiv:2312.00282  [pdf, other

    econ.EM

    Stochastic volatility models with skewness selection

    Authors: Igor Ferreira Batista Martins, Hedibert Freitas Lopes

    Abstract: This paper expands traditional stochastic volatility models by allowing for time-varying skewness without imposing it. While dynamic asymmetry may capture the likely direction of future asset returns, it comes at the risk of leading to overparameterization. Our proposed approach mitigates this concern by leveraging sparsity-inducing priors to automatically selects the skewness parameter as being d… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 22 pages, 8 figures

  24. arXiv:2311.09132  [pdf, other

    cs.CL

    Aligning Neural Machine Translation Models: Human Feedback in Training and Inference

    Authors: Miguel Moura Ramos, Patrick Fernandes, António Farinhas, André F. T. Martins

    Abstract: Reinforcement learning from human feedback (RLHF) is a recent technique to improve the quality of the text generated by a language model, making it closer to what humans would generate. A core ingredient in RLHF's success in aligning and improving large language models (LLMs) is its reward model, trained using human feedback on model outputs. In machine translation (MT), where metrics trained from… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 14 pages, work-in-progress

  25. arXiv:2310.13448  [pdf, other

    cs.CL

    Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning

    Authors: Duarte M. Alves, Nuno M. Guerreiro, João Alves, José Pombal, Ricardo Rei, José G. C. de Souza, Pierre Colombo, André F. T. Martins

    Abstract: Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capa… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 - Findings

  26. arXiv:2310.11430  [pdf, other

    cs.CL

    An Empirical Study of Translation Hypothesis Ensembling with Large Language Models

    Authors: António Farinhas, José G. C. de Souza, André F. T. Martins

    Abstract: Large language models (LLMs) are becoming a one-fits-many solution, but they sometimes hallucinate or produce unreliable output. In this paper, we investigate how hypothesis ensembling can improve the quality of the generated text for the specific problem of LLM-based machine translation. We experiment with several techniques for ensembling hypotheses produced by LLMs such as ChatGPT, LLaMA, and A… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (main conference)

  27. arXiv:2310.10482  [pdf, other

    cs.CL

    xCOMET: Transparent Machine Translation Evaluation through Fine-grained Error Detection

    Authors: Nuno M. Guerreiro, Ricardo Rei, Daan van Stigt, Luisa Coheur, Pierre Colombo, André F. T. Martins

    Abstract: Widely used learned metrics for machine translation evaluation, such as COMET and BLEURT, estimate the quality of a translation hypothesis by providing a single sentence-level score. As such, they offer little insight into translation errors (e.g., what are the errors and what is their severity). On the other hand, generative large language models (LLMs) are amplifying the adoption of more granula… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Work in progress

  28. arXiv:2310.06539  [pdf, ps, other

    astro-ph.SR astro-ph.GA

    Surface chemical composition of single WNh stars

    Authors: Fabrice Martins

    Abstract: Wolf-Rayet (WR) stars of the WNh category contain a significant fraction of hydrogen at their surface. They can be hydrogen-burning, very massive stars or stars in a post-main sequence phase of evolution. Also, WNh stars are sometimes not included in population synthesis models. We aim to better characterise the properties of single WNh stars in the Galaxy and the Magellanic Clouds. In particular,… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 16 pages, 12 figures + appendix. Accepted in Astronomy & Astrophysics

    Journal ref: A&A 680, A22 (2023)

  29. arXiv:2310.01262  [pdf, other

    cs.LG stat.ML

    Non-Exchangeable Conformal Risk Control

    Authors: António Farinhas, Chrysoula Zerva, Dennis Ulmer, André F. T. Martins

    Abstract: Split conformal prediction has recently sparked great interest due to its ability to provide formally guaranteed uncertainty sets or intervals for predictions made by black-box neural models, ensuring a predefined probability of containing the actual ground truth. While the original formulation assumes data exchangeability, some extensions handle non-exchangeable data, which is often the case in m… ▽ More

    Submitted 26 January, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  30. arXiv:2309.11925  [pdf, other

    cs.CL

    Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task

    Authors: Ricardo Rei, Nuno M. Guerreiro, José Pombal, Daan van Stigt, Marcos Treviso, Luisa Coheur, José G. C. de Souza, André F. T. Martins

    Abstract: We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks,… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  31. Inferring the presence of very massive stars in local star-forming regions

    Authors: Fabrice Martins, Daniel Schaerer, Rui Marques-Chaves, Ankur Upadhyaya

    Abstract: We present a study aiming at detecting VMS in local star-forming region from the imprint they leave on the integrated UV and optical light. We analyzed a sample of 27 star-forming regions and galaxies in the local Universe. We selected sources with a metallicity close to that of the LMC. We defined empirical criteria to distinguish sources dominated by VMS and Wolf-Rayet stars (WR), using template… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 16 pages, 10 figures + appendix. Accepted in Astronomy and Astrophysics

    Journal ref: A&A 678, A159 (2023)

  32. arXiv:2308.07286  [pdf, other

    cs.CL cs.LG

    The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation

    Authors: Patrick Fernandes, Daniel Deutsch, Mara Finkelstein, Parker Riley, André F. T. Martins, Graham Neubig, Ankush Garg, Jonathan H. Clark, Markus Freitag, Orhan Firat

    Abstract: Automatic evaluation of machine translation (MT) is a critical tool driving the rapid iterative development of MT systems. While considerable progress has been made on estimating a single scalar quality score, current metrics lack the informativeness of more detailed schemes that annotate individual errors, such as Multidimensional Quality Metrics (MQM). In this paper, we help fill this gap by pro… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 19 pages

  33. arXiv:2307.10018  [pdf, other

    cs.RO cs.AI

    RobôCIn Small Size League Extended Team Description Paper for RoboCup 2023

    Authors: Aline Lima de Oliveira, Cauê Addae da Silva Gomes, Cecília Virginia Santos da Silva, Charles Matheus de Sousa Alves, Danilo Andrade Martins de Souza, Driele Pires Ferreira Araújo Xavier, Edgleyson Pereira da Silva, Felipe Bezerra Martins, Lucas Henrique Cavalcanti Santos, Lucas Dias Maciel, Matheus Paixão Gumercindo dos Santos, Matheus Lafayette Vasconcelos, Matheus Vinícius Teotonio do Nascimento Andrade, João Guilherme Oliveira Carvalho de Melo, João Pedro Souza Pereira de Moura, José Ronald da Silva, José Victor Silva Cruz, Pedro Henrique Santana de Morais, Pedro Paulo Salman de Oliveira, Riei Joaquim Matos Rodrigues, Roberto Costa Fernandes, Ryan Vinicius Santos Morais, Tamara Mayara Ramos Teobaldo, Washington Igor dos Santos Silva, Edna Natividade Silva Barros

    Abstract: RobôCIn has participated in RoboCup Small Size League since 2019, won its first world title in 2022 (Division B), and is currently a three-times Latin-American champion. This paper presents our improvements to defend the Small Size League (SSL) division B title in RoboCup 2023 in Bordeaux, France. This paper aims to share some of the academic research that our team developed over the past year. Ou… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  34. arXiv:2306.06221  [pdf, other

    cs.CL

    Conformalizing Machine Translation Evaluation

    Authors: Chrysoula Zerva, André F. T. Martins

    Abstract: Several uncertainty estimation methods have been recently proposed for machine translation evaluation. While these methods can provide a useful indication of when not to trust model predictions, we show in this paper that the majority of them tend to underestimate model uncertainty, and as a result they often produce misleading confidence intervals that do not cover the ground truth. We propose as… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  35. arXiv:2305.19348  [pdf, ps, other

    cond-mat.stat-mech

    Topologically-constrained fluctuations and thermodynamics regulate nonequilibrium response

    Authors: Gabriela Fernandes Martins, Jordan M. Horowitz

    Abstract: Limits on a system's response to external perturbations inform our understanding of how physical properties can be shaped by microscopic characteristics. Here, we derive constraints on the steady-state nonequilibrium response of physical observables in terms of the topology of the microscopic state space and the strength of thermodynamic driving. Notably, evaluation of these limits requires no kin… ▽ More

    Submitted 26 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: 25 pages, 13 figures

    Journal ref: Phys. Rev. E 108, 044113 (2023)

  36. arXiv:2305.19144  [pdf, other

    cs.CL

    BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation

    Authors: Taisiya Glushkova, Chrysoula Zerva, André F. T. Martins

    Abstract: Although neural-based machine translation evaluation metrics, such as COMET or BLEURT, have achieved strong correlations with human judgements, they are sometimes unreliable in detecting certain phenomena that can be considered as critical errors, such as deviations in entities and numbers. In contrast, traditional evaluation metrics, such as BLEU or chrF, which measure lexical or character overla… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted at EAMT 2023

  37. arXiv:2305.17075  [pdf, other

    cs.CL

    CREST: A Joint Framework for Rationalization and Counterfactual Text Generation

    Authors: Marcos Treviso, Alexis Ross, Nuno M. Guerreiro, André F. T. Martins

    Abstract: Selective rationales and counterfactual examples have emerged as two effective, complementary classes of interpretability methods for analyzing and training NLP models. However, prior work has not explored how these methods can be integrated to combine their complementary advantages. We overcome this limitation by introducing CREST (ContRastive Edits with Sparse raTionalization), a joint framework… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 (main)

  38. arXiv:2305.13684  [pdf, other

    cs.CL

    mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models

    Authors: Peiqin Lin, Chengzhi Hu, Zheyu Zhang, André F. T. Martins, Hinrich Schütze

    Abstract: Recent multilingual pretrained language models (mPLMs) have been shown to encode strong language-specific signals, which are not explicitly provided during pretraining. It remains an open question whether it is feasible to employ mPLMs to measure language similarity, and subsequently use the similarity results to select source languages for boosting cross-lingual transfer. To investigate this, we… ▽ More

    Submitted 29 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EACL 2024 Findings

  39. Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages

    Authors: Ayyoob Imani, Peiqin Lin, Amir Hossein Kargaran, Silvia Severini, Masoud Jalili Sabet, Nora Kassner, Chunlan Ma, Helmut Schmid, André F. T. Martins, François Yvon, Hinrich Schütze

    Abstract: The NLP community has mainly focused on scaling Large Language Models (LLMs) vertically, i.e., making them better for about 100 languages. We instead scale LLMs horizontally: we create, through continued pretraining, Glot500-m, an LLM that covers 511 predominantly low-resource languages. An important part of this effort is to collect and clean Glot500-c, a corpus that covers these 511 languages an… ▽ More

    Submitted 26 May, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  40. arXiv:2305.11806  [pdf, other

    cs.CL

    The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics

    Authors: Ricardo Rei, Nuno M. Guerreiro, Marcos Treviso, Luisa Coheur, Alon Lavie, André F. T. Martins

    Abstract: Neural metrics for machine translation evaluation, such as COMET, exhibit significant improvements in their correlation with human judgments, as compared to traditional metrics based on lexical overlap, such as BLEU. Yet, neural metrics are, to a great extent, "black boxes" returning a single sentence-level score without transparency about the decision-making process. In this work, we develop and… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023

  41. arXiv:2305.06376  [pdf, other

    astro-ph.SR astro-ph.CO astro-ph.GA astro-ph.HE

    X-Shooting ULLYSES: massive stars at low metallicity. I. Project Description

    Authors: Jorick S. Vink, A. Mehner, P. A. Crowther, A. Fullerton, M. Garcia, F. Martins, N. Morrell, L. M. Oskinova, N. St-Louis, A. ud-Doula, A. A. C. Sander, H. Sana, J. -C. Bouret, B. Kubatova, P. Marchant, L. P. Martins, A. Wofford, J. Th. van Loon, O. Grace Telford, Y. Gotberg, D. M. Bowman, C. Erba, V. M. Kalari, M. Abdul-Masih, T. Alkousa , et al. (56 additional authors not shown)

    Abstract: Observations of individual massive stars, super-luminous supernovae, gamma-ray bursts, and gravitational-wave events involving spectacular black-hole mergers, indicate that the low-metallicity Universe is fundamentally different from our own Galaxy. Many transient phenomena will remain enigmatic until we achieve a firm understanding of the physics and evolution of massive stars at low metallicity… ▽ More

    Submitted 1 June, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: Accepted in A&A - 35 Pages, 12 Figures, 4 Tables, 2 Large Tables

    Journal ref: A&A 675, A154 (2023)

  42. arXiv:2305.03182  [pdf, ps, other

    math-ph hep-th nlin.SI

    The Darboux-KP system as an integrable Chern-Simons multiform theory in infinite dimensional space

    Authors: Joao Faria Martins, Frank W Nijhoff, Daniel Riccombeni

    Abstract: In a previous paper by one of the authors, a Lagrangian 3-form structure was established for a generalised Darboux system, originally describing orthogonal curvilinear coordinate systems, which encodes the Kadomtsev-Petviashvili (KP) hierarchy. Here a hierarchy of Lagrangian multiforms is established for the same system, viewed as a hierarchy of Chern-Simons actions in an infinite-dimensional spac… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  43. arXiv:2305.00955  [pdf, other

    cs.CL cs.AI cs.LG

    Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

    Authors: Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins

    Abstract: Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving mod… ▽ More

    Submitted 31 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Work in Progress

  44. arXiv:2304.13442  [pdf, other

    cond-mat.mes-hall quant-ph

    Multi-module microwave assembly for fast read-out and charge noise characterization of silicon quantum dots

    Authors: Felix-Ekkehard von Horstig, David J. Ibberson, Giovanni A. Oakes, Laurence Cochrane, David F. Wise, Nadia Stelmashenko, Sylvain Barraud, Jason A. W. Robinson, Frederico Martins, M. Fernando Gonzalez-Zalba

    Abstract: Fast measurements of quantum devices is important in areas such as quantum sensing, quantum computing and nanodevice quality analysis. Here, we develop a superconductor-semiconductor multi-module microwave assembly to demonstrate charge state readout at the state-of-the-art. The assembly consist of a superconducting readout resonator interfaced to a silicon-on-insulator (SOI) chiplet containing qu… ▽ More

    Submitted 2 May, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Main: 8 pages, 4 figures. Supplementary: 4 pages, 7 figures

  45. arXiv:2304.08457  [pdf, other

    physics.soc-ph cs.SI physics.data-an

    Deep Learning Criminal Networks

    Authors: Haroldo V. Ribeiro, Diego D. Lopes, Arthur A. B. Pessa, Alvaro F. Martins, Bruno R. da Cunha, Sebastian Goncalves, Ervin K. Lenzi, Quentin S. Hanley, Matjaz Perc

    Abstract: Recent advances in deep learning methods have enabled researchers to develop and apply algorithms for the analysis and modeling of complex networks. These advances have sparked a surge of interest at the interface between network science and machine learning. Despite this, the use of machine learning methods to investigate criminal networks remains surprisingly scarce. Here, we explore the potenti… ▽ More

    Submitted 4 June, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: 14 two-column pages, 5 figures

    Journal ref: Chaos, Solitons & Fractals 172, 113579 (2023)

  46. arXiv:2303.16104  [pdf, other

    cs.CL

    Hallucinations in Large Multilingual Translation Models

    Authors: Nuno M. Guerreiro, Duarte Alves, Jonas Waldendorf, Barry Haddow, Alexandra Birch, Pierre Colombo, André F. T. Martins

    Abstract: Large-scale multilingual machine translation systems have demonstrated remarkable ability to translate directly between numerous languages, making them increasingly appealing for real-world applications. However, when deployed in the wild, these models may generate hallucinated translations which have the potential to severely undermine user trust and raise safety concerns. Existing research on ha… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  47. arXiv:2303.00121  [pdf, other

    physics.ins-det hep-ex

    Laser calibration of the ATLAS Tile Calorimeter during LHC Run 2

    Authors: M. N. Agaras, A. Ahmad, A. Blanco, D. Boumediene, R. Bonnefoy, D. Calvet, M. Calvetti, R. Chadelas, P. Conde Muino, A. Cortes Gonzalez, M. Crouau, C. Crozatier, F. Daudon, T. Davidek, G. Di Gregorio, L. Fiorini, B. Galhardo, Ph. Gris, P. Klimek, P. Lafarguette, D. Lambert, S. Leone, A. Maio, M. Marjanovic, F. Martins , et al. (15 additional authors not shown)

    Abstract: This article reports the laser calibration of the hadronic Tile Calorimeter of the ATLAS experiment in the LHC Run 2 data campaign. The upgraded Laser II calibration system is described. The system was commissioned during the first LHC Long Shutdown, exhibiting a stability better than 0.8% for the laser light monitoring. The methods employed to derive the detector calibration factors with data fro… ▽ More

    Submitted 5 July, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

    Journal ref: JINST 18 (2023) 06, P06023

  48. arXiv:2301.07473  [pdf, other

    cs.LG stat.ML

    Discrete Latent Structure in Neural Networks

    Authors: Vlad Niculae, Caio F. Corro, Nikita Nangia, Tsvetomila Mihaylova, André F. T. Martins

    Abstract: Many types of data from fields including natural language processing, computer vision, and bioinformatics, are well represented by discrete, compositional structures such as trees, sequences, or matchings. Latent structure models are a powerful tool for learning to extract such representations, offering a way to incorporate structural bias, discover insight about the data, and interpret decisions.… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    ACM Class: I.2.6

  49. Clues on the presence and segregation of very massive stars in the Sunburst Lyman-continuum cluster at z=2.37

    Authors: U. Mestric, E. Vanzella, A. Upadhyaya, F. Martins, R. Marques-Chaves, D. Schaerer, J. Guibert, A. Zanella, C. Grillo, P. Rosati, F. Calura, G. B. Caminha, A. Bolamperti, M. Meneghetti, P. Bergamini, A. Mercurio, M. Nonino, R. Pascale

    Abstract: We report the identification of very massive stars (VMS; mass $> 100$\,\msun) that may be segregated in the center of the young massive star cluster at $z$=2.37 hosted in the lensed galaxy called {\tt Sunburst} galaxy. This result is based on two pieces of evidence: (1) VLT/MUSE spectra of several multiple images of the same star cluster show key spectral signatures of VMS, such as the \heii\ broa… ▽ More

    Submitted 22 March, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: 10 pages, 8 figures, Accepted to publication in A&A

    Journal ref: A&A 673, A50 (2023)

  50. arXiv:2301.04653  [pdf, other

    q-bio.GN cs.LG

    Optirank: classification for RNA-Seq data with optimal ranking reference genes

    Authors: Paola Malsot, Filipe Martins, Didier Trono, Guillaume Obozinski

    Abstract: Classification algorithms using RNA-Sequencing (RNA-Seq) data as input are used in a variety of biological applications. By nature, RNA-Seq data is subject to uncontrolled fluctuations both within and especially across datasets, which presents a major difficulty for a trained classifier to generalize to an external dataset. Replacing raw gene counts with the rank of gene counts inside an observati… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.