Search | arXiv e-print repository

arXiv:2406.19482 [pdf, other]

xTower: A Multilingual LLM for Explaining and Correcting Translation Errors

Authors: Marcos Treviso, Nuno M. Guerreiro, Sweta Agrawal, Ricardo Rei, José Pombal, Tania Vaz, Helena Wu, Beatriz Silva, Daan van Stigt, André F. T. Martins

Abstract: While machine translation (MT) systems are achieving increasingly strong performance on benchmarks, they often produce translations with errors and anomalies. Understanding these errors can potentially help improve the translation quality and user experience. This paper introduces xTower, an open large language model (LLM) built on top of TowerBase designed to provide free-text explanations for tr… ▽ More While machine translation (MT) systems are achieving increasingly strong performance on benchmarks, they often produce translations with errors and anomalies. Understanding these errors can potentially help improve the translation quality and user experience. This paper introduces xTower, an open large language model (LLM) built on top of TowerBase designed to provide free-text explanations for translation errors in order to guide the generation of a corrected translation. The quality of the generated explanations by xTower are assessed via both intrinsic and extrinsic evaluation. We ask expert translators to evaluate the quality of the explanations across two dimensions: relatedness towards the error span being explained and helpfulness in error understanding and improving translation quality. Extrinsically, we test xTower across various experimental setups in generating translation corrections, demonstrating significant improvements in translation quality. Our findings highlight xTower's potential towards not only producing plausible and helpful explanations of automatic translations, but also leveraging them to suggest corrected translations. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2404.00104 [pdf, other]

Accurate PRD modeling of the forward-scattering Hanle effect in the chromospheric CaI 4227 Å line

Authors: Luca Belluzzi, Simone Riva, Gioele Janett, Nuno Guerreiro, Fabio Riva, Pietro Benedusi, Tanausú del Pino Alemán, Ernest Alsina Ballester, Javier Trujillo Bueno, Jiří Štěpán

Abstract: Measurable linear scattering polarization signals have been predicted and detected at the solar disk center in the core of chromospheric lines. These forward-scattering polarization signals, which are of high interest for magnetic field diagnostics, have always been modeled either under the assumption of complete frequency redistribution (CRD), or taking partial frequency redistribution (PRD) effe… ▽ More Measurable linear scattering polarization signals have been predicted and detected at the solar disk center in the core of chromospheric lines. These forward-scattering polarization signals, which are of high interest for magnetic field diagnostics, have always been modeled either under the assumption of complete frequency redistribution (CRD), or taking partial frequency redistribution (PRD) effects into account under the angle-averaged (AA) approximation. This work aims at assessing the suitability of the CRD and PRD-AA approximations for modeling the forward-scattering polarization signals produced by the presence of an inclined magnetic field, the so-called forward-scattering Hanle effect, in the chromospheric CaI 4227 A line. Radiative transfer calculations are performed in semi-empirical 1D solar atmospheres, out of local thermodynamic equilibrium (LTE). A two-step solution strategy is applied: the non-LTE RT problem is first solved considering a multilevel atom and neglecting polarization phenomena. The same problem is then solved including polarization, considering a two-level atom and kee** fixed the lower-level population calculated at the previous step. The emergent linear polarization signals calculated under the CRD and PRD-AA approximations are analyzed and compared to those obtained by modeling PRD effects in their general angle-dependent (AD) formulation. With respect to the PRD-AD case, the CRD and PRD-AA calculations significantly underestimate the amplitude of the line-center polarization signals produced by the forward-scattering Hanle effect. The results of this work suggest that a PRD-AD modeling is required in order to develop reliable diagnostic techniques exploiting the forward-scattering polarization signals observed in the CaI 4227 A line. These results need to be confirmed by full 3D calculations including non-magnetic symmetry-breaking effects. △ Less

Submitted 29 March, 2024; originally announced April 2024.

arXiv:2402.17733 [pdf, other]

Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

Authors: Duarte M. Alves, José Pombal, Nuno M. Guerreiro, Pedro H. Martins, João Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Agrawal, Pierre Colombo, José G. C. de Souza, André F. T. Martins

Abstract: While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa… ▽ More While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and parallel data, creating TowerBase, followed by finetuning on instructions relevant for translation processes, creating TowerInstruct. Our final model surpasses open alternatives on several tasks relevant to translation workflows and is competitive with general-purpose closed LLMs. To facilitate future research, we release the Tower models, our specialization dataset, an evaluation framework for LLMs focusing on the translation ecosystem, and a collection of model generations, including ours, on our benchmark. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2402.13331 [pdf, other]

Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation

Authors: Anas Himmi, Guillaume Staerman, Marine Picot, Pierre Colombo, Nuno M. Guerreiro

Abstract: Hallucinated translations pose significant threats and safety concerns when it comes to the practical deployment of machine translation systems. Previous research works have identified that detectors exhibit complementary performance different detectors excel at detecting different types of hallucinations. In this paper, we propose to address the limitations of individual detectors by combining th… ▽ More Hallucinated translations pose significant threats and safety concerns when it comes to the practical deployment of machine translation systems. Previous research works have identified that detectors exhibit complementary performance different detectors excel at detecting different types of hallucinations. In this paper, we propose to address the limitations of individual detectors by combining them and introducing a straightforward method for aggregating multiple detectors. Our results demonstrate the efficacy of our aggregated detector, providing a promising step towards evermore reliable machine translation systems. △ Less

Submitted 20 February, 2024; originally announced February 2024.

arXiv:2402.00786 [pdf, other]

CroissantLLM: A Truly Bilingual French-English Language Model

Authors: Manuel Faysse, Patrick Fernandes, Nuno M. Guerreiro, António Loison, Duarte M. Alves, Caio Corro, Nicolas Boizard, João Alves, Ricardo Rei, Pedro H. Martins, Antoni Bigata Casademunt, François Yvon, André F. T. Martins, Gautier Viaud, Céline Hudelot, Pierre Colombo

Abstract: We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a cust… ▽ More We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a custom tokenizer, and bilingual finetuning datasets. We release the training dataset, notably containing a French split with manually curated, high-quality, and varied data sources. To assess performance outside of English, we craft a novel benchmark, FrenchBench, consisting of an array of classification and generation tasks, covering various orthogonal aspects of model performance in the French Language. Additionally, rooted in transparency and to foster further Large Language Model research, we release codebases, and dozens of checkpoints across various model sizes, training data distributions, and training steps, as well as fine-tuned Chat models, and strong translation models. We evaluate our model through the FMTI framework, and validate 81 % of the transparency criteria, far beyond the scores of even most open initiatives. This work enriches the NLP landscape, breaking away from previous English-centric work in order to strengthen our understanding of multilinguality in language models. △ Less

Submitted 29 March, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

arXiv:2310.13448 [pdf, other]

Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning

Authors: Duarte M. Alves, Nuno M. Guerreiro, João Alves, José Pombal, Ricardo Rei, José G. C. de Souza, Pierre Colombo, André F. T. Martins

Abstract: Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capa… ▽ More Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capabilities, due to overspecialization. In this paper, we provide a closer look at this problem. We start by showing that adapter-based finetuning with LoRA matches the performance of traditional finetuning while reducing the number of training parameters by a factor of 50. This method also outperforms few-shot prompting and eliminates the need for post-processing or in-context examples. However, we show that finetuning generally degrades few-shot performance, hindering adaptation capabilities. Finally, to obtain the best of both worlds, we propose a simple approach that incorporates few-shot examples during finetuning. Experiments on 10 language pairs show that our proposed approach recovers the original few-shot capabilities while kee** the added benefits of finetuning. △ Less

Submitted 20 October, 2023; originally announced October 2023.

Comments: Accepted at EMNLP 2023 - Findings

arXiv:2310.10482 [pdf, other]

xCOMET: Transparent Machine Translation Evaluation through Fine-grained Error Detection

Authors: Nuno M. Guerreiro, Ricardo Rei, Daan van Stigt, Luisa Coheur, Pierre Colombo, André F. T. Martins

Abstract: Widely used learned metrics for machine translation evaluation, such as COMET and BLEURT, estimate the quality of a translation hypothesis by providing a single sentence-level score. As such, they offer little insight into translation errors (e.g., what are the errors and what is their severity). On the other hand, generative large language models (LLMs) are amplifying the adoption of more granula… ▽ More Widely used learned metrics for machine translation evaluation, such as COMET and BLEURT, estimate the quality of a translation hypothesis by providing a single sentence-level score. As such, they offer little insight into translation errors (e.g., what are the errors and what is their severity). On the other hand, generative large language models (LLMs) are amplifying the adoption of more granular strategies to evaluation, attempting to detail and categorize translation errors. In this work, we introduce xCOMET, an open-source learned metric designed to bridge the gap between these approaches. xCOMET integrates both sentence-level evaluation and error span detection capabilities, exhibiting state-of-the-art performance across all types of evaluation (sentence-level, system-level, and error span detection). Moreover, it does so while highlighting and categorizing error spans, thus enriching the quality assessment. We also provide a robustness analysis with stress tests, and show that xCOMET is largely capable of identifying localized critical errors and hallucinations. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: Work in progress

arXiv:2310.02021 [pdf, other]

doi 10.1051/0004-6361/202346615

Assessment of the CRD approximation for the observer's frame RIII redistribution matrix

Authors: Simone Riva, Nuno Guerreiro, Gioele Janett, Diego Rossinelli, Pietro Benedusi, Rolf Krause, Luca Belluzzi

Abstract: Approximated forms of the RII and RIII redistribution matrices are frequently applied to simplify the numerical solution of the radiative transfer problem for polarized radiation, taking partial frequency redistribution (PRD) effects into account. A widely used approximation for RIII is to consider its expression under the assumption of complete frequency redistribution (CRD) in the observer frame… ▽ More Approximated forms of the RII and RIII redistribution matrices are frequently applied to simplify the numerical solution of the radiative transfer problem for polarized radiation, taking partial frequency redistribution (PRD) effects into account. A widely used approximation for RIII is to consider its expression under the assumption of complete frequency redistribution (CRD) in the observer frame (RIII CRD). The adequacy of this approximation for modeling the intensity profiles has been firmly established. By contrast, its suitability for modeling scattering polarization signals has only been analyzed in a few studies, considering simplified settings. In this work, we aim at quantitatively assessing the impact and the range of validity of the RIII CRD approximation in the modeling of scattering polarization. Methods. We first present an analytic comparison between RIII and RIII CRD. We then compare the results of radiative transfer calculations, out of local thermodynamic equilibrium, performed with RIII and RIII CRD in realistic 1D atmospheric models. We focus on the chromospheric Ca i line at 4227 A and on the photospheric Sr i line at 4607 A. △ Less

Submitted 12 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

Journal ref: A&A 679, A87 (2023)

arXiv:2309.11925 [pdf, other]

Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task

Authors: Ricardo Rei, Nuno M. Guerreiro, José Pombal, Daan van Stigt, Marcos Treviso, Luisa Coheur, José G. C. de Souza, André F. T. Martins

Abstract: We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks,… ▽ More We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks, reaching state-of-the-art performance for quality estimation at word-, span- and sentence-level granularity. Compared to the previous state-of-the-art COMETKIWI-22, we show large improvements in correlation with human judgements (up to 10 Spearman points). Moreover, we surpass the second-best multilingual submission to the shared-task with up to 3.8 absolute points. △ Less

Submitted 21 September, 2023; originally announced September 2023.

arXiv:2305.17075 [pdf, other]

CREST: A Joint Framework for Rationalization and Counterfactual Text Generation

Authors: Marcos Treviso, Alexis Ross, Nuno M. Guerreiro, André F. T. Martins

Abstract: Selective rationales and counterfactual examples have emerged as two effective, complementary classes of interpretability methods for analyzing and training NLP models. However, prior work has not explored how these methods can be integrated to combine their complementary advantages. We overcome this limitation by introducing CREST (ContRastive Edits with Sparse raTionalization), a joint framework… ▽ More Selective rationales and counterfactual examples have emerged as two effective, complementary classes of interpretability methods for analyzing and training NLP models. However, prior work has not explored how these methods can be integrated to combine their complementary advantages. We overcome this limitation by introducing CREST (ContRastive Edits with Sparse raTionalization), a joint framework for selective rationalization and counterfactual text generation, and show that this framework leads to improvements in counterfactual quality, model robustness, and interpretability. First, CREST generates valid counterfactuals that are more natural than those produced by previous methods, and subsequently can be used for data augmentation at scale, reducing the need for human-generated examples. Second, we introduce a new loss function that leverages CREST counterfactuals to regularize selective rationales and show that this regularization improves both model robustness and rationale quality, compared to methods that do not leverage CREST counterfactuals. Our results demonstrate that CREST successfully bridges the gap between selective rationales and counterfactual examples, addressing the limitations of existing methods and providing a more comprehensive view of a model's predictions. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Comments: Accepted at ACL 2023 (main)

arXiv:2305.11806 [pdf, other]

The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics

Authors: Ricardo Rei, Nuno M. Guerreiro, Marcos Treviso, Luisa Coheur, Alon Lavie, André F. T. Martins

Abstract: Neural metrics for machine translation evaluation, such as COMET, exhibit significant improvements in their correlation with human judgments, as compared to traditional metrics based on lexical overlap, such as BLEU. Yet, neural metrics are, to a great extent, "black boxes" returning a single sentence-level score without transparency about the decision-making process. In this work, we develop and… ▽ More Neural metrics for machine translation evaluation, such as COMET, exhibit significant improvements in their correlation with human judgments, as compared to traditional metrics based on lexical overlap, such as BLEU. Yet, neural metrics are, to a great extent, "black boxes" returning a single sentence-level score without transparency about the decision-making process. In this work, we develop and compare several neural explainability methods and demonstrate their effectiveness for interpreting state-of-the-art fine-tuned neural metrics. Our study reveals that these metrics leverage token-level information that can be directly attributed to translation errors, as assessed through comparison of token-level neural saliency maps with Multidimensional Quality Metrics (MQM) annotations and with synthetically-generated critical translation errors. To ease future research, we release our code at: https://github.com/Unbabel/COMET/tree/explainable-metrics. △ Less

Submitted 19 May, 2023; originally announced May 2023.

Comments: Accepted at ACL 2023

arXiv:2304.08654 [pdf, other]

Unleashing the Power of Sound: Revisiting the Physics of Notations for Modelling with auditory symbols

Authors: Nuno Guerreiro, Vasco Amaral, Miguel Goulão

Abstract: Sound - the oft-neglected sense for Software Engineering - is a crucial component of our daily lives, playing a vital role in how we interact with the world around us. In this paper, we challenge the traditional boundaries of Software Engineering by proposing a new approach based on sound design for using sound in modelling tools that is on par with visual design. By drawing upon the seminal work… ▽ More Sound - the oft-neglected sense for Software Engineering - is a crucial component of our daily lives, playing a vital role in how we interact with the world around us. In this paper, we challenge the traditional boundaries of Software Engineering by proposing a new approach based on sound design for using sound in modelling tools that is on par with visual design. By drawing upon the seminal work of Moody on the `Physics' of Notations for visual design, we develop a comprehensive catalogue of principles that can guide the design of sound notations. Using these principles, we develop a catalogue of sounds for UML and report on an empirical study that supports their usefulness. Our study lays the foundation for building more sophisticated sound-based notations. The guidelines for designing symbolic sounds for software models are an essential starting point for a new research thread that could significantly and effectively enable the use of sound in modelling tools. △ Less

Submitted 17 April, 2023; originally announced April 2023.

arXiv:2303.16104 [pdf, other]

Hallucinations in Large Multilingual Translation Models

Authors: Nuno M. Guerreiro, Duarte Alves, Jonas Waldendorf, Barry Haddow, Alexandra Birch, Pierre Colombo, André F. T. Martins

Abstract: Large-scale multilingual machine translation systems have demonstrated remarkable ability to translate directly between numerous languages, making them increasingly appealing for real-world applications. However, when deployed in the wild, these models may generate hallucinated translations which have the potential to severely undermine user trust and raise safety concerns. Existing research on ha… ▽ More Large-scale multilingual machine translation systems have demonstrated remarkable ability to translate directly between numerous languages, making them increasingly appealing for real-world applications. However, when deployed in the wild, these models may generate hallucinated translations which have the potential to severely undermine user trust and raise safety concerns. Existing research on hallucinations has primarily focused on small bilingual models trained on high-resource languages, leaving a gap in our understanding of hallucinations in massively multilingual models across diverse translation scenarios. In this work, we fill this gap by conducting a comprehensive analysis on both the M2M family of conventional neural machine translation models and ChatGPT, a general-purpose large language model~(LLM) that can be prompted for translation. Our investigation covers a broad spectrum of conditions, spanning over 100 translation directions across various resource levels and going beyond English-centric language pairs. We provide key insights regarding the prevalence, properties, and mitigation of hallucinations, paving the way towards more responsible and reliable machine translation systems. △ Less

Submitted 28 March, 2023; originally announced March 2023.

arXiv:2212.09631 [pdf, other]

Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation

Authors: Nuno M. Guerreiro, Pierre Colombo, Pablo Piantanida, André F. T. Martins

Abstract: Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications. However, NMT models can unpredictably produce severely pathological translations, known as hallucinations, that seriously undermine user trust. It becomes thus crucial to implement effective preventive strategies to guarantee their proper functioning. In this paper, we address the prob… ▽ More Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications. However, NMT models can unpredictably produce severely pathological translations, known as hallucinations, that seriously undermine user trust. It becomes thus crucial to implement effective preventive strategies to guarantee their proper functioning. In this paper, we address the problem of hallucination detection in NMT by following a simple intuition: as hallucinations are detached from the source content, they exhibit encoder-decoder attention patterns that are statistically different from those of good quality translations. We frame this problem with an optimal transport formulation and propose a fully unsupervised, plug-in detector that can be used with any attention-based NMT model. Experimental results show that our detector not only outperforms all previous model-based detectors, but is also competitive with detectors that employ large models trained on millions of samples. △ Less

Submitted 19 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

Comments: Accepted at ACL 2023

arXiv:2209.06243 [pdf, other]

CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task

Authors: Ricardo Rei, Marcos Treviso, Nuno M. Guerreiro, Chrysoula Zerva, Ana C. Farinha, Christine Maroti, José G. C. de Souza, Taisiya Glushkova, Duarte M. Alves, Alon Lavie, Luisa Coheur, André F. T. Martins

Abstract: We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equip** it w… ▽ More We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equip** it with a word-level sequence tagger and an explanation extractor. Our results suggest that incorporating references during pretraining improves performance across several language pairs on downstream tasks, and that jointly training with sentence and word-level objectives yields a further boost. Furthermore, combining attention and gradient information proved to be the top strategy for extracting good explanations of sentence-level QE models. Overall, our submissions achieved the best results for all three tasks for almost all language pairs by a considerable margin. △ Less

Submitted 13 September, 2022; originally announced September 2022.

Comments: WMT 2022 Quality Estimation shared task

arXiv:2208.05309 [pdf, other]

Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation

Authors: Nuno M. Guerreiro, Elena Voita, André F. T. Martins

Abstract: Although the problem of hallucinations in neural machine translation (NMT) has received some attention, research on this highly pathological phenomenon lacks solid ground. Previous work has been limited in several ways: it often resorts to artificial settings where the problem is amplified, it disregards some (common) types of hallucinations, and it does not validate adequacy of detection heuristi… ▽ More Although the problem of hallucinations in neural machine translation (NMT) has received some attention, research on this highly pathological phenomenon lacks solid ground. Previous work has been limited in several ways: it often resorts to artificial settings where the problem is amplified, it disregards some (common) types of hallucinations, and it does not validate adequacy of detection heuristics. In this paper, we set foundations for the study of NMT hallucinations. First, we work in a natural setting, i.e., in-domain data without artificial noise neither in training nor in inference. Next, we annotate a dataset of over 3.4k sentences indicating different kinds of critical errors and hallucinations. Then, we turn to detection methods and both revisit methods used previously and propose using glass-box uncertainty-based detectors. Overall, we show that for preventive settings, (i) previously used methods are largely inadequate, (ii) sequence log-probability works best and performs on par with reference-based methods. Finally, we propose DeHallucinator, a simple method for alleviating hallucinations at test time that significantly reduces the hallucinatory rate. To ease future research, we release our annotated dataset for WMT18 German-English data, along with the model, training data, and code. △ Less

Submitted 5 March, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

Comments: Accepted at EACL23 (main)

arXiv:2202.08659 [pdf, other]

doi 10.1051/0004-6361/202243350

Hanle rotation signatures in Sr I 4607 Å

Authors: Franziska Zeuner, Luca Belluzzi, Nuno Guerreiro, Renzo Ramelli, Michele Bianda

Abstract: Observations of scattering polarization and the Hanle effect in various spectral lines are increasingly used to complement traditional solar magnetic field determination techniques. One of the strongest scattering polarization signals in the photosphere is measured in the Sr I line at 4607.3 Å when observed close to the solar limb. Here, we present the first observational evidence of Hanle rotatio… ▽ More Observations of scattering polarization and the Hanle effect in various spectral lines are increasingly used to complement traditional solar magnetic field determination techniques. One of the strongest scattering polarization signals in the photosphere is measured in the Sr I line at 4607.3 Å when observed close to the solar limb. Here, we present the first observational evidence of Hanle rotation in the linearly polarized spectrum of this at several limb distances. We observed with the Zurich IMaging POLarimeter, ZIMPOL, at the IRSOL observatory, with exceptionally good seeing conditions, allowing for long integration times. We combined the fast modulating polarimeter with a slow modulator installed in front of the telescope. This combination allows the measurement of spectropolarimetric data being highly precise and unprecedentedly accurate. Fixing the reference direction for positive Stokes $Q$ parallel to the limb, we detect singly-peaked $U/I$ signals well above the noise level. We can exclude instrumental origin for such $U/I$ signals. These signatures are exclusively found in the Sr I line, but not in the adjoining Fe I line, therefore eliminating the Zeeman effect as the mechanism responsible for their appearance. However, we find a clear spatial correlation between the circular polarization produced by the Zeeman effect and the $U/I$ amplitudes. This suggests that the detected $U/I$ signals are the signatures of Hanle rotation caused by a spatially resolved magnetic field. A novel measurement technique allows for determining the absolute level of polarization with unprecedented precision. Using this technique, high-precision spectropolarimetric observations reveal for the first time unambiguous $U/I$ signals due to Hanle rotation in the Sr I line. △ Less

Submitted 6 May, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

Comments: 7 pages, 4 figures, accepted for publication in A&A

Journal ref: A&A 662, A46 (2022)

arXiv:2110.11990 [pdf, ps, other]

doi 10.1051/0004-6361/202141549

Modeling the scattering polarization of the solar Ca i 4227 Å line with angle-dependent partial frequency redistribution

Authors: Gioele Janett, Ernest Alsina Ballester, Nuno Guerreiro, Simone Riva, Luca Belluzzi, Tanausú del Pino Alemán, Javier Trujillo Bueno

Abstract: Context. The correct modeling of the scattering polarization signals observed in several strong resonance lines requires taking partial frequency redistribution (PRD) phenomena into account. Aims. This work aims at assessing the impact and the range of validity of the angle-averaged AA approximation with respect to the general angle-dependent (AD) treatment of PRD effects in the modeling of scatte… ▽ More Context. The correct modeling of the scattering polarization signals observed in several strong resonance lines requires taking partial frequency redistribution (PRD) phenomena into account. Aims. This work aims at assessing the impact and the range of validity of the angle-averaged AA approximation with respect to the general angle-dependent (AD) treatment of PRD effects in the modeling of scattering polarization in strong resonance lines, with focus on the solar Ca i 4227 Å line. Methods. Spectral line polarization is modeled by solving the radiative transfer problem for polarized radiation, under nonlocal thermodynamic equilibrium conditions, taking PRD effects into account, in static one-dimensional semi-empirical atmospheric models presenting arbitrary magnetic fields. The problem is solved through a two-step approach. In step 1, the problem is solved for intensity only, considering a multi-level atom. In step 2, the problem is solved including polarization, considering a two-level atom with an unpolarized and infinitely sharp lower level, and fixing the lower level population calculated at step 1. Results. The results for the Ca i 4227 Å line show a good agreement between the AA and AD calculations for the Q/I and U/I wings signals. However, AA calculations reveal an artificial trough in the line-core peak of the linear polarization profiles, whereas AD calculations show a sharper peak in agreement with observations. Conclusions. An AD treatment of PRD effects is essential to correctly model the line-core peak of the scattering polarization signal of the Ca i 4227 Å line. By contrast, in the considered static case, the AA approximation seems to be suitable to model the wing scattering polarization lobes and their magnetic sensitivity through magneto-optical effects. △ Less

Submitted 22 October, 2021; originally announced October 2021.

arXiv:2109.04552 [pdf, other]

SPECTRA: Sparse Structured Text Rationalization

Authors: Nuno Miguel Guerreiro, André F. T. Martins

Abstract: Selective rationalization aims to produce decisions along with rationales (e.g., text highlights or word alignments between two sentences). Commonly, rationales are modeled as stochastic binary masks, requiring sampling-based gradient estimators, which complicates training and requires careful hyperparameter tuning. Sparse attention mechanisms are a deterministic alternative, but they lack a way t… ▽ More Selective rationalization aims to produce decisions along with rationales (e.g., text highlights or word alignments between two sentences). Commonly, rationales are modeled as stochastic binary masks, requiring sampling-based gradient estimators, which complicates training and requires careful hyperparameter tuning. Sparse attention mechanisms are a deterministic alternative, but they lack a way to regularize the rationale extraction (e.g., to control the sparsity of a text highlight or the number of alignments). In this paper, we present a unified framework for deterministic extraction of structured explanations via constrained inference on a factor graph, forming a differentiable layer. Our approach greatly eases training and rationale regularization, generally outperforming previous work on what comes to performance and plausibility of the extracted rationales. We further provide a comparative study of stochastic and deterministic methods for rationale extraction for classification and natural language inference tasks, jointly assessing their predictive power, quality of the explanations, and model variability. △ Less

Submitted 9 September, 2021; originally announced September 2021.

Comments: Accepted to EMNLP 2021 (main conference)

arXiv:1612.02348 [pdf]

doi 10.5281/zenodo.183859

Lower solar atmosphere and magnetism at ultra-high spatial resolution

Authors: Remo Collet, Serena Criscuoli, Ilaria Ermolli, Damian Fabbian, Nuno Guerreiro, Margit Haberreiter, Courtney Peck, Tiago M. D. Pereira, Matthias Rempel, Sami K. Solanki, Sven Wedemeyer-Boehm

Abstract: We present the scientific case for a future space-based telescope aimed at very high spatial and temporal resolution imaging of the solar photosphere and chromosphere. Previous missions (e.g., HINODE, SUNRISE) have demonstrated the power of observing the solar photosphere and chromosphere at high spatial resolution without contamination from Earth's atmosphere. We argue here that increased spatial… ▽ More We present the scientific case for a future space-based telescope aimed at very high spatial and temporal resolution imaging of the solar photosphere and chromosphere. Previous missions (e.g., HINODE, SUNRISE) have demonstrated the power of observing the solar photosphere and chromosphere at high spatial resolution without contamination from Earth's atmosphere. We argue here that increased spatial resolution (from currently 70 km to 25 km in the future) and high temporal cadence of the observations will vastly improve our understanding of the physical processes controlling solar magnetism and its characteristic scales. This is particularly important as the Sun's magnetic field drives solar activity and can significantly influence the Sun-Earth system. At the same time a better knowledge of solar magnetism can greatly improve our understanding of other astrophysical objects. △ Less

Submitted 7 December, 2016; originally announced December 2016.

arXiv:1508.07234 [pdf, other]

doi 10.1088/0004-637X/811/2/106

Numerical Simulations of Coronal Heating through Footpoint Braiding

Authors: Viggo Hansteen, Nuno Guerreiro, Bart De Pontieu, Mats Carlsson

Abstract: Advanced 3D radiative MHD simulations now reproduce many properties of the outer solar atmosphere. When including a domain from the convection zone into the corona, a hot chromosphere and corona are self-consistently maintained. Here we study two realistic models, with different simulated area, magnetic field strength and topology, and numerical resolution. These are compared in order to character… ▽ More Advanced 3D radiative MHD simulations now reproduce many properties of the outer solar atmosphere. When including a domain from the convection zone into the corona, a hot chromosphere and corona are self-consistently maintained. Here we study two realistic models, with different simulated area, magnetic field strength and topology, and numerical resolution. These are compared in order to characterize the heating in the 3D-MHD simulations which self-consistently maintains the structure of the atmosphere. We analyze the heating at both large and small scales and find that heating is episodic and highly structured in space, but occurs along loop shaped structures, and moves along with the magnetic field. On large scales we find that the heating per particle is maximal near the transition region and that widely distributed opposite-polarity field in the photosphere leads to a greater heating scale height in the corona. On smaller scales, heating is concentrated in current sheets, the thicknesses of which are set by the numerical resolution. Some current sheets fragment in time, this process occurring more readily in the higher-resolution model leading to spatially highly intermittent heating. The large scale heating structures are found to fade in less than about five minutes, while the smaller, local, heating shows time scales of the order of 2 minutes in one model and 1 minutes in the other, higher-resolution, model. △ Less

Submitted 28 August, 2015; originally announced August 2015.

Comments: 20 pages, accepted by ApJ

arXiv:0807.4373 [pdf, ps, other]

doi 10.1103/PhysRevD.78.067302

Dark matter from cosmic defects on galactic scales?

Authors: N. Guerreiro, P. P. Avelino, J. P. M. de Carvalho, C. J. A. P. Martins

Abstract: We discuss the possible dynamical role of extended cosmic defects on galactic scales, specifically focusing on the possibility that they may provide the dark matter suggested by the classical problem of galactic rotation curves. We emphasize that the more standard defects (such as Goto-Nambu strings) are unsuitable for this task, but show that more general models (such as transonic wiggly string… ▽ More We discuss the possible dynamical role of extended cosmic defects on galactic scales, specifically focusing on the possibility that they may provide the dark matter suggested by the classical problem of galactic rotation curves. We emphasize that the more standard defects (such as Goto-Nambu strings) are unsuitable for this task, but show that more general models (such as transonic wiggly strings) could in principle have a better chance. In any case, we show that observational data severely restricts any such scenarios. △ Less

Submitted 12 September, 2008; v1 submitted 28 July, 2008; originally announced July 2008.

Comments: Submitted to Phys. Rev. D (Brief Reports). v2: Reference added and some typos corrected, matches published version

Journal ref: Phys.Rev.D78:067302,2008

Showing 1–22 of 22 results for author: Guerreiro, N