-
xTower: A Multilingual LLM for Explaining and Correcting Translation Errors
Authors:
Marcos Treviso,
Nuno M. Guerreiro,
Sweta Agrawal,
Ricardo Rei,
José Pombal,
Tania Vaz,
Helena Wu,
Beatriz Silva,
Daan van Stigt,
André F. T. Martins
Abstract:
While machine translation (MT) systems are achieving increasingly strong performance on benchmarks, they often produce translations with errors and anomalies. Understanding these errors can potentially help improve the translation quality and user experience. This paper introduces xTower, an open large language model (LLM) built on top of TowerBase designed to provide free-text explanations for tr…
▽ More
While machine translation (MT) systems are achieving increasingly strong performance on benchmarks, they often produce translations with errors and anomalies. Understanding these errors can potentially help improve the translation quality and user experience. This paper introduces xTower, an open large language model (LLM) built on top of TowerBase designed to provide free-text explanations for translation errors in order to guide the generation of a corrected translation. The quality of the generated explanations by xTower are assessed via both intrinsic and extrinsic evaluation. We ask expert translators to evaluate the quality of the explanations across two dimensions: relatedness towards the error span being explained and helpfulness in error understanding and improving translation quality. Extrinsically, we test xTower across various experimental setups in generating translation corrections, demonstrating significant improvements in translation quality. Our findings highlight xTower's potential towards not only producing plausible and helpful explanations of automatic translations, but also leveraging them to suggest corrected translations.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Accurate PRD modeling of the forward-scattering Hanle effect in the chromospheric CaI 4227 Å line
Authors:
Luca Belluzzi,
Simone Riva,
Gioele Janett,
Nuno Guerreiro,
Fabio Riva,
Pietro Benedusi,
Tanausú del Pino Alemán,
Ernest Alsina Ballester,
Javier Trujillo Bueno,
Jiří Štěpán
Abstract:
Measurable linear scattering polarization signals have been predicted and detected at the solar disk center in the core of chromospheric lines. These forward-scattering polarization signals, which are of high interest for magnetic field diagnostics, have always been modeled either under the assumption of complete frequency redistribution (CRD), or taking partial frequency redistribution (PRD) effe…
▽ More
Measurable linear scattering polarization signals have been predicted and detected at the solar disk center in the core of chromospheric lines. These forward-scattering polarization signals, which are of high interest for magnetic field diagnostics, have always been modeled either under the assumption of complete frequency redistribution (CRD), or taking partial frequency redistribution (PRD) effects into account under the angle-averaged (AA) approximation. This work aims at assessing the suitability of the CRD and PRD-AA approximations for modeling the forward-scattering polarization signals produced by the presence of an inclined magnetic field, the so-called forward-scattering Hanle effect, in the chromospheric CaI 4227 A line. Radiative transfer calculations are performed in semi-empirical 1D solar atmospheres, out of local thermodynamic equilibrium (LTE). A two-step solution strategy is applied: the non-LTE RT problem is first solved considering a multilevel atom and neglecting polarization phenomena. The same problem is then solved including polarization, considering a two-level atom and kee** fixed the lower-level population calculated at the previous step. The emergent linear polarization signals calculated under the CRD and PRD-AA approximations are analyzed and compared to those obtained by modeling PRD effects in their general angle-dependent (AD) formulation. With respect to the PRD-AD case, the CRD and PRD-AA calculations significantly underestimate the amplitude of the line-center polarization signals produced by the forward-scattering Hanle effect. The results of this work suggest that a PRD-AD modeling is required in order to develop reliable diagnostic techniques exploiting the forward-scattering polarization signals observed in the CaI 4227 A line. These results need to be confirmed by full 3D calculations including non-magnetic symmetry-breaking effects.
△ Less
Submitted 29 March, 2024;
originally announced April 2024.
-
Tower: An Open Multilingual Large Language Model for Translation-Related Tasks
Authors:
Duarte M. Alves,
José Pombal,
Nuno M. Guerreiro,
Pedro H. Martins,
João Alves,
Amin Farajian,
Ben Peters,
Ricardo Rei,
Patrick Fernandes,
Sweta Agrawal,
Pierre Colombo,
José G. C. de Souza,
André F. T. Martins
Abstract:
While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and pa…
▽ More
While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and parallel data, creating TowerBase, followed by finetuning on instructions relevant for translation processes, creating TowerInstruct. Our final model surpasses open alternatives on several tasks relevant to translation workflows and is competitive with general-purpose closed LLMs. To facilitate future research, we release the Tower models, our specialization dataset, an evaluation framework for LLMs focusing on the translation ecosystem, and a collection of model generations, including ours, on our benchmark.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation
Authors:
Anas Himmi,
Guillaume Staerman,
Marine Picot,
Pierre Colombo,
Nuno M. Guerreiro
Abstract:
Hallucinated translations pose significant threats and safety concerns when it comes to the practical deployment of machine translation systems. Previous research works have identified that detectors exhibit complementary performance different detectors excel at detecting different types of hallucinations. In this paper, we propose to address the limitations of individual detectors by combining th…
▽ More
Hallucinated translations pose significant threats and safety concerns when it comes to the practical deployment of machine translation systems. Previous research works have identified that detectors exhibit complementary performance different detectors excel at detecting different types of hallucinations. In this paper, we propose to address the limitations of individual detectors by combining them and introducing a straightforward method for aggregating multiple detectors. Our results demonstrate the efficacy of our aggregated detector, providing a promising step towards evermore reliable machine translation systems.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
CroissantLLM: A Truly Bilingual French-English Language Model
Authors:
Manuel Faysse,
Patrick Fernandes,
Nuno M. Guerreiro,
António Loison,
Duarte M. Alves,
Caio Corro,
Nicolas Boizard,
João Alves,
Ricardo Rei,
Pedro H. Martins,
Antoni Bigata Casademunt,
François Yvon,
André F. T. Martins,
Gautier Viaud,
Céline Hudelot,
Pierre Colombo
Abstract:
We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a cust…
▽ More
We introduce CroissantLLM, a 1.3B language model pretrained on a set of 3T English and French tokens, to bring to the research and industrial community a high-performance, fully open-sourced bilingual model that runs swiftly on consumer-grade local hardware. To that end, we pioneer the approach of training an intrinsically bilingual model with a 1:1 English-to-French pretraining data ratio, a custom tokenizer, and bilingual finetuning datasets. We release the training dataset, notably containing a French split with manually curated, high-quality, and varied data sources. To assess performance outside of English, we craft a novel benchmark, FrenchBench, consisting of an array of classification and generation tasks, covering various orthogonal aspects of model performance in the French Language. Additionally, rooted in transparency and to foster further Large Language Model research, we release codebases, and dozens of checkpoints across various model sizes, training data distributions, and training steps, as well as fine-tuned Chat models, and strong translation models. We evaluate our model through the FMTI framework, and validate 81 % of the transparency criteria, far beyond the scores of even most open initiatives. This work enriches the NLP landscape, breaking away from previous English-centric work in order to strengthen our understanding of multilinguality in language models.
△ Less
Submitted 29 March, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning
Authors:
Duarte M. Alves,
Nuno M. Guerreiro,
João Alves,
José Pombal,
Ricardo Rei,
José G. C. de Souza,
Pierre Colombo,
André F. T. Martins
Abstract:
Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capa…
▽ More
Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capabilities, due to overspecialization. In this paper, we provide a closer look at this problem. We start by showing that adapter-based finetuning with LoRA matches the performance of traditional finetuning while reducing the number of training parameters by a factor of 50. This method also outperforms few-shot prompting and eliminates the need for post-processing or in-context examples. However, we show that finetuning generally degrades few-shot performance, hindering adaptation capabilities. Finally, to obtain the best of both worlds, we propose a simple approach that incorporates few-shot examples during finetuning. Experiments on 10 language pairs show that our proposed approach recovers the original few-shot capabilities while kee** the added benefits of finetuning.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
xCOMET: Transparent Machine Translation Evaluation through Fine-grained Error Detection
Authors:
Nuno M. Guerreiro,
Ricardo Rei,
Daan van Stigt,
Luisa Coheur,
Pierre Colombo,
André F. T. Martins
Abstract:
Widely used learned metrics for machine translation evaluation, such as COMET and BLEURT, estimate the quality of a translation hypothesis by providing a single sentence-level score. As such, they offer little insight into translation errors (e.g., what are the errors and what is their severity). On the other hand, generative large language models (LLMs) are amplifying the adoption of more granula…
▽ More
Widely used learned metrics for machine translation evaluation, such as COMET and BLEURT, estimate the quality of a translation hypothesis by providing a single sentence-level score. As such, they offer little insight into translation errors (e.g., what are the errors and what is their severity). On the other hand, generative large language models (LLMs) are amplifying the adoption of more granular strategies to evaluation, attempting to detail and categorize translation errors. In this work, we introduce xCOMET, an open-source learned metric designed to bridge the gap between these approaches. xCOMET integrates both sentence-level evaluation and error span detection capabilities, exhibiting state-of-the-art performance across all types of evaluation (sentence-level, system-level, and error span detection). Moreover, it does so while highlighting and categorizing error spans, thus enriching the quality assessment. We also provide a robustness analysis with stress tests, and show that xCOMET is largely capable of identifying localized critical errors and hallucinations.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Assessment of the CRD approximation for the observer's frame RIII redistribution matrix
Authors:
Simone Riva,
Nuno Guerreiro,
Gioele Janett,
Diego Rossinelli,
Pietro Benedusi,
Rolf Krause,
Luca Belluzzi
Abstract:
Approximated forms of the RII and RIII redistribution matrices are frequently applied to simplify the numerical solution of the radiative transfer problem for polarized radiation, taking partial frequency redistribution (PRD) effects into account. A widely used approximation for RIII is to consider its expression under the assumption of complete frequency redistribution (CRD) in the observer frame…
▽ More
Approximated forms of the RII and RIII redistribution matrices are frequently applied to simplify the numerical solution of the radiative transfer problem for polarized radiation, taking partial frequency redistribution (PRD) effects into account. A widely used approximation for RIII is to consider its expression under the assumption of complete frequency redistribution (CRD) in the observer frame (RIII CRD). The adequacy of this approximation for modeling the intensity profiles has been firmly established. By contrast, its suitability for modeling scattering polarization signals has only been analyzed in a few studies, considering simplified settings.
In this work, we aim at quantitatively assessing the impact and the range of validity of the RIII CRD approximation in the modeling of scattering polarization. Methods. We first present an analytic comparison between RIII and RIII CRD. We then compare the results of radiative transfer calculations, out of local thermodynamic equilibrium, performed with RIII and RIII CRD in realistic 1D atmospheric models. We focus on the chromospheric Ca i line at 4227 A and on the photospheric Sr i line at 4607 A.
△ Less
Submitted 12 November, 2023; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task
Authors:
Ricardo Rei,
Nuno M. Guerreiro,
José Pombal,
Daan van Stigt,
Marcos Treviso,
Luisa Coheur,
José G. C. de Souza,
André F. T. Martins
Abstract:
We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks,…
▽ More
We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks, reaching state-of-the-art performance for quality estimation at word-, span- and sentence-level granularity. Compared to the previous state-of-the-art COMETKIWI-22, we show large improvements in correlation with human judgements (up to 10 Spearman points). Moreover, we surpass the second-best multilingual submission to the shared-task with up to 3.8 absolute points.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
CREST: A Joint Framework for Rationalization and Counterfactual Text Generation
Authors:
Marcos Treviso,
Alexis Ross,
Nuno M. Guerreiro,
André F. T. Martins
Abstract:
Selective rationales and counterfactual examples have emerged as two effective, complementary classes of interpretability methods for analyzing and training NLP models. However, prior work has not explored how these methods can be integrated to combine their complementary advantages. We overcome this limitation by introducing CREST (ContRastive Edits with Sparse raTionalization), a joint framework…
▽ More
Selective rationales and counterfactual examples have emerged as two effective, complementary classes of interpretability methods for analyzing and training NLP models. However, prior work has not explored how these methods can be integrated to combine their complementary advantages. We overcome this limitation by introducing CREST (ContRastive Edits with Sparse raTionalization), a joint framework for selective rationalization and counterfactual text generation, and show that this framework leads to improvements in counterfactual quality, model robustness, and interpretability. First, CREST generates valid counterfactuals that are more natural than those produced by previous methods, and subsequently can be used for data augmentation at scale, reducing the need for human-generated examples. Second, we introduce a new loss function that leverages CREST counterfactuals to regularize selective rationales and show that this regularization improves both model robustness and rationale quality, compared to methods that do not leverage CREST counterfactuals. Our results demonstrate that CREST successfully bridges the gap between selective rationales and counterfactual examples, addressing the limitations of existing methods and providing a more comprehensive view of a model's predictions.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics
Authors:
Ricardo Rei,
Nuno M. Guerreiro,
Marcos Treviso,
Luisa Coheur,
Alon Lavie,
André F. T. Martins
Abstract:
Neural metrics for machine translation evaluation, such as COMET, exhibit significant improvements in their correlation with human judgments, as compared to traditional metrics based on lexical overlap, such as BLEU. Yet, neural metrics are, to a great extent, "black boxes" returning a single sentence-level score without transparency about the decision-making process. In this work, we develop and…
▽ More
Neural metrics for machine translation evaluation, such as COMET, exhibit significant improvements in their correlation with human judgments, as compared to traditional metrics based on lexical overlap, such as BLEU. Yet, neural metrics are, to a great extent, "black boxes" returning a single sentence-level score without transparency about the decision-making process. In this work, we develop and compare several neural explainability methods and demonstrate their effectiveness for interpreting state-of-the-art fine-tuned neural metrics. Our study reveals that these metrics leverage token-level information that can be directly attributed to translation errors, as assessed through comparison of token-level neural saliency maps with Multidimensional Quality Metrics (MQM) annotations and with synthetically-generated critical translation errors. To ease future research, we release our code at: https://github.com/Unbabel/COMET/tree/explainable-metrics.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
Unleashing the Power of Sound: Revisiting the Physics of Notations for Modelling with auditory symbols
Authors:
Nuno Guerreiro,
Vasco Amaral,
Miguel Goulão
Abstract:
Sound - the oft-neglected sense for Software Engineering - is a crucial component of our daily lives, playing a vital role in how we interact with the world around us. In this paper, we challenge the traditional boundaries of Software Engineering by proposing a new approach based on sound design for using sound in modelling tools that is on par with visual design. By drawing upon the seminal work…
▽ More
Sound - the oft-neglected sense for Software Engineering - is a crucial component of our daily lives, playing a vital role in how we interact with the world around us. In this paper, we challenge the traditional boundaries of Software Engineering by proposing a new approach based on sound design for using sound in modelling tools that is on par with visual design. By drawing upon the seminal work of Moody on the `Physics' of Notations for visual design, we develop a comprehensive catalogue of principles that can guide the design of sound notations.
Using these principles, we develop a catalogue of sounds for UML and report on an empirical study that supports their usefulness. Our study lays the foundation for building more sophisticated sound-based notations. The guidelines for designing symbolic sounds for software models are an essential starting point for a new research thread that could significantly and effectively enable the use of sound in modelling tools.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
Hallucinations in Large Multilingual Translation Models
Authors:
Nuno M. Guerreiro,
Duarte Alves,
Jonas Waldendorf,
Barry Haddow,
Alexandra Birch,
Pierre Colombo,
André F. T. Martins
Abstract:
Large-scale multilingual machine translation systems have demonstrated remarkable ability to translate directly between numerous languages, making them increasingly appealing for real-world applications. However, when deployed in the wild, these models may generate hallucinated translations which have the potential to severely undermine user trust and raise safety concerns. Existing research on ha…
▽ More
Large-scale multilingual machine translation systems have demonstrated remarkable ability to translate directly between numerous languages, making them increasingly appealing for real-world applications. However, when deployed in the wild, these models may generate hallucinated translations which have the potential to severely undermine user trust and raise safety concerns. Existing research on hallucinations has primarily focused on small bilingual models trained on high-resource languages, leaving a gap in our understanding of hallucinations in massively multilingual models across diverse translation scenarios. In this work, we fill this gap by conducting a comprehensive analysis on both the M2M family of conventional neural machine translation models and ChatGPT, a general-purpose large language model~(LLM) that can be prompted for translation. Our investigation covers a broad spectrum of conditions, spanning over 100 translation directions across various resource levels and going beyond English-centric language pairs. We provide key insights regarding the prevalence, properties, and mitigation of hallucinations, paving the way towards more responsible and reliable machine translation systems.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
Optimal Transport for Unsupervised Hallucination Detection in Neural Machine Translation
Authors:
Nuno M. Guerreiro,
Pierre Colombo,
Pablo Piantanida,
André F. T. Martins
Abstract:
Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications. However, NMT models can unpredictably produce severely pathological translations, known as hallucinations, that seriously undermine user trust. It becomes thus crucial to implement effective preventive strategies to guarantee their proper functioning. In this paper, we address the prob…
▽ More
Neural machine translation (NMT) has become the de-facto standard in real-world machine translation applications. However, NMT models can unpredictably produce severely pathological translations, known as hallucinations, that seriously undermine user trust. It becomes thus crucial to implement effective preventive strategies to guarantee their proper functioning. In this paper, we address the problem of hallucination detection in NMT by following a simple intuition: as hallucinations are detached from the source content, they exhibit encoder-decoder attention patterns that are statistically different from those of good quality translations. We frame this problem with an optimal transport formulation and propose a fully unsupervised, plug-in detector that can be used with any attention-based NMT model. Experimental results show that our detector not only outperforms all previous model-based detectors, but is also competitive with detectors that employ large models trained on millions of samples.
△ Less
Submitted 19 May, 2023; v1 submitted 19 December, 2022;
originally announced December 2022.
-
CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task
Authors:
Ricardo Rei,
Marcos Treviso,
Nuno M. Guerreiro,
Chrysoula Zerva,
Ana C. Farinha,
Christine Maroti,
José G. C. de Souza,
Taisiya Glushkova,
Duarte M. Alves,
Alon Lavie,
Luisa Coheur,
André F. T. Martins
Abstract:
We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equip** it w…
▽ More
We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equip** it with a word-level sequence tagger and an explanation extractor. Our results suggest that incorporating references during pretraining improves performance across several language pairs on downstream tasks, and that jointly training with sentence and word-level objectives yields a further boost. Furthermore, combining attention and gradient information proved to be the top strategy for extracting good explanations of sentence-level QE models. Overall, our submissions achieved the best results for all three tasks for almost all language pairs by a considerable margin.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
Looking for a Needle in a Haystack: A Comprehensive Study of Hallucinations in Neural Machine Translation
Authors:
Nuno M. Guerreiro,
Elena Voita,
André F. T. Martins
Abstract:
Although the problem of hallucinations in neural machine translation (NMT) has received some attention, research on this highly pathological phenomenon lacks solid ground. Previous work has been limited in several ways: it often resorts to artificial settings where the problem is amplified, it disregards some (common) types of hallucinations, and it does not validate adequacy of detection heuristi…
▽ More
Although the problem of hallucinations in neural machine translation (NMT) has received some attention, research on this highly pathological phenomenon lacks solid ground. Previous work has been limited in several ways: it often resorts to artificial settings where the problem is amplified, it disregards some (common) types of hallucinations, and it does not validate adequacy of detection heuristics. In this paper, we set foundations for the study of NMT hallucinations. First, we work in a natural setting, i.e., in-domain data without artificial noise neither in training nor in inference. Next, we annotate a dataset of over 3.4k sentences indicating different kinds of critical errors and hallucinations. Then, we turn to detection methods and both revisit methods used previously and propose using glass-box uncertainty-based detectors. Overall, we show that for preventive settings, (i) previously used methods are largely inadequate, (ii) sequence log-probability works best and performs on par with reference-based methods. Finally, we propose DeHallucinator, a simple method for alleviating hallucinations at test time that significantly reduces the hallucinatory rate. To ease future research, we release our annotated dataset for WMT18 German-English data, along with the model, training data, and code.
△ Less
Submitted 5 March, 2023; v1 submitted 10 August, 2022;
originally announced August 2022.
-
Hanle rotation signatures in Sr I 4607 Å
Authors:
Franziska Zeuner,
Luca Belluzzi,
Nuno Guerreiro,
Renzo Ramelli,
Michele Bianda
Abstract:
Observations of scattering polarization and the Hanle effect in various spectral lines are increasingly used to complement traditional solar magnetic field determination techniques. One of the strongest scattering polarization signals in the photosphere is measured in the Sr I line at 4607.3 Å when observed close to the solar limb. Here, we present the first observational evidence of Hanle rotatio…
▽ More
Observations of scattering polarization and the Hanle effect in various spectral lines are increasingly used to complement traditional solar magnetic field determination techniques. One of the strongest scattering polarization signals in the photosphere is measured in the Sr I line at 4607.3 Å when observed close to the solar limb. Here, we present the first observational evidence of Hanle rotation in the linearly polarized spectrum of this at several limb distances. We observed with the Zurich IMaging POLarimeter, ZIMPOL, at the IRSOL observatory, with exceptionally good seeing conditions, allowing for long integration times. We combined the fast modulating polarimeter with a slow modulator installed in front of the telescope. This combination allows the measurement of spectropolarimetric data being highly precise and unprecedentedly accurate. Fixing the reference direction for positive Stokes $Q$ parallel to the limb, we detect singly-peaked $U/I$ signals well above the noise level. We can exclude instrumental origin for such $U/I$ signals. These signatures are exclusively found in the Sr I line, but not in the adjoining Fe I line, therefore eliminating the Zeeman effect as the mechanism responsible for their appearance. However, we find a clear spatial correlation between the circular polarization produced by the Zeeman effect and the $U/I$ amplitudes. This suggests that the detected $U/I$ signals are the signatures of Hanle rotation caused by a spatially resolved magnetic field. A novel measurement technique allows for determining the absolute level of polarization with unprecedented precision. Using this technique, high-precision spectropolarimetric observations reveal for the first time unambiguous $U/I$ signals due to Hanle rotation in the Sr I line.
△ Less
Submitted 6 May, 2022; v1 submitted 17 February, 2022;
originally announced February 2022.
-
Modeling the scattering polarization of the solar Ca i 4227 Å line with angle-dependent partial frequency redistribution
Authors:
Gioele Janett,
Ernest Alsina Ballester,
Nuno Guerreiro,
Simone Riva,
Luca Belluzzi,
Tanausú del Pino Alemán,
Javier Trujillo Bueno
Abstract:
Context. The correct modeling of the scattering polarization signals observed in several strong resonance lines requires taking partial frequency redistribution (PRD) phenomena into account. Aims. This work aims at assessing the impact and the range of validity of the angle-averaged AA approximation with respect to the general angle-dependent (AD) treatment of PRD effects in the modeling of scatte…
▽ More
Context. The correct modeling of the scattering polarization signals observed in several strong resonance lines requires taking partial frequency redistribution (PRD) phenomena into account. Aims. This work aims at assessing the impact and the range of validity of the angle-averaged AA approximation with respect to the general angle-dependent (AD) treatment of PRD effects in the modeling of scattering polarization in strong resonance lines, with focus on the solar Ca i 4227 Å line. Methods. Spectral line polarization is modeled by solving the radiative transfer problem for polarized radiation, under nonlocal thermodynamic equilibrium conditions, taking PRD effects into account, in static one-dimensional semi-empirical atmospheric models presenting arbitrary magnetic fields. The problem is solved through a two-step approach. In step 1, the problem is solved for intensity only, considering a multi-level atom. In step 2, the problem is solved including polarization, considering a two-level atom with an unpolarized and infinitely sharp lower level, and fixing the lower level population calculated at step 1. Results. The results for the Ca i 4227 Å line show a good agreement between the AA and AD calculations for the Q/I and U/I wings signals. However, AA calculations reveal an artificial trough in the line-core peak of the linear polarization profiles, whereas AD calculations show a sharper peak in agreement with observations. Conclusions. An AD treatment of PRD effects is essential to correctly model the line-core peak of the scattering polarization signal of the Ca i 4227 Å line. By contrast, in the considered static case, the AA approximation seems to be suitable to model the wing scattering polarization lobes and their magnetic sensitivity through magneto-optical effects.
△ Less
Submitted 22 October, 2021;
originally announced October 2021.
-
SPECTRA: Sparse Structured Text Rationalization
Authors:
Nuno Miguel Guerreiro,
André F. T. Martins
Abstract:
Selective rationalization aims to produce decisions along with rationales (e.g., text highlights or word alignments between two sentences). Commonly, rationales are modeled as stochastic binary masks, requiring sampling-based gradient estimators, which complicates training and requires careful hyperparameter tuning. Sparse attention mechanisms are a deterministic alternative, but they lack a way t…
▽ More
Selective rationalization aims to produce decisions along with rationales (e.g., text highlights or word alignments between two sentences). Commonly, rationales are modeled as stochastic binary masks, requiring sampling-based gradient estimators, which complicates training and requires careful hyperparameter tuning. Sparse attention mechanisms are a deterministic alternative, but they lack a way to regularize the rationale extraction (e.g., to control the sparsity of a text highlight or the number of alignments). In this paper, we present a unified framework for deterministic extraction of structured explanations via constrained inference on a factor graph, forming a differentiable layer. Our approach greatly eases training and rationale regularization, generally outperforming previous work on what comes to performance and plausibility of the extracted rationales. We further provide a comparative study of stochastic and deterministic methods for rationale extraction for classification and natural language inference tasks, jointly assessing their predictive power, quality of the explanations, and model variability.
△ Less
Submitted 9 September, 2021;
originally announced September 2021.
-
Lower solar atmosphere and magnetism at ultra-high spatial resolution
Authors:
Remo Collet,
Serena Criscuoli,
Ilaria Ermolli,
Damian Fabbian,
Nuno Guerreiro,
Margit Haberreiter,
Courtney Peck,
Tiago M. D. Pereira,
Matthias Rempel,
Sami K. Solanki,
Sven Wedemeyer-Boehm
Abstract:
We present the scientific case for a future space-based telescope aimed at very high spatial and temporal resolution imaging of the solar photosphere and chromosphere. Previous missions (e.g., HINODE, SUNRISE) have demonstrated the power of observing the solar photosphere and chromosphere at high spatial resolution without contamination from Earth's atmosphere. We argue here that increased spatial…
▽ More
We present the scientific case for a future space-based telescope aimed at very high spatial and temporal resolution imaging of the solar photosphere and chromosphere. Previous missions (e.g., HINODE, SUNRISE) have demonstrated the power of observing the solar photosphere and chromosphere at high spatial resolution without contamination from Earth's atmosphere. We argue here that increased spatial resolution (from currently 70 km to 25 km in the future) and high temporal cadence of the observations will vastly improve our understanding of the physical processes controlling solar magnetism and its characteristic scales. This is particularly important as the Sun's magnetic field drives solar activity and can significantly influence the Sun-Earth system. At the same time a better knowledge of solar magnetism can greatly improve our understanding of other astrophysical objects.
△ Less
Submitted 7 December, 2016;
originally announced December 2016.
-
Numerical Simulations of Coronal Heating through Footpoint Braiding
Authors:
Viggo Hansteen,
Nuno Guerreiro,
Bart De Pontieu,
Mats Carlsson
Abstract:
Advanced 3D radiative MHD simulations now reproduce many properties of the outer solar atmosphere. When including a domain from the convection zone into the corona, a hot chromosphere and corona are self-consistently maintained. Here we study two realistic models, with different simulated area, magnetic field strength and topology, and numerical resolution. These are compared in order to character…
▽ More
Advanced 3D radiative MHD simulations now reproduce many properties of the outer solar atmosphere. When including a domain from the convection zone into the corona, a hot chromosphere and corona are self-consistently maintained. Here we study two realistic models, with different simulated area, magnetic field strength and topology, and numerical resolution. These are compared in order to characterize the heating in the 3D-MHD simulations which self-consistently maintains the structure of the atmosphere. We analyze the heating at both large and small scales and find that heating is episodic and highly structured in space, but occurs along loop shaped structures, and moves along with the magnetic field. On large scales we find that the heating per particle is maximal near the transition region and that widely distributed opposite-polarity field in the photosphere leads to a greater heating scale height in the corona. On smaller scales, heating is concentrated in current sheets, the thicknesses of which are set by the numerical resolution. Some current sheets fragment in time, this process occurring more readily in the higher-resolution model leading to spatially highly intermittent heating. The large scale heating structures are found to fade in less than about five minutes, while the smaller, local, heating shows time scales of the order of 2 minutes in one model and 1 minutes in the other, higher-resolution, model.
△ Less
Submitted 28 August, 2015;
originally announced August 2015.
-
Dark matter from cosmic defects on galactic scales?
Authors:
N. Guerreiro,
P. P. Avelino,
J. P. M. de Carvalho,
C. J. A. P. Martins
Abstract:
We discuss the possible dynamical role of extended cosmic defects on galactic scales, specifically focusing on the possibility that they may provide the dark matter suggested by the classical problem of galactic rotation curves. We emphasize that the more standard defects (such as Goto-Nambu strings) are unsuitable for this task, but show that more general models (such as transonic wiggly string…
▽ More
We discuss the possible dynamical role of extended cosmic defects on galactic scales, specifically focusing on the possibility that they may provide the dark matter suggested by the classical problem of galactic rotation curves. We emphasize that the more standard defects (such as Goto-Nambu strings) are unsuitable for this task, but show that more general models (such as transonic wiggly strings) could in principle have a better chance. In any case, we show that observational data severely restricts any such scenarios.
△ Less
Submitted 12 September, 2008; v1 submitted 28 July, 2008;
originally announced July 2008.