Search | arXiv e-print repository

Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving

Authors: Xin Quan, Marco Valentino, Louise A. Dennis, André Freitas

Abstract: Natural language explanations have become a proxy for evaluating explainable and multi-step Natural Language Inference (NLI) models. However, assessing the validity of explanations for NLI is challenging as it typically involves the crowd-sourcing of apposite datasets, a process that is time-consuming and prone to logical errors. To address existing limitations, this paper investigates the verific… ▽ More Natural language explanations have become a proxy for evaluating explainable and multi-step Natural Language Inference (NLI) models. However, assessing the validity of explanations for NLI is challenging as it typically involves the crowd-sourcing of apposite datasets, a process that is time-consuming and prone to logical errors. To address existing limitations, this paper investigates the verification and refinement of natural language explanations through the integration of Large Language Models (LLMs) and Theorem Provers (TPs). Specifically, we present a neuro-symbolic framework, named Explanation-Refiner, that augments a TP with LLMs to generate and formalise explanatory sentences and suggest potential inference strategies for NLI. In turn, the TP is employed to provide formal guarantees on the logical validity of the explanations and to generate feedback for subsequent improvements. We demonstrate how Explanation-Refiner can be jointly used to evaluate explanatory reasoning, autoformalisation, and error correction mechanisms of state-of-the-art LLMs as well as to automatically enhance the quality of human-annotated explanations of variable complexity in different domains. △ Less

Submitted 7 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

arXiv:2404.04963 [pdf, other]

SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials

Authors: Mael Jullien, Marco Valentino, André Freitas

Abstract: Large Language Models (LLMs) are at the forefront of NLP achievements but fall short in dealing with shortcut learning, factual inconsistency, and vulnerability to adversarial inputs.These shortcomings are especially critical in medical contexts, where they can misrepresent actual model capabilities. Addressing this, we present SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Cl… ▽ More Large Language Models (LLMs) are at the forefront of NLP achievements but fall short in dealing with shortcut learning, factual inconsistency, and vulnerability to adversarial inputs.These shortcomings are especially critical in medical contexts, where they can misrepresent actual model capabilities. Addressing this, we present SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for ClinicalTrials. Our contributions include the refined NLI4CT-P dataset (i.e., Natural Language Inference for Clinical Trials - Perturbed), designed to challenge LLMs with interventional and causal reasoning tasks, along with a comprehensive evaluation of methods and results for participant submissions. A total of 106 participants registered for the task contributing to over 1200 individual submissions and 25 system overview papers. This initiative aims to advance the robustness and applicability of NLI models in healthcare, ensuring safer and more dependable AI assistance in clinical decision-making. We anticipate that the dataset, models, and outcomes of this task can support future research in the field of biomedical NLI. The dataset, competition leaderboard, and website are publicly available. △ Less

Submitted 7 April, 2024; originally announced April 2024.

arXiv:2404.02625 [pdf, other]

A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference

Authors: Mokanarangan Thayaparan, Marco Valentino, André Freitas

Abstract: Integer Linear Programming (ILP) has been proposed as a formalism for encoding precise structural and semantic constraints for Natural Language Inference (NLI). However, traditional ILP frameworks are non-differentiable, posing critical challenges for the integration of continuous language representations based on deep learning. In this paper, we introduce a novel approach, named Diff-Comb Explain… ▽ More Integer Linear Programming (ILP) has been proposed as a formalism for encoding precise structural and semantic constraints for Natural Language Inference (NLI). However, traditional ILP frameworks are non-differentiable, posing critical challenges for the integration of continuous language representations based on deep learning. In this paper, we introduce a novel approach, named Diff-Comb Explainer, a neuro-symbolic architecture for explanation-based NLI based on Differentiable BlackBox Combinatorial Solvers (DBCS). Differently from existing neuro-symbolic solvers, Diff-Comb Explainer does not necessitate a continuous relaxation of the semantic constraints, enabling a direct, more precise, and efficient incorporation of neural representations into the ILP formulation. Our experiments demonstrate that Diff-Comb Explainer achieves superior performance when compared to conventional ILP solvers, neuro-symbolic black-box solvers, and Transformer-based encoders. Moreover, a deeper analysis reveals that Diff-Comb Explainer can significantly improve the precision, consistency, and faithfulness of the constructed explanations, opening new opportunities for research on neuro-symbolic architectures for explainable and transparent NLI in complex domains. △ Less

Submitted 3 April, 2024; originally announced April 2024.

Comments: Accepted to LREC-COLING 2024 - Camera Ready. arXiv admin note: substantial text overlap with arXiv:2208.03339

arXiv:2404.02622 [pdf, other]

Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models

Authors: Julia Rozanova, Marco Valentino, André Freitas

Abstract: Rigorous evaluation of the causal effects of semantic features on language model predictions can be hard to achieve for natural language reasoning problems. However, this is such a desirable form of analysis from both an interpretability and model evaluation perspective, that it is valuable to investigate specific patterns of reasoning with enough structure and regularity to identify and quantify… ▽ More Rigorous evaluation of the causal effects of semantic features on language model predictions can be hard to achieve for natural language reasoning problems. However, this is such a desirable form of analysis from both an interpretability and model evaluation perspective, that it is valuable to investigate specific patterns of reasoning with enough structure and regularity to identify and quantify systematic reasoning failures in widely-used models. In this vein, we pick a portion of the NLI task for which an explicit causal diagram can be systematically constructed: the case where across two sentences (the premise and hypothesis), two related words/terms occur in a shared context. In this work, we apply causal effect estimation strategies to measure the effect of context interventions (whose effect on the entailment label is mediated by the semantic monotonicity characteristic) and interventions on the inserted word-pair (whose effect on the entailment label is mediated by the relation between these words). Extending related work on causal analysis of NLP models in different settings, we perform an extensive interventional study on the NLI task to investigate robustness to irrelevant changes and sensitivity to impactful changes of Transformers. The results strongly bolster the fact that similar benchmark accuracy scores may be observed for models that exhibit very different behaviour. Moreover, our methodology reinforces previously suspected biases from a causal perspective, including biases in favour of upward-monotone contexts and ignoring the effects of negation markers. △ Less

Submitted 3 April, 2024; originally announced April 2024.

Comments: Accepted to LREC-COLING 2024 - Camera Ready. arXiv admin note: substantial text overlap with arXiv:2305.08572

arXiv:2402.10767 [pdf, other]

Inference to the Best Explanation in Large Language Models

Authors: Dhairya Dalal, Marco Valentino, André Freitas, Paul Buitelaar

Abstract: While Large Language Models (LLMs) have found success in real-world applications, their underlying explanatory process is still poorly understood. This paper proposes IBE-Eval, a framework inspired by philosophical accounts on Inference to the Best Explanation (IBE) to advance the interpretation and evaluation of LLMs' explanations. IBE-Eval estimates the plausibility of natural language explanati… ▽ More While Large Language Models (LLMs) have found success in real-world applications, their underlying explanatory process is still poorly understood. This paper proposes IBE-Eval, a framework inspired by philosophical accounts on Inference to the Best Explanation (IBE) to advance the interpretation and evaluation of LLMs' explanations. IBE-Eval estimates the plausibility of natural language explanations through a combination of explicit logical and linguistic features including: consistency, parsimony, coherence, and uncertainty. Extensive experiments are conducted on Causal Question Answering (CQA), where \textit{IBE-Eval} is tasked to select the most plausible causal explanation amongst competing ones generated by LLMs (i.e., GPT 3.5 and Llama 2). The experiments reveal that IBE-Eval can successfully identify the best explanation with up to 77\% accuracy ($\approx 27\%$ above random), improving upon a GPT 3.5-as-a-Judge baseline ($\approx+17\%$) while being intrinsically more efficient and interpretable. Additional analyses suggest that, despite model-specific variances, LLM-generated explanations tend to conform to IBE criteria and that IBE-Eval is significantly correlated with human judgment, opening up opportunities for future development of automated explanation verification tools. △ Less

Submitted 16 February, 2024; originally announced February 2024.

ACM Class: I.2.7

arXiv:2402.00745 [pdf, other]

Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement

Authors: Xin Quan, Marco Valentino, Louise A. Dennis, André Freitas

Abstract: An increasing amount of research in Natural Language Inference (NLI) focuses on the application and evaluation of Large Language Models (LLMs) and their reasoning capabilities. Despite their success, however, LLMs are still prone to factual errors and inconsistencies in their explanations, offering limited control and interpretability for inference in complex domains. In this paper, we focus on et… ▽ More An increasing amount of research in Natural Language Inference (NLI) focuses on the application and evaluation of Large Language Models (LLMs) and their reasoning capabilities. Despite their success, however, LLMs are still prone to factual errors and inconsistencies in their explanations, offering limited control and interpretability for inference in complex domains. In this paper, we focus on ethical NLI, investigating how hybrid neuro-symbolic techniques can enhance the logical validity and alignment of ethical explanations produced by LLMs. Specifically, we present an abductive-deductive framework named Logic-Explainer, which integrates LLMs with an external backward-chaining solver to refine step-wise natural language explanations and jointly verify their correctness, reduce incompleteness and minimise redundancy. An extensive empirical analysis demonstrates that Logic-Explainer can improve explanations generated via in-context learning methods and Chain-of-Thought (CoT) on challenging ethical NLI tasks, while, at the same time, producing formal proofs describing and supporting models' reasoning. As ethical NLI requires commonsense reasoning to identify underlying moral violations, our results suggest the effectiveness of neuro-symbolic methods for multi-step NLI more broadly, opening new opportunities to enhance the logical consistency, reliability, and alignment of LLMs. △ Less

Submitted 1 February, 2024; originally announced February 2024.

Comments: Camera-ready for EACL 2024

arXiv:2402.00723 [pdf, other]

Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders

Authors: Yingji Zhang, Danilo S. Carvalho, Marco Valentino, Ian Pratt-Hartmann, Andre Freitas

Abstract: Achieving precise semantic control over the latent spaces of Variational AutoEncoders (VAEs) holds significant value for downstream tasks in NLP as the underlying generative mechanisms could be better localised, explained and improved upon. Recent research, however, has struggled to achieve consistent results, primarily due to the inevitable loss of semantic information in the variational bottlene… ▽ More Achieving precise semantic control over the latent spaces of Variational AutoEncoders (VAEs) holds significant value for downstream tasks in NLP as the underlying generative mechanisms could be better localised, explained and improved upon. Recent research, however, has struggled to achieve consistent results, primarily due to the inevitable loss of semantic information in the variational bottleneck and limited control over the decoding mechanism. To overcome these challenges, we investigate discrete latent spaces in Vector Quantized Variational AutoEncoders (VQVAEs) to improve semantic control and generation in Transformer-based VAEs. In particular, We propose T5VQVAE, a novel model that leverages the controllability of VQVAEs to guide the self-attention mechanism in T5 at the token-level, exploiting its full generalization capabilities. Experimental results indicate that T5VQVAE outperforms existing state-of-the-art VAE models, including Optimus, in terms of controllability and preservation of semantic information across different tasks such as auto-encoding of sentences and mathematical expressions, text transfer, and inference. Moreover, T5VQVAE exhibits improved inference capabilities, suggesting potential applications for downstream natural language and symbolic reasoning tasks. △ Less

Submitted 1 February, 2024; originally announced February 2024.

arXiv:2311.08579 [pdf, other]

Graph-Induced Syntactic-Semantic Spaces in Transformer-Based Variational AutoEncoders

Authors: Yingji Zhang, Marco Valentino, Danilo S. Carvalho, Ian Pratt-Hartmann, André Freitas

Abstract: The injection of syntactic information in Variational AutoEncoders (VAEs) has been shown to result in an overall improvement of performances and generalisation. An effective strategy to achieve such a goal is to separate the encoding of distributional semantic features and syntactic structures into heterogeneous latent spaces via multi-task learning or dual encoder architectures. However, existing… ▽ More The injection of syntactic information in Variational AutoEncoders (VAEs) has been shown to result in an overall improvement of performances and generalisation. An effective strategy to achieve such a goal is to separate the encoding of distributional semantic features and syntactic structures into heterogeneous latent spaces via multi-task learning or dual encoder architectures. However, existing works employing such techniques are limited to LSTM-based VAEs. In this paper, we investigate latent space separation methods for structural syntactic injection in Transformer-based VAE architectures (i.e., Optimus). Specifically, we explore how syntactic structures can be leveraged in the encoding stage through the integration of graph-based and sequential models, and how multiple, specialised latent representations can be injected into the decoder's attention mechanism via low-rank operators. Our empirical evaluation, carried out on natural language sentences and mathematical expressions, reveals that the proposed end-to-end VAE architecture can result in a better overall organisation of the latent space, alleviating the information loss occurring in standard VAE setups, resulting in enhanced performances on language modelling and downstream generation tasks. △ Less

Submitted 14 November, 2023; originally announced November 2023.

arXiv:2311.01230 [pdf, other]

Multi-Operational Mathematical Derivations in Latent Space

Authors: Marco Valentino, Jordan Meadows, Lan Zhang, André Freitas

Abstract: This paper investigates the possibility of approximating multiple mathematical operations in latent space for expression derivation. To this end, we introduce different multi-operational representation paradigms, modelling mathematical operations as explicit geometric transformations. By leveraging a symbolic engine, we construct a large-scale dataset comprising 1.7M derivation steps stemming from… ▽ More This paper investigates the possibility of approximating multiple mathematical operations in latent space for expression derivation. To this end, we introduce different multi-operational representation paradigms, modelling mathematical operations as explicit geometric transformations. By leveraging a symbolic engine, we construct a large-scale dataset comprising 1.7M derivation steps stemming from 61K premises and 6 operators, analysing the properties of each paradigm when instantiated with state-of-the-art neural encoders. Specifically, we investigate how different encoding mechanisms can approximate expression manipulation in latent space, exploring the trade-off between learning different operators and specialising within single operations, as well as the ability to support multi-step derivations and out-of-distribution generalisation. Our empirical analysis reveals that the multi-operational paradigm is crucial for disentangling different operators, while discriminating the conclusions for a single operation is achievable in the original expression encoder. Moreover, we show that architectural choices can heavily affect the training dynamics, structural organisation, and generalisation of the latent space, resulting in significant variations across paradigms and classes of encoders. △ Less

Submitted 3 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

Comments: Accepted to NAACL 2024 - Camera Ready

arXiv:2310.03775 [pdf, other]

Hidden Markov Models for Stock Market Prediction

Authors: Luigi Catello, Ludovica Ruggiero, Lucia Schiavone, Mario Valentino

Abstract: The stock market presents a challenging environment for accurately predicting future stock prices due to its intricate and ever-changing nature. However, the utilization of advanced methodologies can significantly enhance the precision of stock price predictions. One such method is Hidden Markov Models (HMMs). HMMs are statistical models that can be used to model the behavior of a partially observ… ▽ More The stock market presents a challenging environment for accurately predicting future stock prices due to its intricate and ever-changing nature. However, the utilization of advanced methodologies can significantly enhance the precision of stock price predictions. One such method is Hidden Markov Models (HMMs). HMMs are statistical models that can be used to model the behavior of a partially observable system, making them suitable for modeling stock prices based on historical data. Accurate stock price predictions can help traders make better investment decisions, leading to increased profits. In this article, we trained and tested a Hidden Markov Model for the purpose of predicting a stock closing price based on its opening price and the preceding day's prices. The model's performance has been evaluated using two indicators: Mean Average Prediction Error (MAPE), which specifies the average accuracy of our model, and Directional Prediction Accuracy (DPA), a newly introduced indicator that accounts for the number of fractional change predictions that are correct in sign. △ Less

Submitted 5 October, 2023; originally announced October 2023.

arXiv:2307.09998 [pdf, other]

Generating Mathematical Derivations with Large Language Models

Authors: Jordan Meadows, Marco Valentino, Andre Freitas

Abstract: The derivation of mathematical results in specialised fields, using Large Language Models (LLMs), is an emerging research direction that can help identify models' limitations, and potentially support mathematical discovery. In this paper, we leverage a symbolic engine to generate derivations of equations at scale, and investigate the capabilities of LLMs when deriving goal equations from premises.… ▽ More The derivation of mathematical results in specialised fields, using Large Language Models (LLMs), is an emerging research direction that can help identify models' limitations, and potentially support mathematical discovery. In this paper, we leverage a symbolic engine to generate derivations of equations at scale, and investigate the capabilities of LLMs when deriving goal equations from premises. Specifically, we employ in-context learning for GPT and fine-tune a range of T5 models to compare the robustness and generalisation of pre-training strategies to specialised models. Empirical results show that fine-tuned FLAN-T5-large (MathT5) outperforms GPT models on all static and out-of-distribution test sets in conventional scores. However, an in-depth analysis reveals that the fine-tuned models are more sensitive to perturbations involving unseen symbols and (to a lesser extent) changes to equation structure. In addition, we analyse 1.7K equations, and over 200 derivations, to highlight common reasoning errors such as the inclusion of incorrect, irrelevant, and redundant equations. Finally, we explore the suitability of existing metrics for evaluating mathematical derivations and find evidence that, while they can capture general properties such as sensitivity to perturbations, they fail to highlight fine-grained reasoning errors and essential differences between models. Overall, this work demonstrates that training models on synthetic data may improve their math capabilities beyond much larger LLMs, but current metrics are not appropriately assessing the quality of generated mathematical text. △ Less

Submitted 8 August, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

Comments: 10 pages

arXiv:2305.12563 [pdf, other]

A Symbolic Framework for Evaluating Mathematical Reasoning and Generalisation with Transformers

Authors: Jordan Meadows, Marco Valentino, Damien Teney, Andre Freitas

Abstract: This paper proposes a methodology for generating and perturbing detailed derivations of equations at scale, aided by a symbolic engine, to evaluate the generalisability of Transformers to out-of-distribution mathematical reasoning problems. Instantiating the framework in the context of sequence classification tasks, we compare the capabilities of GPT-4, GPT-3.5, and a canon of fine-tuned BERT mode… ▽ More This paper proposes a methodology for generating and perturbing detailed derivations of equations at scale, aided by a symbolic engine, to evaluate the generalisability of Transformers to out-of-distribution mathematical reasoning problems. Instantiating the framework in the context of sequence classification tasks, we compare the capabilities of GPT-4, GPT-3.5, and a canon of fine-tuned BERT models, exploring the relationship between specific operators and generalisation failure via the perturbation of reasoning aspects such as symmetry and variable surface forms. Surprisingly, our empirical evaluation reveals that the average in-distribution performance of fine-tuned models surpasses GPT-3.5, and rivals GPT-4. However, perturbations to input reasoning can reduce their performance by up to 80 F1 points. Overall, the results suggest that the in-distribution performance of smaller open-source models may potentially rival GPT by incorporating appropriately structured derivation dependencies during training, and highlight a shared weakness between BERT and GPT involving a relative inability to decode indirect references to mathematical entities. We release the full codebase, constructed datasets, and fine-tuned models to encourage future progress in the field. △ Less

Submitted 8 April, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

Comments: NAACL 2024

arXiv:2305.08572 [pdf, other]

Estimating the Causal Effects of Natural Logic Features in Neural NLI Models

Authors: Julia Rozanova, Marco Valentino, Andre Freitas

Abstract: Rigorous evaluation of the causal effects of semantic features on language model predictions can be hard to achieve for natural language reasoning problems. However, this is such a desirable form of analysis from both an interpretability and model evaluation perspective, that it is valuable to zone in on specific patterns of reasoning with enough structure and regularity to be able to identify and… ▽ More Rigorous evaluation of the causal effects of semantic features on language model predictions can be hard to achieve for natural language reasoning problems. However, this is such a desirable form of analysis from both an interpretability and model evaluation perspective, that it is valuable to zone in on specific patterns of reasoning with enough structure and regularity to be able to identify and quantify systematic reasoning failures in widely-used models. In this vein, we pick a portion of the NLI task for which an explicit causal diagram can be systematically constructed: in particular, the case where across two sentences (the premise and hypothesis), two related words/terms occur in a shared context. In this work, we apply causal effect estimation strategies to measure the effect of context interventions (whose effect on the entailment label is mediated by the semantic monotonicity characteristic) and interventions on the inserted word-pair (whose effect on the entailment label is mediated by the relation between these words.). Following related work on causal analysis of NLP models in different settings, we adapt the methodology for the NLI task to construct comparative model profiles in terms of robustness to irrelevant changes and sensitivity to impactful changes. △ Less

Submitted 15 May, 2023; originally announced May 2023.

arXiv:2305.07303 [pdf, other]

Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions

Authors: Marco Valentino, Danilo S. Carvalho, André Freitas

Abstract: Natural language definitions possess a recursive, self-explanatory semantic structure that can support representation learning methods able to preserve explicit conceptual relations and constraints in the latent space. This paper presents a multi-relational model that explicitly leverages such a structure to derive word embeddings from definitions. By automatically extracting the relations linking… ▽ More Natural language definitions possess a recursive, self-explanatory semantic structure that can support representation learning methods able to preserve explicit conceptual relations and constraints in the latent space. This paper presents a multi-relational model that explicitly leverages such a structure to derive word embeddings from definitions. By automatically extracting the relations linking defined and defining terms from dictionaries, we demonstrate how the problem of learning word embeddings can be formalised via a translational framework in Hyperbolic space and used as a proxy to capture the global semantic structure of definitions. An extensive empirical analysis demonstrates that the framework can help imposing the desired structural constraints while preserving the semantic map** required for controllable and interpretable traversal. Moreover, the experiments reveal the superiority of the Hyperbolic word embeddings over the Euclidean counterparts and demonstrate that the multi-relational approach can obtain competitive results when compared to state-of-the-art neural models, with the advantage of being intrinsically more efficient and interpretable. △ Less

Submitted 16 February, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

Comments: Accepted at the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), camera-ready

arXiv:2305.03598 [pdf, other]

NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports

Authors: Maël Jullien, Marco Valentino, Hannah Frost, Paul O'Regan, Donal Landers, André Freitas

Abstract: How can we interpret and retrieve medical evidence to support clinical decisions? Clinical trial reports (CTR) amassed over the years contain indispensable information for the development of personalized medicine. However, it is practically infeasible to manually inspect over 400,000+ clinical trial reports in order to find the best evidence for experimental treatments. Natural Language Inference… ▽ More How can we interpret and retrieve medical evidence to support clinical decisions? Clinical trial reports (CTR) amassed over the years contain indispensable information for the development of personalized medicine. However, it is practically infeasible to manually inspect over 400,000+ clinical trial reports in order to find the best evidence for experimental treatments. Natural Language Inference (NLI) offers a potential solution to this problem, by allowing the scalable computation of textual entailment. However, existing NLI models perform poorly on biomedical corpora, and previously published datasets fail to capture the full complexity of inference over CTRs. In this work, we present a novel resource to advance research on NLI for reasoning on CTRs. The resource includes two main tasks. Firstly, to determine the inference relation between a natural language statement, and a CTR. Secondly, to retrieve supporting facts to justify the predicted relation. We provide NLI4CT, a corpus of 2400 statements and CTRs, annotated for these tasks. Baselines on this corpus expose the limitations of existing NLI models, with 6 state-of-the-art NLI models achieving a maximum F1 score of 0.627. To the best of our knowledge, we are the first to design a task that covers the interpretation of full CTRs. To encourage further work on this challenging dataset, we make the corpus, competition leaderboard, website and code to replicate the baseline experiments available at: https://github.com/ai-systems/nli4ct △ Less

Submitted 28 October, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

Comments: EMNLP 2023 Camera-ready, 15 pages

arXiv:2305.02993 [pdf, other]

SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data

Authors: Maël Jullien, Marco Valentino, Hannah Frost, Paul O'Regan, Donal Landers, André Freitas

Abstract: This paper describes the results of SemEval 2023 task 7 -- Multi-Evidence Natural Language Inference for Clinical Trial Data (NLI4CT) -- consisting of 2 tasks, a Natural Language Inference (NLI) task, and an evidence selection task on clinical trial data. The proposed challenges require multi-hop biomedical and numerical reasoning, which are of significant importance to the development of systems… ▽ More This paper describes the results of SemEval 2023 task 7 -- Multi-Evidence Natural Language Inference for Clinical Trial Data (NLI4CT) -- consisting of 2 tasks, a Natural Language Inference (NLI) task, and an evidence selection task on clinical trial data. The proposed challenges require multi-hop biomedical and numerical reasoning, which are of significant importance to the development of systems capable of large-scale interpretation and retrieval of medical evidence, to provide personalized evidence-based care. Task 1, the entailment task, received 643 submissions from 40 participants, and Task 2, the evidence selection task, received 364 submissions from 23 participants. The tasks are challenging, with the majority of submitted systems failing to significantly outperform the majority class baseline on the entailment task, and we observe significantly better performance on the evidence selection task than on the entailment task. Increasing the number of model parameters leads to a direct increase in performance, far more significant than the effect of biomedical pre-training. Future works could explore the limitations of large models for generalization and numerical inference, and investigate methods to augment clinical datasets to allow for more rigorous testing and to facilitate fine-tuning. We envisage that the dataset, models, and results of this task will be useful to the biomedical NLI and evidence retrieval communities. The dataset, competition leaderboard, and website are publicly available. △ Less

Submitted 11 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

arXiv:2304.10346 [pdf, other]

Interventional Probing in High Dimensions: An NLI Case Study

Authors: Julia Rozanova, Marco Valentino, Lucas Cordeiro, Andre Freitas

Abstract: Probing strategies have been shown to detect the presence of various linguistic features in large language models; in particular, semantic features intermediate to the "natural logic" fragment of the Natural Language Inference task (NLI). In the case of natural logic, the relation between the intermediate features and the entailment label is explicitly known: as such, this provides a ripe setting… ▽ More Probing strategies have been shown to detect the presence of various linguistic features in large language models; in particular, semantic features intermediate to the "natural logic" fragment of the Natural Language Inference task (NLI). In the case of natural logic, the relation between the intermediate features and the entailment label is explicitly known: as such, this provides a ripe setting for interventional studies on the NLI models' representations, allowing for stronger causal conjectures and a deeper critical analysis of interventional probing methods. In this work, we carry out new and existing representation-level interventions to investigate the effect of these semantic features on NLI classification: we perform amnesic probing (which removes features as directed by learned linear probes) and introduce the mnestic probing variation (which forgets all dimensions except the probe-selected ones). Furthermore, we delve into the limitations of these methods and outline some pitfalls have been obscuring the effectivity of interventional probing studies. △ Less

Submitted 20 April, 2023; originally announced April 2023.

arXiv:2208.03339 [pdf, other]

Going Beyond Approximation: Encoding Constraints for Explainable Multi-hop Inference via Differentiable Combinatorial Solvers

Authors: Mokanarangan Thayaparan, Marco Valentino, André Freitas

Abstract: Integer Linear Programming (ILP) provides a viable mechanism to encode explicit and controllable assumptions about explainable multi-hop inference with natural language. However, an ILP formulation is non-differentiable and cannot be integrated into broader deep learning architectures. Recently, Thayaparan et al. (2021a) proposed a novel methodology to integrate ILP with Transformers to achieve en… ▽ More Integer Linear Programming (ILP) provides a viable mechanism to encode explicit and controllable assumptions about explainable multi-hop inference with natural language. However, an ILP formulation is non-differentiable and cannot be integrated into broader deep learning architectures. Recently, Thayaparan et al. (2021a) proposed a novel methodology to integrate ILP with Transformers to achieve end-to-end differentiability for complex multi-hop inference. While this hybrid framework has been demonstrated to deliver better answer and explanation selection than transformer-based and existing ILP solvers, the neuro-symbolic integration still relies on a convex relaxation of the ILP formulation, which can produce sub-optimal solutions. To improve these limitations, we propose Diff-Comb Explainer, a novel neuro-symbolic architecture based on Differentiable BlackBox Combinatorial solvers (DBCS) (Pogančić et al., 2019). Unlike existing differentiable solvers, the presented model does not require the transformation and relaxation of the explicit semantic constraints, allowing for direct and more efficient integration of ILP formulations. Diff-Comb Explainer demonstrates improved accuracy and explainability over non-differentiable solvers, Transformers and existing differentiable constraint-based multi-hop inference frameworks. △ Less

Submitted 5 August, 2022; originally announced August 2022.

arXiv:2205.01809 [pdf, other]

Scientific Explanation and Natural Language: A Unified Epistemological-Linguistic Perspective for Explainable AI

Authors: Marco Valentino, André Freitas

Abstract: A fundamental research goal for Explainable AI (XAI) is to build models that are capable of reasoning through the generation of natural language explanations. However, the methodologies to design and evaluate explanation-based inference models are still poorly informed by theoretical accounts on the nature of explanation. As an attempt to provide an epistemologically grounded characterisation for… ▽ More A fundamental research goal for Explainable AI (XAI) is to build models that are capable of reasoning through the generation of natural language explanations. However, the methodologies to design and evaluate explanation-based inference models are still poorly informed by theoretical accounts on the nature of explanation. As an attempt to provide an epistemologically grounded characterisation for XAI, this paper focuses on the scientific domain, aiming to bridge the gap between theory and practice on the notion of a scientific explanation. Specifically, the paper combines a detailed survey of the modern accounts of scientific explanation in Philosophy of Science with a systematic analysis of corpora of natural language explanations, clarifying the nature and function of explanatory arguments from both a top-down (categorical) and a bottom-up (corpus-based) perspective. Through a mixture of quantitative and qualitative methodologies, the presented study allows deriving the following main conclusions: (1) Explanations cannot be entirely characterised in terms of inductive or deductive arguments as their main function is to perform unification; (2) An explanation must cite causes and mechanisms that are responsible for the occurrence of the event to be explained; (3) While natural language explanations possess an intrinsic causal-mechanistic nature, they are not limited to causes and mechanisms, also accounting for pragmatic elements such as definitions, properties and taxonomic relations; (4) Patterns of unification naturally emerge in corpora of explanations even if not intentionally modelled; (5) Unification is realised through a process of abstraction, whose function is to provide the inference substrate for subsuming the event to be explained under recurring patterns and high-level regularities. △ Less

Submitted 5 May, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

arXiv:2203.08169 [pdf, other]

doi 10.1117/1.JATIS.8.1.014007

Design and Performance of the Prototype Schwarzschild-Couder Telescope Camera

Authors: Colin B. Adams, Giovanni Ambrosi, Michelangelo Ambrosio, Carla Aramo, Timothy Arlen, Wystan Benbow, Bruna Bertucci, Elisabetta Bissaldi, Jonathan Biteau, Massimiliano Bitossi, Alfonso Boiano, Carmela Bonavolontà, Richard Bose, Aurelien Bouvier, Mario Buscemi, Aryeh Brill, Anthony M. Brown, James H. Buckley, Rodolfo Canestrari, Massimo Capasso, Mirco Caprai, Paolo Coppi, Corbin E. Covault, Davide Depaoli, Leonardo Di Venere , et al. (64 additional authors not shown)

Abstract: The prototype Schwarzschild-Couder Telescope (pSCT) is a candidate for a medium-sized telescope in the Cherenkov Telescope Array. The pSCT is based on a novel dual mirror optics design which reduces the plate scale and allows for the use of silicon photomultipliers as photodetectors. The prototype pSCT camera currently has only the central sector instrumented with 25 camera modules (1600 pixels)… ▽ More The prototype Schwarzschild-Couder Telescope (pSCT) is a candidate for a medium-sized telescope in the Cherenkov Telescope Array. The pSCT is based on a novel dual mirror optics design which reduces the plate scale and allows for the use of silicon photomultipliers as photodetectors. The prototype pSCT camera currently has only the central sector instrumented with 25 camera modules (1600 pixels), providing a 2.68$^{\circ}$ field of view (FoV). The camera electronics are based on custom TARGET (TeV array readout with GSa/s sampling and event trigger) application specific integrated circuits. Field programmable gate arrays sample incoming signals at a gigasample per second. A single backplane provides camera-wide triggers. An upgrade of the pSCT camera is in progress, which will fully populate the focal plane. This will increase the number of pixels to 11,328, the number of backplanes to 9, and the FoV to 8.04$^{\circ}$. Here we give a detailed description of the pSCT camera, including the basic concept, mechanical design, detectors, electronics, current status and first light. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Journal ref: J. Astron. Telesc. Instrum. Syst. 8(1), 014007 (2022)

arXiv:2201.10262 [pdf, other]

Do Transformers Encode a Foundational Ontology? Probing Abstract Classes in Natural Language

Authors: Mael Jullien, Marco Valentino, Andre Freitas

Abstract: With the methodological support of probing (or diagnostic classification), recent studies have demonstrated that Transformers encode syntactic and semantic information to some extent. Following this line of research, this paper aims at taking semantic probing to an abstraction extreme with the goal of answering the following research question: can contemporary Transformer-based models reflect an u… ▽ More With the methodological support of probing (or diagnostic classification), recent studies have demonstrated that Transformers encode syntactic and semantic information to some extent. Following this line of research, this paper aims at taking semantic probing to an abstraction extreme with the goal of answering the following research question: can contemporary Transformer-based models reflect an underlying Foundational Ontology? To this end, we present a systematic Foundational Ontology (FO) probing methodology to investigate whether Transformers-based models encode abstract semantic information. Following different pre-training and fine-tuning regimes, we present an extensive evaluation of a diverse set of large-scale language models over three distinct and complementary FO tagging experiments. Specifically, we present and discuss the following conclusions: (1) The probing results indicate that Transformer-based models incidentally encode information related to Foundational Ontologies during the pre-training pro-cess; (2) Robust FO taggers (accuracy of 90 percent)can be efficiently built leveraging on this knowledge. △ Less

Submitted 25 January, 2022; originally announced January 2022.

arXiv:2112.08289 [pdf, other]

Decomposing Natural Logic Inferences in Neural NLI

Authors: Julia Rozanova, Deborah Ferreira, Marco Valentino, Mokanrarangan Thayaparan, Andre Freitas

Abstract: In the interest of interpreting neural NLI models and their reasoning strategies, we carry out a systematic probing study which investigates whether these models capture the crucial semantic features central to natural logic: monotonicity and concept inclusion. Correctly identifying valid inferences in downward-monotone contexts is a known stumbling block for NLI performance, subsuming linguistic… ▽ More In the interest of interpreting neural NLI models and their reasoning strategies, we carry out a systematic probing study which investigates whether these models capture the crucial semantic features central to natural logic: monotonicity and concept inclusion. Correctly identifying valid inferences in downward-monotone contexts is a known stumbling block for NLI performance, subsuming linguistic phenomena such as negation scope and generalized quantifiers. To understand this difficulty, we emphasize monotonicity as a property of a context and examine the extent to which models capture monotonicity information in the contextual embeddings which are intermediate to their decision making process. Drawing on the recent advancement of the probing paradigm, we compare the presence of monotonicity features across various models. We find that monotonicity information is notably weak in the representations of popular NLI models which achieve high scores on benchmarks, and observe that previous improvements to these models based on fine-tuning strategies have introduced stronger monotonicity features together with their improved performance on challenge sets. △ Less

Submitted 8 November, 2023; v1 submitted 15 December, 2021; originally announced December 2021.

arXiv:2110.07463 [pdf, other]

Prototype Schwarzschild-Couder Telescope for the Cherenkov Telescope Array: Commissioning the Optical System

Authors: C. B. Adams, G. Ambrosi, M. Ambrosio, C. Aramo, P. I. Batista, W. Benbow, B. Bertucci, E. Bissaldi, M. Bitossi, A. Boiano, C. Bonavolontà, R. Bose, A. Brill, J. H. Buckley, R. A. Cameron, R. Canestrari, M. Capasso, M. Caprai, C. E. Covault, D. Depaoli, L. Di Venere, M. Errando, S. Fegan, Q. Feng, E. Fiandrini , et al. (47 additional authors not shown)

Abstract: A prototype Schwarzschild-Couder Telescope (pSCT) has been constructed at the Fred Lawrence Whipple Observatory as a candidate for the medium-sized telescopes of the Cherenkov Telescope Array Observatory (CTAO). CTAO is currently entering early construction phase of the project and once completed it will vastly improve very high energy gamma-ray detection component in multi-wavelength and multi-me… ▽ More A prototype Schwarzschild-Couder Telescope (pSCT) has been constructed at the Fred Lawrence Whipple Observatory as a candidate for the medium-sized telescopes of the Cherenkov Telescope Array Observatory (CTAO). CTAO is currently entering early construction phase of the project and once completed it will vastly improve very high energy gamma-ray detection component in multi-wavelength and multi-messenger observations due to significantly improved sensitivity, angular resolution and field of view comparing to the current generation of the ground-based gamma-ray observatories H.E.S.S., MAGIC and VERITAS. The pSCT uses a dual aspheric mirror design with a $9.7$ m primary mirror and $5.4$ m secondary mirror, both of which are segmented. The Schwarzschild-Couder (SC) optical system (OS) selected for the prototype telescope achieves wide field of view of $8$ degrees and simultaneously reduces the focal plane plate scale allowing an unprecedented compact ($0.78$m diameter) implementation of the high-resolution camera ($6$mm/ $0.067$deg per imaging pixel with $11,328$ pixels) based on the silicon photo-multipliers (SiPMs). The OS of the telescope is designed to eliminate spherical and comatic aberrations and minimize astigmatism to radically improve off-axis imaging and consequently angular resolution across all the field of view with respect to the conventional single-mirror telescopes. Fast and high imaging resolution OS of the pSCT comes with the challenging submillimeter-precision custom alignment system, which was successfully demonstrated with an on-axis point spread function (PSF) of $2.9$ arcmin prior to the first-light detection of the Crab Nebula in 2020. Ongoing and future commissioning activities are reported. △ Less

Submitted 14 October, 2021; originally announced October 2021.

Journal ref: Proceedings of Science, PoS(ICRC2021)717

arXiv:2109.06225 [pdf, other]

doi 10.22323/1.395.0830

Detection of the Crab Nebula by the prototype Schwarzschild-Couder Telescope

Authors: C. B. Adams, G. Ambrosi, M. Ambrosio, C. Aramo, P. I. Batista, W. Benbow, B. Bertucci, E. Bissaldi, M. Bitossi, A. Boiano, C. Bonavolontà, R. Bose, A. Brill, A. M. Brown, J. H. Buckley, R. A. Cameron, R. Canestrari, M. Capasso, M. Caprai, C. E. Covault, D. Depaoli, L. Di Venere, M. Errando, S. Fegan, Q. Feng , et al. (49 additional authors not shown)

Abstract: The Schwarzschild-Couder Telescope (SCT) is a medium-sized telescope technology proposed for the Cherenkov Telescope Array. It uses a novel dual-mirror optical design that removes comatic aberrations across its entire field of view. The SCT camera employs high-resolution silicon photomultiplier (SiPM) sensors with a pixel size of 4 arcminutes. A prototype SCT (pSCT) has been constructed at the Fre… ▽ More The Schwarzschild-Couder Telescope (SCT) is a medium-sized telescope technology proposed for the Cherenkov Telescope Array. It uses a novel dual-mirror optical design that removes comatic aberrations across its entire field of view. The SCT camera employs high-resolution silicon photomultiplier (SiPM) sensors with a pixel size of 4 arcminutes. A prototype SCT (pSCT) has been constructed at the Fred Lawrence Whipple Observatory in Arizona, USA. An observing campaign in 2020, with a partial camera of 1600 pixels (2.7 degrees by 2.7 degrees field of view) resulted in detection of the Crab Nebula at 8.6 sigma statistical significance. Work on the pSCT camera and optical system is ongoing to improve performance and prepare for an upcoming camera upgrade. The pSCT camera upgrade will replace the current camera modules with improved SiPMs and readout electronics and will expand the camera to its full design field of view of 8 degrees in diameter (11,328 pixels). The fully upgraded pSCT will enable next-generation very-high-energy gamma-ray astrophysics through excellent background rejection and angular resolution. In this presentation we describe first results from the successful operation of the pSCT and future plans. △ Less

Submitted 13 September, 2021; originally announced September 2021.

Comments: 9 pages, 3 figures, 2 tables, contribution to ICRC 2021, similar to 10.1016/j.astropartphys.2021.102562 (arXiv:2012.08448)

Journal ref: Proceedings of Science, PoS(ICRC2021)830

arXiv:2109.05127 [pdf, other]

doi 10.22323/1.395.0748

Design and performance of the prototype Schwarzschild-Couder Telescope camera

Authors: C. B. Adams, G. Ambrosi, M. Ambrosio, C. Aramo, P. I. Batista, W. Benbow, B. Bertucci, E. Bissaldi, M. Bitossi, A. Boiano, C. Bonavolonta, R. Bose, A. Brill, A. M. Brown, J. H. Buckley, R. A. Cameron, M. Capasso, M. Caprai, C. E. Covault, D. Depaoli, L. Di Venere, M. Errando, S. Fegan, Q. Feng, E. Fiandrini , et al. (49 additional authors not shown)

Abstract: The Cherenkov Telescope Array (CTA) is the next-generation ground-based observatory for very-high-energy gamma-ray astronomy. An innovative 9.7 m aperture, dual-mirror Schwarzschild-Couder Telescope (SCT) design is a candidate design for CTA Medium-Sized Telescopes. A prototype SCT (pSCT) has been constructed at the Fred Lawrence Whipple Observatory in Arizona, USA. Its camera is currently partial… ▽ More The Cherenkov Telescope Array (CTA) is the next-generation ground-based observatory for very-high-energy gamma-ray astronomy. An innovative 9.7 m aperture, dual-mirror Schwarzschild-Couder Telescope (SCT) design is a candidate design for CTA Medium-Sized Telescopes. A prototype SCT (pSCT) has been constructed at the Fred Lawrence Whipple Observatory in Arizona, USA. Its camera is currently partially instrumented with 1600 pixels covering a field of view of 2.7 degrees square. The small plate scale of the optical system allows densely packed silicon photomultipliers to be used, which combined with high-density trigger and waveform readout electronics enable the high-resolution camera. The camera's electronics are capable of imaging air shower development at a rate of one billion samples per second. We describe the commissioning and performance of the pSCT camera, including trigger and waveform readout performance, calibration, and absolute GPS time stam**. We also present the upgrade to the camera, which is currently underway. The upgrade will fully populate the focal plane, increasing the field of view to 8 degree diameter, and lower the front-end electronics noise, enabling a lower trigger threshold and improved reconstruction and background rejection. △ Less

Submitted 10 September, 2021; originally announced September 2021.

Comments: 8 pages, 5 figures, Proceedings of the 37th International Cosmic Ray Conference (ICRC 2021), Berlin, Germany

arXiv:2107.11879 [pdf, other]

Hybrid Autoregressive Inference for Scalable Multi-hop Explanation Regeneration

Authors: Marco Valentino, Mokanarangan Thayaparan, Deborah Ferreira, André Freitas

Abstract: Regenerating natural language explanations in the scientific domain has been proposed as a benchmark to evaluate complex multi-hop and explainable inference. In this context, large language models can achieve state-of-the-art performance when employed as cross-encoder architectures and fine-tuned on human-annotated explanations. However, while much attention has been devoted to the quality of the… ▽ More Regenerating natural language explanations in the scientific domain has been proposed as a benchmark to evaluate complex multi-hop and explainable inference. In this context, large language models can achieve state-of-the-art performance when employed as cross-encoder architectures and fine-tuned on human-annotated explanations. However, while much attention has been devoted to the quality of the explanations, the problem of performing inference efficiently is largely under-studied. Cross-encoders, in fact, are intrinsically not scalable, possessing limited applicability to real-world scenarios that require inference on massive facts banks. To enable complex multi-hop reasoning at scale, this paper focuses on bi-encoder architectures, investigating the problem of scientific explanation regeneration at the intersection of dense and sparse models. Specifically, we present SCAR (for Scalable Autoregressive Inference), a hybrid framework that iteratively combines a Transformer-based bi-encoder with a sparse model of explanatory power, designed to leverage explicit inference patterns in the explanations. Our experiments demonstrate that the hybrid framework significantly outperforms previous sparse models, achieving performance comparable with that of state-of-the-art cross-encoders while being approx 50 times faster and scalable to corpora of millions of facts. Further analyses on semantic drift and multi-hop question answering reveal that the proposed hybridisation boosts the quality of the most challenging explanations, contributing to improved performance on downstream inference tasks. △ Less

Submitted 6 December, 2021; v1 submitted 25 July, 2021; originally announced July 2021.

Comments: To appear at the 36th AAAI Conference on Artificial Intelligence (AAAI-22)

arXiv:2105.08008 [pdf, other]

Supporting Context Monotonicity Abstractions in Neural NLI Models

Authors: Julia Rozanova, Deborah Ferreira, Mokanarangan Thayaparan, Marco Valentino, André Freitas

Abstract: Natural language contexts display logical regularities with respect to substitutions of related concepts: these are captured in a functional order-theoretic property called monotonicity. For a certain class of NLI problems where the resulting entailment label depends only on the context monotonicity and the relation between the substituted concepts, we build on previous techniques that aim to impr… ▽ More Natural language contexts display logical regularities with respect to substitutions of related concepts: these are captured in a functional order-theoretic property called monotonicity. For a certain class of NLI problems where the resulting entailment label depends only on the context monotonicity and the relation between the substituted concepts, we build on previous techniques that aim to improve the performance of NLI models for these problems, as consistent performance across both upward and downward monotone contexts still seems difficult to attain even for state-of-the-art models. To this end, we reframe the problem of context monotonicity classification to make it compatible with transformer-based pre-trained NLI models and add this task to the training pipeline. Furthermore, we introduce a sound and complete simplified monotonicity logic formalism which describes our treatment of contexts as abstract units. Using the notions in our formalism, we adapt targeted challenge sets to investigate whether an intermediate context monotonicity classification task can aid NLI models' performance on examples exhibiting monotonicity reasoning. △ Less

Submitted 17 May, 2021; originally announced May 2021.

Comments: NALOMA'21 (NAtural LOgic Meets MAchine Learning) @IWCS 2021

arXiv:2105.05737 [pdf, other]

Encoding Explanatory Knowledge for Zero-shot Science Question Answering

Authors: Zili Zhou, Marco Valentino, Donal Landers, Andre Freitas

Abstract: This paper describes N-XKT (Neural encoding based on eXplanatory Knowledge Transfer), a novel method for the automatic transfer of explanatory knowledge through neural encoding mechanisms. We demonstrate that N-XKT is able to improve accuracy and generalization on science Question Answering (QA). Specifically, by leveraging facts from background explanatory knowledge corpora, the N-XKT model shows… ▽ More This paper describes N-XKT (Neural encoding based on eXplanatory Knowledge Transfer), a novel method for the automatic transfer of explanatory knowledge through neural encoding mechanisms. We demonstrate that N-XKT is able to improve accuracy and generalization on science Question Answering (QA). Specifically, by leveraging facts from background explanatory knowledge corpora, the N-XKT model shows a clear improvement on zero-shot QA. Furthermore, we show that N-XKT can be fine-tuned on a target QA dataset, enabling faster convergence and more accurate results. A systematic analysis is conducted to quantitatively analyze the performance of the N-XKT model and the impact of different categories of knowledge on the zero-shot generalization task. △ Less

Submitted 19 May, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

arXiv:2105.03417 [pdf, other]

Diff-Explainer: Differentiable Convex Optimization for Explainable Multi-hop Inference

Authors: Mokanarangan Thayaparan, Marco Valentino, Deborah Ferreira, Julia Rozanova, André Freitas

Abstract: This paper presents Diff-Explainer, the first hybrid framework for explainable multi-hop inference that integrates explicit constraints with neural architectures through differentiable convex optimization. Specifically, Diff-Explainer allows for the fine-tuning of neural representations within a constrained optimization framework to answer and explain multi-hop questions in natural language. To de… ▽ More This paper presents Diff-Explainer, the first hybrid framework for explainable multi-hop inference that integrates explicit constraints with neural architectures through differentiable convex optimization. Specifically, Diff-Explainer allows for the fine-tuning of neural representations within a constrained optimization framework to answer and explain multi-hop questions in natural language. To demonstrate the efficacy of the hybrid framework, we combine existing ILP-based solvers for multi-hop Question Answering (QA) with Transformer-based representations. An extensive empirical evaluation on scientific and commonsense QA tasks demonstrates that the integration of explicit constraints in an end-to-end differentiable framework can significantly improve the performance of non-differentiable ILP solvers (8.91% - 13.3%). Moreover, additional analysis reveals that Diff-Explainer is able to achieve strong performance when compared to standalone Transformers and previous multi-hop approaches while still providing structured explanations in support of its predictions. △ Less

Submitted 22 June, 2022; v1 submitted 7 May, 2021; originally announced May 2021.

arXiv:2105.01974 [pdf, other]

Do Natural Language Explanations Represent Valid Logical Arguments? Verifying Entailment in Explainable NLI Gold Standards

Authors: Marco Valentino, Ian Pratt-Hartmann, André Freitas

Abstract: An emerging line of research in Explainable NLP is the creation of datasets enriched with human-annotated explanations and rationales, used to build and evaluate models with step-wise inference and explanation generation capabilities. While human-annotated explanations are used as ground-truth for the inference, there is a lack of systematic assessment of their consistency and rigour. In an attemp… ▽ More An emerging line of research in Explainable NLP is the creation of datasets enriched with human-annotated explanations and rationales, used to build and evaluate models with step-wise inference and explanation generation capabilities. While human-annotated explanations are used as ground-truth for the inference, there is a lack of systematic assessment of their consistency and rigour. In an attempt to provide a critical quality assessment of Explanation Gold Standards (XGSs) for NLI, we propose a systematic annotation methodology, named Explanation Entailment Verification (EEV), to quantify the logical validity of human-annotated explanations. The application of EEV on three mainstream datasets reveals the surprising conclusion that a majority of the explanations, while appearing coherent on the surface, represent logically invalid arguments, ranging from being incomplete to containing clearly identifiable logical errors. This conclusion confirms that the inferential properties of explanations are still poorly formalised and understood, and that additional work on this line of research is necessary to improve the way Explanation Gold Standards are constructed. △ Less

Submitted 15 May, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

Comments: To appear in IWCS 2021 proceedings

arXiv:2104.05807 [pdf, other]

Does My Representation Capture X? Probe-Ably

Authors: Deborah Ferreira, Julia Rozanova, Mokanarangan Thayaparan, Marco Valentino, André Freitas

Abstract: Probing (or diagnostic classification) has become a popular strategy for investigating whether a given set of intermediate features is present in the representations of neural models. Probing studies may have misleading results, but various recent works have suggested more reliable methodologies that compensate for the possible pitfalls of probing. However, these best practices are numerous and fa… ▽ More Probing (or diagnostic classification) has become a popular strategy for investigating whether a given set of intermediate features is present in the representations of neural models. Probing studies may have misleading results, but various recent works have suggested more reliable methodologies that compensate for the possible pitfalls of probing. However, these best practices are numerous and fast-evolving. To simplify the process of running a set of probing experiments in line with suggested methodologies, we introduce Probe-Ably: an extendable probing framework which supports and automates the application of probing methods to the user's inputs. △ Less

Submitted 30 September, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

Comments: ACL 2021 (System Demonstrations)

arXiv:2012.08480 [pdf, other]

Atkin-Lehner theory for Drinfeld modular forms and applications

Authors: Maria Valentino

Abstract: The present paper deals with Atkin-Lehner theory for Drinfeld modular forms. We provide an equivalent definition of $\mathfrak{p}$-newforms (which makes computations easier) and commutativity results between Hecke operators and Atkin-Lehner involutions. As applications we show a criterion for a direct sum decomposition of cusp forms, we exibit $\mathfrak{p}$-newforms arising from lower levels and… ▽ More The present paper deals with Atkin-Lehner theory for Drinfeld modular forms. We provide an equivalent definition of $\mathfrak{p}$-newforms (which makes computations easier) and commutativity results between Hecke operators and Atkin-Lehner involutions. As applications we show a criterion for a direct sum decomposition of cusp forms, we exibit $\mathfrak{p}$-newforms arising from lower levels and we provide $\mathfrak{p}$-adic Drinfeld modular forms of level greater than 1. △ Less

Submitted 15 December, 2020; originally announced December 2020.

MSC Class: 11F52; 11F25

arXiv:2012.08448 [pdf, other]

doi 10.1016/j.astropartphys.2021.102562

Detection of the Crab Nebula with the 9.7 m Prototype Schwarzschild-Couder Telescope

Authors: C. B. Adams, R. Alfaro, G. Ambrosi, M. Ambrosio, C. Aramo, T. Arlen, P. I. Batista, W. Benbow, B. Bertucci, E. Bissaldi, J. Biteau, M. Bitossi, A. Boiano, C. Bonavolontà, R. Bose, A. Bouvier, A. Brill, A. M. Brown, J. H. Buckley, K. Byrum, R. A. Cameron, R. Canestrari, M. Capasso, M. Caprai, C. E. Covault , et al. (83 additional authors not shown)

Abstract: The Schwarzschild-Couder Telescope (SCT) is a telescope concept proposed for the Cherenkov Telescope Array. It employs a dual-mirror optical design to remove comatic aberrations over an $8^{\circ}$ field of view, and a high-density silicon photomultiplier camera (with a pixel resolution of 4 arcmin) to record Cherenkov emission from cosmic ray and gamma-ray initiated particle cascades in the atmos… ▽ More The Schwarzschild-Couder Telescope (SCT) is a telescope concept proposed for the Cherenkov Telescope Array. It employs a dual-mirror optical design to remove comatic aberrations over an $8^{\circ}$ field of view, and a high-density silicon photomultiplier camera (with a pixel resolution of 4 arcmin) to record Cherenkov emission from cosmic ray and gamma-ray initiated particle cascades in the atmosphere. The prototype SCT (pSCT), comprising a 9.7 m diameter primary mirror and a partially instrumented camera with 1536 pixels, has been constructed at the Fred Lawrence Whipple Observatory. The telescope was inaugurated in January 2019, with commissioning continuing throughout 2019. We describe the first campaign of observations with the pSCT, conducted in January and February of 2020, and demonstrate the detection of gamma-ray emission from the Crab Nebula with a statistical significance of $8.6σ$. △ Less

Submitted 15 December, 2020; originally announced December 2020.

Comments: 13 pages, 12 figures, 3 tables, submitted to Astroparticle Physics

arXiv:2012.05935 [pdf, other]

doi 10.3847/1538-4357/abce66

The Fundamental Plane of Massive Quiescent Galaxies at z~2

Authors: Mikkel Stockmann, Inger Jørgensen, Sune Toft, Christopher J. Conselice, Andreas Faisst, Berta Margalef-Bentabol, Anna Gallazzi, Stefano Zibetti, Gabriel B. Brammer, Carlos Gómez-Guijarro, Michaela Hirschmann, Claudia D. Lagos, Francesco M. Valentino, Johannes Zabl

Abstract: We examine the Fundamental Plane (FP) and mass-to-light ratio ($M/L$) scaling relations using the largest sample of massive quiescent galaxies at $1.5<z<2.5$ to date. The FP ($r_{e}, σ_{e}, I_{e}$) is established using $19$ $UVJ$ quiescent galaxies from COSMOS with $Hubble$ $Space$ $Telescope$ $(HST)$ $H_{F160W}$ rest-frame optical sizes and X-shooter absorption line measured stellar velocity disp… ▽ More We examine the Fundamental Plane (FP) and mass-to-light ratio ($M/L$) scaling relations using the largest sample of massive quiescent galaxies at $1.5<z<2.5$ to date. The FP ($r_{e}, σ_{e}, I_{e}$) is established using $19$ $UVJ$ quiescent galaxies from COSMOS with $Hubble$ $Space$ $Telescope$ $(HST)$ $H_{F160W}$ rest-frame optical sizes and X-shooter absorption line measured stellar velocity dispersions. For a very massive, ${\rm{log}}(M_{\ast}/M_{\odot})>11.26$, subset of 8 quiescent galaxies at $z>2$, from Stockmann et al. (2020), we show that they cannot passively evolve to the local Coma cluster relation alone and must undergo significant structural evolution to mimic the sizes of local massive galaxies. The evolution of the FP and $M/L$ scaling relations, from $z=2$ to present-day, for this subset are consistent with passive aging of the stellar population and minor merger structural evolution into the most massive galaxies in the Coma cluster and other massive elliptical galaxies from the MASSIVE Survey. Modeling the luminosity evolution from minor merger added stellar populations favors a history of merging with "dry" quiescent galaxies. △ Less

Submitted 10 December, 2020; originally announced December 2020.

Comments: 17 pages, 6 figures

arXiv:2010.13128 [pdf, other]

ExplanationLP: Abductive Reasoning for Explainable Science Question Answering

Authors: Mokanarangan Thayaparan, Marco Valentino, André Freitas

Abstract: We propose a novel approach for answering and explaining multiple-choice science questions by reasoning on grounding and abstract inference chains. This paper frames question answering as an abductive reasoning problem, constructing plausible explanations for each choice and then selecting the candidate with the best explanation as the final answer. Our system, ExplanationLP, elicits explanations… ▽ More We propose a novel approach for answering and explaining multiple-choice science questions by reasoning on grounding and abstract inference chains. This paper frames question answering as an abductive reasoning problem, constructing plausible explanations for each choice and then selecting the candidate with the best explanation as the final answer. Our system, ExplanationLP, elicits explanations by constructing a weighted graph of relevant facts for each candidate answer and extracting the facts that satisfy certain structural and semantic constraints. To extract the explanations, we employ a linear programming formalism designed to select the optimal subgraph. The graphs' weighting function is composed of a set of parameters, which we fine-tune to optimize answer selection performance. We carry out our experiments on the WorldTree and ARC-Challenge corpus to empirically demonstrate the following conclusions: (1) Grounding-Abstract inference chains provides the semantic control to perform explainable abductive reasoning (2) Efficiency and robustness in learning with a fewer number of parameters by outperforming contemporary explainable and transformer-based approaches in a similar setting (3) Generalisability by outperforming SOTA explainable approaches on general science question sets. △ Less

Submitted 25 October, 2020; originally announced October 2020.

arXiv:2010.13027 [pdf, other]

doi 10.1117/12.2568134

Verification of the Optical System of the 9.7-m Prototype Schwarzschild-Couder Telescope

Authors: C. Adams, R. Alfaro, G. Ambrosi, M. Ambrosio, C. Aramo, W. Benbow, B. Bertucci, E. Bissaldi, M. Bitossi, A. Boiano, C. Bonavolontà, R. Bose, A. Brill, J. H. Buckley, K. Byrum, R. A. Cameron, M. Capasso, M. Caprai, C. E. Covault, L. Di Venere, S. Fegan, Q. Feng, E. Fiandrini, A. Furniss, M. Garczarczyk , et al. (55 additional authors not shown)

Abstract: For the first time in the history of ground-based $γ$-ray astronomy, the on-axis performance of the dual mirror, aspheric, aplanatic Schwarzschild-Couder optical system has been demonstrated in a $9.7$-m aperture imaging atmospheric Cherenkov telescope. The novel design of the prototype Schwarzschild-Couder Telescope (pSCT) is motivated by the need of the next-generation Cherenkov Telescope Array… ▽ More For the first time in the history of ground-based $γ$-ray astronomy, the on-axis performance of the dual mirror, aspheric, aplanatic Schwarzschild-Couder optical system has been demonstrated in a $9.7$-m aperture imaging atmospheric Cherenkov telescope. The novel design of the prototype Schwarzschild-Couder Telescope (pSCT) is motivated by the need of the next-generation Cherenkov Telescope Array (CTA) observatory to have the ability to perform wide ($\geq 8^{\circ}$) field-of-view observations simultaneously with superior imaging of atmospheric cascades (resolution of $0.067^{\circ}$ per pixel or better). The pSCT design, if implemented in the CTA installation, has the potential to improve significantly both the $γ$-ray angular resolution and the off-axis sensitivity of the observatory, reaching nearly the theoretical limit of the technique and thereby making a major impact on the CTA observatory sky survey programs, follow-up observations of multi-messenger transients with poorly known initial localization, as well as on the spatially resolved spectroscopic studies of extended $γ$-ray sources. This contribution reports on the initial alignment procedures and point-spread-function results for the challenging segmented aspheric primary and secondary mirrors of the pSCT. △ Less

Submitted 25 October, 2020; originally announced October 2020.

Comments: 19 pages, 11 figures, proceedings for SPIE Optical Engineering + Applications, 2020, Online Only

arXiv:2010.01349 [pdf, other]

doi 10.1088/1475-7516/2021/02/048

Sensitivity of the Cherenkov Telescope Array for probing cosmology and fundamental physics with gamma-ray propagation

Authors: The Cherenkov Telescope Array Consortium, :, H. Abdalla, H. Abe, F. Acero, A. Acharyya, R. Adam, I. Agudo, A. Aguirre-Santaella, R. Alfaro, J. Alfaro, C. Alispach, R. Aloisio, R. Alves B, L. Amati, E. Amato, G. Ambrosi, E. O. Angüner, A. Araudo, T. Armstrong, F. Arqueros, L. Arrabito, K. Asano, Y. Ascasíbar, M. Ashley , et al. (474 additional authors not shown)

Abstract: The Cherenkov Telescope Array (CTA), the new-generation ground-based observatory for $γ$-ray astronomy, provides unique capabilities to address significant open questions in astrophysics, cosmology, and fundamental physics. We study some of the salient areas of $γ$-ray cosmology that can be explored as part of the Key Science Projects of CTA, through simulated observations of active galactic nucle… ▽ More The Cherenkov Telescope Array (CTA), the new-generation ground-based observatory for $γ$-ray astronomy, provides unique capabilities to address significant open questions in astrophysics, cosmology, and fundamental physics. We study some of the salient areas of $γ$-ray cosmology that can be explored as part of the Key Science Projects of CTA, through simulated observations of active galactic nuclei (AGN) and of their relativistic jets. Observations of AGN with CTA will enable a measurement of $γ$-ray absorption on the extragalactic background light with a statistical uncertainty below 15% up to a redshift $z=2$ and to constrain or detect $γ$-ray halos up to intergalactic-magnetic-field strengths of at least 0.3pG. Extragalactic observations with CTA also show promising potential to probe physics beyond the Standard Model. The best limits on Lorentz invariance violation from $γ$-ray astronomy will be improved by a factor of at least two to three. CTA will also probe the parameter space in which axion-like particles could constitute a significant fraction, if not all, of dark matter. We conclude on the synergies between CTA and other upcoming facilities that will foster the growth of $γ$-ray cosmology. △ Less

Submitted 26 February, 2021; v1 submitted 3 October, 2020; originally announced October 2020.

Comments: 71 pages (including affiliations and references), 13 figures, 6 tables. Accepted in JCAP; matches published version. Corresponding authors: Jonathan Biteau, Julien Lefaucheur, Humberto Martinez-Huerta, Manuel Meyer, Santiago Pita, Ievgen Vovk

Journal ref: JCAP 02 (2021) 048

arXiv:2010.00389 [pdf, other]

A Survey on Explainability in Machine Reading Comprehension

Authors: Mokanarangan Thayaparan, Marco Valentino, André Freitas

Abstract: This paper presents a systematic review of benchmarks and approaches for explainability in Machine Reading Comprehension (MRC). We present how the representation and inference challenges evolved and the steps which were taken to tackle these challenges. We also present the evaluation methodologies to assess the performance of explainable systems. In addition, we identify persisting open research q… ▽ More This paper presents a systematic review of benchmarks and approaches for explainability in Machine Reading Comprehension (MRC). We present how the representation and inference challenges evolved and the steps which were taken to tackle these challenges. We also present the evaluation methodologies to assess the performance of explainable systems. In addition, we identify persisting open research questions and highlight critical directions for future work. △ Less

Submitted 1 October, 2020; originally announced October 2020.

arXiv:2009.14539 [pdf, other]

Case-Based Abductive Natural Language Inference

Authors: Marco Valentino, Mokanarangan Thayaparan, André Freitas

Abstract: Most of the contemporary approaches for multi-hop Natural Language Inference (NLI) construct explanations considering each test case in isolation. However, this paradigm is known to suffer from semantic drift, a phenomenon that causes the construction of spurious explanations leading to wrong conclusions. In contrast, this paper proposes an abductive framework for multi-hop NLI exploring the retri… ▽ More Most of the contemporary approaches for multi-hop Natural Language Inference (NLI) construct explanations considering each test case in isolation. However, this paradigm is known to suffer from semantic drift, a phenomenon that causes the construction of spurious explanations leading to wrong conclusions. In contrast, this paper proposes an abductive framework for multi-hop NLI exploring the retrieve-reuse-refine paradigm in Case-Based Reasoning (CBR). Specifically, we present Case-Based Abductive Natural Language Inference (CB-ANLI), a model that addresses unseen inference problems by analogical transfer of prior explanations from similar examples. We empirically evaluate the abductive framework on commonsense and scientific question answering tasks, demonstrating that CB-ANLI can be effectively integrated with sparse and dense pre-trained encoders to improve multi-hop inference, or adopted as an evidence retriever for Transformers. Moreover, an empirical analysis of semantic drift reveals that the CBR paradigm boosts the quality of the most challenging explanations, a feature that has a direct impact on robustness and accuracy in downstream inference tasks. △ Less

Submitted 10 September, 2022; v1 submitted 30 September, 2020; originally announced September 2020.

Comments: Accepted to the 29th International Conference on Computational Linguistics (COLING 2022) - Camera-ready

arXiv:2004.00061 [pdf, other]

Unification-based Reconstruction of Multi-hop Explanations for Science Questions

Authors: Marco Valentino, Mokanarangan Thayaparan, André Freitas

Abstract: This paper presents a novel framework for reconstructing multi-hop explanations in science Question Answering (QA). While existing approaches for multi-hop reasoning build explanations considering each question in isolation, we propose a method to leverage explanatory patterns emerging in a corpus of scientific explanations. Specifically, the framework ranks a set of atomic facts by integrating le… ▽ More This paper presents a novel framework for reconstructing multi-hop explanations in science Question Answering (QA). While existing approaches for multi-hop reasoning build explanations considering each question in isolation, we propose a method to leverage explanatory patterns emerging in a corpus of scientific explanations. Specifically, the framework ranks a set of atomic facts by integrating lexical relevance with the notion of unification power, estimated analysing explanations for similar questions in the corpus. An extensive evaluation is performed on the Worldtree corpus, integrating k-NN clustering and Information Retrieval (IR) techniques. We present the following conclusions: (1) The proposed method achieves results competitive with Transformers, yet being orders of magnitude faster, a feature that makes it scalable to large explanatory corpora (2) The unification-based mechanism has a key role in reducing semantic drift, contributing to the reconstruction of many hops explanations (6 or more facts) and the ranking of complex inference facts (+12.0 Mean Average Precision) (3) Crucially, the constructed explanations can support downstream QA models, improving the accuracy of BERT by up to 10% overall. △ Less

Submitted 10 February, 2021; v1 submitted 31 March, 2020; originally announced April 2020.

Comments: Accepted at EACL 2021

arXiv:2003.04642 [pdf, ps, other]

A Framework for Evaluation of Machine Reading Comprehension Gold Standards

Authors: Viktor Schlegel, Marco Valentino, André Freitas, Goran Nenadic, Riza Batista-Navarro

Abstract: Machine Reading Comprehension (MRC) is the task of answering a question over a paragraph of text. While neural MRC systems gain popularity and achieve noticeable performance, issues are being raised with the methodology used to establish their performance, particularly concerning the data design of gold standards that are used to evaluate them. There is but a limited understanding of the challenge… ▽ More Machine Reading Comprehension (MRC) is the task of answering a question over a paragraph of text. While neural MRC systems gain popularity and achieve noticeable performance, issues are being raised with the methodology used to establish their performance, particularly concerning the data design of gold standards that are used to evaluate them. There is but a limited understanding of the challenges present in this data, which makes it hard to draw comparisons and formulate reliable hypotheses. As a first step towards alleviating the problem, this paper proposes a unifying framework to systematically investigate the present linguistic features, required reasoning and background knowledge and factual correctness on one hand, and the presence of lexical cues as a lower bound for the requirement of understanding on the other hand. We propose a qualitative annotation schema for the first and a set of approximative metrics for the latter. In a first application of the framework, we analyse modern MRC gold standards and present our findings: the absence of features that contribute towards lexical ambiguity, the varying factual correctness of the expected answers and the presence of lexical cues, all of which potentially lower the reading comprehension complexity and quality of the evaluation data. △ Less

Submitted 10 March, 2020; originally announced March 2020.

Comments: In Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020)

arXiv:1912.01619 [pdf, other]

doi 10.3847/1538-4357/ab5af4

X-shooter Spectroscopy and HST Imaging of 15 Ultra Massive Quiescent Galaxies at $z\gtrsim2$

Authors: Mikkel Stockmann, Sune Toft, Anna Gallazzi, Stefano Zibetti, Christopher J. Conselice, Berta Margalef-Bentabol, Johannes Zabl, Inger Jørgensen, Georgios E. Magdis, Carlos Gómez-Guijarro, Francesco M. Valentino, Gabriel B. Brammer, Daniel Ceverino, Isabella Cortzen, Iary Davidzon, Richardo Demarco, Andreas Faisst, Michaela Hirschmann, Jens-Kristian Krogager, Claudia D. Lagos, Allison W. S. Man, Carl J. Mundy, Yingjie Peng, Jonatan Selsing, Charles L. Steinhardt , et al. (1 additional authors not shown)

Abstract: We present a detailed analysis of a large sample of spectroscopically confirmed ultra-massive quiescent galaxies (${\rm{log}}(M_{\ast}/M_{\odot})\sim11.5$) at $z\gtrsim2$. This sample comprises 15 galaxies selected in the COSMOS and UDS fields by their bright K-band magnitudes and followed up with VLT/X-shooter spectroscopy and HST/WFC3 $H_{F160W}$ imaging. These observations allow us to unambiguo… ▽ More We present a detailed analysis of a large sample of spectroscopically confirmed ultra-massive quiescent galaxies (${\rm{log}}(M_{\ast}/M_{\odot})\sim11.5$) at $z\gtrsim2$. This sample comprises 15 galaxies selected in the COSMOS and UDS fields by their bright K-band magnitudes and followed up with VLT/X-shooter spectroscopy and HST/WFC3 $H_{F160W}$ imaging. These observations allow us to unambiguously confirm their redshifts ascertain their quiescent nature and stellar ages, and to reliably assess their internal kinematics and effective radii. We find that these galaxies are compact, consistent with the high mass end of the mass-size relation for quiescent galaxies at $z=2$. Moreover, the distribution of the measured stellar velocity dispersions of the sample is consistent with the most massive local early-type galaxies from the MASSIVE Survey showing that evolution in these galaxies, is dominated by changes in size. The HST images reveal, as surprisingly high, that $40\ \%$ of the sample have tidal features suggestive of mergers and companions in close proximity, including three galaxies experiencing ongoing major mergers. The absence of velocity dispersion evolution from $z=2$ to $0$, coupled with a doubling of the stellar mass, with a factor of four size increase and the observed disturbed stellar morphologies support dry minor mergers as the primary drivers of the evolution of the massive quiescent galaxies over the last 10 billion years. △ Less

Submitted 3 December, 2019; originally announced December 2019.

Comments: 30 pages, 10 figures, accepted in ApJ

arXiv:1910.00290 [pdf, other]

Identifying Supporting Facts for Multi-hop Question Answering with Document Graph Networks

Authors: Mokanarangan Thayaparan, Marco Valentino, Viktor Schlegel, Andre Freitas

Abstract: Recent advances in reading comprehension have resulted in models that surpass human performance when the answer is contained in a single, continuous passage of text. However, complex Question Answering (QA) typically requires multi-hop reasoning - i.e. the integration of supporting facts from different sources, to infer the correct answer. This paper proposes Document Graph Network (DGN), a messag… ▽ More Recent advances in reading comprehension have resulted in models that surpass human performance when the answer is contained in a single, continuous passage of text. However, complex Question Answering (QA) typically requires multi-hop reasoning - i.e. the integration of supporting facts from different sources, to infer the correct answer. This paper proposes Document Graph Network (DGN), a message passing architecture for the identification of supporting facts over a graph-structured representation of text. The evaluation on HotpotQA shows that DGN obtains competitive results when compared to a reading comprehension baseline operating on raw text, confirming the relevance of structured representations for supporting multi-hop reasoning. △ Less

Submitted 1 October, 2019; originally announced October 2019.

arXiv:1910.00133 [pdf, other]

Camera design and performance of the prototype Schwarzschild-Couder Telescope for the Cherenkov Telescope Array

Authors: C. Adams, G. Ambrosi, M. Ambrosio, C. Aramo, W. Benbow, B. Bertucci, E. Bissaldi, M. Bitossi, A. Boiano, C. Bonavolonta, R. Bose, A. Brill, J. H. Buckley, M. Caprai, L. Di Venere, Q. Feng, E. Fiandrini, N. Giglietto, F. Giordano, O. Hervet, G. Hughes, T. B. Humensky, M. Ionica, W. **, P. Kaaret , et al. (27 additional authors not shown)

Abstract: The Schwarzschild-Couder Telescope (SCT) is a candidate technology for a medium-sized telescope within the Cherenkov Telescope Array, the next generation ground based observatory for very high energy gamma ray astronomy. The SCT uses a novel two-mirror design and is expected to yield improvements in field of view and image resolution compared to traditional Cherenkov telescopes based on single-mir… ▽ More The Schwarzschild-Couder Telescope (SCT) is a candidate technology for a medium-sized telescope within the Cherenkov Telescope Array, the next generation ground based observatory for very high energy gamma ray astronomy. The SCT uses a novel two-mirror design and is expected to yield improvements in field of view and image resolution compared to traditional Cherenkov telescopes based on single-mirror-dish optics. To match the improved optical resolution, challenging requirements of high channel count and density at low power consumption must be overcome by the camera. The prototype camera, currently commissioned and tested on the prototype SCT, has been developed based on millimeter scale SiPM pixels and a custom high density digitizer ASIC, TARGET, to provide 1600 pixels spanning a 2.7 degree field of view while being able to sample nanosecond photon pulses. It is mechanically designed to allow for an upgrade to 11,328 pixels covering a field of view of 8 degrees and demonstrating the full potential of the technology. The camera was installed on the telescope in 2018. We will present its design and performance including first light data. △ Less

Submitted 30 September, 2019; originally announced October 2019.

Comments: ICRC 2019 Proceeding

arXiv:1909.11403 [pdf, other]

Prototype Schwarzschild-Couder Telescope for the Cherenkov Telescope Array: Commissioning Status of the Optical System

Authors: C. Adams, G. Ambrosi, M. Ambrosio, C. Aramo, W. Benbow, B. Bertucci, E. Bissaldi, M. Bitossi, A. Boiano, C. Bonavolontà, R. Bose, A. Brill, J. H. Buckley, M. Caprai, C. E. Covault, L. Di Venere, S. Fegan, Q. Feng, E. Fiandrini, A. Gent, N. Giglietto, F. Giordano, R. Halliday, O. Hervet, G. Hughes , et al. (34 additional authors not shown)

Abstract: The Cherenkov Telescope Array (CTA), with more than 100 telescopes, will be the largest ever ground-based gamma-ray observatory and is expected to greatly improve on both gamma-ray detection sensitivity and energy coverage compared to current-generation detectors. The 9.7-m Schwarzschild-Couder telescope (SCT) is one of the two candidates for the medium size telescope (MST) design for CTA. The nov… ▽ More The Cherenkov Telescope Array (CTA), with more than 100 telescopes, will be the largest ever ground-based gamma-ray observatory and is expected to greatly improve on both gamma-ray detection sensitivity and energy coverage compared to current-generation detectors. The 9.7-m Schwarzschild-Couder telescope (SCT) is one of the two candidates for the medium size telescope (MST) design for CTA. The novel aplanatic dual-mirror SCT design offers a wide field-of-view with a compact plate scale, allowing for a large number of camera pixels that improves the angular resolution and reduce the night sky background noise per pixel compared to the traditional single-mirror Davies-Cotton (DC) design of ground-based gamma-ray telescopes. The production, installation, and the alignment of the segmented aspherical mirrors are the main challenges for the realization of the SCT optical system. In this contribution, we report on the commissioning status, the alignment procedures, and initial alignment results during the initial commissioning phase of the optical system of the prototype SCT. △ Less

Submitted 25 September, 2019; originally announced September 2019.

Comments: 8 pages, PoS proceedings 36th ICRC 2019 Madison

arXiv:1909.08361 [pdf, other]

Development and operations of INFN optical modules for the SCT Telescope camera proposed for the Cherenkov Telescope Array Observatory

Authors: C. Adams, G. Ambrosi, M. Ambrosio, C. Aramo, W. Benbow, B. Bertucci, E. Bissaldi, M. Bitossi, A. Boiano, C. Bonavolontà, R. Bose, A. Brill, J. H. Buckley, M. Caprai, C. E. Covault, L. Di Venere, Q. Feng, E. Fiandrini, A. Gent, N. Giglietto, F. Giordano, R. Halliday, O. Hervet, G. Hughes, T. B. Humensky , et al. (32 additional authors not shown)

Abstract: The Schwarzschild-Couder Telescope (SCT) is a proposal for the Medium Size Telescopes of the Cherenkov Telescope Array. Its concept is based on a two-mirror optical system designed to improve the telescope field of view and image resolution with respect to the single mirror Davies-Cotton solution. The SCT camera is planned to be instrumented with 177 photodetection modules, each composed of 64 Sil… ▽ More The Schwarzschild-Couder Telescope (SCT) is a proposal for the Medium Size Telescopes of the Cherenkov Telescope Array. Its concept is based on a two-mirror optical system designed to improve the telescope field of view and image resolution with respect to the single mirror Davies-Cotton solution. The SCT camera is planned to be instrumented with 177 photodetection modules, each composed of 64 Silicon Photomultiplier (SiPM) pixels. The third generation of $6 x 6~mm^2$ high density NUV SiPMs (NUV-HD3) produced by Fondazione Bruno Kessler (FBK) in collaboration with INFN has been used to equip optical units to be integrated on the upgrade of the camera of the SCT prototype (pSCT). Each optical unit is composed of an array of 16 NUV-HD3 SiPMs coupled with the front-end electronics, which is designed for full-waveform nanosecond readout and digitization using the TARGET-7 ASIC. Several optical units have been assembled and tested in the laboratories of INFN and have been integrated on the camera of the pSCT telescope, that is currently operating at the Fred Lawrence Whipple Observatory. In this contribution we report on the development, assembly and calibration of the optical units that are currently taking data on the pSCT camera. △ Less

Submitted 18 September, 2019; originally announced September 2019.

Comments: 8 pages, proceeding ICRC

arXiv:1908.09768 [pdf, other]

On Drinfeld cusp forms of prime level

Authors: Andrea Bandini, Maria Valentino

Abstract: Let $(P_d)$ be any prime of $\mathbb{F}_q[t]$ of degree $d$ and consider the space of Drinfeld cusp forms of level $P_d$, i.e. for the modular group $Γ_0(P_d)$. We provide a definition for oldforms and newforms of level $P_d$. Moreover, when the dimension of the vector space of oldforms is one and $P_1=t$ we prove that the space of cuspforms of level $t$ is the direct sum of oldforms and newforms… ▽ More Let $(P_d)$ be any prime of $\mathbb{F}_q[t]$ of degree $d$ and consider the space of Drinfeld cusp forms of level $P_d$, i.e. for the modular group $Γ_0(P_d)$. We provide a definition for oldforms and newforms of level $P_d$. Moreover, when the dimension of the vector space of oldforms is one and $P_1=t$ we prove that the space of cuspforms of level $t$ is the direct sum of oldforms and newforms and that the Hecke operator $\mathbf{T}_t$ acting on Drinfeld cusp forms of level 1 is injective, thus providing more evidence for the conjectures presented and stated in [2] and [3]. △ Less

Submitted 26 August, 2019; originally announced August 2019.

arXiv:1904.01426 [pdf, other]

doi 10.1016/j.astropartphys.2019.04.001

Monte Carlo studies for the optimisation of the Cherenkov Telescope Array layout

Authors: A. Acharyya, I. Agudo, E. O. Angüner, R. Alfaro, J. Alfaro, C. Alispach, R. Aloisio, R. Alves Batista, J. -P. Amans, L. Amati, E. Amato, G. Ambrosi, L. A. Antonelli, C. Aramo, T. Armstrong, F. Arqueros, L. Arrabito, K. Asano, H. Ashkar, C. Balazs, M. Balbo, B. Balmaverde, P. Barai, A. Barbano, M. Barkov , et al. (445 additional authors not shown)

Abstract: The Cherenkov Telescope Array (CTA) is the major next-generation observatory for ground-based very-high-energy gamma-ray astronomy. It will improve the sensitivity of current ground-based instruments by a factor of five to twenty, depending on the energy, greatly improving both their angular and energy resolutions over four decades in energy (from 20 GeV to 300 TeV). This achievement will be possi… ▽ More The Cherenkov Telescope Array (CTA) is the major next-generation observatory for ground-based very-high-energy gamma-ray astronomy. It will improve the sensitivity of current ground-based instruments by a factor of five to twenty, depending on the energy, greatly improving both their angular and energy resolutions over four decades in energy (from 20 GeV to 300 TeV). This achievement will be possible by using tens of imaging Cherenkov telescopes of three successive sizes. They will be arranged into two arrays, one per hemisphere, located on the La Palma island (Spain) and in Paranal (Chile). We present here the optimised and final telescope arrays for both CTA sites, as well as their foreseen performance, resulting from the analysis of three different large-scale Monte Carlo productions. △ Less

Submitted 2 April, 2019; originally announced April 2019.

Comments: 48 pages, 16 figures, accepted for publication in Astroparticle Physics

arXiv:1812.02032 [pdf, other]

doi 10.1080/10586458.2019.1671921

On the structure and slopes of Drinfeld cusp forms

Authors: Andrea Bandini, Maria Valentino

Abstract: We define oldforms and newforms for Drinfeld cusp forms of level $t$ and conjecture that their direct sum is the whole space of cusp forms. Moreover we describe explicitly the matrix $U$ associated to the action of the Atkin operator $\mathbf{U}_t$ on cusp forms of level $t$ and use it to compute tables of slopes of eigenforms. Building on such data, we formulate conjectures on bounds for slopes,… ▽ More We define oldforms and newforms for Drinfeld cusp forms of level $t$ and conjecture that their direct sum is the whole space of cusp forms. Moreover we describe explicitly the matrix $U$ associated to the action of the Atkin operator $\mathbf{U}_t$ on cusp forms of level $t$ and use it to compute tables of slopes of eigenforms. Building on such data, we formulate conjectures on bounds for slopes, on the diagonalizability of $\mathbf{U}_t$ and on various other issues. Via the explicit form of the matrix $U$ we are then able to verify our conjectures in various cases (mainly in small weights). △ Less

Submitted 21 September, 2019; v1 submitted 5 December, 2018; originally announced December 2018.

Comments: Final version, to appear in Exp. Math

MSC Class: 11F52 (Primary); 15B99 (Secondary)

arXiv:1807.04350 [pdf, other]

doi 10.3847/2041-8213/aad33e

Near-infrared emission lines in starburst galaxies at 0.5 < z < 0.9 : Discovery of a merger sequence of extreme obscurations

Authors: Antonello Calabrò, Emanuele Daddi, Paolo Cassata, Masato Onodera, Raphael Gobat, Annagrazia Puglisi, Shuowen **, Daizhong Liu, Ricardo Amorín, Nobuo Arimoto, Médéric Boquien, Rosamaria Carraro, David Elbaz, Eduardo Ibar, Stéphanie Juneau, Filippo Mannucci, Hugo Méndez Hernánez, Ernesto Oliva, Giulia Rodighiero, Francesco M. Valentino, Anita Zanella

Abstract: We obtained optical/near-IR rest-frame Magellan FIRE spectra (including Pa$β$ and Pa$γ$) of 25 starburst galaxies at 0.5<z<0.9, with average star formation rates (SFR) x7 above the Main Sequence (MS). We find that Paschen-to-Balmer line ratios saturate around a constant value corresponding to $A_{\rm V}\sim$2-3 mag, while line to IR luminosity ratios suggest a large range of more extreme obscurati… ▽ More We obtained optical/near-IR rest-frame Magellan FIRE spectra (including Pa$β$ and Pa$γ$) of 25 starburst galaxies at 0.5<z<0.9, with average star formation rates (SFR) x7 above the Main Sequence (MS). We find that Paschen-to-Balmer line ratios saturate around a constant value corresponding to $A_{\rm V}\sim$2-3 mag, while line to IR luminosity ratios suggest a large range of more extreme obscurations and appear to be uncorrelated to the former. This behavior is not consistent with standard attenuation laws derived for local and distant galaxies, while being remarkably consistent with observations of starburst cores in which young stars and dust are homogeneously mixed. This model implies $A_{\rm V}=$2-30 mag attenuation to the center of starburst cores, with a median of ~9 mag (a factor of 4000). X-ray hardness ratios for 6 AGNs in our sample and column densities derived from observed dust masses and radio sizes independently confirm this level of attenuation. In these conditions observed optical/near-IR emission comes from surface regions, while inner starburst cores are invisible. We thus attribute the high [NII]/H$α$ ratios to widespread shocks from accretion, turbulence and dynamic disturbances rather than to AGNs. The large range of optical depths demonstrates that substantial diversity is present within the starburst population, possibly connected to different merger phases or progenitor properties. The majority of our targets are, in fact, morphologically classified as mergers. We argue that the extreme obscuration provides in itself smoking gun evidence of their merger origin, and a powerful tool for identifying mergers at even higher redshifts. △ Less

Submitted 11 July, 2018; originally announced July 2018.

Comments: ApJ Letters in press; the key result is in Figure 4 (left)

Showing 1–50 of 64 results for author: Valentino, M