-
A LOFAR prompt search for radio emission accompanying X-ray flares in GRB 210112A
Authors:
A. Hennessy,
R. L. C. Starling,
A. Rowlinson,
I. de Ruiter,
A. Kumar,
R. A. J. Eyles-Ferris,
A. K. Ror,
G. E. Anderson,
K. Gourdji,
A. J. van der Horst,
S. B. Pandey,
T. W. Shimwell,
D. Steeghs,
N. Stylianou,
S. ter Veen,
K. Wiersema,
R. A. M. J. Wijers
Abstract:
The composition of relativistic gamma-ray burst (GRB) jets and their emission mechanisms are still debated, and they could be matter or magnetically dominated. One way to distinguish these mechanisms arises because a Poynting flux dominated jet may produce low-frequency radio emission during the energetic prompt phase, through magnetic reconnection at the shock front. We present a search for radio…
▽ More
The composition of relativistic gamma-ray burst (GRB) jets and their emission mechanisms are still debated, and they could be matter or magnetically dominated. One way to distinguish these mechanisms arises because a Poynting flux dominated jet may produce low-frequency radio emission during the energetic prompt phase, through magnetic reconnection at the shock front. We present a search for radio emission coincident with three GRB X-ray flares with the LOw Frequency ARray (LOFAR), in a rapid response mode follow-up of long GRB 210112A (at z~2) with a 2 hour duration, where our observations began 511 seconds after the initial swift-BAT trigger. Using timesliced imaging at 120-168 MHz, we obtain upper limits at 3 sigma confidence of 42 mJy averaging over 320 second snapshot images, and 87 mJy averaging over 60 second snapshot images. LOFAR's fast response time means that all three potential radio counterparts to X-ray flares are observable after accounting for dispersion at the estimated source redshift. Furthermore, the radio pulse in the magnetic wind model was expected to be detectable at our observing frequency and flux density limits which allows us to disfavour a region of parameter space for this GRB. However, we note that stricter constraints on redshift and the fraction of energy in the magnetic field are required to further test jet characteristics across the GRB population.
△ Less
Submitted 19 October, 2023; v1 submitted 30 August, 2023;
originally announced August 2023.
-
The sensitivity of GPz estimates of photo-z posterior PDFs to realistically complex training set imperfections
Authors:
Natalia Stylianou,
Alex I. Malz,
Peter Hatfield,
John Franklin Crenshaw,
Julia Gschwend
Abstract:
The accurate estimation of photometric redshifts is crucial to many upcoming galaxy surveys, for example the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). Almost all Rubin extragalactic and cosmological science requires accurate and precise calculation of photometric redshifts; many diverse approaches to this problem are currently in the process of being developed, validated, a…
▽ More
The accurate estimation of photometric redshifts is crucial to many upcoming galaxy surveys, for example the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). Almost all Rubin extragalactic and cosmological science requires accurate and precise calculation of photometric redshifts; many diverse approaches to this problem are currently in the process of being developed, validated, and tested. In this work, we use the photometric redshift code GPz to examine two realistically complex training set imperfections scenarios for machine learning based photometric redshift calculation: i) where the spectroscopic training set has a very different distribution in colour-magnitude space to the test set, and ii) where the effect of emission line confusion causes a fraction of the training spectroscopic sample to not have the true redshift. By evaluating the sensitivity of GPz to a range of increasingly severe imperfections, with a range of metrics (both of photo-z point estimates as well as posterior probability distribution functions, PDFs), we quantify the degree to which predictions get worse with higher degrees of degradation. In particular we find that there is a substantial drop-off in photo-z quality when line-confusion goes above ~1%, and sample incompleteness below a redshift of 1.5, for an experimental setup using data from the Buzzard Flock synthetic sky catalogues.
△ Less
Submitted 25 February, 2022;
originally announced February 2022.
-
CoreLM: Coreference-aware Language Model Fine-Tuning
Authors:
Nikolaos Stylianou,
Ioannis Vlahavas
Abstract:
Language Models are the underpin of all modern Natural Language Processing (NLP) tasks. The introduction of the Transformers architecture has contributed significantly into making Language Modeling very effective across many NLP task, leading to significant advancements in the field. However, Transformers come with a big computational cost, which grows quadratically with respect to the input lengt…
▽ More
Language Models are the underpin of all modern Natural Language Processing (NLP) tasks. The introduction of the Transformers architecture has contributed significantly into making Language Modeling very effective across many NLP task, leading to significant advancements in the field. However, Transformers come with a big computational cost, which grows quadratically with respect to the input length. This presents a challenge as to understand long texts requires a lot of context. In this paper, we propose a Fine-Tuning framework, named CoreLM, that extends the architecture of current Pretrained Language Models so that they incorporate explicit entity information. By introducing entity representations, we make available information outside the contextual space of the model, which results in a better Language Model for a fraction of the computational cost. We implement our approach using GPT2 and compare the fine-tuned model to the original. Our proposed model achieves a lower Perplexity in GUMBY and LAMBDADA datasets when compared to GPT2 and a fine-tuned version of GPT2 without any changes. We also compare the models' performance in terms of Accuracy in LAMBADA and Children's Book Test, with and without the use of model-created coreference annotations.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.
-
E.T.: Entity-Transformers. Coreference augmented Neural Language Model for richer mention representations via Entity-Transformer blocks
Authors:
Nikolaos Stylianou,
Ioannis Vlahavas
Abstract:
In the last decade, the field of Neural Language Modelling has witnessed enormous changes, with the development of novel models through the use of Transformer architectures. However, even these models struggle to model long sequences due to memory constraints and increasing computational complexity. Coreference annotations over the training data can provide context far beyond the modelling limitat…
▽ More
In the last decade, the field of Neural Language Modelling has witnessed enormous changes, with the development of novel models through the use of Transformer architectures. However, even these models struggle to model long sequences due to memory constraints and increasing computational complexity. Coreference annotations over the training data can provide context far beyond the modelling limitations of such language models. In this paper we present an extension over the Transformer-block architecture used in neural language models, specifically in GPT2, in order to incorporate entity annotations during training. Our model, GPT2E, extends the Transformer layers architecture of GPT2 to Entity-Transformers, an architecture designed to handle coreference information when present. To that end, we achieve richer representations for entity mentions, with insignificant training cost. We show the comparative model performance between GPT2 and GPT2E in terms of Perplexity on the CoNLL 2012 and LAMBADA datasets as well as the key differences in the entity representations and their effects in downstream tasks such as Named Entity Recognition. Furthermore, our approach can be adopted by the majority of Transformer-based language models.
△ Less
Submitted 10 November, 2020;
originally announced November 2020.
-
A Neural Entity Coreference Resolution Review
Authors:
Nikolaos Stylianou,
Ioannis Vlahavas
Abstract:
Entity Coreference Resolution is the task of resolving all mentions in a document that refer to the same real world entity and is considered as one of the most difficult tasks in natural language understanding. It is of great importance for downstream natural language processing tasks such as entity linking, machine translation, summarization, chatbots, etc. This work aims to give a detailed revie…
▽ More
Entity Coreference Resolution is the task of resolving all mentions in a document that refer to the same real world entity and is considered as one of the most difficult tasks in natural language understanding. It is of great importance for downstream natural language processing tasks such as entity linking, machine translation, summarization, chatbots, etc. This work aims to give a detailed review of current progress on solving Coreference Resolution using neural-based approaches. It also provides a detailed appraisal of the datasets and evaluation metrics in the field, as well as the subtask of Pronoun Resolution that has seen various improvements in the recent years. We highlight the advantages and disadvantages of the approaches, the challenges of the task, the lack of agreed-upon standards in the task and propose a way to further expand the boundaries of the field.
△ Less
Submitted 9 December, 2020; v1 submitted 21 October, 2019;
originally announced October 2019.