ATAT: Astronomical Transformer for time series And Tabular data
Authors:
G. Cabrera-Vives,
D. Moreno-Cartagena,
N. Astorga,
I. Reyes-Jainaga,
F. Förster,
P. Huijse,
J. Arredondo,
A. M. Muñoz Arancibia,
A. Bayo,
M. Catelan,
P. A. Estévez,
P. Sánchez-Sáez,
A. Álvarez,
P. Castellanos,
P. Gallardo,
A. Moya,
D. Rodriguez-Mancini
Abstract:
The advent of next-generation survey instruments, such as the Vera C. Rubin Observatory and its Legacy Survey of Space and Time (LSST), is opening a window for new research in time-domain astronomy. The Extended LSST Astronomical Time-Series Classification Challenge (ELAsTiCC) was created to test the capacity of brokers to deal with a simulated LSST stream. We describe ATAT, the Astronomical Trans…
▽ More
The advent of next-generation survey instruments, such as the Vera C. Rubin Observatory and its Legacy Survey of Space and Time (LSST), is opening a window for new research in time-domain astronomy. The Extended LSST Astronomical Time-Series Classification Challenge (ELAsTiCC) was created to test the capacity of brokers to deal with a simulated LSST stream. We describe ATAT, the Astronomical Transformer for time series And Tabular data, a classification model conceived by the ALeRCE alert broker to classify light-curves from next-generation alert streams. ATAT was tested in production during the first round of the ELAsTiCC campaigns. ATAT consists of two Transformer models that encode light curves and features using novel time modulation and quantile feature tokenizer mechanisms, respectively. ATAT was trained on different combinations of light curves, metadata, and features calculated over the light curves. We compare ATAT against the current ALeRCE classifier, a Balanced Hierarchical Random Forest (BHRF) trained on human-engineered features derived from light curves and metadata. When trained on light curves and metadata, ATAT achieves a macro F1-score of 82.9 +- 0.4 in 20 classes, outperforming the BHRF model trained on 429 features, which achieves a macro F1-score of 79.4 +- 0.1. The use of Transformer multimodal architectures, combining light curves and tabular data, opens new possibilities for classifying alerts from a new generation of large etendue telescopes, such as the Vera C. Rubin Observatory, in real-world brokering scenarios.
△ Less
Submitted 16 May, 2024; v1 submitted 5 May, 2024;
originally announced May 2024.
Multi-scale stamps for real-time classification of alert streams
Authors:
Ignacio Reyes-Jainaga,
Francisco Förster,
Alejandra M. Muñoz Arancibia,
Guillermo Cabrera-Vives,
Amelia Bayo,
Franz E. Bauer,
Javier Arredondo,
Esteban Reyes,
Giuliano Pignata,
A. M. Mourão,
Javier Silva-Farfán,
Lluís Galbany,
Alex Álvarez,
Nicolás Astorga,
Pablo Castellanos,
Pedro Gallardo,
Alberto Moya,
Diego Rodríguez
Abstract:
In recent years, automatic classifiers of image cutouts (also called "stamps") have shown to be key for fast supernova discovery. The Vera C. Rubin Observatory will distribute about ten million alerts with their respective stamps each night, enabling the discovery of approximately one million supernovae each year. A growing source of confusion for these classifiers is the presence of satellite gli…
▽ More
In recent years, automatic classifiers of image cutouts (also called "stamps") have shown to be key for fast supernova discovery. The Vera C. Rubin Observatory will distribute about ten million alerts with their respective stamps each night, enabling the discovery of approximately one million supernovae each year. A growing source of confusion for these classifiers is the presence of satellite glints, sequences of point-like sources produced by rotating satellites or debris. The currently planned Rubin stamps will have a size smaller than the typical separation between these point sources. Thus, a larger field of view stamp could enable the automatic identification of these sources. However, the distribution of larger stamps would be limited by network bandwidth restrictions. We evaluate the impact of using image stamps of different angular sizes and resolutions for the fast classification of events (AGNs, asteroids, bogus, satellites, SNe, and variable stars), using data from the Zwicky Transient Facility. We compare four scenarios: three with the same number of pixels (small field of view with high resolution, large field of view with low resolution, and a multi-scale proposal) and a scenario with the full stamp that has a larger field of view and higher resolution. Compared to small field of view stamps, our multi-scale strategy reduces misclassifications of satellites as asteroids or supernovae, performing on par with high-resolution stamps that are 15 times heavier. We encourage Rubin and its Science Collaborations to consider the benefits of implementing multi-scale stamps as a possible update to the alert specification.
△ Less
Submitted 14 July, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.