-
Efficient multi-prompt evaluation of LLMs
Authors:
Felipe Maia Polo,
Ronald Xu,
Lucas Weber,
Mírian Silva,
Onkar Bhardwaj,
Leshem Choshen,
Allysson Flavio Melo de Oliveira,
Yuekai Sun,
Mikhail Yurochkin
Abstract:
Most popular benchmarks for comparing LLMs rely on a limited set of prompt templates, which may not fully capture the LLMs' abilities and can affect the reproducibility of results on leaderboards. Many recent works empirically verify prompt sensitivity and advocate for changes in LLM evaluation. In this paper, we consider the problem of estimating the performance distribution across many prompt va…
▽ More
Most popular benchmarks for comparing LLMs rely on a limited set of prompt templates, which may not fully capture the LLMs' abilities and can affect the reproducibility of results on leaderboards. Many recent works empirically verify prompt sensitivity and advocate for changes in LLM evaluation. In this paper, we consider the problem of estimating the performance distribution across many prompt variants instead of finding a single prompt to evaluate with. We introduce PromptEval, a method for estimating performance across a large set of prompts borrowing strength across prompts and examples to produce accurate estimates under practical evaluation budgets. The resulting distribution can be used to obtain performance quantiles to construct various robust performance metrics (e.g., top 95% quantile or median). We prove that PromptEval consistently estimates the performance distribution and demonstrate its efficacy empirically on three prominent LLM benchmarks: MMLU, BIG-bench Hard, and LMentry. For example, PromptEval can accurately estimate performance quantiles across 100 prompt templates on MMLU with a budget equivalent to two single-prompt evaluations. Our code and data can be found at https://github.com/felipemaiapolo/prompt-eval.
△ Less
Submitted 7 June, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
A statistical framework for weak-to-strong generalization
Authors:
Seamus Somerstep,
Felipe Maia Polo,
Moulinath Banerjee,
Ya'acov Ritov,
Mikhail Yurochkin,
Yuekai Sun
Abstract:
Modern large language model (LLM) alignment techniques rely on human feedback, but it is unclear whether the techniques fundamentally limit the capabilities of aligned LLMs. In particular, it is unclear whether it is possible to align (stronger) LLMs with superhuman capabilities with (weaker) human feedback without degrading their capabilities. This is an instance of the weak-to-strong generalizat…
▽ More
Modern large language model (LLM) alignment techniques rely on human feedback, but it is unclear whether the techniques fundamentally limit the capabilities of aligned LLMs. In particular, it is unclear whether it is possible to align (stronger) LLMs with superhuman capabilities with (weaker) human feedback without degrading their capabilities. This is an instance of the weak-to-strong generalization problem: using weaker (less capable) feedback to train a stronger (more capable) model. We prove that weak-to-strong generalization is possible by eliciting latent knowledge from pre-trained LLMs. In particular, we cast the weak-to-strong generalization problem as a transfer learning problem in which we wish to transfer a latent concept from a weak model to a strong pre-trained model. We prove that a naive fine-tuning approach suffers from fundamental limitations, but an alternative refinement-based approach suggested by the problem structure provably overcomes the limitations of fine-tuning. Finally, we demonstrate the practical applicability of the refinement approach with three LLM alignment tasks.
△ Less
Submitted 25 May, 2024;
originally announced May 2024.
-
Discovery of a dormant 33 solar-mass black hole in pre-release Gaia astrometry
Authors:
Gaia Collaboration,
P. Panuzzo,
T. Mazeh,
F. Arenou,
B. Holl,
E. Caffau,
A. Jorissen,
C. Babusiaux,
P. Gavras,
J. Sahlmann,
U. Bastian,
Ł. Wyrzykowski,
L. Eyer,
N. Leclerc,
N. Bauchet,
A. Bombrun,
N. Mowlavi,
G. M. Seabroke,
D. Teyssier,
E. Balbinot,
A. Helmi,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne
, et al. (390 additional authors not shown)
Abstract:
Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is exp…
▽ More
Gravitational waves from black-hole merging events have revealed a population of extra-galactic BHs residing in short-period binaries with masses that are higher than expected based on most stellar evolution models - and also higher than known stellar-origin black holes in our Galaxy. It has been proposed that those high-mass BHs are the remnants of massive metal-poor stars. Gaia astrometry is expected to uncover many Galactic wide-binary systems containing dormant BHs, which may not have been detected before. The study of this population will provide new information on the BH-mass distribution in binaries and shed light on their formation mechanisms and progenitors. As part of the validation efforts in preparation for the fourth Gaia data release (DR4), we analysed the preliminary astrometric binary solutions, obtained by the Gaia Non-Single Star pipeline, to verify their significance and to minimise false-detection rates in high-mass-function orbital solutions. The astrometric binary solution of one source, Gaia BH3, implies the presence of a 32.70 \pm 0.82 M\odot BH in a binary system with a period of 11.6 yr. Gaia radial velocities independently validate the astrometric orbit. Broad-band photometric and spectroscopic data show that the visible component is an old, very metal-poor giant of the Galactic halo, at a distance of 590 pc. The BH in the Gaia BH3 system is more massive than any other Galactic stellar-origin BH known thus far. The low metallicity of the star companion supports the scenario that metal-poor massive stars are progenitors of the high-mass BHs detected by gravitational-wave telescopes. The Galactic orbit of the system and its metallicity indicate that it might belong to the Sequoia halo substructure. Alternatively, and more plausibly, it could belong to the ED-2 stream, which likely originated from a globular cluster that had been disrupted by the Milky Way.
△ Less
Submitted 19 April, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
tinyBenchmarks: evaluating LLMs with fewer examples
Authors:
Felipe Maia Polo,
Lucas Weber,
Leshem Choshen,
Yuekai Sun,
Gongjun Xu,
Mikhail Yurochkin
Abstract:
The versatility of large language models (LLMs) led to the creation of diverse benchmarks that thoroughly test a variety of language models' abilities. These benchmarks consist of tens of thousands of examples making evaluation of LLMs very expensive. In this paper, we investigate strategies to reduce the number of evaluations needed to assess the performance of an LLM on several key benchmarks. F…
▽ More
The versatility of large language models (LLMs) led to the creation of diverse benchmarks that thoroughly test a variety of language models' abilities. These benchmarks consist of tens of thousands of examples making evaluation of LLMs very expensive. In this paper, we investigate strategies to reduce the number of evaluations needed to assess the performance of an LLM on several key benchmarks. For example, we show that to accurately estimate the performance of an LLM on MMLU, a popular multiple-choice QA benchmark consisting of 14K examples, it is sufficient to evaluate this LLM on 100 curated examples. We release evaluation tools and tiny versions of popular benchmarks: Open LLM Leaderboard, MMLU, HELM, and AlpacaEval 2.0. Our empirical analysis demonstrates that these tools and tiny benchmarks are sufficient to reliably and efficiently reproduce the original evaluation results.
△ Less
Submitted 26 May, 2024; v1 submitted 22 February, 2024;
originally announced February 2024.
-
Performance and first measurements of the MAGIC Stellar Intensity Interferometer
Authors:
MAGIC Collaboration,
S. Abe,
J. Abhir,
V. A. Acciari,
A. Aguasca-Cabot,
I. Agudo,
T. Aniello,
S. Ansoldi,
L. A. Antonelli,
A. Arbet Engels,
C. Arcaro,
M. Artero,
K. Asano,
A. Babić,
A. Baquero,
U. Barres de Almeida,
J. A. Barrio,
I. Batković,
A. Bautista,
J. Baxter,
J. Becerra González,
E. Bernardini,
M. Bernardos,
J. Bernete,
A. Berti
, et al. (195 additional authors not shown)
Abstract:
In recent years, a new generation of optical intensity interferometers has emerged, leveraging the existing infrastructure of Imaging Atmospheric Cherenkov Telescopes (IACTs). The MAGIC telescopes host the MAGIC-SII system (Stellar Intensity Interferometer), implemented to investigate the feasibility and potential of this technique on IACTs. After the first successful measurements in 2019, the sys…
▽ More
In recent years, a new generation of optical intensity interferometers has emerged, leveraging the existing infrastructure of Imaging Atmospheric Cherenkov Telescopes (IACTs). The MAGIC telescopes host the MAGIC-SII system (Stellar Intensity Interferometer), implemented to investigate the feasibility and potential of this technique on IACTs. After the first successful measurements in 2019, the system was upgraded and now features a real-time, dead-time-free, 4-channel, GPU-based correlator. These hardware modifications allow seamless transitions between MAGIC's standard very-high-energy gamma-ray observations and optical interferometry measurements within seconds. We establish the feasibility and potential of employing IACTs as competitive optical Intensity Interferometers with minimal hardware adjustments. The measurement of a total of 22 stellar diameters are reported, 9 corresponding to reference stars with previous comparable measurements, and 13 with no prior measurements. A prospective implementation involving telescopes from the forthcoming Cherenkov Telescope Array Observatory's northern hemisphere array, such as the first prototype of its Large-Sized Telescopes, LST-1, is technically viable. This integration would significantly enhance the sensitivity of the current system and broaden the UV-plane coverage. This advancement would enable the system to achieve competitive sensitivity with the current generation of long-baseline optical interferometers over blue wavelengths.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Estimating Fréchet bounds for validating programmatic weak supervision
Authors:
Felipe Maia Polo,
Mikhail Yurochkin,
Moulinath Banerjee,
Subha Maity,
Yuekai Sun
Abstract:
We develop methods for estimating Fréchet bounds on (possibly high-dimensional) distribution classes in which some variables are continuous-valued. We establish the statistical correctness of the computed bounds under uncertainty in the marginal constraints and demonstrate the usefulness of our algorithms by evaluating the performance of machine learning (ML) models trained with programmatic weak…
▽ More
We develop methods for estimating Fréchet bounds on (possibly high-dimensional) distribution classes in which some variables are continuous-valued. We establish the statistical correctness of the computed bounds under uncertainty in the marginal constraints and demonstrate the usefulness of our algorithms by evaluating the performance of machine learning (ML) models trained with programmatic weak supervision (PWS). PWS is a framework for principled learning from weak supervision inputs (e.g., crowdsourced labels, knowledge bases, pre-trained models on related tasks, etc), and it has achieved remarkable success in many areas of science and engineering. Unfortunately, it is generally difficult to validate the performance of ML models trained with PWS due to the absence of labeled data. Our algorithms address this issue by estimating sharp lower and upper bounds for performance metrics such as accuracy/recall/precision/F1 score.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Performance of the joint LST-1 and MAGIC observations evaluated with Crab Nebula data
Authors:
H. Abe,
K. Abe,
S. Abe,
V. A. Acciari,
A. Aguasca-Cabot,
I. Agudo,
N. Alvarez Crespo,
T. Aniello,
S. Ansoldi,
L. A. Antonelli,
C. Aramo,
A. Arbet-Engels,
C. Arcaro,
M. Artero,
K. Asano,
P. Aubert,
D. Baack,
A. Babić,
A. Baktash,
A. Bamba,
A. Baquero Larriva,
L. Baroncelli,
U. Barres de Almeida,
J. A. Barrio,
I. Batković
, et al. (344 additional authors not shown)
Abstract:
Aims. LST-1, the prototype of the Large-Sized Telescope for the upcoming Cherenkov Telescope Array Observatory, is concluding its commissioning in Observatorio del Roque de los Muchachos on the island of La Palma. The proximity of LST-1 (Large-Sized Telescope 1) to the two MAGIC (Major Atmospheric Gamma Imaging Cherenkov) telescopes permits observations of the same gamma-ray events with both syste…
▽ More
Aims. LST-1, the prototype of the Large-Sized Telescope for the upcoming Cherenkov Telescope Array Observatory, is concluding its commissioning in Observatorio del Roque de los Muchachos on the island of La Palma. The proximity of LST-1 (Large-Sized Telescope 1) to the two MAGIC (Major Atmospheric Gamma Imaging Cherenkov) telescopes permits observations of the same gamma-ray events with both systems. Methods. We describe the joint LST-1+MAGIC analysis pipeline and use simultaneous Crab Nebula observations and Monte Carlo simulations to assess the performance of the three-telescope system. The addition of the LST-1 telescope allows the recovery of events in which one of the MAGIC images is too dim to survive analysis quality cuts. Results. Thanks to the resulting increase in the collection area and stronger background rejection, we find a significant improvement in sensitivity, allowing the detection of 30% weaker fluxes in the energy range between 200 GeV and 3 TeV. The spectrum of the Crab Nebula, reconstructed in the energy range ~60 GeV to ~10 TeV, is in agreement with previous measurements.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
Fusing Models with Complementary Expertise
Authors:
Hongyi Wang,
Felipe Maia Polo,
Yuekai Sun,
Souvik Kundu,
Eric Xing,
Mikhail Yurochkin
Abstract:
Training AI models that generalize across tasks and domains has long been among the open problems driving AI research. The emergence of Foundation Models made it easier to obtain expert models for a given task, but the heterogeneity of data that may be encountered at test time often means that any single expert is insufficient. We consider the Fusion of Experts (FoE) problem of fusing outputs of e…
▽ More
Training AI models that generalize across tasks and domains has long been among the open problems driving AI research. The emergence of Foundation Models made it easier to obtain expert models for a given task, but the heterogeneity of data that may be encountered at test time often means that any single expert is insufficient. We consider the Fusion of Experts (FoE) problem of fusing outputs of expert models with complementary knowledge of the data distribution and formulate it as an instance of supervised learning. Our method is applicable to both discriminative and generative tasks and leads to significant performance improvements in image and text classification, text summarization, multiple-choice QA, and automatic evaluation of generated text. We also extend our method to the "frugal" setting where it is desired to reduce the number of expert model evaluations at test time. Our implementation is publicly available at https://github.com/hwang595/FoE-ICLR2024.
△ Less
Submitted 9 May, 2024; v1 submitted 2 October, 2023;
originally announced October 2023.
-
Prospects for $γ$-ray observations of the Perseus galaxy cluster with the Cherenkov Telescope Array
Authors:
The Cherenkov Telescope Array Consortium,
:,
K. Abe,
S. Abe,
F. Acero,
A. Acharyya,
R. Adam,
A. Aguasca-Cabot,
I. Agudo,
A. Aguirre-Santaella,
J. Alfaro,
R. Alfaro,
N. Alvarez-Crespo,
R. Alves Batista,
J. -P. Amans,
E. Amato,
E. O. Angüner,
L. A. Antonelli,
C. Aramo,
M. Araya,
C. Arcaro,
L. Arrabito,
K. Asano,
Y. Ascasíbar,
J. Aschersleben
, et al. (542 additional authors not shown)
Abstract:
Galaxy clusters are expected to be dark matter (DM) reservoirs and storage rooms for the cosmic-ray protons (CRp) that accumulate along the cluster's formation history. Accordingly, they are excellent targets to search for signals of DM annihilation and decay at gamma-ray energies and are predicted to be sources of large-scale gamma-ray emission due to hadronic interactions in the intracluster med…
▽ More
Galaxy clusters are expected to be dark matter (DM) reservoirs and storage rooms for the cosmic-ray protons (CRp) that accumulate along the cluster's formation history. Accordingly, they are excellent targets to search for signals of DM annihilation and decay at gamma-ray energies and are predicted to be sources of large-scale gamma-ray emission due to hadronic interactions in the intracluster medium. We estimate the sensitivity of the Cherenkov Telescope Array (CTA) to detect diffuse gamma-ray emission from the Perseus galaxy cluster. We perform a detailed spatial and spectral modelling of the expected signal for the DM and the CRp components. For each, we compute the expected CTA sensitivity. The observing strategy of Perseus is also discussed. In the absence of a diffuse signal (non-detection), CTA should constrain the CRp to thermal energy ratio within the radius $R_{500}$ down to about $X_{500}<3\times 10^{-3}$, for a spatial CRp distribution that follows the thermal gas and a CRp spectral index $α_{\rm CRp}=2.3$. Under the optimistic assumption of a pure hadronic origin of the Perseus radio mini-halo and depending on the assumed magnetic field profile, CTA should measure $α_{\rm CRp}$ down to about $Δα_{\rm CRp}\simeq 0.1$ and the CRp spatial distribution with 10% precision. Regarding DM, CTA should improve the current ground-based gamma-ray DM limits from clusters observations on the velocity-averaged annihilation cross-section by a factor of up to $\sim 5$, depending on the modelling of DM halo substructure. In the case of decay of DM particles, CTA will explore a new region of the parameter space, reaching models with $τ_χ>10^{27}$s for DM masses above 1 TeV. These constraints will provide unprecedented sensitivity to the physics of both CRp acceleration and transport at cluster scale and to TeV DM particle models, especially in the decay scenario.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Conditional independence testing under misspecified inductive biases
Authors:
Felipe Maia Polo,
Yuekai Sun,
Moulinath Banerjee
Abstract:
Conditional independence (CI) testing is a fundamental and challenging task in modern statistics and machine learning. Many modern methods for CI testing rely on powerful supervised learning methods to learn regression functions or Bayes predictors as an intermediate step; we refer to this class of tests as regression-based tests. Although these methods are guaranteed to control Type-I error when…
▽ More
Conditional independence (CI) testing is a fundamental and challenging task in modern statistics and machine learning. Many modern methods for CI testing rely on powerful supervised learning methods to learn regression functions or Bayes predictors as an intermediate step; we refer to this class of tests as regression-based tests. Although these methods are guaranteed to control Type-I error when the supervised learning methods accurately estimate the regression functions or Bayes predictors of interest, their behavior is less understood when they fail due to misspecified inductive biases; in other words, when the employed models are not flexible enough or when the training algorithm does not induce the desired predictors. Then, we study the performance of regression-based CI tests under misspecified inductive biases. Namely, we propose new approximations or upper bounds for the testing errors of three regression-based tests that depend on misspecification errors. Moreover, we introduce the Rao-Blackwellized Predictor Test (RBPT), a regression-based CI test robust against misspecified inductive biases. Finally, we conduct experiments with artificial and real data, showcasing the usefulness of our theory and methods.
△ Less
Submitted 27 October, 2023; v1 submitted 5 July, 2023;
originally announced July 2023.
-
Observations of the Crab Nebula and Pulsar with the Large-Sized Telescope Prototype of the Cherenkov Telescope Array
Authors:
CTA-LST Project,
:,
H. Abe,
K. Abe,
S. Abe,
A. Aguasca-Cabot,
I. Agudo,
N. Alvarez Crespo,
L. A. Antonelli,
C. Aramo,
A. Arbet-Engels,
C. Arcaro,
M. Artero,
K. Asano,
P. Aubert,
A. Baktash,
A. Bamba,
A. Baquero Larriva,
L. Baroncelli,
U. Barres de Almeida,
J. A. Barrio,
I. Batkovic,
J. Baxter,
J. Becerra González,
E. Bernardini
, et al. (467 additional authors not shown)
Abstract:
CTA (Cherenkov Telescope Array) is the next generation ground-based observatory for gamma-ray astronomy at very-high energies. The Large-Sized Telescope prototype (LST-1) is located at the Northern site of CTA, on the Canary Island of La Palma. LSTs are designed to provide optimal performance in the lowest part of the energy range covered by CTA, down to $\simeq 20$ GeV. LST-1 started performing a…
▽ More
CTA (Cherenkov Telescope Array) is the next generation ground-based observatory for gamma-ray astronomy at very-high energies. The Large-Sized Telescope prototype (LST-1) is located at the Northern site of CTA, on the Canary Island of La Palma. LSTs are designed to provide optimal performance in the lowest part of the energy range covered by CTA, down to $\simeq 20$ GeV. LST-1 started performing astronomical observations in November 2019, during its commissioning phase, and it has been taking data since then. We present the first LST-1 observations of the Crab Nebula, the standard candle of very-high energy gamma-ray astronomy, and use them, together with simulations, to assess the basic performance parameters of the telescope. The data sample consists of around 36 hours of observations at low zenith angles collected between November 2020 and March 2022. LST-1 has reached the expected performance during its commissioning period - only a minor adjustment of the preexisting simulations was needed to match the telescope behavior. The energy threshold at trigger level is estimated to be around 20 GeV, rising to $\simeq 30$ GeV after data analysis. Performance parameters depend strongly on energy, and on the strength of the gamma-ray selection cuts in the analysis: angular resolution ranges from 0.12 to 0.40 degrees, and energy resolution from 15 to 50%. Flux sensitivity is around 1.1% of the Crab Nebula flux above 250 GeV for a 50-h observation (12% for 30 minutes). The spectral energy distribution (in the 0.03 - 30 TeV range) and the light curve obtained for the Crab Nebula agree with previous measurements, considering statistical and systematic uncertainties. A clear periodic signal is also detected from the pulsar at the center of the Nebula.
△ Less
Submitted 19 July, 2023; v1 submitted 22 June, 2023;
originally announced June 2023.
-
A-BASE-DE-PROS: una implementación práctica de los Objetivos de Desarrollo Sostenible en la Universidad Politécnica de Madrid
Authors:
Patricia Almendros,
Silvia Otegui,
Alejandro Nares,
Laura del Fresno,
Javier Ablanque,
Irene Blanco,
Juan Ramón Ferrer,
Sonia Benito,
Carmen Lopez,
Sonia García,
León Fernández,
Sergio Zubelzu,
Raúl Sánchez,
Paloma Esteve,
Rosa María Benito,
Juan Carlos Losada,
Antonio Saa,
Gabriel Gascó,
Ana M. Méndez,
Mónica Montoya,
Marina de Francisco,
Jesús Ruiz,
Samuel Seoanez,
Sara Castilla,
Dámaris Fuente
, et al. (24 additional authors not shown)
Abstract:
The influence of the Sustainable Development Goals (SDGs) has been widely spread over the last years, establishing new public and privat policies. Education has also been experiencing this change by aligning with the previous goals. In this chapter, we briefly summarize the main activities conducted under the Grant APS22.2003 'Service-based learning of the SDGs related to a responsible production…
▽ More
The influence of the Sustainable Development Goals (SDGs) has been widely spread over the last years, establishing new public and privat policies. Education has also been experiencing this change by aligning with the previous goals. In this chapter, we briefly summarize the main activities conducted under the Grant APS22.2003 'Service-based learning of the SDGs related to a responsible production and consumption (A-BASE-DE-PROS)', which uses the SDG 12 as a guide line to raise the awareness of the importance of the 2030 Agenda among undergraduate and secondary-school students. In general, the service-based learning has increased the knowledge of the SDGs among the students. Furthermore, most of the (university and secondary) students found the service-learning
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Behavior patterns of 15-year-old students in a Digital Storytelling in Mathematics. A social network analysis
Authors:
Anna Concas,
Maria Polo
Abstract:
This paper presents the preliminary considerations of the application of a software to an experimental work conducted on Digital Storytelling in Mathematics, as part of the project Prin 2015 "Digital Interactive Storytelling in Mathematics: a competence-based social approach". An activity designed for promoting critical mathematical thinking among the students that foresees them to participate as…
▽ More
This paper presents the preliminary considerations of the application of a software to an experimental work conducted on Digital Storytelling in Mathematics, as part of the project Prin 2015 "Digital Interactive Storytelling in Mathematics: a competence-based social approach". An activity designed for promoting critical mathematical thinking among the students that foresees them to participate as active protagonists and as observers of the protagonists during the problem-solving activity, will be illustrated and then the outcomes will be examined from a numerical analysis point of view. In particular, the interactions between the participants will be investigated by using a Matlab software for solving the seriation problem.
△ Less
Submitted 14 December, 2022;
originally announced December 2022.
-
Multi-wavelength study of the galactic PeVatron candidate LHAASO J2108+5157
Authors:
S. Abe,
A. Aguasca-Cabot,
I. Agudo,
N. Alvarez Crespo,
L. A. Antonelli,
C. Aramo,
A. Arbet-Engels,
M. Artero,
K. Asano,
P. Aubert,
A. Baktash,
A. Bamba,
A. Baquero Larriva,
L. Baroncelli,
U. Barres de Almeida,
J. A. Barrio,
I. Batkovic,
J. Baxter,
J. Becerra González,
E. Bernardini,
M. I. Bernardos,
J. Bernete Medrano,
A. Berti,
P. Bhattacharjee,
N. Biederbeck
, et al. (245 additional authors not shown)
Abstract:
LHAASO J2108+5157 is one of the few known unidentified Ultra-High-Energy (UHE) gamma-ray sources with no Very-High-Energy (VHE) counterpart, recently discovered by the LHAASO collaboration. We observed LHAASO J2108+5157 in the X-ray band with XMM-Newton in 2021 for a total of 3.8 hours and at TeV energies with the Large-Sized Telescope prototype (LST-1), yielding 49 hours of good quality data. In…
▽ More
LHAASO J2108+5157 is one of the few known unidentified Ultra-High-Energy (UHE) gamma-ray sources with no Very-High-Energy (VHE) counterpart, recently discovered by the LHAASO collaboration. We observed LHAASO J2108+5157 in the X-ray band with XMM-Newton in 2021 for a total of 3.8 hours and at TeV energies with the Large-Sized Telescope prototype (LST-1), yielding 49 hours of good quality data. In addition, we analyzed 12 years of Fermi-LAT data, to better constrain emission of its High-Energy (HE) counterpart 4FGL J2108.0+5155. We found an excess (3.7 sigma) in the LST-1 data at energies E > 3 TeV. Further analysis in the whole LST-1 energy range assuming a point-like source, resulted in a hint (2.2 sigma) of hard emission which can be described with a single power law with photon index Gamma = 1.6 +- 0.2 between 0.3 - 100 TeV. We did not find any significant extended emission which could be related to a Supernova Remnant (SNR) or Pulsar Wind Nebula (PWN) in the XMM-Newton data, which puts strong constraints on possible synchrotron emission of relativistic electrons. The LST-1 and LHAASO observations can be explained as inverse Compton-dominated leptonic emission of relativistic electrons with a cutoff energy of $100^{+70}_{-30}$ TeV. The low magnetic field in the source imposed by the X-ray upper limits on synchrotron emission is compatible with a hypothesis of a PWN or a TeV halo. The lack of a pulsar in the neighborhood of the UHE source is a challenge to the PWN/TeV-halo scenario. The UHE gamma rays can also be explained as $π^0$ decay-dominated hadronic emission due to interaction of relativistic protons with one of the two known molecular clouds in the direction of the source. The hard spectrum in the LST-1 band is compatible with protons esca** a shock around a middle-aged SNR because of their high low-energy cut-off.
△ Less
Submitted 16 March, 2023; v1 submitted 3 October, 2022;
originally announced October 2022.
-
First measurements and upgrade plans of the MAGIC intensity interferometer
Authors:
Juan Cortina,
V. A. Acciari,
A. Biland,
E. Colombo,
C. da Costa,
C. Delgado,
C. Diaz,
M. Fiori,
D. Fink,
T. Hassan,
I. Jimenez-Martinez,
E. Lyard,
M. Mariotti,
G. Martinez,
R. Mirzoyan,
G. Naletto,
M. Polo,
N. Produit,
J. J. Rodriguez,
T. Schweizer,
R. Walter,
C. W. Wunderlich,
L. Zampieri,
the MAGIC,
LST collaborations
Abstract:
The two MAGIC 17-m diameter Imaging Atmospheric Cherenkov Telescopes have been equipped to work also as an intensity interferometer with a deadtime-free, 4-channel, GPU-based, real-time correlator. Operating with baselines between approx. 40 and 90 m the MAGIC interferometer is able to measure stellar diameters of 0.5-1 mas in the 400-440 nm wavelength range with a sensitivity roughly 10 times bet…
▽ More
The two MAGIC 17-m diameter Imaging Atmospheric Cherenkov Telescopes have been equipped to work also as an intensity interferometer with a deadtime-free, 4-channel, GPU-based, real-time correlator. Operating with baselines between approx. 40 and 90 m the MAGIC interferometer is able to measure stellar diameters of 0.5-1 mas in the 400-440 nm wavelength range with a sensitivity roughly 10 times better than that achieved in the 1970s by the Narrabri Stellar Intensity Interferometer. Besides, active mirror control allows to split the primary mirrors into sub-mirrors. This allows to make simultaneous calibration measurements of the zero-baseline correlation or to simultaneously collect six baselines below 17 m with almost arbitrary orientation, corresponding to angular scales of approx. 1-50 mas. We plan to perform test observations adding the nearby Cherenkov Telescope Array (CTA) LST-1 23 m diameter telescope by next year. All three telescope pairs will be correlated simultaneously. Adding LST-1 is expected to increase the sensitivity by at least 1 mag and significantly improve the u-v plane coverage. If successful, the proposed correlator setup is scalable enough to be implemented to the full CTA arrays.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
Gaia Data Release 3: Summary of the content and survey properties
Authors:
Gaia Collaboration,
A. Vallenari,
A. G. A. Brown,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
C. Ducourant,
D. W. Evans,
L. Eyer,
R. Guerra,
A. Hutton,
C. Jordi,
S. A. Klioner,
U. L. Lammers,
L. Lindegren,
X. Luri,
F. Mignard,
C. Panem,
D. Pourbaix,
S. Randich,
P. Sartoretti,
C. Soubiran
, et al. (431 additional authors not shown)
Abstract:
We present the third data release of the European Space Agency's Gaia mission, GDR3. The GDR3 catalogue is the outcome of the processing of raw data collected with the Gaia instruments during the first 34 months of the mission by the Gaia Data Processing and Analysis Consortium. The GDR3 catalogue contains the same source list, celestial positions, proper motions, parallaxes, and broad band photom…
▽ More
We present the third data release of the European Space Agency's Gaia mission, GDR3. The GDR3 catalogue is the outcome of the processing of raw data collected with the Gaia instruments during the first 34 months of the mission by the Gaia Data Processing and Analysis Consortium. The GDR3 catalogue contains the same source list, celestial positions, proper motions, parallaxes, and broad band photometry in the G, G$_{BP}$, and G$_{RP}$ pass-bands already present in the Early Third Data Release. GDR3 introduces an impressive wealth of new data products. More than 33 million objects in the ranges $G_{rvs} < 14$ and $3100 <T_{eff} <14500 $, have new determinations of their mean radial velocities based on data collected by Gaia. We provide G$_{rvs}$ magnitudes for most sources with radial velocities, and a line broadening parameter is listed for a subset of these. Mean Gaia spectra are made available to the community. The GDR3 catalogue includes about 1 million mean spectra from the radial velocity spectrometer, and about 220 million low-resolution blue and red prism photometer BPRP mean spectra. The results of the analysis of epoch photometry are provided for some 10 million sources across 24 variability types. GDR3 includes astrophysical parameters and source class probabilities for about 470 million and 1500 million sources, respectively, including stars, galaxies, and quasars. Orbital elements and trend parameters are provided for some $800\,000$ astrometric, spectroscopic and eclipsing binaries. More than $150\,000$ Solar System objects, including new discoveries, with preliminary orbital solutions and individual epoch observations are part of this release. Reflectance spectra derived from the epoch BPRP spectral data are published for about 60\,000 asteroids. Finally, an additional data set is provided, namely the Gaia Andromeda Photometric Survey (abridged)
△ Less
Submitted 30 July, 2022;
originally announced August 2022.
-
Gaia Data Release 3: Reflectance spectra of Solar System small bodies
Authors:
Gaia Collaboration,
L. Galluccio,
M. Delbo,
F. De Angeli,
T. Pauwels,
P. Tanga,
F. Mignard,
A. Cellino,
A. G. A. Brown,
K. Muinonen,
A. Penttila,
S. Jordan,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
C. Ducourant,
D. W. Evans,
L. Eyer,
R. Guerra,
A. Hutton,
C. Jordi
, et al. (422 additional authors not shown)
Abstract:
The Gaia mission of the European Space Agency (ESA) has been routinely observing Solar System objects (SSOs) since the beginning of its operations in August 2014. The Gaia data release three (DR3) includes, for the first time, the mean reflectance spectra of a selected sample of 60 518 SSOs, primarily asteroids, observed between August 5, 2014, and May 28, 2017. Each reflectance spectrum was deriv…
▽ More
The Gaia mission of the European Space Agency (ESA) has been routinely observing Solar System objects (SSOs) since the beginning of its operations in August 2014. The Gaia data release three (DR3) includes, for the first time, the mean reflectance spectra of a selected sample of 60 518 SSOs, primarily asteroids, observed between August 5, 2014, and May 28, 2017. Each reflectance spectrum was derived from measurements obtained by means of the Blue and Red photometers (BP/RP), which were binned in 16 discrete wavelength bands. We describe the processing of the Gaia spectral data of SSOs, explaining both the criteria used to select the subset of asteroid spectra published in Gaia DR3, and the different steps of our internal validation procedures. In order to further assess the quality of Gaia SSO reflectance spectra, we carried out external validation against SSO reflectance spectra obtained from ground-based and space-borne telescopes and available in the literature. For each selected SSO, an epoch reflectance was computed by dividing the calibrated spectrum observed by the BP/RP at each transit on the focal plane by the mean spectrum of a solar analogue. The latter was obtained by averaging the Gaia spectral measurements of a selected sample of stars known to have very similar spectra to that of the Sun. Finally, a mean of the epoch reflectance spectra was calculated in 16 spectral bands for each SSO. The agreement between Gaia mean reflectance spectra and those available in the literature is good for bright SSOs, regardless of their taxonomic spectral class. We identify an increase in the spectral slope of S-type SSOs with increasing phase angle. Moreover, we show that the spectral slope increases and the depth of the 1 um absorption band decreases for increasing ages of S-type asteroid families.
△ Less
Submitted 24 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Map** the asymmetric disc of the Milky Way
Authors:
Gaia Collaboration,
R. Drimmel,
M. Romero-Gomez,
L. Chemin,
P. Ramos,
E. Poggio,
V. Ripepi,
R. Andrae,
R. Blomme,
T. Cantat-Gaudin,
A. Castro-Ginard,
G. Clementini,
F. Figueras,
M. Fouesneau,
Y. Fremat,
K. Jardine,
S. Khanna,
A. Lobel,
D. J. Marshall,
T. Muraveva,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou
, et al. (431 additional authors not shown)
Abstract:
With the most recent Gaia data release the number of sources with complete 6D phase space information (position and velocity) has increased to well over 33 million stars, while stellar astrophysical parameters are provided for more than 470 million sources, in addition to the identification of over 11 million variable stars. Using the astrophysical parameters and variability classifications provid…
▽ More
With the most recent Gaia data release the number of sources with complete 6D phase space information (position and velocity) has increased to well over 33 million stars, while stellar astrophysical parameters are provided for more than 470 million sources, in addition to the identification of over 11 million variable stars. Using the astrophysical parameters and variability classifications provided in Gaia DR3, we select various stellar populations to explore and identify non-axisymmetric features in the disc of the Milky Way in both configuration and velocity space. Using more about 580 thousand sources identified as hot OB stars, together with 988 known open clusters younger than 100 million years, we map the spiral structure associated with star formation 4-5 kpc from the Sun. We select over 2800 Classical Cepheids younger than 200 million years, which show spiral features extending as far as 10 kpc from the Sun in the outer disc. We also identify more than 8.7 million sources on the red giant branch (RGB), of which 5.7 million have line-of-sight velocities, allowing the velocity field of the Milky Way to be mapped as far as 8 kpc from the Sun, including the inner disc. The spiral structure revealed by the young populations is consistent with recent results using Gaia EDR3 astrometry and source lists based on near infrared photometry, showing the Local (Orion) arm to be at least 8 kpc long, and an outer arm consistent with what is seen in HI surveys, which seems to be a continuation of the Perseus arm into the third quadrant. Meanwhile, the subset of RGB stars with velocities clearly reveals the large scale kinematic signature of the bar in the inner disc, as well as evidence of streaming motions in the outer disc that might be associated with spiral arms or bar resonances. (abridged)
△ Less
Submitted 5 August, 2022; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Pulsations in main sequence OBAF-type stars
Authors:
Gaia Collaboration,
J. De Ridder,
V. Ripepi,
C. Aerts,
L. Palaversa,
L. Eyer,
B. Holl,
M. Audard,
L. Rimoldini,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
C. Ducourant,
D. W. Evans,
R. Guerra,
A. Hutton,
C. Jordi,
S. A. Klioner,
U. L. Lammers,
L. Lindegren
, et al. (423 additional authors not shown)
Abstract:
The third Gaia data release provides photometric time series covering 34 months for about 10 million stars. For many of those stars, a characterisation in Fourier space and their variability classification are also provided. This paper focuses on intermediate- to high-mass (IHM) main sequence pulsators M >= 1.3 Msun) of spectral types O, B, A, or F, known as beta Cep, slowly pulsating B (SPB), del…
▽ More
The third Gaia data release provides photometric time series covering 34 months for about 10 million stars. For many of those stars, a characterisation in Fourier space and their variability classification are also provided. This paper focuses on intermediate- to high-mass (IHM) main sequence pulsators M >= 1.3 Msun) of spectral types O, B, A, or F, known as beta Cep, slowly pulsating B (SPB), delta Sct, and gamma Dor stars. These stars are often multi-periodic and display low amplitudes, making them challenging targets to analyse with sparse time series. All datasets used in this analysis are part of the Gaia DR3 data release. The photometric time series were used to perform a Fourier analysis, while the global astrophysical parameters necessary for the empirical instability strips were taken from the Gaia DR3 gspphot tables, and the vsini data were taken from the Gaia DR3 esphs tables. We show that for nearby OBAF-type pulsators, the Gaia DR3 data are precise and accurate enough to pinpoint them in the Hertzsprung-Russell diagram. We find empirical instability strips covering broader regions than theoretically predicted. In particular, our study reveals the presence of fast rotating gravity-mode pulsators outside the strips, as well as the co-existence of rotationally modulated variables inside the strips as reported before in the literature. We derive an extensive period-luminosity relation for delta Sct stars and provide evidence that the relation features different regimes depending on the oscillation period. Finally, we demonstrate how stellar rotation attenuates the amplitude of the dominant oscillation mode of delta Sct stars.
△ Less
Submitted 16 August, 2022; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: A Golden Sample of Astrophysical Parameters
Authors:
Gaia Collaboration,
O. L. Creevey,
L. M. Sarro,
A. Lobel,
E. Pancino,
R. Andrae,
R. L. Smart,
G. Clementini,
U. Heiter,
A. J. Korn,
M. Fouesneau,
Y. Frémat,
F. De Angeli,
A. Vallenari,
D. L. Harrison,
F. Thévenin,
C. Reylé,
R. Sordo,
A. Garofalo,
A. G. A. Brown,
L. Eyer,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux
, et al. (423 additional authors not shown)
Abstract:
Gaia Data Release 3 (DR3) provides a wealth of new data products for the astronomical community to exploit, including astrophysical parameters for a half billion stars. In this work we demonstrate the high quality of these data products and illustrate their use in different astrophysical contexts. We query the astrophysical parameter tables along with other tables in Gaia DR3 to derive the samples…
▽ More
Gaia Data Release 3 (DR3) provides a wealth of new data products for the astronomical community to exploit, including astrophysical parameters for a half billion stars. In this work we demonstrate the high quality of these data products and illustrate their use in different astrophysical contexts. We query the astrophysical parameter tables along with other tables in Gaia DR3 to derive the samples of the stars of interest. We validate our results by using the Gaia catalogue itself and by comparison with external data. We have produced six homogeneous samples of stars with high quality astrophysical parameters across the HR diagram for the community to exploit. We first focus on three samples that span a large parameter space: young massive disk stars (~3M), FGKM spectral type stars (~3M), and UCDs (~20K). We provide these sources along with additional information (either a flag or complementary parameters) as tables that are made available in the Gaia archive. We furthermore identify 15740 bone fide carbon stars, 5863 solar-analogues, and provide the first homogeneous set of stellar parameters of the Spectro Photometric Standard Stars. We use a subset of the OBA sample to illustrate its usefulness to analyse the Milky Way rotation curve. We then use the properties of the FGKM stars to analyse known exoplanet systems. We also analyse the ages of some unseen UCD-companions to the FGKM stars. We additionally predict the colours of the Sun in various passbands (Gaia, 2MASS, WISE) using the solar-analogue sample.
△ Less
Submitted 12 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: The extragalactic content
Authors:
Gaia Collaboration,
C. A. L. Bailer-Jones,
D. Teyssier,
L. Delchambre,
C. Ducourant,
D. Garabato,
D. Hatzidimitriou,
S. A. Klioner,
L. Rimoldini,
I. Bellas-Velidis,
R. Carballo,
M. I. Carnerero,
C. Diener,
M. Fouesneau,
L. Galluccio,
P. Gavras,
A. Krone-Martins,
C. M. Raiteri,
R. Teixeira,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
F. Arenou,
C. Babusiaux
, et al. (422 additional authors not shown)
Abstract:
The Gaia Galactic survey mission is designed and optimized to obtain astrometry, photometry, and spectroscopy of nearly two billion stars in our Galaxy. Yet as an all-sky multi-epoch survey, Gaia also observes several million extragalactic objects down to a magnitude of G~21 mag. Due to the nature of the Gaia onboard selection algorithms, these are mostly point-source-like objects. Using data prov…
▽ More
The Gaia Galactic survey mission is designed and optimized to obtain astrometry, photometry, and spectroscopy of nearly two billion stars in our Galaxy. Yet as an all-sky multi-epoch survey, Gaia also observes several million extragalactic objects down to a magnitude of G~21 mag. Due to the nature of the Gaia onboard selection algorithms, these are mostly point-source-like objects. Using data provided by the satellite, we have identified quasar and galaxy candidates via supervised machine learning methods, and estimate their redshifts using the low resolution BP/RP spectra. We further characterise the surface brightness profiles of host galaxies of quasars and of galaxies from pre-defined input lists. Here we give an overview of the processing of extragalactic objects, describe the data products in Gaia DR3, and analyse their properties. Two integrated tables contain the main results for a high completeness, but low purity (50-70%), set of 6.6 million candidate quasars and 4.8 million candidate galaxies. We provide queries that select purer sub-samples of these containing 1.9 million probable quasars and 2.9 million probable galaxies (both 95% purity). We also use high quality BP/RP spectra of 43 thousand high probability quasars over the redshift range 0.05-4.36 to construct a composite quasar spectrum spanning restframe wavelengths from 72-100 nm.
△ Less
Submitted 12 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Stellar multiplicity, a teaser for the hidden treasure
Authors:
Gaia Collaboration,
F. Arenou,
C. Babusiaux,
M. A. Barstow,
S. Faigler,
A. Jorissen,
P. Kervella,
T. Mazeh,
N. Mowlavi,
P. Panuzzo,
J. Sahlmann,
S. Shahaf,
A. Sozzetti,
N. Bauchet,
Y. Damerdji,
P. Gavras,
P. Giacobbe,
E. Gosset,
J. -L. Halbwachs,
B. Holl,
M. G. Lattanzi,
N. Leclerc,
T. Morel,
D. Pourbaix,
P. Re Fiorentin
, et al. (425 additional authors not shown)
Abstract:
The Gaia DR3 Catalogue contains for the first time about eight hundred thousand solutions with either orbital elements or trend parameters for astrometric, spectroscopic and eclipsing binaries, and combinations of them. This paper aims to illustrate the huge potential of this large non-single star catalogue. Using the orbital solutions together with models of the binaries, a catalogue of tens of t…
▽ More
The Gaia DR3 Catalogue contains for the first time about eight hundred thousand solutions with either orbital elements or trend parameters for astrometric, spectroscopic and eclipsing binaries, and combinations of them. This paper aims to illustrate the huge potential of this large non-single star catalogue. Using the orbital solutions together with models of the binaries, a catalogue of tens of thousands of stellar masses, or lower limits, partly together with consistent flux ratios, has been built. Properties concerning the completeness of the binary catalogues are discussed, statistical features of the orbital elements are explained and a comparison with other catalogues is performed. Illustrative applications are proposed for binaries across the H-R diagram. The binarity is studied in the RGB/AGB and a search for genuine SB1 among long-period variables is performed. The discovery of new EL CVn systems illustrates the potential of combining variability and binarity catalogues. Potential compact object companions are presented, mainly white dwarf companions or double degenerates, but one candidate neutron star is also presented. Towards the bottom of the main sequence, the orbits of previously-suspected binary ultracool dwarfs are determined and new candidate binaries are discovered. The long awaited contribution of Gaia to the analysis of the substellar regime shows the brown dwarf desert around solar-type stars using true, rather than minimum, masses, and provides new important constraints on the occurrence rates of substellar companions to M dwarfs. Several dozen new exoplanets are proposed, including two with validated orbital solutions and one super-Jupiter orbiting a white dwarf, all being candidates requiring confirmation. Beside binarity, higher order multiple systems are also found.
△ Less
Submitted 11 June, 2022;
originally announced June 2022.
-
Gaia Data Release 3: Chemical cartography of the Milky Way
Authors:
Gaia Collaboration,
A. Recio-Blanco,
G. Kordopatis,
P. de Laverny,
P. A. Palicio,
A. Spagna,
L. Spina,
D. Katz,
P. Re Fiorentin,
E. Poggio,
P. J. McMillan,
A. Vallenari,
M. G. Lattanzi,
G. M. Seabroke,
L. Casamiquela,
A. Bragaglia,
T. Antoja,
C. A. L. Bailer-Jones,
R. Andrae,
M. Fouesneau,
M. Cropper,
T. Cantat-Gaudin,
U. Heiter,
A. Bijaoui,
A. G. A. Brown
, et al. (425 additional authors not shown)
Abstract:
Gaia DR3 opens a new era of all-sky spectral analysis of stellar populations thanks to the nearly 5.6 million stars observed by the RVS and parametrised by the GSP-spec module. The all-sky Gaia chemical cartography allows a powerful and precise chemo-dynamical view of the Milky Way with unprecedented spatial coverage and statistical robustness. First, it reveals the strong vertical symmetry of the…
▽ More
Gaia DR3 opens a new era of all-sky spectral analysis of stellar populations thanks to the nearly 5.6 million stars observed by the RVS and parametrised by the GSP-spec module. The all-sky Gaia chemical cartography allows a powerful and precise chemo-dynamical view of the Milky Way with unprecedented spatial coverage and statistical robustness. First, it reveals the strong vertical symmetry of the Galaxy and the flared structure of the disc. Second, the observed kinematic disturbances of the disc -- seen as phase space correlations -- and kinematic or orbital substructures are associated with chemical patterns that favour stars with enhanced metallicities and lower [alpha/Fe] abundance ratios compared to the median values in the radial distributions. This is detected both for young objects that trace the spiral arms and older populations. Several alpha, iron-peak elements and at least one heavy element trace the thin and thick disc properties in the solar cylinder. Third, young disc stars show a recent chemical impoverishment in several elements. Fourth, the largest chemo-dynamical sample of open clusters analysed so far shows a steepening of the radial metallicity gradient with age, which is also observed in the young field population. Finally, the Gaia chemical data have the required coverage and precision to unveil galaxy accretion debris and heated disc stars on halo orbits through their [alpha/Fe] ratio, and to allow the study of the chemo-dynamical properties of globular clusters. Gaia DR3 chemo-dynamical diagnostics open new horizons before the era of ground-based wide-field spectroscopic surveys. They unveil a complex Milky Way that is the outcome of an eventful evolution, sha** it to the present day (abridged).
△ Less
Submitted 11 June, 2022;
originally announced June 2022.
-
A unified framework for dataset shift diagnostics
Authors:
Felipe Maia Polo,
Rafael Izbicki,
Evanildo Gomes Lacerda Jr,
Juan Pablo Ibieta-Jimenez,
Renato Vicente
Abstract:
Supervised learning techniques typically assume training data originates from the target population. Yet, in reality, dataset shift frequently arises, which, if not adequately taken into account, may decrease the performance of their predictors. In this work, we propose a novel and flexible framework called DetectShift that quantifies and tests for multiple dataset shifts, encompassing shifts in t…
▽ More
Supervised learning techniques typically assume training data originates from the target population. Yet, in reality, dataset shift frequently arises, which, if not adequately taken into account, may decrease the performance of their predictors. In this work, we propose a novel and flexible framework called DetectShift that quantifies and tests for multiple dataset shifts, encompassing shifts in the distributions of $(X, Y)$, $X$, $Y$, $X|Y$, and $Y|X$. DetectShift equips practitioners with insights into data shifts, facilitating the adaptation or retraining of predictors using both source and target data. This proves extremely valuable when labeled samples in the target domain are limited. The framework utilizes test statistics with the same nature to quantify the magnitude of the various shifts, making results more interpretable. It is versatile, suitable for regression and classification tasks, and accommodates diverse data forms - tabular, text, or image. Experimental results demonstrate the effectiveness of DetectShift in detecting dataset shifts even in higher dimensions.
△ Less
Submitted 12 September, 2023; v1 submitted 17 May, 2022;
originally announced May 2022.
-
Gaia Early Data Release 3: The celestial reference frame (Gaia-CRF3)
Authors:
Gaia Collaboration,
S. A. Klioner,
L. Lindegren,
F. Mignard,
J. Hernández,
M. Ramos-Lerate,
U. Bastian,
M. Biermann,
A. Bombrun,
A. de Torres,
E. Gerlach,
R. Geyer,
T. Hilger,
D. Hobbs,
U. L. Lammers,
P. J. McMillan,
H. Steidelmüller,
D. Teyssier,
C. M. Raiteri,
S. Bartolomé,
M. Bernet,
J. Castañeda,
M. Clotet,
M. Davidson,
C. Fabricius
, et al. (426 additional authors not shown)
Abstract:
Gaia-CRF3 is the celestial reference frame for positions and proper motions in the third release of data from the Gaia mission, Gaia DR3 (and for the early third release, Gaia EDR3, which contains identical astrometric results). The reference frame is defined by the positions and proper motions at epoch 2016.0 for a specific set of extragalactic sources in the (E)DR3 catalogue.
We describe the c…
▽ More
Gaia-CRF3 is the celestial reference frame for positions and proper motions in the third release of data from the Gaia mission, Gaia DR3 (and for the early third release, Gaia EDR3, which contains identical astrometric results). The reference frame is defined by the positions and proper motions at epoch 2016.0 for a specific set of extragalactic sources in the (E)DR3 catalogue.
We describe the construction of Gaia-CRF3, and its properties in terms of the distributions in magnitude, colour, and astrometric quality.
Compact extragalactic sources in Gaia DR3 were identified by positional cross-matching with 17 external catalogues of quasars (QSO) and active galactic nuclei (AGN), followed by astrometric filtering designed to remove stellar contaminants. Selecting a clean sample was favoured over including a higher number of extragalactic sources. For the final sample, the random and systematic errors in the proper motions are analysed, as well as the radio-optical offsets in position for sources in the third realisation of the International Celestial Reference Frame (ICRF3).
The Gaia-CRF3 comprises about 1.6 million QSO-like sources, of which 1.2 million have five-parameter astrometric solutions in Gaia DR3 and 0.4 million have six-parameter solutions. The sources span the magnitude range G = 13 to 21 with a peak density at 20.6 mag, at which the typical positional uncertainty is about 1 mas. The proper motions show systematic errors on the level of 12 $μ$as yr${}^{-1}$ on angular scales greater than 15 deg. For the 3142 optical counterparts of ICRF3 sources in the S/X frequency bands, the median offset from the radio positions is about 0.5 mas, but exceeds 4 mas in either coordinate for 127 sources. We outline the future of the Gaia-CRF in the next Gaia data releases.
△ Less
Submitted 30 October, 2022; v1 submitted 26 April, 2022;
originally announced April 2022.
-
Calibration and performance of the readout system based on switched capacitor arrays for the Large-Sized Telescope of the Cherenkov Telescope Array
Authors:
Seiya Nozaki,
Kyosuke Awai,
Aya Bamba,
Juan Abel Barrio,
Maria Isabel Bernardos,
Oscar Blanch,
Joan Boix,
Franca Cassol,
Yuki Choushi,
Carlos Delgado,
Carlos Diaz,
Nadia Fouque,
Lluis Freixas,
Pawel Gliwny,
Shunichi Gunji,
Daniela Hadasch,
Dirk Hoffmann,
Julien Houles,
Yusuke Inome,
Yuki Iwamura,
Léa Jouvin,
Hideaki Katagiri,
Kiomei Kawamura,
Daniel Kerszberg,
Yusuke Konno
, et al. (37 additional authors not shown)
Abstract:
The Cherenkov Telescope Array (CTA) is the next-generation ground-based very-high-energy gamma-ray observatory. The Large-Sized Telescope (LST) of CTA is designed to detect gamma rays between 20 GeV and a few TeV with a 23-meter diameter mirror. We have developed the focal plane camera of the first LST, which has 1855 photomultiplier tubes (PMTs) and the readout system which samples a PMT waveform…
▽ More
The Cherenkov Telescope Array (CTA) is the next-generation ground-based very-high-energy gamma-ray observatory. The Large-Sized Telescope (LST) of CTA is designed to detect gamma rays between 20 GeV and a few TeV with a 23-meter diameter mirror. We have developed the focal plane camera of the first LST, which has 1855 photomultiplier tubes (PMTs) and the readout system which samples a PMT waveform at GHz with switched capacitor arrays, Domino Ring Sampler ver4 (DRS4). To measure the precise pulse charge and arrival time of Cherenkov signals, we developed a method to calibrate the output voltage of DRS4 and the sampling time interval, as well as an analysis method to correct the spike noise of DRS4. Since the first LST was inaugurated in 2018, we have performed the commissioning tests and calibrated the camera. We characterised the camera in terms of the charge pedestal under various conditions of the night sky background, the charge resolution of each pixel, the charge uniformity of the whole camera, and the time resolutions with a test pulse and calibration laser.
△ Less
Submitted 13 March, 2022;
originally announced March 2022.
-
LegalNLP -- Natural Language Processing methods for the Brazilian Legal Language
Authors:
Felipe Maia Polo,
Gabriel Caiaffa Floriano Mendonça,
Kauê Capellato J. Parreira,
Lucka Gianvechio,
Peterson Cordeiro,
Jonathan Batista Ferreira,
Leticia Maria Paz de Lima,
Antônio Carlos do Amaral Maia,
Renato Vicente
Abstract:
We present and make available pre-trained language models (Phraser, Word2Vec, Doc2Vec, FastText, and BERT) for the Brazilian legal language, a Python package with functions to facilitate their use, and a set of demonstrations/tutorials containing some applications involving them. Given that our material is built upon legal texts coming from several Brazilian courts, this initiative is extremely he…
▽ More
We present and make available pre-trained language models (Phraser, Word2Vec, Doc2Vec, FastText, and BERT) for the Brazilian legal language, a Python package with functions to facilitate their use, and a set of demonstrations/tutorials containing some applications involving them. Given that our material is built upon legal texts coming from several Brazilian courts, this initiative is extremely helpful for the Brazilian legal field, which lacks other open and specific tools and language models. Our main objective is to catalyze the use of natural language processing tools for legal texts analysis by the Brazilian industry, government, and academia, providing the necessary tools and accessible material.
△ Less
Submitted 5 October, 2021;
originally announced October 2021.
-
Commissioning of the camera of the first Large Size Telescope of the Cherenkov Telescope Array
Authors:
T. Saito,
C. Delgado,
O. Blanch,
M. Artero,
J. A. Barrio,
F. Cassol,
C. Diaz,
D. Hadasch,
D. Hoffmann,
J. Houles,
Y. Inome,
M. Iori,
L. Jouvin,
D. Kerszberg,
Y. Kobayashi,
H. Kubo,
G. Martinez,
D. Mazin,
E. Moretti,
T. Nakamori,
S. Nozaki,
T. Oka,
A. Okumura,
M. Palatiello,
M. Polo
, et al. (8 additional authors not shown)
Abstract:
The first Large Size Telescope (LST-1) of the Cherenkov Telescope Array has been operational since October 2018 at La Palma, Spain. We report on the results obtained during the camera commissioning. The noise level of the readout is determined as a 0.2 p.e. level. The gain of PMTs are well equalized within 2\% variation, using the calibration flash system. The effect of the night sky background on…
▽ More
The first Large Size Telescope (LST-1) of the Cherenkov Telescope Array has been operational since October 2018 at La Palma, Spain. We report on the results obtained during the camera commissioning. The noise level of the readout is determined as a 0.2 p.e. level. The gain of PMTs are well equalized within 2\% variation, using the calibration flash system. The effect of the night sky background on the signal readout noise as well as the PMT gain estimation are also well evaluated. Trigger thresholds are optimized for the lowest possible gamma-ray energy threshold and the trigger distribution synchronization has been achieved within 1~ns precision. Automatic rate control realizes the stable observation with 1.5\% rate variation over 3 hours. The performance of the novel DAQ system demonstrates a less than 10\% dead time for 15 kHz trigger rate even with sophisticated online data correction.
△ Less
Submitted 4 August, 2021;
originally announced August 2021.
-
Effects of personality traits in predicting grade retention of Brazilian students
Authors:
Carmen Melo Toledo,
Guilherme Mendes Bassedon,
Jonathan Batista Ferreira,
Lucka de Godoy Gianvechio,
Carlos Guatimosim,
Felipe Maia Polo,
Renato Vicente
Abstract:
Student's grade retention is a key issue faced by many education systems, especially those in develo** countries. In this paper, we seek to gauge the relevance of students' personality traits in predicting grade retention in Brazil. For that, we used data collected in 2012 and 2017, in the city of Sertaozinho, countryside of the state of Sao Paulo, Brazil. The surveys taken in Sertaozinho includ…
▽ More
Student's grade retention is a key issue faced by many education systems, especially those in develo** countries. In this paper, we seek to gauge the relevance of students' personality traits in predicting grade retention in Brazil. For that, we used data collected in 2012 and 2017, in the city of Sertaozinho, countryside of the state of Sao Paulo, Brazil. The surveys taken in Sertaozinho included several socioeconomic questions, standardized tests, and a personality test. Moreover, students were in grades 4, 5, and 6 in 2012. Our approach was based on training machine learning models on the surveys' data to predict grade retention between 2012 and 2017 using information from 2012 or before, and then using some strategies to quantify personality traits' predictive power. We concluded that, besides proving to be fairly better than a random classifier when isolated, personality traits contribute to prediction even when using socioeconomic variables and standardized tests results.
△ Less
Submitted 12 July, 2021;
originally announced July 2021.
-
OCDE: Odds Conditional Density Estimator
Authors:
Alex Akira Okuno,
Felipe Maia Polo
Abstract:
Conditional density estimation (CDE) models can be useful for many statistical applications, especially because the full conditional density is estimated instead of traditional regression point estimates, revealing more information about the uncertainty of the random variable of interest. In this paper, we propose a new methodology called Odds Conditional Density Estimator (OCDE) to estimate condi…
▽ More
Conditional density estimation (CDE) models can be useful for many statistical applications, especially because the full conditional density is estimated instead of traditional regression point estimates, revealing more information about the uncertainty of the random variable of interest. In this paper, we propose a new methodology called Odds Conditional Density Estimator (OCDE) to estimate conditional densities in a supervised learning scheme. The main idea is that it is very difficult to estimate $p_{x,y}$ and $p_{x}$ in order to estimate the conditional density $p_{y|x}$, but by introducing an instrumental distribution, we transform the CDE problem into a problem of odds estimation, or similarly, training a binary probabilistic classifier. We demonstrate how OCDE works using simulated data and then test its performance against other known state-of-the-art CDE methods in real data. Overall, OCDE is competitive compared with these methods in real datasets.
△ Less
Submitted 23 June, 2021;
originally announced July 2021.
-
Gaia Early Data Release 3: The Galactic anticentre
Authors:
Gaia Collaboration,
T. Antoja,
P. McMillan,
G. Kordopatis,
P. Ramos,
A. Helmi,
E. Balbinot,
T. Cantat-Gaudin,
L. Chemin,
F. Figueras,
C. Jordi,
S. Khanna,
M. Romero-Gomez,
G. Seabroke,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
D. W. Evans,
L. Eyer,
A. Hutton,
F. Jansen
, et al. (395 additional authors not shown)
Abstract:
We aim to demonstrate the scientific potential of the Gaia Early Data Release 3 (EDR3) for the study of the Milky Way structure and evolution. We used astrometric positions, proper motions, parallaxes, and photometry from EDR3 to select different populations and components and to calculate the distances and velocities in the direction of the anticentre. We explore the disturbances of the current d…
▽ More
We aim to demonstrate the scientific potential of the Gaia Early Data Release 3 (EDR3) for the study of the Milky Way structure and evolution. We used astrometric positions, proper motions, parallaxes, and photometry from EDR3 to select different populations and components and to calculate the distances and velocities in the direction of the anticentre. We explore the disturbances of the current disc, the spatial and kinematical distributions of early accreted versus in-situ stars, the structures in the outer parts of the disc, and the orbits of open clusters Berkeley 29 and Saurer 1. We find that: i) the dynamics of the Galactic disc are very complex with vertical asymmetries, and new correlations, including a bimodality with disc stars with large angular momentum moving vertically upwards from below the plane, and disc stars with slightly lower angular momentum moving preferentially downwards; ii) we resolve the kinematic substructure (diagonal ridges) in the outer parts of the disc for the first time; iii) the red sequence that has been associated with the proto-Galactic disc that was present at the time of the merger with Gaia-Enceladus-Sausage is currently radially concentrated up to around 14 kpc, while the blue sequence that has been associated with debris of the satellite extends beyond that; iv) there are density structures in the outer disc, both above and below the plane, most probably related to Monoceros, the Anticentre Stream, and TriAnd, for which the Gaia data allow an exhaustive selection of candidate member stars and dynamical study; and v) the open clusters Berkeley~29 and Saurer~1, despite being located at large distances from the Galactic centre, are on nearly circular disc-like orbits. We demonstrate how, once again, the Gaia are crucial for our understanding of the different pieces of our Galaxy and their connection to its global structure and history.
△ Less
Submitted 26 April, 2021; v1 submitted 14 January, 2021;
originally announced January 2021.
-
Gaia Early Data Release 3: The Gaia Catalogue of Nearby Stars
Authors:
Gaia Collaboration,
R. L. Smart,
L. M. Sarro,
J. Rybizki,
C. Reylé,
A. C. Robin,
N. C. Hambly,
U. Abbas,
M. A. Barstow,
J. H. J. de Bruijne,
B. Bucciarelli,
J. M. Carrasco,
W. J. Cooper,
S. T. Hodgkin,
E. Masana,
D. Michalik,
J. Sahlmann,
A. Sozzetti,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
D. W. Evans
, et al. (398 additional authors not shown)
Abstract:
We produce a clean and well-characterised catalogue of objects within 100\,pc of the Sun from the \G\ Early Data Release 3. We characterise the catalogue through comparisons to the full data release, external catalogues, and simulations. We carry out a first analysis of the science that is possible with this sample to demonstrate its potential and best practices for its use.
The selection of obj…
▽ More
We produce a clean and well-characterised catalogue of objects within 100\,pc of the Sun from the \G\ Early Data Release 3. We characterise the catalogue through comparisons to the full data release, external catalogues, and simulations. We carry out a first analysis of the science that is possible with this sample to demonstrate its potential and best practices for its use.
The selection of objects within 100\,pc from the full catalogue used selected training sets, machine-learning procedures, astrometric quantities, and solution quality indicators to determine a probability that the astrometric solution is reliable. The training set construction exploited the astrometric data, quality flags, and external photometry. For all candidates we calculated distance posterior probability densities using Bayesian procedures and mock catalogues to define priors. Any object with reliable astrometry and a non-zero probability of being within 100\,pc is included in the catalogue.
We have produced a catalogue of \NFINAL\ objects that we estimate contains at least 92\% of stars of stellar type M9 within 100\,pc of the Sun. We estimate that 9\% of the stars in this catalogue probably lie outside 100\,pc, but when the distance probability function is used, a correct treatment of this contamination is possible. We produced luminosity functions with a high signal-to-noise ratio for the main-sequence stars, giants, and white dwarfs. We examined in detail the Hyades cluster, the white dwarf population, and wide-binary systems and produced candidate lists for all three samples. We detected local manifestations of several streams, superclusters, and halo objects, in which we identified 12 members of \G\ Enceladus. We present the first direct parallaxes of five objects in multiple systems within 10\,pc of the Sun.
△ Less
Submitted 3 December, 2020;
originally announced December 2020.
-
Gaia Early Data Release 3: Acceleration of the solar system from Gaia astrometry
Authors:
Gaia Collaboration,
S. A. Klioner,
F. Mignard,
L. Lindegren,
U. Bastian,
P. J. McMillan,
J. Hernández,
D. Hobbs,
M. Ramos-Lerate,
M. Biermann,
A. Bombrun,
A. de Torres,
E. Gerlach,
R. Geyer,
T. Hilger,
U. Lammers,
H. Steidelmüller,
C. A. Stephenson,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
C. Babusiaux,
O. L. Creevey,
D. W. Evans
, et al. (392 additional authors not shown)
Abstract:
Context. Gaia Early Data Release 3 (Gaia EDR3) provides accurate astrometry for about 1.6 million compact (QSO-like) extragalactic sources, 1.2 million of which have the best-quality five-parameter astrometric solutions.
Aims. The proper motions of QSO-like sources are used to reveal a systematic pattern due to the acceleration of the solar system barycentre with respect to the rest frame of the…
▽ More
Context. Gaia Early Data Release 3 (Gaia EDR3) provides accurate astrometry for about 1.6 million compact (QSO-like) extragalactic sources, 1.2 million of which have the best-quality five-parameter astrometric solutions.
Aims. The proper motions of QSO-like sources are used to reveal a systematic pattern due to the acceleration of the solar system barycentre with respect to the rest frame of the Universe. Apart from being an important scientific result by itself, the acceleration measured in this way is a good quality indicator of the Gaia astrometric solution. Methods. The effect of the acceleration is obtained as a part of the general expansion of the vector field of proper motions in Vector Spherical Harmonics (VSH). Various versions of the VSH fit and various subsets of the sources are tried and compared to get the most consistent result and a realistic estimate of its uncertainty. Additional tests with the Gaia astrometric solution are used to get a better idea on possible systematic errors in the estimate.
Results. Our best estimate of the acceleration based on Gaia EDR3 is $(2.32 \pm 0.16) \times 10^{-10}$ m s${}^{-2}$ (or $7.33 \pm 0.51$ km s$^{-1}$ Myr${}^{-1}$) towards $α= 269.1^\circ \pm 5.4^\circ$, $δ= -31.6^\circ \pm 4.1^\circ$, corresponding to a proper motion amplitude of $5.05 \pm 0.35$ $μ$as yr${}^{-1}$. This is in good agreement with the acceleration expected from current models of the Galactic gravitational potential. We expect that future Gaia data releases will provide estimates of the acceleration with uncertainties substantially below 0.1 $μ$as yr${}^{-1}$.
△ Less
Submitted 3 December, 2020;
originally announced December 2020.
-
Gaia Early Data Release 3: Structure and properties of the Magellanic Clouds
Authors:
Gaia Collaboration,
X. Luri,
L. Chemin,
G. Clementini,
H. E. Delgado,
P. J. McMillan,
M. Romero-Gómez,
E. Balbinot,
A. Castro-Ginard,
R. Mor,
V. Ripepi,
L. M. Sarro,
M. -R. L. Cioni,
C. Fabricius,
A. Garofalo,
A. Helmi,
T. Muraveva,
A. G. A. Brown,
A. Vallenari,
T. Prusti,
J. H. J. de,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
D. W. Evans
, et al. (395 additional authors not shown)
Abstract:
We compare the Gaia DR2 and Gaia EDR3 performances in the study of the Magellanic Clouds and show the clear improvements in precision and accuracy in the new release. We also show that the systematics still present in the data make the determination of the 3D geometry of the LMC a difficult endeavour; this is at the very limit of the usefulness of the Gaia EDR3 astrometry, but it may become feasib…
▽ More
We compare the Gaia DR2 and Gaia EDR3 performances in the study of the Magellanic Clouds and show the clear improvements in precision and accuracy in the new release. We also show that the systematics still present in the data make the determination of the 3D geometry of the LMC a difficult endeavour; this is at the very limit of the usefulness of the Gaia EDR3 astrometry, but it may become feasible with the use of additional external data.
We derive radial and tangential velocity maps and global profiles for the LMC for the several subsamples we defined. To our knowledge, this is the first time that the two planar components of the ordered and random motions are derived for multiple stellar evolutionary phases in a galactic disc outside the Milky Way, showing the differences between younger and older phases. We also analyse the spatial structure and motions in the central region, the bar, and the disc, providing new insights into features and kinematics.
Finally, we show that the Gaia EDR3 data allows clearly resolving the Magellanic Bridge, and we trace the density and velocity flow of the stars from the SMC towards the LMC not only globally, but also separately for young and evolved populations. This allows us to confirm an evolved population in the Bridge that is slightly shift from the younger population. Additionally, we were able to study the outskirts of both Magellanic Clouds, in which we detected some well-known features and indications of new ones.
△ Less
Submitted 4 January, 2021; v1 submitted 3 December, 2020;
originally announced December 2020.
-
Gaia Early Data Release 3: Summary of the contents and survey properties
Authors:
Gaia Collaboration,
A. G. A Brown,
A. Vallenari,
T. Prusti,
J. H. J. de Bruijne,
C. Babusiaux,
M. Biermann,
O. L. Creevey,
D. W. Evans,
L. Eyer,
A. Hutton,
F. Jansen,
C. Jordi,
S. A. Klioner,
U. Lammers,
L. Lindegren,
X. Luri,
F. Mignard,
C. Panem,
D. Pourbaix,
S. Randich,
P. Sartoretti,
C. Soubiran,
N. A. Walton,
F. Arenou
, et al. (401 additional authors not shown)
Abstract:
We present the early installment of the third Gaia data release, Gaia EDR3, consisting of astrometry and photometry for 1.8 billion sources brighter than magnitude 21, complemented with the list of radial velocities from Gaia DR2. Gaia EDR3 contains celestial positions and the apparent brightness in G for approximately 1.8 billion sources. For 1.5 billion of those sources, parallaxes, proper motio…
▽ More
We present the early installment of the third Gaia data release, Gaia EDR3, consisting of astrometry and photometry for 1.8 billion sources brighter than magnitude 21, complemented with the list of radial velocities from Gaia DR2. Gaia EDR3 contains celestial positions and the apparent brightness in G for approximately 1.8 billion sources. For 1.5 billion of those sources, parallaxes, proper motions, and the (G_BP-G_RP) colour are also available. The passbands for G, G_BP, and G_RP are provided as part of the release. For ease of use, the 7 million radial velocities from Gaia DR2 are included in this release, after the removal of a small number of spurious values. New radial velocities will appear as part of Gaia DR3. Finally, Gaia EDR3 represents an updated materialisation of the celestial reference frame (CRF) in the optical, the Gaia-CRF3, which is based solely on extragalactic sources. The creation of the source list for Gaia EDR3 includes enhancements that make it more robust with respect to high proper motion stars, and the disturbing effects of spurious and partially resolved sources. The source list is largely the same as that for Gaia DR2, but it does feature new sources and there are some notable changes. The source list will not change for Gaia DR3. Gaia EDR3 represents a significant advance over Gaia DR2, with parallax precisions increased by 30 percent, proper motion precisions increased by a factor of 2, and the systematic errors in the astrometry suppressed by 30--40 percent for the parallaxes and by a factor ~2.5 for the proper motions. The photometry also features increased precision, but above all much better homogeneity across colour, magnitude, and celestial position. A single passband for G, G_BP, and G_RP is valid over the entire magnitude and colour range, with no systematics above the 1 percent level.
△ Less
Submitted 9 June, 2021; v1 submitted 2 December, 2020;
originally announced December 2020.
-
Sensitivity of the Cherenkov Telescope Array for probing cosmology and fundamental physics with gamma-ray propagation
Authors:
The Cherenkov Telescope Array Consortium,
:,
H. Abdalla,
H. Abe,
F. Acero,
A. Acharyya,
R. Adam,
I. Agudo,
A. Aguirre-Santaella,
R. Alfaro,
J. Alfaro,
C. Alispach,
R. Aloisio,
R. Alves B,
L. Amati,
E. Amato,
G. Ambrosi,
E. O. Angüner,
A. Araudo,
T. Armstrong,
F. Arqueros,
L. Arrabito,
K. Asano,
Y. Ascasíbar,
M. Ashley
, et al. (474 additional authors not shown)
Abstract:
The Cherenkov Telescope Array (CTA), the new-generation ground-based observatory for $γ$-ray astronomy, provides unique capabilities to address significant open questions in astrophysics, cosmology, and fundamental physics. We study some of the salient areas of $γ$-ray cosmology that can be explored as part of the Key Science Projects of CTA, through simulated observations of active galactic nucle…
▽ More
The Cherenkov Telescope Array (CTA), the new-generation ground-based observatory for $γ$-ray astronomy, provides unique capabilities to address significant open questions in astrophysics, cosmology, and fundamental physics. We study some of the salient areas of $γ$-ray cosmology that can be explored as part of the Key Science Projects of CTA, through simulated observations of active galactic nuclei (AGN) and of their relativistic jets. Observations of AGN with CTA will enable a measurement of $γ$-ray absorption on the extragalactic background light with a statistical uncertainty below 15% up to a redshift $z=2$ and to constrain or detect $γ$-ray halos up to intergalactic-magnetic-field strengths of at least 0.3pG. Extragalactic observations with CTA also show promising potential to probe physics beyond the Standard Model. The best limits on Lorentz invariance violation from $γ$-ray astronomy will be improved by a factor of at least two to three. CTA will also probe the parameter space in which axion-like particles could constitute a significant fraction, if not all, of dark matter. We conclude on the synergies between CTA and other upcoming facilities that will foster the growth of $γ$-ray cosmology.
△ Less
Submitted 26 February, 2021; v1 submitted 3 October, 2020;
originally announced October 2020.
-
Effective Sample Size, Dimensionality, and Generalization in Covariate Shift Adaptation
Authors:
Felipe Maia Polo,
Renato Vicente
Abstract:
In supervised learning, training and test datasets are often sampled from distinct distributions. Domain adaptation techniques are thus required. Covariate shift adaptation yields good generalization performance when domains differ only by the marginal distribution of features. Covariate shift adaptation is usually implemented using importance weighting, which may fail, according to common wisdom,…
▽ More
In supervised learning, training and test datasets are often sampled from distinct distributions. Domain adaptation techniques are thus required. Covariate shift adaptation yields good generalization performance when domains differ only by the marginal distribution of features. Covariate shift adaptation is usually implemented using importance weighting, which may fail, according to common wisdom, due to small effective sample sizes (ESS). Previous research argues this scenario is more common in high-dimensional settings. However, how effective sample size, dimensionality, and model performance/generalization are formally related in supervised learning, considering the context of covariate shift adaptation, is still somewhat obscure in the literature. Thus, a main challenge is presenting a unified theory connecting those points. Hence, in this paper, we focus on building a unified view connecting the ESS, data dimensionality, and generalization in the context of covariate shift adaptation. Moreover, we also demonstrate how dimensionality reduction or feature selection can increase the ESS, and argue that our results support dimensionality reduction before covariate shift adaptation as a good practice.
△ Less
Submitted 8 January, 2022; v1 submitted 2 October, 2020;
originally announced October 2020.
-
Predicting Legal Proceedings Status: Approaches Based on Sequential Text Data
Authors:
Felipe Maia Polo,
Itamar Ciochetti,
Emerson Bertolo
Abstract:
The objective of this paper is to develop predictive models to classify Brazilian legal proceedings in three possible classes of status: (i) archived proceedings, (ii) active proceedings, and (iii) suspended proceedings. This problem's resolution is intended to assist public and private institutions in managing large portfolios of legal proceedings, providing gains in scale and efficiency. In this…
▽ More
The objective of this paper is to develop predictive models to classify Brazilian legal proceedings in three possible classes of status: (i) archived proceedings, (ii) active proceedings, and (iii) suspended proceedings. This problem's resolution is intended to assist public and private institutions in managing large portfolios of legal proceedings, providing gains in scale and efficiency. In this paper, legal proceedings are made up of sequences of short texts called "motions." We combined several natural language processing (NLP) and machine learning techniques to solve the problem. Although working with Portuguese NLP, which can be challenging due to lack of resources, our approaches performed remarkably well in the classification task, achieving maximum accuracy of .93 and top average F1 Scores of .89 (macro) and .93 (weighted). Furthermore, we could extract and interpret the patterns learned by one of our models besides quantifying how those patterns relate to the classification task. The interpretability step is important among machine learning legal applications and gives us an exciting insight into how black-box models make decisions.
△ Less
Submitted 22 June, 2021; v1 submitted 13 March, 2020;
originally announced March 2020.
-
Skills to not fall behind in school
Authors:
Felipe Maia Polo
Abstract:
Many recent studies emphasize how important the role of cognitive and social-emotional skills can be in determining people's quality of life. Although skills are of great importance in many aspects, in this paper we will focus our efforts to better understand the relationship between several types of skills with academic progress delay. Our dataset contains the same students in 2012 and 2017, and…
▽ More
Many recent studies emphasize how important the role of cognitive and social-emotional skills can be in determining people's quality of life. Although skills are of great importance in many aspects, in this paper we will focus our efforts to better understand the relationship between several types of skills with academic progress delay. Our dataset contains the same students in 2012 and 2017, and we consider that there was a academic progress delay for a specific student if he or she progressed less than expected in school grades. Our methodology primarily includes the use of a Bayesian logistic regression model and our results suggest that both cognitive and social-emotional skills may impact the conditional probability of falling behind in school, and the magnitude of the impact between the two types of skills can be comparable.
△ Less
Submitted 28 January, 2020;
originally announced January 2020.
-
Optical intensity interferometry observations using the MAGIC imaging atmospheric Cherenkov telescopes
Authors:
V. A. Acciari,
M. I. Bernardos,
E. Colombo,
J. L. Contreras,
J. Cortina,
C. Delgado,
C. Diaz,
D. Fink,
M. Mariotti,
S. Mangano,
R. Mirzoyan,
M. Polo,
T. Schweizer,
M. Will
Abstract:
Imaging Atmospheric Cherenkov Telescopes (IACTs) currently in operation feature large mirrors and order of 1 ns time response to signals of a few photo-electrons produced by optical photons. This means that they are ideally suited for optical interferometry observations. Thanks to their sensitivity to visible wavelengths and long baselines optical intensity interferometry with IACTs allows reachin…
▽ More
Imaging Atmospheric Cherenkov Telescopes (IACTs) currently in operation feature large mirrors and order of 1 ns time response to signals of a few photo-electrons produced by optical photons. This means that they are ideally suited for optical interferometry observations. Thanks to their sensitivity to visible wavelengths and long baselines optical intensity interferometry with IACTs allows reaching angular resolutions of tens to microarcsec. We have installed a simple optical setup on top of the cameras of the two 17 m diameter MAGIC IACTs and observed coherent fluctuations in the photon intensity measured at the two telescopes for three different stars. The sensitivity is roughly 10 times better than that achieved in the 1970s with the Narrabri interferometer.
△ Less
Submitted 14 November, 2019;
originally announced November 2019.
-
Study of uncertainty and repeatability in structured-light 3D scanners
Authors:
María-Eugenia Polo,
Aurora Cuartero,
Ángel M. Felicísimo
Abstract:
Structured-light 3D scanners create 3D models with high accuracy, but controlling the accuracy and repeatability of the scanner is essential. The objective of this paper is to analyze the repeatability and accuracy of two structured-light 3D scanners (Go!SCAN 20TM and Go!SCAN 50TM). The method used scans steel gauge blocks several times with different resolutions to analyze the scanned data and to…
▽ More
Structured-light 3D scanners create 3D models with high accuracy, but controlling the accuracy and repeatability of the scanner is essential. The objective of this paper is to analyze the repeatability and accuracy of two structured-light 3D scanners (Go!SCAN 20TM and Go!SCAN 50TM). The method used scans steel gauge blocks several times with different resolutions to analyze the scanned data and to test the correlation between uncertainty, resolution, and scanner model. The primary results include: 1) a systematic error of magnitude similar to nominal accuracy exists and must be corrected and 2) the global uncertainty is approximately 0.05 mm without significant differences between the two scanner models. A strategy for scanning is proposed based on the results at different resolutions.
△ Less
Submitted 29 October, 2019;
originally announced October 2019.
-
Monte Carlo studies for the optimisation of the Cherenkov Telescope Array layout
Authors:
A. Acharyya,
I. Agudo,
E. O. Angüner,
R. Alfaro,
J. Alfaro,
C. Alispach,
R. Aloisio,
R. Alves Batista,
J. -P. Amans,
L. Amati,
E. Amato,
G. Ambrosi,
L. A. Antonelli,
C. Aramo,
T. Armstrong,
F. Arqueros,
L. Arrabito,
K. Asano,
H. Ashkar,
C. Balazs,
M. Balbo,
B. Balmaverde,
P. Barai,
A. Barbano,
M. Barkov
, et al. (445 additional authors not shown)
Abstract:
The Cherenkov Telescope Array (CTA) is the major next-generation observatory for ground-based very-high-energy gamma-ray astronomy. It will improve the sensitivity of current ground-based instruments by a factor of five to twenty, depending on the energy, greatly improving both their angular and energy resolutions over four decades in energy (from 20 GeV to 300 TeV). This achievement will be possi…
▽ More
The Cherenkov Telescope Array (CTA) is the major next-generation observatory for ground-based very-high-energy gamma-ray astronomy. It will improve the sensitivity of current ground-based instruments by a factor of five to twenty, depending on the energy, greatly improving both their angular and energy resolutions over four decades in energy (from 20 GeV to 300 TeV). This achievement will be possible by using tens of imaging Cherenkov telescopes of three successive sizes. They will be arranged into two arrays, one per hemisphere, located on the La Palma island (Spain) and in Paranal (Chile). We present here the optimised and final telescope arrays for both CTA sites, as well as their foreseen performance, resulting from the analysis of three different large-scale Monte Carlo productions.
△ Less
Submitted 2 April, 2019;
originally announced April 2019.
-
Science with the Cherenkov Telescope Array
Authors:
The Cherenkov Telescope Array Consortium,
:,
B. S. Acharya,
I. Agudo,
I. Al Samarai,
R. Alfaro,
J. Alfaro,
C. Alispach,
R. Alves Batista,
J. -P. Amans,
E. Amato,
G. Ambrosi,
E. Antolini,
L. A. Antonelli,
C. Aramo,
M. Araya,
T. Armstrong,
F. Arqueros,
L. Arrabito,
K. Asano,
M. Ashley,
M. Backes,
C. Balazs,
M. Balbo,
O. Ballester
, et al. (558 additional authors not shown)
Abstract:
The Cherenkov Telescope Array, CTA, will be the major global observatory for very high energy gamma-ray astronomy over the next decade and beyond. The scientific potential of CTA is extremely broad: from understanding the role of relativistic cosmic particles to the search for dark matter. CTA is an explorer of the extreme universe, probing environments from the immediate neighbourhood of black ho…
▽ More
The Cherenkov Telescope Array, CTA, will be the major global observatory for very high energy gamma-ray astronomy over the next decade and beyond. The scientific potential of CTA is extremely broad: from understanding the role of relativistic cosmic particles to the search for dark matter. CTA is an explorer of the extreme universe, probing environments from the immediate neighbourhood of black holes to cosmic voids on the largest scales. Covering a huge range in photon energy from 20 GeV to 300 TeV, CTA will improve on all aspects of performance with respect to current instruments.
The observatory will operate arrays on sites in both hemispheres to provide full sky coverage and will hence maximize the potential for the rarest phenomena such as very nearby supernovae, gamma-ray bursts or gravitational wave transients. With 99 telescopes on the southern site and 19 telescopes on the northern site, flexible operation will be possible, with sub-arrays available for specific tasks. CTA will have important synergies with many of the new generation of major astronomical and astroparticle observatories. Multi-wavelength and multi-messenger approaches combining CTA data with those from other instruments will lead to a deeper understanding of the broad-band non-thermal properties of target sources.
The CTA Observatory will be operated as an open, proposal-driven observatory, with all data available on a public archive after a pre-defined proprietary period. Scientists from institutions worldwide have combined together to form the CTA Consortium. This Consortium has prepared a proposal for a Core Programme of highly motivated observations. The programme, encompassing approximately 40% of the available observing time over the first ten years of CTA operation, is made up of individual Key Science Projects (KSPs), which are presented in this document.
△ Less
Submitted 21 January, 2018; v1 submitted 22 September, 2017;
originally announced September 2017.
-
Random lasing in an organic light-emitting crystal and its interplay with vertical cavity feedback
Authors:
Andrea Camposeo,
Marco Polo,
Pompilio Del Carro,
Leonardo Silvestri,
Silvia Tavazzi,
Dario Pisignano
Abstract:
The simultaneous vertical-cavity and random lasing emission properties of a blue-emitting molecular crystal are investigated. The 1,1,4,4-tetraphenyl-1,3-butadiene samples, grown by physical vapour transport, feature room-temperature stimulated emission peaked at about 430 nm. Fabry-Pérot and random resonances are primed by the interfaces of the crystal with external media and by defect scatterers…
▽ More
The simultaneous vertical-cavity and random lasing emission properties of a blue-emitting molecular crystal are investigated. The 1,1,4,4-tetraphenyl-1,3-butadiene samples, grown by physical vapour transport, feature room-temperature stimulated emission peaked at about 430 nm. Fabry-Pérot and random resonances are primed by the interfaces of the crystal with external media and by defect scatterers, respectively. The analysis of the resulting lasing spectra evidences the existence of narrow peaks due to both the built-in vertical Fabry-Pérot cavity and random lasing in a novel, surface-emitting configuration and threshold around 500 microJ cm^-2. The anti-correlation between different modes is also highlighted, due to competition for gain. Molecular crystals with optical gain candidate as promising photonic media inherently supporting multiple lasing mechanisms.
△ Less
Submitted 22 September, 2014; v1 submitted 19 May, 2014;
originally announced May 2014.
-
Polarized superradiance from delocalized exciton transitions in tetracene single crystals
Authors:
Andrea Camposeo,
Marco Polo,
Silvia Tavazzi,
Leonardo Silvestri,
Peter Spearman,
Roberto Cingolani,
Dario Pisignano
Abstract:
Polarized superradiant emission and exciton delocalization in tetracene single crystals are reported. Polarization-, time-, and temperature-resolved spectroscopy evidence the complete polarization of the zero-phonon line of the intrinsic tetracene emission from both the lower (F state) and the upper (thermally activated) Davydov excitons. The superradiance of the F emission is substantiated by a n…
▽ More
Polarized superradiant emission and exciton delocalization in tetracene single crystals are reported. Polarization-, time-, and temperature-resolved spectroscopy evidence the complete polarization of the zero-phonon line of the intrinsic tetracene emission from both the lower (F state) and the upper (thermally activated) Davydov excitons. The superradiance of the F emission is substantiated by a nearly linear decrease of the radiative lifetime with temperature, being fifteen times shorter at 30 K compared to the isolated molecule, with an exciton delocalization of about 40 molecules.
△ Less
Submitted 30 April, 2013;
originally announced April 2013.