-
Euclid preparation. XXXI. The effect of the variations in photometric passbands on photometric-redshift accuracy
Authors:
Euclid Collaboration,
Stéphane Paltani,
J. Coupon,
W. G. Hartley,
A. Alvarez-Ayllon,
F. Dubath,
J. J. Mohr,
M. Schirmer,
J. -C. Cuillandre,
G. Desprez,
O. Ilbert,
K. Kuijken,
N. Aghanim,
B. Altieri,
A. Amara,
N. Auricchio,
M. Baldi,
R. Bender,
C. Bodendorf,
D. Bonino,
E. Branchini,
M. Brescia,
J. Brinchmann,
S. Camera,
V. Capobianco
, et al. (192 additional authors not shown)
Abstract:
The technique of photometric redshifts has become essential for the exploitation of multi-band extragalactic surveys. While the requirements on photo-zs for the study of galaxy evolution mostly pertain to the precision and to the fraction of outliers, the most stringent requirement in their use in cosmology is on the accuracy, with a level of bias at the sub-percent level for the Euclid cosmology…
▽ More
The technique of photometric redshifts has become essential for the exploitation of multi-band extragalactic surveys. While the requirements on photo-zs for the study of galaxy evolution mostly pertain to the precision and to the fraction of outliers, the most stringent requirement in their use in cosmology is on the accuracy, with a level of bias at the sub-percent level for the Euclid cosmology mission. A separate, and challenging, calibration process is needed to control the bias at this level of accuracy. The bias in photo-zs has several distinct origins that may not always be easily overcome. We identify here one source of bias linked to the spatial or time variability of the passbands used to determine the photometric colours of galaxies. We first quantified the effect as observed on several well-known photometric cameras, and found in particular that, due to the properties of optical filters, the redshifts of off-axis sources are usually overestimated. We show using simple simulations that the detailed and complex changes in the shape can be mostly ignored and that it is sufficient to know the mean wavelength of the passbands of each photometric observation to correct almost exactly for this bias; the key point is that this mean wavelength is independent of the spectral energy distribution of the source}. We use this property to propose a correction that can be computationally efficiently implemented in some photo-z algorithms, in particular template-fitting. We verified that our algorithm, implemented in the new photo-z code Phosphoros, can effectively reduce the bias in photo-zs on real data using the CFHTLS T007 survey, with an average measured bias Delta z over the redshift range 0.4<z<0.7 decreasing by about 0.02, specifically from Delta z~0.04 to Delta z~0.02 around z=0.5. Our algorithm is also able to produce corrected photometry for other applications.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Two-sample test based on Self-Organizing Maps
Authors:
Alejandro Álvarez-Ayllón,
Manuel Palomo-Duarte,
Juan-Manuel Dodero
Abstract:
Machine-learning classifiers can be leveraged as a two-sample statistical test. Suppose each sample is assigned a different label and that a classifier can obtain a better-than-chance result discriminating them. In this case, we can infer that both samples originate from different populations. However, many types of models, such as neural networks, behave as a black-box for the user: they can reje…
▽ More
Machine-learning classifiers can be leveraged as a two-sample statistical test. Suppose each sample is assigned a different label and that a classifier can obtain a better-than-chance result discriminating them. In this case, we can infer that both samples originate from different populations. However, many types of models, such as neural networks, behave as a black-box for the user: they can reject that both samples originate from the same population, but they do not offer insight into how both samples differ. Self-Organizing Maps are a dimensionality reduction initially devised as a data visualization tool that displays emergent properties, being also useful for classification tasks. Since they can be used as classifiers, they can be used also as a two-sample statistical test. But since their original purpose is visualization, they can also offer insights.
△ Less
Submitted 17 December, 2022;
originally announced December 2022.
-
Using the SourceXtractor++ package for data reduction
Authors:
M. Kümmel,
A. Álvarez-Ayllón,
E. Bertin,
P. Dubath,
R. Gavazzi,
W. Hartley,
M. Schefer
Abstract:
The Euclid satellite is an ESA mission scheduled for launch in September 2023. To optimally perform critical stages of the data reduction, such as object detection and morphology determination, a new and modern software package was required. We have developed SourceXtractor++ as open source software for detecting and measuring sources in astronomical images. It is a complete redesign of the origin…
▽ More
The Euclid satellite is an ESA mission scheduled for launch in September 2023. To optimally perform critical stages of the data reduction, such as object detection and morphology determination, a new and modern software package was required. We have developed SourceXtractor++ as open source software for detecting and measuring sources in astronomical images. It is a complete redesign of the original SExtractor, written mainly in C++. The package follows a modular approach and facilitates the analysis of multiple overlap** sources over many images with different pixel grids. SourceXtractor++ is already operational in many areas of the Euclid processing, and we demonstrate here the capabilities of the current version v0.19 on the basis of a set of typical use cases, which are available for download
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Inference of Common Multidimensional Equally-Distributed Attributes
Authors:
Alejandro Alvarez-Ayllon,
Manuel Palomo-Duarte,
Juan-Manuel Dodero
Abstract:
Given two relations containing multiple measurements - possibly with uncertainties - our objective is to find which sets of attributes from the first have a corresponding set on the second, using exclusively a sample of the data. This approach could be used even when the associated metadata is damaged, missing or incomplete, or when the volume is too big for exact methods. This problem is similar…
▽ More
Given two relations containing multiple measurements - possibly with uncertainties - our objective is to find which sets of attributes from the first have a corresponding set on the second, using exclusively a sample of the data. This approach could be used even when the associated metadata is damaged, missing or incomplete, or when the volume is too big for exact methods. This problem is similar to the search of Inclusion Dependencies (IND), a type of rule over two relations asserting that for a set of attributes X from the first, every combination of values appears on a set Y from the second. Existing IND can be found exploiting the existence of a partial order relation called specialization. However, this relation is based on set theory, requiring the values to be directly comparable. Statistical tests are an intuitive possible replacement, but it has not been studied how would they affect the underlying assumptions. In this paper we formally review the effect that a statistical approach has over the inference rules applied to IND discovery. Our results confirm the intuitive thought that statistical tests can be used, but not in a directly equivalent manner. We provide a workable alternative based on a "hierarchy of null hypotheses", allowing for the automatic discovery of multi-dimensional equally distributed sets of attributes.
△ Less
Submitted 19 July, 2022; v1 submitted 20 April, 2021;
originally announced April 2021.
-
Euclid preparation: X. The Euclid photometric-redshift challenge
Authors:
Euclid Collaboration,
G. Desprez,
S. Paltani,
J. Coupon,
I. Almosallam,
A. Alvarez-Ayllon,
V. Amaro,
M. Brescia,
M. Brodwin,
S. Cavuoti,
J. De Vicente-Albendea,
S. Fotopoulou,
P. W. Hatfield,
W. G. Hartley,
O. Ilbert,
M. J. Jarvis,
G. Longo,
R. Saha,
J. S. Speagle,
A. Tramacere,
M. Castellano,
F. Dubath,
A. Galametz,
M. Kuemmel,
C. Laigle
, et al. (148 additional authors not shown)
Abstract:
Forthcoming large photometric surveys for cosmology require precise and accurate photometric redshift (photo-z) measurements for the success of their main science objectives. However, to date, no method has been able to produce photo-$z$s at the required accuracy using only the broad-band photometry that those surveys will provide. An assessment of the strengths and weaknesses of current methods i…
▽ More
Forthcoming large photometric surveys for cosmology require precise and accurate photometric redshift (photo-z) measurements for the success of their main science objectives. However, to date, no method has been able to produce photo-$z$s at the required accuracy using only the broad-band photometry that those surveys will provide. An assessment of the strengths and weaknesses of current methods is a crucial step in the eventual development of an approach to meet this challenge. We report on the performance of 13 photometric redshift code single value redshift estimates and redshift probability distributions (PDZs) on a common set of data, focusing particularly on the 0.2--2.6 redshift range that the Euclid mission will probe. We design a challenge using emulated Euclid data drawn from three photometric surveys of the COSMOS field. The data are divided into two samples: one calibration sample for which photometry and redshifts are provided to the participants; and the validation sample, containing only the photometry, to ensure a blinded test of the methods. Participants were invited to provide a redshift single value estimate and a PDZ for each source in the validation sample, along with a rejection flag that indicates sources they consider unfit for use in cosmological analyses. The performance of each method is assessed through a set of informative metrics, using cross-matched spectroscopic and highly-accurate photometric redshifts as the ground truth. We show that the rejection criteria set by participants are efficient in removing strong outliers, sources for which the photo-z deviates by more than 0.15(1+z) from the spectroscopic-redshift (spec-z). We also show that, while all methods are able to provide reliable single value estimates, several machine-learning methods do not manage to produce useful PDZs. [abridged]
△ Less
Submitted 18 November, 2020; v1 submitted 25 September, 2020;
originally announced September 2020.