Optical and near-infrared spectroscopy of quasars at z>6.5𝑧6.5z>6.5italic_z > 6.5: public data release and composite spectrum

Silvia Onorato1, Joseph F. Hennawi1,2, Jan-Torge Schindler3, **yi Yang4, Feige Wang4, Aaron J. Barth5, Eduardo Bañados6, Anna-Christina Eilers7, Sarah E. I. Bosman6,8, Frederick B. Davies6, Bram P. Venemans1, Chiara Mazzucchelli9, Silvia Belladitta6,10, Fabio Vito10, Emanuele Paolo Farina11, Irham T. Andika12,13, Xiaohui Fan4, Fabian Walter6, Roberto Decarli10, Masafusa Onoue14,15,16, and Riccardo Nanni1
1Leiden Observatory, Leiden University, P.O. Box 9513, 2300 RA Leiden, The Netherlands
2Department of Physics, Broida Hall, University of California, Santa Barbara, Santa Barbara, CA 93106-9530, USA
3Hamburger Sternwarte, Universität Hamburg, Gojenbergsweg 112, D-21029 Hamburg, Germany
4Steward Observatory, University of Arizona, 933 N. Cherry Ave, Tucson, AZ 85721, USA
5Department of Physics and Astronomy, 4129 Frederick Reines Hall, University of California, Irvine, CA 92697-4575, USA
6Max Planck Institut für Astronomie, Königstuhl 17, D-69117, Heidelberg, Germany
7MIT Kavli Institute for Astrophysics and Space Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
8Institute for Theoretical Physics, Heidelberg University, Philosophenweg 12, D-69120, Heidelberg, Germany
9Instituto de Estudios Astrofísicos, Facultad de Ingeniería y Ciencias, Universidad Diego Portales, Avenida Ejercito Libertador 441, Santiago, Chile
10INAF – Osservatorio di Astrofisica e Scienza dello Spazio di Bologna, Via Gobetti 93/3, I-40129 Bologna, Italy
11Gemini Observatory, NSF’s NOIRLab, 670 N A’ohoku Place, Hilo, Hawai’i 96720, USA
12Technical University of Munich, TUM School of Natural Sciences, Department of Physics, James-Franck-Str. 1, D-85748 Garching, Germany
13Max-Planck-Institut für Astrophysik, Karl-Schwarzschild-Str. 1, D-85748 Garching, Germany
14Kavli Institute for the Physics and Mathematics of the Universe (Kavli IPMU, WPI), The University of Tokyo Institutes for Advanced Study,
The University of Tokyo, Kashiwa, Chiba 277-8583, Japan
15Center for Data-Driven Discovery, Kavli IPMU (WPI), UTIAS, The University of Tokyo, Kashiwa, Chiba 277-8583, Japan
16Kavli Institute for Astronomy and Astrophysics, Peking University, Bei**g 100871, P.R.China
E-mail: [email protected]
(Accepted XXX. Received YYY; in original form ZZZ)
Abstract

We present optical and near-infrared (NIR) spectroscopic observations for a sample of 45454545 quasars at 6.50<z7.646.50𝑧7.646.50<z\leq 7.646.50 < italic_z ≤ 7.64 with absolute magnitudes at 1450145014501450 Å in the range 28.82M145024.1328.82subscript𝑀145024.13-28.82\leq M_{1450}\leq-24.13- 28.82 ≤ italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT ≤ - 24.13 and their composite spectrum. The median redshift and M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT of the quasars in the sample are zmedian=6.71subscript𝑧median6.71z_{\rm{median}}=6.71italic_z start_POSTSUBSCRIPT roman_median end_POSTSUBSCRIPT = 6.71 and M1450,median26.1similar-to-or-equalssubscript𝑀1450median26.1M_{1450,\rm{median}}\simeq-26.1italic_M start_POSTSUBSCRIPT 1450 , roman_median end_POSTSUBSCRIPT ≃ - 26.1, respectively. The NIR spectra are taken with echelle spectrographs, complemented with additional data from optical long slit instruments, and then reduced consistently using the open-source Python-based spectroscopic data reduction pipeline PypeIt. The median value of the mean signal-to-noise ratios of the spectra in J, H, and K band (median SNRλdelimited-⟨⟩subscriptSNR𝜆\langle\rm{SNR}_{\lambda}\rangle⟨ roman_SNR start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT ⟩) is: median SNRJ=9.7delimited-⟨⟩subscriptSNRJ9.7\langle\rm{SNR}_{J}\rangle=9.7⟨ roman_SNR start_POSTSUBSCRIPT roman_J end_POSTSUBSCRIPT ⟩ = 9.7, median SNRH=10.3delimited-⟨⟩subscriptSNRH10.3\langle\rm{SNR}_{H}\rangle=10.3⟨ roman_SNR start_POSTSUBSCRIPT roman_H end_POSTSUBSCRIPT ⟩ = 10.3, and median SNRK=11.7delimited-⟨⟩subscriptSNRK11.7\langle\rm{SNR}_{K}\rangle=11.7⟨ roman_SNR start_POSTSUBSCRIPT roman_K end_POSTSUBSCRIPT ⟩ = 11.7; demonstrating the good data quality. This work presents the largest medium/moderate-resolution sample of quasars at z>6.5𝑧6.5z>6.5italic_z > 6.5 from ground-based instruments. Its homogeneity and reproducibility make it ideally suited for several scientific goals, i.e., the study of the quasar proximity zones and dam** wings, the Lyα𝛼\alphaitalic_α forest, the intergalactic medium’s metal content, as well as other properties such as the distribution of SMBH masses and Eddington ratios. Our composite spectrum is compared to others at both high and low-z𝑧zitalic_z from the literature, showing differences in the strengths of many emission lines, probably due to differences in luminosity among the samples, but a consistent continuum slope, which proves that the same spectral features are preserved in quasars at different redshift ranges.

keywords:
quasars: supermassive black holes – galaxies: active – cosmology: early Universe – methods: data analysis – techniques: spectroscopic
pubyear: 2024pagerange: Optical and near-infrared spectroscopy of quasars at z>6.5𝑧6.5z>6.5italic_z > 6.5: public data release and composite spectrumC

1 Introduction

Discoveries of high redshift quasars (z>6𝑧6z>6italic_z > 6) have uncovered the presence of exceptionally supermassive black holes (SMBHs), ranging from approximately 108superscript10810^{8}10 start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT to 1010superscript101010^{10}10 start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT Msubscript𝑀direct-productM_{\odot}italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT, in the early stages of the universe (Wu et al. 2015; Bañados et al. 2018; Matsuoka et al. 2019a; Onoue et al. 2019; Shen et al. 2019; Yang et al. 2020; Wang et al. 2021a; see Fan et al. 2023 for a recent review). This motivates inquiry into the rapid growth mechanisms enabling these SMBHs to reach billions of solar masses within an extraordinarily brief time frame, less than one billion years after the Big Bang.

Various theoretical models proposing different seed black hole masses (102106superscript102superscript10610^{2}-10^{6}10 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT Msubscript𝑀direct-productM_{\odot}italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT) and highly efficient accretion modes present potential explanations for the formation and growth of early SMBHs (see Woods et al. 2019 and Inayoshi et al. 2020 for recent reviews). Rigorous observations of an extensive sample of z>6.5𝑧6.5z>6.5italic_z > 6.5 quasars are imperative to test and refine these models, advancing our comprehension of SMBH formation and evolution. Recent advancements in deep imaging surveys, coupled with the enhanced capabilities of near-infrared (NIR) spectroscopy on large telescopes, have substantially expanded the sample size of quasars with z>6𝑧6z>6italic_z > 6 to more than 200200200200 (e.g., Fan et al. 2023). This has extended the quasar frontier beyond z>7.5𝑧7.5z>7.5italic_z > 7.5, moving deep into the epoch of reionization (EoR; e.g., Mortlock et al. 2011; Jiang et al. 2016; Mazzucchelli et al. 2017; Bañados et al. 2016, 2018; Fan et al. 2019; Reed et al. 2019; Matsuoka et al. 2019a, b; Venemans et al. 2013, 2015; Wang et al. 2018, 2019; Yang et al. 2019, 2020, 2021; Wang et al. 2021a). Furthermore, recent observations from the James Webb Space Telescope (JWST) have already uncovered new high-z𝑧zitalic_z active galactic nuclei (AGNs) with lower luminosity and black hole mass, promising to deliver additional breakthroughs in our understanding (e.g., Labbe et al. 2023; Maiolino et al. 2023a, b; Larson et al. 2023; Fujimoto et al. 2023; Goulding et al. 2023; Furtak et al. 2023; Kokorev et al. 2023; Greene et al. 2023; Kokorev et al. 2024; Pérez-González et al. 2024; Matthee et al. 2024; Andika et al. 2024).

Our work of collecting high-quality spectra marks a significant milestone as the most extensive medium/moderate-resolution collection of quasar data at very high redshifts obtained through ground-based instruments. Its uniformity and reproducibility make it exceptionally well-suited for various scientific objectives, including the exploration of the Lyα𝛼\alphaitalic_α forest (Fan et al. 2006; Becker et al. 2015), the study of the final stages of the EoR (Bosman et al. 2022), the investigation into the intergalactic medium’s (IGM) metal content (e.g., Davies et al. 2023), but also the broad line region (BLR) chemical abundances (Lai et al., 2022), and analysis of properties such as the distribution of SMBH masses and Eddington ratios (Yang et al. 2021; Farina et al. 2022; Mazzucchelli et al. 2023). Also, the creation of a composite spectrum can be extremely helpful in evaluating the continuum and line properties of quasar spectra, investigating the average rest-frame UV quasar spectral properties and their possible evolution with redshift (Vanden Berk et al. 2001; Selsing et al. 2016; Mazzucchelli et al. 2017; Meyer et al. 2019; Schindler et al. 2020; Yang et al. 2021). On top of that, the creation of a composite spectrum allows for detecting features that might be too weak to appear in individual spectra, identifying objects that differ from the template, or modeling/fitting spectral energy distributions (SED).

Other key analyses involving high-quality spectra aim to place constraints on the average fraction of neutral hydrogen (xHIdelimited-⟨⟩subscript𝑥HI\langle x_{\rm{HI}}\rangle⟨ italic_x start_POSTSUBSCRIPT roman_HI end_POSTSUBSCRIPT ⟩) at the EoR and on the radiative efficiency of the earliest SMBHs. They can be performed by starting with the reconstruction of the quasar’s intrinsic blue side of the spectrum from the observed red side employing, e.g., principal component analysis (PCA) continuum modeling (e.g., Davies et al. 2018a, b, 2019; Bosman et al. 2021) so that we can systematically study quasar proximity zones and lifetimes tQsubscript𝑡Qt_{\rm{Q}}italic_t start_POSTSUBSCRIPT roman_Q end_POSTSUBSCRIPT (see Fan et al. 2006; Eilers et al. 2017; Eilers et al. 2020; Davies et al. 2020; Satyavolu et al. 2023), proximate damped Lyα𝛼\alphaitalic_α systems (see Bañados et al. 2019; Andika et al. 2020, 2022), and Lyα𝛼\alphaitalic_α dam** wings (Miralda-Escudé 1998; Davies et al. 2018a; Ďurovčíková et al. 2024; Greig et al. 2024).

This paper is the first of a series aimed at exploiting the wealth of information that could be obtained from this sample (e.g., the study of the quasar proximity zone sizes in Paper II, Onorato et al. in prep). Here, we present and make publicly available the optical and NIR spectra of 45454545 quasars at 6.50<z7.646.50𝑧7.646.50<z\leq 7.646.50 < italic_z ≤ 7.64, and the composite rest-frame UV/optical spectrum. We describe the dataset including the quasar sample, the instruments, and their properties, with the data reduction in Section 2. Section 3 details the spectral calibration procedure and presents the main properties of the sample. We show a comparison between this sample and other spectroscopic data releases of high-z𝑧zitalic_z quasars in Section 4. We discuss the mean quasar composite spectrum in Section 5, and we conclude this work with a summary of the paper, presented in Section 6. All results below refer to a ΛΛ\Lambdaroman_ΛCDM cosmology with parameters ΩΛ=0.7subscriptΩΛ0.7\Omega_{\Lambda}=0.7roman_Ω start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT = 0.7, Ωm=0.3subscriptΩ𝑚0.3\Omega_{m}=0.3roman_Ω start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = 0.3, and h=0.70.7h=0.7italic_h = 0.7; all magnitudes are reported in the AB system.

Table 1: Properties of the instruments used to collect the data. The black line separates the NIR from the VIS arm.
Instrument Dither sequence ΔλΔ𝜆\Delta\lambdaroman_Δ italic_λ (μm𝜇m\rm{\mu m}italic_μ roman_m) Slit width (′′) R𝑅Ritalic_R FWHM (km s-1) dvpix𝑑subscript𝑣pixdv_{\rm{pix}}italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT (km s-1)
GNIRS ABBA 0.812.520.812.520.81-2.520.81 - 2.52 0.68/1.00 1200/1000\simeq 1200/\simeq 1000≃ 1200 / ≃ 1000 280similar-to-or-equalsabsent280\simeq 280≃ 280 86similar-to-or-equalsabsent86\simeq 86≃ 86
NIRES ABBA 0.942.470.942.470.94-2.470.94 - 2.47 0.55 2700similar-to-or-equalsabsent2700\simeq 2700≃ 2700 111similar-to-or-equalsabsent111\simeq 111≃ 111 38similar-to-or-equalsabsent38\simeq 38≃ 38
X-Shooter NIR varies 1.022.481.022.481.02-2.481.02 - 2.48 0.6 8100similar-to-or-equalsabsent8100\simeq 8100≃ 8100 41similar-to-or-equalsabsent41\simeq 41≃ 41 13similar-to-or-equalsabsent13\simeq 13≃ 13
X-Shooter VIS - 0.551.020.551.020.55-1.020.55 - 1.02 0.9 8900similar-to-or-equalsabsent8900\simeq 8900≃ 8900 44similar-to-or-equalsabsent44\simeq 44≃ 44 11similar-to-or-equalsabsent11\simeq 11≃ 11
GMOS - 0.651.150.651.150.65-1.150.65 - 1.15 1.00 3200similar-to-or-equalsabsent3200\simeq 3200≃ 3200 94similar-to-or-equalsabsent94\simeq 94≃ 94 51similar-to-or-equalsabsent51\simeq 51≃ 51
DEIMOS - 0.651.000.651.000.65-1.000.65 - 1.00 1.00 7900similar-to-or-equalsabsent7900\simeq 7900≃ 7900 38similar-to-or-equalsabsent38\simeq 38≃ 38 17similar-to-or-equalsabsent17\simeq 17≃ 17
LRIS - 0.701.030.701.030.70-1.030.70 - 1.03 1.00 1700similar-to-or-equalsabsent1700\simeq 1700≃ 1700 184similar-to-or-equalsabsent184\simeq 184≃ 184 56similar-to-or-equalsabsent56\simeq 56≃ 56
MODS - 0.421.120.421.120.42-1.120.42 - 1.12 1.00/1.2 2700/2300\simeq 2700/\simeq 2300≃ 2700 / ≃ 2300 111similar-to-or-equalsabsent111\simeq 111≃ 111 33similar-to-or-equalsabsent33\simeq 33≃ 33

2 The Data Set

2.1 Quasar Sample

We start this analysis by compiling a list of quasars, both known from the literature and unpublished, and applying a redshift cut at z>6.5𝑧6.5z>6.5italic_z > 6.5. We check the individual telescope archives looking for medium/moderate-resolution spectroscopy of these sources from echelle spectrographs, and then also long slit instruments if the spectrum was not sufficiently covered in the Lyα𝛼\alphaitalic_α region. We avoid collecting data of faint objects (M145025greater-than-or-equivalent-tosubscript𝑀145025M_{1450}\gtrsim-25italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT ≳ - 25), even if we do not apply a strict M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT cut as long as the luminosity of the source does not strongly compromise the data reduction. Thus, our catalog includes 45454545 quasars in the redshift range 6.50<z7.646.50𝑧7.646.50<z\leq 7.646.50 < italic_z ≤ 7.64 (zmedian=6.71subscript𝑧median6.71z_{\rm{median}}=6.71italic_z start_POSTSUBSCRIPT roman_median end_POSTSUBSCRIPT = 6.71) and in the magnitude range 28.82M145024.1328.82subscript𝑀145024.13-28.82\leq M_{1450}\leq-24.13- 28.82 ≤ italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT ≤ - 24.13 (M1450,median26.1similar-to-or-equalssubscript𝑀1450median26.1M_{1450,\rm{median}}\simeq-26.1italic_M start_POSTSUBSCRIPT 1450 , roman_median end_POSTSUBSCRIPT ≃ - 26.1, see Figure 1 and Table 3). The full names of the quasars are shown in Table 3, while short names are adopted throughout the paper. Eleven out of the 45454545 objects are classified in the literature as Broad Absorption Lines (BAL) quasars, defined as those with absorption lines with FWHM greater-than-or-equivalent-to\gtrsim 2000 km s-1. These BALs sources are: J0313--1806, J0038--1527, J1243+++0100111Identified as a possible BAL in the discovery paper (Matsuoka et al., 2019a), but the spectroscopy quality does not lead to a clear classification., J0839+++3900, J2348--3054, J0246--5219, J0910--0414, J0923+++0402, J0706+++2921, J1526--2050, and J0439+++1634, as flagged in Table 3. We also include three new unpublished high-z𝑧zitalic_z quasars, J0410--0139 (Bañados et al. in prep), J1917+++5003, and J0430--1445 (Belladitta et al. in prep). This last source seems also to be a possible BAL, leading to a final number of 12121212 BALs in this work.

2.2 Instruments

We obtain visible (VIS) and NIR spectroscopy of our sample from the data archives of three main echelle spectrographs: Gemini/GNIRS (Gemini Near-Infrared Spectrograph; Elias et al. 2006a; Elias et al. 2006b), Keck/NIRES (Near-Infrared Echellette Spectrometer; Wilson et al. 2004), and VLT/X-Shooter (Vernet et al., 2011), both VIS and NIR arms, except for J1342+++0928, whose spectrum is entirely covered by the NIR arm. For some quasars, part of the spectrum redward of Lyα𝛼\alphaitalic_α is in the optical, and hence not covered by the NIR arm, so we have complemented them using observations from additional long slit instruments, such as Gemini/GMOS (Gemini Multi-Object Spectrographs; Hook et al. 2004), Keck/DEIMOS (DEep Imaging Multi-Object Spectrograph; Faber et al. 2003), Keck/LRIS (Low-Resolution Imaging Spectrometer; Oke et al. 1995; Rockosi et al. 2010) and LBT/MODS (Multi-Object Double Spectrographs; Pogge et al. 2010). A complete list of the instruments used to observe each quasar is available in Table 3.

2.2.1 Dither sequences

All GNIRS and NIRES observations were executed following an ABBA dither sequence except for the last two frames of J0313--1806, which are AABB. The X-Shooter observations were acquired with different dither sequences (i.e.: ABBA, ABAB, AABB), but because of the long average exposure time per frame (6001800similar-to-or-equalsabsent6001800\simeq 600-1800≃ 600 - 1800 s) and the consequent change of the sky conditions, they were treated as ABCD during the data reduction instead of performing usual image differencing. The only X-Shooter frames where a dither sequence of ABBA was used while performing the reduction are those for J1110--1329 because of the short exposure time of 300300300300 s. This applies mostly for the NIR arm since VIS data are usually not reduced doing image differencing, but using a sky model.

2.2.2 Wavelength coverage, Slit widths, R𝑅Ritalic_R, FWHM, and dvpix𝑑subscript𝑣pixdv_{\rm{pix}}italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT

The GNIRS data provide full wavelength coverage of the YJHK bands (0.812.520.812.520.81-2.520.81 - 2.52 μ𝜇\muitalic_μm). Slit widths of 0.68′′ or 1.00′′ were used, with mean resolution of R12001000similar-to-or-equals𝑅12001000R\simeq 1200-1000italic_R ≃ 1200 - 1000 or FWHM280similar-to-or-equalsFWHM280\rm{FWHM}\simeq 280roman_FWHM ≃ 280 km s-1 and pixel size of dvpix86similar-to-or-equals𝑑subscript𝑣pix86dv_{\rm{pix}}\simeq 86italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 86 km s-1. The NIRES observations have a coverage of 0.942.470.942.470.94-2.470.94 - 2.47 μ𝜇\muitalic_μm through a fixed 0.55′′ slit, where the mean resolution is R2700similar-to-or-equals𝑅2700R\simeq 2700italic_R ≃ 2700 or FWHM111similar-to-or-equalsFWHM111\rm{FWHM}\simeq 111roman_FWHM ≃ 111 km s-1 and pixel size of dvpix38similar-to-or-equals𝑑subscript𝑣pix38dv_{\rm{pix}}\simeq 38italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 38 km s-1. The X-Shooter data cover the wavelength range 0.551.020.551.020.55-1.020.55 - 1.02 μ𝜇\muitalic_μm in the VIS arm and 1.022.481.022.481.02-2.481.02 - 2.48 μ𝜇\muitalic_μm in the NIR arm. The slit width varies among programs but is typically 0.9′′ in the VIS and 0.6′′ in the NIR arm. The mean resolution in VIS is R8500similar-to-or-equals𝑅8500R\simeq 8500italic_R ≃ 8500 or FWHM44similar-to-or-equalsFWHM44\rm{FWHM}\simeq 44roman_FWHM ≃ 44 km s-1 and pixel size dvpix11similar-to-or-equals𝑑subscript𝑣pix11dv_{\rm{pix}}\simeq 11italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 11 km s-1; while in NIR, R8100similar-to-or-equals𝑅8100R\simeq 8100italic_R ≃ 8100 or FWHM41similar-to-or-equalsFWHM41\rm{FWHM}\simeq 41roman_FWHM ≃ 41 km s-1 and dvpix13similar-to-or-equals𝑑subscript𝑣pix13dv_{\rm{pix}}\simeq 13italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 13 km s-1. J0319--1008 is complemented with R400 grating GMOS-N observations, covering the wavelength range 0.651.150.651.150.65-1.150.65 - 1.15 μ𝜇\muitalic_μm. The slit width is 1.00′′, with a mean resolution of R3200similar-to-or-equals𝑅3200R\simeq 3200italic_R ≃ 3200 or FWHM94similar-to-or-equalsFWHM94\rm{FWHM}\simeq 94roman_FWHM ≃ 94 km s-1, and pixel size of dvpix51similar-to-or-equals𝑑subscript𝑣pix51dv_{\rm{pix}}\simeq 51italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 51 km s-1. J0706+++2921 is complemented with 830G grating DEIMOS observations, covering the wavelength range 0.651.000.651.000.65-1.000.65 - 1.00 μ𝜇\muitalic_μm. The slit width is 1.00′′, with a mean resolution of R7900similar-to-or-equals𝑅7900R\simeq 7900italic_R ≃ 7900 or FWHM38similar-to-or-equalsFWHM38\rm{FWHM}\simeq 38roman_FWHM ≃ 38 km s-1, and pixel size of dvpix17similar-to-or-equals𝑑subscript𝑣pix17dv_{\rm{pix}}\simeq 17italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 17 km s-1. J0218+++0007 is complemented using grating 600/10000 LRIS red observations, covering the wavelength range 0.701.030.701.030.70-1.030.70 - 1.03 μ𝜇\muitalic_μm. The slit width is 1.00′′, with a mean resolution of R1700similar-to-or-equals𝑅1700R\simeq 1700italic_R ≃ 1700 or FWHM184similar-to-or-equalsFWHM184\rm{FWHM}\simeq 184roman_FWHM ≃ 184 km s-1, and pixel size of dvpix56similar-to-or-equals𝑑subscript𝑣pix56dv_{\rm{pix}}\simeq 56italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 56 km s-1. J0411--0907, J1917+++5003, and J1058+++2930 are complemented by red grating MODS observations (both MODS1 and MODS2), covering the wavelength range 0.421.120.421.120.42-1.120.42 - 1.12 μ𝜇\muitalic_μm. The slit width is 1.00′′ and 1.2′′, with a mean resolution of R27002300similar-to-or-equals𝑅27002300R\simeq 2700-2300italic_R ≃ 2700 - 2300 or FWHM111similar-to-or-equalsFWHM111\rm{FWHM}\simeq 111roman_FWHM ≃ 111 km s-1, and pixel size of dvpix33similar-to-or-equals𝑑subscript𝑣pix33dv_{\rm{pix}}\simeq 33italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 33 km s-1. All of these properties of the instruments are summarised in Table 1.

2.3 Data Reduction

All spectra are reduced with the open-source Python-based Spectroscopic Data Reduction Pipeline PypeIt222https://github.com/pypeit/PypeIt, using versions between 1.7.1 and 1.14.1 (Prochaska et al., 2020). The pipeline performs image processing, including gain correction, bias subtraction, dark subtraction, and flat fielding. It uses supplied flat-field images to automatically trace the echelle orders and correct for the detector illumination. Construction of the wavelength solutions and the wavelength tilt models are based on either arc (for VIS instruments) or science frames (i.e. using sky OH lines, for NIR spectrographs). Cosmic rays are removed with the L. A. COSMIC algorithm (van Dokkum, 2001). The sky subtraction is based on the standard A–B mode and a B-spline fitting procedure that is performed to further clean up the sky line residuals following Bochanski et al. (2009). Optimal extraction (Horne, 1986) is performed to generate 1D science spectra. The extracted spectra are flux-calibrated with sensitivity functions derived from the observations of spectroscopic standard stars. All flux-calibrated 1D spectra of each quasar are then co-added to achieve higher SNR and corrected for telluric absorption using PypeIt. A telluric model is fit to correct the absorbed science spectrum up to a best-fit PCA model (Davies et al., 2018b) of said spectrum. The telluric model is based on telluric model grids produced from the Line-By-Line Radiative Transfer Model (LBLRTM4; Clough et al. 2005; Gullikson et al. 2014).

All the PypeIt files to reproduce the reduction are publicly available in a GitHub repository333https://github.com/enigma-igm/onorato24_hiz_qsos.

2.4 Co-add of spectra from different instruments (or arms)

As reported in Table 3 and in Subsection 2.2, some quasars are observed with more than one spectrograph and/or in more arms. Here we describe how we treat these spectra after the flux calibration with sensitivity function since they have different resolutions, wavelength grids, and pixel sizes.

  • X-Shooter VIS - NIR: two different approaches were used depending on the stage of development of PypeIt. We co-add the majority of the quasars with versions between 1.8.2 and 1.11.1, using the echelle combspec PypeIt routine, getting a unique spectrum in a wavelength grid of [5410, 24770] Å and pixel size of dvpix13similar-to-or-equals𝑑subscript𝑣pix13dv_{\rm{pix}}\simeq 13italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 13 km s-1. Few quasars were reduced using recent PypeIt versions (after 1.12.3) which now support the option of editing a unique file (.coadd1d) to combine the 1D spectra from multiple exposures of the same object, in case we work with the same type of spectrograph (i.e. echelle), containing both arms together. The final spectrum has the same characteristics as those obtained with the first method.

  • GNIRS - NIRES: we co-add them using a single coadd1d file. J0313--1806 and J1007+++2115 are co-added onto a common grid covering [9410, 24690] Å with a pixel size of dvpix90similar-to-or-equals𝑑subscript𝑣pix90dv_{\rm{pix}}\simeq 90italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 90 km s-1. We co-add J0910--0414 and J2102--1458 by requiring a pixel size of dvpix90similar-to-or-equals𝑑subscript𝑣pix90dv_{\rm{pix}}\simeq 90italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 90 km s-1, but without constraints on the wavelength range. We get a final spectrum that covers [8240--25200] Å and has dvpix90similar-to-or-equals𝑑subscript𝑣pix90dv_{\rm{pix}}\simeq 90italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 90 km s-1.

  • GNIRS - X-Shooter (VIS - NIR): we co-add them using a unique coadd1d file for all the GNIRS, X-Shooter VIS, and NIR frames. The final spectra cover [5410, 25200] Å with a pixel size of dvpix90similar-to-or-equals𝑑subscript𝑣pix90dv_{\rm{pix}}\simeq 90italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 90 km s-1.

  • NIRES - GMOS: we co-add all the echelle - long slit spectra using the multi combspec PypeIt routine, which works with the final 1D spectrum in both instruments, getting an ultimate spectrum with wavelength coverage of [8000, 24700] Å and pixel size of dvpix55similar-to-or-equals𝑑subscript𝑣pix55dv_{\rm{pix}}\simeq 55italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 55 km s-1.

  • NIRES - DEIMOS: using multi combspec as described above, we get a final spectrum covering [8000, 24700] Å with pixel size of dvpix40similar-to-or-equals𝑑subscript𝑣pix40dv_{\rm{pix}}\simeq 40italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 40 km s-1.

  • NIRES - LRIS: using multi combspec as described above, we get a final spectrum covering [8000, 24700] Å with pixel size of dvpix60similar-to-or-equals𝑑subscript𝑣pix60dv_{\rm{pix}}\simeq 60italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 60 km s-1.

  • NIRES - MODS: using multi combspec as described above, we get a final spectrum covering [8000, 24700] Å with pixel size of dvpix40similar-to-or-equals𝑑subscript𝑣pix40dv_{\rm{pix}}\simeq 40italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT ≃ 40 km s-1.

In the end, when echelle frames are co-added together, the telluric correction described above (at the end of Section 2.3) is applied to the final stacked spectrum; when echelle - long slit are co-added, the individual spectra of both instruments are already telluric corrected, and no further correction is required.

Refer to caption
Figure 1: Panel (a): Distribution of J-band photometry used to scale the spectra as a function of z𝑧zitalic_z for all the 45454545 quasars in this sample. The legend is unique and split between the two plots, showing the sources color-coded with decreasing z𝑧zitalic_z. The full symbol marks the quasars for which J-band photometry is available, while the open symbol marks those for which it is not (we report Y-band photometry for J1243+++0100 and J0923+++0753, and KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT-band photometry for J1058+++2930). The error bars show the uncertainties on both z𝑧zitalic_z and the photometric measurements. Panel (b): Distribution of M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT as a function of z𝑧zitalic_z for all the 45454545 quasars in this sample. The error bars show the uncertainties on z𝑧zitalic_z. The circle marks the M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT computed from the spectrum scaled with J, Y, or KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT-band photometry. The star marks the quasars for what M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT from spectroscopy is not reliable because of the low SNR of the spectrum or appearance of BAL features. In the case of J1243+++0100, M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT is from the discovery paper (Matsuoka et al. 2019a); while for J0910--0414 and J0923+++0402, we follow the method described in Appendix B. Panel (c): Histogram of the redshift distribution of the sample, with bins of size 0.05. The dashed red line represents the median redshift (zmedian=6.71subscript𝑧median6.71z_{\rm{median}}=6.71italic_z start_POSTSUBSCRIPT roman_median end_POSTSUBSCRIPT = 6.71). Panel (d): Histogram of the M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT distribution of the sample, with bins of size 0.2. The dashed red line represents the median M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT (M1450,median26.1similar-to-or-equalssubscript𝑀1450median26.1M_{1450,\rm{median}}\simeq-26.1italic_M start_POSTSUBSCRIPT 1450 , roman_median end_POSTSUBSCRIPT ≃ - 26.1).
Table 2: Example of the internal structure of the FITS files, at a random row, for two of the spectra in this sample. The first column is the wavelengths in the observed frame in units of Å. The second column is a new wavelength grid evaluated at the centers of the wavelength bins, also in units of Å (this is the quantity we usually put on the x𝑥xitalic_x-axis when plotting a spectrum). The third column is the flux array in units of 1017superscript101710^{-17}10 start_POSTSUPERSCRIPT - 17 end_POSTSUPERSCRIPT erg s-1cm-2Å-1. The fourth column is the inverse variance array in units of (1017superscript101710^{-17}10 start_POSTSUPERSCRIPT - 17 end_POSTSUPERSCRIPT erg s-1cm-2Å-1)-2. "sigma" is a column introduced only in the most recent PypeIt versions and represents the noise vector (if it is not present, it is always possible to determine from ivar-1/2). The sixth column is a good pixel mask for the spectrum, and the seventh one is the telluric model. The "obj model" column is not present in the spectra reduced with the most recent PypeIt versions and represents the object model used for the telluric fitting. The last two columns are created after the flux scaling procedure and represent the new flux and ivar corrected with the X𝑋Xitalic_X-band photometry of the quasar, where X𝑋Xitalic_X could be J, Y, or KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT depending on the available photometric information, in the same units as before.
wave wave grid mid flux ivar sigma mask telluric obj model flux scaled X𝑋Xitalic_X ivar scaled X𝑋Xitalic_X
J1342+++0928 10289.82613 10289.83458 0.31196 18.34511 - 1 0.99 0.61569 0.36666 13.27947
J1058+++2930 10240.48127 10240.45124 1.56780 2.58264 0.62226 1 1.0 - 0.54980 21.00057

3 Quasar sample Properties

3.1 Redshift

In this paper, we account for uncertainties in the systemic redshift of every quasar depending on the emission line used to determine its redshift. Systemic redshifts are challenging quantities to determine because of various factors, like the broad widths of emission lines, Gunn-Peterson absorption (Gunn & Peterson, 1965), and offsets between different ionization lines (Gaskell 1982; Tytler & Fan 1992; Vanden Berk et al. 2001; Richards et al. 2002; Shen et al. 2016). Also, most quasars show winds and strong internal motions, which displace many of the emission lines far from the systemic redshift of the host galaxy. Thus, we decide to assign a redshift error of Δv=100Δ𝑣100\Delta v=100roman_Δ italic_v = 100 km s-1 for the most precise determinations of the location of the quasar, such as those using emission lines from the atomic gas reservoir of the host galaxy itself ([C II] lines). For quasars with a redshift measurement from low-ionization broad emission lines, such as Mg II, we assume a redshift error of Δv=270Δ𝑣270\Delta v=270roman_Δ italic_v = 270 km s-1, to account for the dispersion between the redshift of the Mg II line and the redshift of the host galaxy (e.g., Richards et al. 2002; Hennawi et al. 2006; Shen & Ménard 2012; Shen et al. 2016; Venemans et al. 2016; Schindler et al. 2020). Table 3 reports redshifts, methods, and references for every quasar.

These ΔvΔ𝑣\Delta vroman_Δ italic_v will be adopted as uncertainties on the redshift (and thus on the position of the Lyα𝛼\alphaitalic_α line) to achieve our next scientific goal of studying the proximity zones of the quasars in this sample (Onorato et al., in prep). From the literature (zRefsubscript𝑧Refz_{\rm{Ref}}italic_z start_POSTSUBSCRIPT roman_Ref end_POSTSUBSCRIPT in Table 3), we also get values of ΔzΔ𝑧\Delta zroman_Δ italic_z for every quasar, but we do not report them here. They are available online at the link at the end of Section 2.3, and we use them as uncertainties on z𝑧zitalic_z in Figure 1.

3.2 Absolute flux calibration

The flux calibration of the spectra carried out is relative: every spectrum is corrected using a spectroscopic standard star, but the flux values at the different wavelengths may differ from the true flux. This could be due to slit losses and non-photometric sky conditions at the time of observations. Tests of absolute flux calibration have been performed in this analysis, tying the spectra to the photometric data in the Y, J, H, and K bands, where all or part of them are available. However, since there is not a good match in all the photometric bands at once, a definitive method is still an object of discussion.

For this paper, the reduced spectrum of each quasar is scaled using its J-band magnitude, or the Y and KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT band ones if the J-band is not available (as in the case of J1243+++0100, J0923+++0753, and J1058+++2930), all in the AB system; an example for all bands is shown in Figure 8 and the approach is described in Appendix A. These NIR data come from different archival surveys: the UKIRT Infrared Deep Sky Survey (UKIDSS; Lawrence et al. 2007), the UKIRT Hemisphere Survey (UHS; Dye et al. 2018), in both cases mostly from WFCAM (Casali et al., 2007), and the VISTA Hemisphere Survey (VHS; McMahon et al. 2013). The Y-band photometry of J1243+++0100 comes from the Hyper Suprime-Cam (HSC) Subaru Strategic Program (SSP) survey (Aihara et al., 2018), as reported in the discovery paper (Matsuoka et al., 2019a). The J-band photometry of J0410--0139, J0430--1445, and J2132+++1217 is taken from SofI (Son of ISAAC; Moorwood et al. 1998), at the NTT in La Silla, and that of J1129+++1846 and J1917+++5003 comes from NOTCam444Both SofI and NOTCam are dedicated follow up., at the Nordic Optical Telescope (NOT). The KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT-band photometry of J1058+++2930 is gotten from the acquisition image taken with NIRES, whose guider camera uses a Mauna Kea K ("K prime") filter555https://www2.keck.hawaii.edu/inst/nires/guider.html. Since the transmission curve of this filter is not available, we use the one from Keck/NIRC2 KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT to perform the flux scaling (see the bottom panel of Figure 8), as it is very similar to the Mauna Kea K. Most of these magnitudes are collected from Ross & Cross (2020) and available online. All J-band (or Y and KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT) magnitudes are displayed in panel (a) of Figure 1 as a function of redshift z𝑧zitalic_z for each quasar, and the final flux-scaled spectra are plotted as a function of wavelength in Figure 2.

A detailed list of all the filters that were used to acquire every magnitude is available at the link given at the end of Section 2.3, while the final flux calibrated spectra are available online as explained in the Data Availability Section. An example of the format of the FITS files, selecting a random row, for two spectra reduced using different PypeIt versions is shown in Table 2.

3.3 Absolute magnitudes at 1450 Å

We calculate M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT using the flux-scaled spectra of the quasars themselves. From every spectrum in the observed frame and knowing the redshift of each quasar, we compute the rest-frame wavelength.

We determine the apparent AB magnitude at 1450145014501450 Å from the median flux between 1445144514451445 and 1455145514551455 Å, converted to Jansky (fν,1450subscript𝑓𝜈1450f_{\nu,1450}italic_f start_POSTSUBSCRIPT italic_ν , 1450 end_POSTSUBSCRIPT), using the Pogson law: m1450=2.5log10(fν,1450)+8.9subscript𝑚14502.5subscript10subscript𝑓𝜈14508.9m_{1450}=-2.5\cdot\log_{10}(f_{\nu,1450})+8.9italic_m start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT = - 2.5 ⋅ roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_f start_POSTSUBSCRIPT italic_ν , 1450 end_POSTSUBSCRIPT ) + 8.9. We continue calculating the luminosity per unit frequency (Lνsubscript𝐿𝜈L_{\nu}italic_L start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT) at 1450145014501450 Å with Equation 1, where dLsubscript𝑑𝐿d_{L}italic_d start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT is the luminosity distance to the object at a redshift z𝑧zitalic_z:

Lν=4πdL2fν,14501+zsubscript𝐿𝜈4𝜋superscriptsubscript𝑑𝐿2subscript𝑓𝜈14501𝑧L_{\nu}=\frac{4\pi{d_{L}}^{2}\cdot f_{\nu,1450}}{1+z}italic_L start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT = divide start_ARG 4 italic_π italic_d start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ italic_f start_POSTSUBSCRIPT italic_ν , 1450 end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_z end_ARG (1)

We find the absolute AB magnitude at 1450145014501450 Å using Equation 2 from the luminosity per unit frequency Lνsubscript𝐿𝜈L_{\nu}italic_L start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT, where d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the reference distance (10101010 pc), and 3631363136313631 Jy is the zero-point flux density in the AB system:

M1450=2.5log10(Lν4πd023631Jy)subscript𝑀14502.5subscript10subscript𝐿𝜈4𝜋superscriptsubscript𝑑023631JyM_{1450}=-2.5\cdot\log_{10}\left(\frac{L_{\nu}}{4\pi{d_{0}}^{2}\cdot 3631\text% {Jy}}\right)italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT = - 2.5 ⋅ roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( divide start_ARG italic_L start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT end_ARG start_ARG 4 italic_π italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ 3631 Jy end_ARG ) (2)

The results from this method are compared with other values from the literature, showing good agreement. For this reason, we trust the estimates we obtain for all quasars, with only three exceptions. In the case of J1243+++0100, the spectrum is of poor quality as the source is faint and has a low signal-to-noise ratio (SNR, see the values at the last three columns of Table 3). As a consequence, this makes the result difficult to trust, so we adopt the M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT from Matsuoka et al. (2019a), in which they measured it from the best-fit power-law continuum. In the case of J0910--0414 and J0923+++0402, the BAL features visible in their spectra create a bias in the measurement of M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT. For this reason, we correct the estimate by matching these spectra with a "reference spectrum" with a trustworthy continuum shape, such as the composite spectrum created from the sample in this paper (shown in Figure 4). The match is performed by eye and shown in Figure 9, with a detailed description of the method followed reported in Appendix B.

We show the absolute magnitude values M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT as a function of redshift z𝑧zitalic_z for our sample in panel (b) of Figure 1, with its histogram in panel (d).

Refer to caption
Figure 2: Spectra of all the 45454545 quasars in this sample, sorted by decreasing z𝑧zitalic_z (as reported in Table 3). Every subplot shows the spectrum (color-coded with z𝑧zitalic_z) and the associated error (grey) in the observed wavelength range [8245, 22980] Å, the short name, and z𝑧zitalic_z of the quasar. The light-grey bands cover [13500, 14150] Å, and [18200, 19300] Å, indicating the regions affected by strong telluric absorption, where the spectra are masked. All the spectra shown here are smoothed for display purposes. The box containing the name and the redshift of the quasars is colored in light beige to mark the BALs.
Refer to caption
Figure 2: (Continued)
Refer to caption
Figure 2: (Continued)
Table 3: Information on the 45454545 quasars in this sample, sorted by decreasing z𝑧zitalic_z. The details on the columns are provided at the end of Section 3.4.
Name Instrument (arms) texp(s)subscripttexps\rm{t_{exp}(s)}roman_t start_POSTSUBSCRIPT roman_exp end_POSTSUBSCRIPT ( roman_s ) z𝑧zitalic_z zmethodsubscript𝑧methodz_{\rm{method}}italic_z start_POSTSUBSCRIPT roman_method end_POSTSUBSCRIPT zRefsubscript𝑧Refz_{\rm{Ref}}italic_z start_POSTSUBSCRIPT roman_Ref end_POSTSUBSCRIPT JABsubscriptJAB\rm{J_{AB}}roman_J start_POSTSUBSCRIPT roman_AB end_POSTSUBSCRIPT M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT Discovery SNRJdelimited-⟨⟩subscriptSNRJ\langle\rm{SNR_{J}}\rangle⟨ roman_SNR start_POSTSUBSCRIPT roman_J end_POSTSUBSCRIPT ⟩ SNRHdelimited-⟨⟩subscriptSNRH\langle\rm{SNR_{H}}\rangle⟨ roman_SNR start_POSTSUBSCRIPT roman_H end_POSTSUBSCRIPT ⟩ SNRKdelimited-⟨⟩subscriptSNRK\langle\rm{SNR_{K}}\rangle⟨ roman_SNR start_POSTSUBSCRIPT roman_K end_POSTSUBSCRIPT ⟩
J031343.839--180636.404a GNIRS/NIRES 27300/15840 7.6423 [C II] 1 20.92 ±plus-or-minus\pm± 0.13 --26.208 1 15.9 18.8 27.2
J134208.110+++092838.610 X-Shooter (NIR) (81600) 7.5413 [C II] 2 20.64 ±plus-or-minus\pm± 0.08 --26.336 3 33.5 43.7 13
J100758.264+++211529.207 GNIRS/NIRES 21000/7920 7.5149 [C II] 4 20.20 ±plus-or-minus\pm± 0.18 --26.818 4 18.5 19.5 24.2
J112001.480+++064124.300 X-Shooter (NIR/VIS) (114000/113440) 7.0851 [C II] 2 20.17 ±plus-or-minus\pm± 0.07 --26.565 5 44.7 53.7 308.7
J124353.930+++010038.500a GNIRS 10944 7.07 Mg II 26 23.57 ±plus-or-minus\pm± 0.08b --24.130d 26 1.4 1 1.3
J003836.097--152723.636a GNIRS/X-Shooter (NIR/VIS) 15300/(15600/17912) 7.0340 [C II] 6 19.69 ±plus-or-minus\pm± 0.07 --27.030 7 24.8 24.6 22.6
J025216.640--050331.810 X-Shooter (NIR/VIS) (28800/31200) 7.0006 [C II] 6 20.19 ±plus-or-minus\pm± 0.07 --26.625 8 29.1 39.7 12.7
J041009--013919e NIRES 9360 6.9964 [C II] 27 20.75 ±plus-or-minus\pm± 0.07 --25.858 27 6 5.9 8.1
J083946.880+++390011.440a GNIRS 16800 6.9046 Mg II 9 20.39 ±plus-or-minus\pm± 0.20 --26.214 10 27.2 29.9 43.9
J234833.340--305410.000a X-Shooter (NIR/VIS) (9200/8783) 6.9018 [C II] 11 21.10 ±plus-or-minus\pm± 0.08 --25.224 12 7.5 10.3 4.2
J024655.902--521949.950a X-Shooter (NIR/VIS) (24000/24000) 6.8876 [C II] 6 21.20 ±plus-or-minus\pm± 0.14 --25.301 8 6.1 11.7 4.9
J191729.984+++500313.540e NIRES/MODS 12720/26400 6.845 Mg II 28 20.66 ±plus-or-minus\pm± 0.05 --26.208 28 5.2 5.4 5.3
J221100.601--632055.845 X-Shooter (NIR/VIS) (36000/37961) 6.8449 [C II] 6 21.27 ±plus-or-minus\pm± 0.18 --25.470 8 9 12.9 4.8
J002031.470--365341.800 X-Shooter (NIR/VIS) (4800/4800) 6.834 Mg II 13 20.42 ±plus-or-minus\pm± 0.10 --25.999 13 6 8.3 3.6
J031941.660--100846.000 NIRES/GMOS 18750/15300 6.8275 [C II] 6 20.88 ±plus-or-minus\pm± 0.30 --25.480 8 4.9 6.1 12.2
J041128.628--090749.700 NIRES/MODS 5760/30000 6.8260 [C II] 6 20.02 ±plus-or-minus\pm± 0.14 --26.490 10 17.4 18 19.6
J112925.368+++184624.330 X-Shooter (NIR/VIS) (12000/12000) 6.823 Mg II 9 20.90 ±plus-or-minus\pm± 0.11 --25.421 14 6.6 8.2 3.2
J010953.130--304726.300 X-Shooter (NIR/VIS) (21600/21600) 6.7909 [C II] 11 21.28 ±plus-or-minus\pm± 0.14 --25.200 12 6.7 9.3 3.7
J082931.979+++411740.870 GNIRS 12600 6.773 Mg II 9 20.26 ±plus-or-minus\pm± 0.15 --26.154 10 12.2 9.2 13.2
J021847.040+++000715.200 NIRES/LRIS 5760/3583 6.7700 [C II] 6 21.08 ±plus-or-minus\pm± 0.30 --25.896 9,15 2.8 3.3 3.6
J110421.580+++213428.850 GNIRS 7200 6.7662 [C II] 6 19.91 ±plus-or-minus\pm± 0.11 --26.506 10 16.1 14.3 17.6
J091013.651+++165630.180 GNIRS 13200 6.7289 [C II] 6 21.06 ±plus-or-minus\pm± 0.13 --25.346 10 8.9 7.3 10.7
J043043.660--144541.210a,e GNIRS 12000 6.7142 Mg II 28 20.78 ±plus-or-minus\pm± 0.18 --25.656 28 8.4 7.5 11.7
J083737.830+++492900.600 GNIRS 15600 6.702 Mg II 9 20.21 ±plus-or-minus\pm± 0.17 --26.069 10 26.4 23.7 36.2
J200241.594--301321.690 GNIRS 3600 6.6876 [C II] 6 19.97 ±plus-or-minus\pm± 0.16 --26.622 9 9.7 6.4 8
J092358.997+++075349.107 GNIRS 7200 6.6817 [C II] 6 21.25 ±plus-or-minus\pm± 0.26b --25.687 9 3.2 2 2.9
J104819.090--010940.210 X-Shooter (NIR/VIS) (4800/4800) 6.6759 [C II] 16 20.87 ±plus-or-minus\pm± 0.06 --25.864 17 5.1 6.7 2.9
J223255.150+++293032.040 GNIRS/X-Shooter (NIR/VIS) 4800/(14400/14400) 6.666 Mg II 6 20.46 ±plus-or-minus\pm± 0.12 --25.831 18 10.6 9.2 8.5
J210219.230--145853.860 GNIRS/NIRES 10200/5760 6.6645 [C II] 6 21.15 ±plus-or-minus\pm± 0.20 --25.421 10 9.4 8.5 10.4
J121627.584+++451910.675 GNIRS 3600 6.66 Mg II 9 21.02 ±plus-or-minus\pm± 0.13 --25.518 10 4.8 3.6 5.3
J091054.535--041406.846a GNIRS/NIRES 3600/3600 6.6363 [C II] 6 20.23 ±plus-or-minus\pm± 0.07 --26.397f 10 9.8 11.4 15.6
J092347.117+++040254.580a X-Shooter (NIR/VIS) (43200/43200) 6.6330 [C II] 6 20.14 ±plus-or-minus\pm± 0.08 --26.524f 10,19 24.4 48.5 23.1
J002429.772+++391318.980 GNIRS 13800 6.621 [C II] 20 21.28 ±plus-or-minus\pm± 0.48 --25.316 21 8.5 6.1 9.7
J030516.920--315056.000 X-Shooter (NIR/VIS) (16800/16800) 6.6145 [C II] 11 20.60 ±plus-or-minus\pm± 0.05 --25.690 12 12.3 15.6 6.3
J070626.379+++292105.460a NIRES/DEIMOS 14960/17800 6.6037 [C II] 6 19.14 ±plus-or-minus\pm± 0.05 --27.410 10 24.3 25.6 34.2
J213233.190+++121755.260 X-Shooter (NIR/VIS) (33600/35809) 6.5881 [C II] 20 19.74 ±plus-or-minus\pm± 0.03 --26.914 20 30.6 35.3 16.5
J152637.840--205000.660a X-Shooter (NIR/VIS) (43200/46234) 6.5864 [C II] 16 19.60 ±plus-or-minus\pm± 0.08 --27.030 20 33 45.6 18.9
J113508.918+++501132.600 GNIRS 7200 6.5851 [C II] 6 20.41 ±plus-or-minus\pm± 0.16 --26.075 10 12.8 11.6 16.2
J105807.720+++293041.703 NIRES/MODS 3600/28800 6.5846 [C II] 22 20.56 ±plus-or-minus\pm± 0.05c --26.039 9 5 4.9 5.7
J233807.032+++214358.170 GNIRS 4500 6.565 Mg II 9 20.75 ±plus-or-minus\pm± 0.30 --25.816 9 3.4 2.4 2.8
J092120.560+++000722.900 GNIRS 9600 6.5646 [C II] 6 21.11 ±plus-or-minus\pm± 0.11 --25.319 19 9 8.1 12
J022601.870+++030259.280 X-Shooter (NIR/VIS) (14400/23520) 6.5412 [C II] 23 19.40 ±plus-or-minus\pm± 0.09 --27.192 18 26.5 30.6 11.7
J022426.540--471129.400 X-Shooter (NIR/VIS) (31200/33360) 6.5222 [C II] 6 19.73 ±plus-or-minus\pm± 0.05 --26.663 24 27.2 40.7 20
J043947.098+++163415.819a,g X-Shooter (NIR/VIS) (54720/53656) 6.5192 [C II] 6 17.47 ±plus-or-minus\pm± 0.02 --28.822 25 207.7 285.8 149.5
J111033.960--132945.600 X-Shooter (NIR/VIS) (34800/29520) 6.5148 [C II] 16 20.59 ±plus-or-minus\pm± 0.18 --26.079 18 4.8 7.6 3.7
  • Ref: 1 - Wang et al. (2021a); 2 - Venemans et al. (2017); 3 - Bañados et al. (2018); 4 - Yang et al. (2020); 5 - Mortlock et al. (2011); 6 - Wang et al. (2021b); 7 - Wang et al. (2018); 8 - Yang et al. (2019); 9 - Yang et al. (2021); 10 - Wang et al. (2019); 11 - Venemans et al. (2016); 12 - Venemans et al. (2013); 13 - Reed et al. (2019); 14 - Bañados et al. (2021); 15 - Matsuoka et al. (2022); 16 - Decarli et al. (2018); 17 - Wang et al. (2017); 18 - Venemans et al. (2015); 19 - Matsuoka et al. (2018); 20 - Mazzucchelli et al. (2017); 21 - Tang et al. (2017); 22 - Wang et al. (2024); 23 - Bañados et al. (2015); 24 - Reed et al. (2017); 25 - Fan et al. (2019); 26 - Matsuoka et al. (2019a); 27 - Bañados et al. in prep; 28 - Belladitta et al. in prep

  • a, b, c, d, e, f, g

    Respectively: BAL quasar, YABsubscriptYAB\rm{Y_{AB}}roman_Y start_POSTSUBSCRIPT roman_AB end_POSTSUBSCRIPT magnitude, Kp,ABsubscriptKpAB\rm{K_{p,AB}}roman_K start_POSTSUBSCRIPT roman_p , roman_AB end_POSTSUBSCRIPT magnitude, M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT from Matsuoka et al. (2019a), unpublished quasar, M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT calculated as described in Appendix B, lensed quasar.

3.4 SNR of the spectra

We compute the SNR of the spectra in this sample considering three different wavelength ranges (the J, H, and K bands). We want to properly sample the spectra and avoid possible biases introduced by telluric absorption regions. For these reasons, we define the ranges in which we can compute the SNR in a very conservative manner, avoiding regions affected by absorption: [11000,13400]1100013400[11000,13400][ 11000 , 13400 ], [14500,17950]1450017950[14500,17950][ 14500 , 17950 ], and [19650,22400]1965022400[19650,22400][ 19650 , 22400 ] Å.

To make the calculation and have a comparison among the quality of the spectra that is as fair as possible, we follow these steps:

  • We move the spectra to the rest-frame, dividing the wavelengths by (1+z)1𝑧(1+z)( 1 + italic_z );

  • We define a new wavelength grid: [1040, 3332] Å in steps of dvpix𝑑subscript𝑣pixdv_{\rm{pix}}italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT=110 km s-1 (set to be coarser than the coarsest pixel scale, in velocity, among all the instruments used to create the sample);

  • We re-bin the spectra into the new wavelength grid;

  • We finally compute the mean SNR in the three different wavelength ranges (SNRλdelimited-⟨⟩subscriptSNR𝜆\langle\rm{SNR}_{\lambda}\rangle⟨ roman_SNR start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT ⟩, where λ=𝜆absent\lambda=italic_λ = J, H, K), also shifted to the rest-frame according to the redshift of the quasar considered.

Two examples showing the method described above, considering the highest and lowest redshift quasar in the sample, are visible in Figure 10. The results of this analysis are shown in the last three columns of Table 3, and in the histograms in Figure 3, where we generate 25 logarithmically spaced bins between the minimum and maximum of all the SNRλdelimited-⟨⟩subscriptSNR𝜆\langle\rm{SNR}_{\lambda}\rangle⟨ roman_SNR start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT ⟩ values in the sample. The majority of the spectra have SNRJ,Hdelimited-⟨⟩subscriptSNRJH\langle\rm{SNR}_{J,H}\rangle⟨ roman_SNR start_POSTSUBSCRIPT roman_J , roman_H end_POSTSUBSCRIPT ⟩ in the range [5,10], and SNRKdelimited-⟨⟩subscriptSNRK\langle\rm{SNR}_{K}\rangle⟨ roman_SNR start_POSTSUBSCRIPT roman_K end_POSTSUBSCRIPT ⟩ between 05050-50 - 5 and 1015101510-1510 - 15. To quantify the overall data quality we compute the median SNRλdelimited-⟨⟩subscriptSNR𝜆\langle\rm{SNR}_{\lambda}\rangle⟨ roman_SNR start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT ⟩ in the three wavelength ranges. We have median SNRJ=9.7delimited-⟨⟩subscriptSNRJ9.7\langle\rm{SNR}_{J}\rangle=9.7⟨ roman_SNR start_POSTSUBSCRIPT roman_J end_POSTSUBSCRIPT ⟩ = 9.7, median SNRH=10.3delimited-⟨⟩subscriptSNRH10.3\langle\rm{SNR}_{H}\rangle=10.3⟨ roman_SNR start_POSTSUBSCRIPT roman_H end_POSTSUBSCRIPT ⟩ = 10.3, and median SNRK=11.7delimited-⟨⟩subscriptSNRK11.7\langle\rm{SNR}_{K}\rangle=11.7⟨ roman_SNR start_POSTSUBSCRIPT roman_K end_POSTSUBSCRIPT ⟩ = 11.7 (reported as red dashed vertical lines in Figure 3).

Refer to caption
Figure 3: Top: Histogram of the SNRλdelimited-⟨⟩subscriptSNR𝜆\langle\rm{SNR}_{\lambda}\rangle⟨ roman_SNR start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT ⟩ of the quasars in this sample, reported in Table 3 in the J-band. We generate 25 logarithmically spaced bins between the minimum and maximum of all the SNRλdelimited-⟨⟩subscriptSNR𝜆\langle\rm{SNR}_{\lambda}\rangle⟨ roman_SNR start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT ⟩ values in the sample. The median SNRλdelimited-⟨⟩subscriptSNR𝜆\langle\rm{SNR}_{\lambda}\rangle⟨ roman_SNR start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT ⟩ is shown as a red dashed vertical line. Middle: Same, but for the H-band. Bottom: Same, but for the K-band.

The different properties for every quasar are reported in Table 3, which lists: the name of the quasars, the instruments (and arms), the exposure time for each quasar in every instrument, their redshifts, the method adopted to determine the redshift, the reference for this measurement, the J-band photometry in the AB system, the magnitude (M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT), the reference for the discovery of each quasar, and the mean SNR of the spectra in the three different wavelength ranges. The references and some notes are listed at the bottom of the table.

4 Comparison with other quasar samples

It is important to discuss this work comparing it to the present literature on other spectroscopic data releases, such as Ďurovčíková et al. (2024), D’Odorico et al. (2023), Yang et al. (2021), Schindler et al. (2020), and Shen et al. (2019). The most relevant quantities used to characterize the samples are reported in Table 4. We want to state clearly that for all the objects where the spectra are previously published, these are not the same reductions as in earlier publications, as all data are re-reduced in this work.

The sample that shows the most similarities with this work is the one in Yang et al. (2021), with 31313131 quasars in common in the redshift range 6.50<z<7.656.50𝑧7.656.50<z<7.656.50 < italic_z < 7.65 out of the 37373737 total sources. The major differences are in the instruments used to collect the spectra: in Yang et al. (2021) the data mostly come from Gemini/GNIRS and Keck/NIRES, while in this work, a higher fraction of the data comes from VLT/X-Shooter which provides better spectral resolution (see the R𝑅Ritalic_R values of each instrument reported in Table 1) and, consequently, higher quality spectra. Most spectra in Yang et al. (2021) have wavelength coverage only in the NIR arm. Indeed, because of their different scientific goals, which consist of investigating the central BH masses from the Mg II emission lines and the Eddington ratio distribution, and studying rest-frame UV properties like the chemical abundances from the Fe II/Mg II ratio, Yang et al. (2021) do not aim to have coverage in the optical band. In contrast, we create a sample that can be used for additional scientific goals, such as the study of the proximity zones (Onorato et al., in prep), or the Lyα𝛼\alphaitalic_α forest, requiring wavelength coverage also in the VIS arm, as we provide in this work.

The other three samples with only a few similarities with this work are: Ďurovčíková et al. (2024) with 8888 sources in common in the redshift range 6.53<z<7.096.53𝑧7.096.53<z<7.096.53 < italic_z < 7.09 out of the 18181818 total ones, Schindler et al. (2020) with 8888 quasars in common at 6.57<z<7.656.57𝑧7.656.57<z<7.656.57 < italic_z < 7.65 out of the 38383838 total ones, and D’Odorico et al. (2023) with only 6666 sources in common at 6.50<z<6.646.50𝑧6.646.50<z<6.646.50 < italic_z < 6.64 out of the 42424242 total ones. In the first sample mentioned, the spectra come almost entirely from Magellan/FIRE (R=6000𝑅6000R=6000italic_R = 6000 if the slit is 0.6′′ wide), which has a better data quality than the GNIRS and NIRES/MODS spectra in common, but worse than the X-Shooter ones. The other two samples have all the spectra acquired with VLT/X-Shooter and the same data quality as this sample.

Finally, there are no sources in common with Shen et al. (2019) as the redshift range (5.716.425.716.425.71-6.425.71 - 6.42) of their 50505050 quasars does not overlap with the one in this work. All their spectra come from Gemini/GNIRS.

Table 4: Main properties of the sample in this paper, compared with those of other spectroscopic samples from the literature. The columns show respectively: the reference of the sample, the redshift range, the M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT range, the total number of quasars included in the sample (with the objects in common with this work), and the instruments used to take the spectra.
Sample z𝑧zitalic_z range M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT range NsourcessubscriptNsources\rm{N_{sources}}roman_N start_POSTSUBSCRIPT roman_sources end_POSTSUBSCRIPT (in common) Instruments
Onorato et al. (2024 - this work) 6.507.656.507.656.50-7.656.50 - 7.65 [28.8,24.1]28.824.1[-28.8,-24.1][ - 28.8 , - 24.1 ] 45 GNIRS/NIRES/X-Shooter/GMOS/LRIS/DEIMOS/MODS
Ďurovčíková et al. (2024) 6.037.086.037.086.03-7.086.03 - 7.08 [28.0,26.5]28.026.5[-28.0,-26.5][ - 28.0 , - 26.5 ] 18 (8) FIRE/X-Shooter/MOSFIRE/ESI
D’Odorico et al. (2023) 5.776.635.776.635.77-6.635.77 - 6.63 [27.8,25.8]27.825.8[-27.8,-25.8][ - 27.8 , - 25.8 ] 42 (6) X-Shooter
Yang et al. (2021) 6.307.656.307.656.30-7.656.30 - 7.65 [27.4,25.2]27.425.2[-27.4,-25.2][ - 27.4 , - 25.2 ] 37 (31) GNIRS/NIRES/X-Shooter/FIRE/F2
Schindler et al. (2020) 5.787.545.787.545.78-7.545.78 - 7.54 [29.0,24.4]29.024.4[-29.0,-24.4][ - 29.0 , - 24.4 ] 38 (8) X-Shooter
Shen et al. (2019) 5.716.425.716.425.71-6.425.71 - 6.42 [27.8,25.2]27.825.2[-27.8,-25.2][ - 27.8 , - 25.2 ] 50 GNIRS

5 Composite Spectrum

In this section, we present a z>6.5𝑧6.5z>6.5italic_z > 6.5 quasar composite spectrum based on this sample and compare it with other composite spectra known from the literature (see Figure 4 and Table 6). We aim to study the average UV quasar spectral properties and their possible evolution through the different redshifts. We decide to exclude all the quasars that show BAL features in their spectra (flagged in Table 3) and then we generate the composite with 33333333 out of the 45454545 z>6.5𝑧6.5z>6.5italic_z > 6.5 quasars in the sample. This choice comes from the fact that BAL features can distort the shape of the main emission lines, producing a prominent C IV and many other high-ionization features, such as Si IV and N V.

We generate the composite spectrum following Selsing et al. (2016) as a guideline:

  • We move the spectra to the rest-frame, dividing the wavelengths by (1+z)1𝑧(1+z)( 1 + italic_z );

  • We define a new wavelength grid: [1040, 3332] Å in steps of dvpix𝑑subscript𝑣pixdv_{\rm{pix}}italic_d italic_v start_POSTSUBSCRIPT roman_pix end_POSTSUBSCRIPT=110 km s-1 (set to be coarser than the coarsest pixel-scale, in velocity, among all the instruments used to create the sample);

  • We re-bin the spectra into the new wavelength grid;

  • We normalize the spectra to the continuum flux at 1450 Å rest-frame, where there are no strong broad lines or iron emission;

  • Only for λ>1225𝜆1225\lambda>1225italic_λ > 1225 Å, we apply the following masks to improve the overall quality of the composite, without affecting its natural shape in the Lyα𝛼\alphaitalic_α region:

    1. 1.

      Telluric transmission >0.5absent0.5>0.5> 0.5: we use the individual PypeIt telluric model fits to mask out the telluric regions;

    2. 2.

      SNR >0.5absent0.5>0.5> 0.5: to mask the flux where the SNR is very low (and hence the noise is high);

    3. 3.

      σ𝜎\sigmaitalic_σ [1017superscript101710^{-17}10 start_POSTSUPERSCRIPT - 17 end_POSTSUPERSCRIPT erg s-1cm-2Å-1] <1.5absent1.5<1.5< 1.5: to mask the flux where the noise is high (i.e., in those regions close to the right edge of the wavelength coverage of the spectra, where the noise increases exponentially);

    4. 4.

      Flux [1017superscript101710^{-17}10 start_POSTSUPERSCRIPT - 17 end_POSTSUPERSCRIPT erg s-1cm-2Å-1] <40absent40<40< 40: to mask out the outliers (e.g., hot pixels or sky lines not well subtracted during the spectra reduction).

  • We create the composite spectrum as a weighted mean of the individual spectra. The weights for each spectrum at each wavelength are calculated as shown in Equation 3, where the different factors are: the good pixel mask (gpm) used to exclude bad pixels from the calculation, the combination of the four masks defined above (maskstot), and the number of spectra used at every wavelength (nused). Thus, the composite spectrum at every wavelength is the sum of the fluxes of the individual spectra at that wavelength, each multiplied by its weight, and divided by the sum of the weights.

    weights=gpmmaskstotnusedweightsgpmsubscriptmaskstotsubscriptnused\rm{weights}=\frac{\rm{gpm}\cdot\rm{masks_{tot}}}{n_{\rm{used}}}roman_weights = divide start_ARG roman_gpm ⋅ roman_masks start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT end_ARG start_ARG roman_n start_POSTSUBSCRIPT roman_used end_POSTSUBSCRIPT end_ARG (3)
Refer to caption
Figure 4: Bottom panel: Composite spectrum of the 33333333 non-BAL quasars in the sample (black) with its noise vector (grey), compared with several other composites from the literature (colored curves). All the composites are normalized to the continuum flux at 1450 Å. The grey band for λ<1225𝜆1225\lambda<1225italic_λ < 1225 Å highlights the mask-free region, where none of the masks described in the main text is applied. The main emission lines are shown as dashed blue lines. The inset panel shows a zoom-in of the region [1175, 1580] Å, where the composites differ more. Middle panel: Number of spectra that are contributing to the composite at each wavelength for this work and the only two available from the literature (same colors as in the bottom panel). Top panel: Mean redshift that contributes to the composite at each wavelength for this work.
Table 5: Composite spectrum of the 33333333 non-BAL quasars in the sample. Wavelengths are in the rest-frame and units of Å. Flux density units are arbitrary, normalized to the rest-frame 1450 Å continuum flux. The third column is the noise vector. The fourth column indicates the number of quasar’s spectra contributing to the composite at each wavelength. The last column is the mean redshift that contributes to the composite at each wavelength. The entire table data is available online.
Wavelengths [Å] Flux [FλsubscriptF𝜆\rm{F_{\lambda}}roman_F start_POSTSUBSCRIPT italic_λ end_POSTSUBSCRIPT] Error N spec Mean z𝑧zitalic_z
1040.00000 0.07235 0.03021 18 6.73951
1040.38167 0.14979 0.02575 18 6.73951
1040.76347 0.24631 0.02212 18 6.73951
1600.55798 1.36016 0.04412 30 6.72212
2096.78621 0.97827 0.04707 33 6.76699
3250.71769 0.78563 0.14352 13 6.62963

The composite spectrum that we obtain is available online as explained in the Data Availability Section, and an example of the format of the FITS file is shown in Table 5. For λ𝜆absent\lambda\initalic_λ ∈ [1700, 1900] Å and λ𝜆absent\lambda\initalic_λ ∈ [2300, 2600] Å (rest-frame), the quality of the composite decreases because of the presence of strong telluric absorption at λ𝜆absent\lambda\initalic_λ ∈ [13500, 14150] Å and λ𝜆absent\lambda\initalic_λ ∈ [18200, 19300] Å (observed frame). Indeed, at the wavelengths of the highest absorption, only 510similar-to-or-equalsabsent510\simeq 5-10≃ 5 - 10 spectra contribute to the composite because of the narrow redshift range of our sample. The quality also decreases for λ>3100𝜆3100\lambda>3100italic_λ > 3100 Å, where the number of contributing spectra starts to get lower.

In the bottom panel of Figure 4, we compare our composite with those from D’Odorico et al. (2023); Yang et al. (2021); Shen et al. (2019); Selsing et al. (2016); Telfer et al. (2002); Vanden Berk et al. (2001); for comparison purposes, all the composites are normalized at 1450 Å, showing a better match among their continuum shape for λ<1580𝜆1580\lambda<1580italic_λ < 1580 Å rather than at longer wavelengths, where there is a visible offset between the various continua. We also report the main emission lines in this wavelength range, and an inset panel with a zoom-in of the region [1175, 1580] Å, where we can see differences in the Lyα𝛼\alphaitalic_α and C IV lines, despite the good overlap among their continuum shape. In the middle panel of Figure 4, we display the number of spectra that are contributing to the composite at each wavelength for our work, Yang et al. (2021), and Shen et al. (2019) (the only two for which this piece of information is available); and finally, in the top panel, we show the distribution of the mean redshift that contributes to the composite at each wavelength. We summarize the properties of our composite and the comparison ones in Table 6.

Table 6: Main properties of the non-BAL composite spectrum created from the sample in this paper, compared with those of other composites. The columns show respectively: the reference of the composite, the redshift range of the sample, the M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT range for the sample, the total number of quasars included in the sample, and the instruments used to take the spectra.
Composite z𝑧zitalic_z range M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT range NtotsubscriptNtot\rm{N_{tot}}roman_N start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT quasars Instruments
Onorato et al. (2024 - this work) 6.507.556.507.556.50-7.556.50 - 7.55 [27.4,25.2]27.425.2[-27.4,-25.2][ - 27.4 , - 25.2 ] 33 GNIRS/NIRES/X-Shooter/GMOS/LRIS/MODS
D’Odorico et al. (2023) 5.776.635.776.635.77-6.635.77 - 6.63 [27.8,25.8]27.825.8[-27.8,-25.8][ - 27.8 , - 25.8 ] 42 X-Shooter
Yang et al. (2021) 6.507.656.507.656.50-7.656.50 - 7.65 [27.4,25.2]27.425.2[-27.4,-25.2][ - 27.4 , - 25.2 ] 38a GNIRS/NIRES/X-Shooter/FIRE/F2
Shen et al. (2019) 5.716.425.716.425.71-6.425.71 - 6.42 [27.8,25.2]27.825.2[-27.8,-25.2][ - 27.8 , - 25.2 ] 50 GNIRS
Selsing et al. (2016) 1.002.101.002.101.00-2.101.00 - 2.10 [28.1,27.2]bsuperscript28.127.2b[-28.1,-27.2]^{\rm{b}}[ - 28.1 , - 27.2 ] start_POSTSUPERSCRIPT roman_b end_POSTSUPERSCRIPT 7 X-Shooter
Telfer et al. (2002) 0.333.600.333.600.33-3.600.33 - 3.60 [27.7,24.7]bsuperscript27.724.7b[-27.7,-24.7]^{\rm{b}}[ - 27.7 , - 24.7 ] start_POSTSUPERSCRIPT roman_b end_POSTSUPERSCRIPT 184 FOS/GHRS/STIS
Vanden Berk et al. (2001) 0.0444.7890.0444.7890.044-4.7890.044 - 4.789 [25.8,22.8]csuperscript25.822.8c[-25.8,-22.8]^{\rm{c}}[ - 25.8 , - 22.8 ] start_POSTSUPERSCRIPT roman_c end_POSTSUPERSCRIPT 2204 SDSS
  • a

    Where 31 come from Yang et al. (2021), and 7 from Schindler et al. (2020).

  • b

    From Lusso et al. (2015), M1450=Mi(z=2)+1.28subscript𝑀1450subscript𝑀𝑖𝑧21.28M_{1450}=M_{i}(z=2)+1.28italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT = italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_z = 2 ) + 1.28.

  • c

    Converted from 17.5<r<20.517.5superscript𝑟20.517.5<r^{\prime}<20.517.5 < italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT < 20.5, with zmedian=1.253subscript𝑧median1.253z_{\rm{median}}=1.253italic_z start_POSTSUBSCRIPT roman_median end_POSTSUBSCRIPT = 1.253.

The discrepancies highlighted in the inset panel of Figure 4 are most likely due to differences in luminosity among the samples (see the M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT column in Table 6), as we know that many emission line properties are functions of quasar luminosity. This is the case of C IV, a high-ionization line visible in the composites, where a decreasing equivalent line width (EW) is expected with increasing luminosity, according to the so-called Baldwin effect (Baldwin, 1977). The biggest differences are noticeable in the strength of the C IV emission line when comparing our composite with the one from the most bright samples, such as Selsing et al. (2016), Shen et al. (2019) and D’Odorico et al. (2023). This effect is also confirmed by the test described in Section 5.3, where we divide the sample of non-BAL quasars into two luminosity bins and create a separate composite from each one. In Figure 7, we can notice the different EW of the Lyα𝛼\alphaitalic_α, C IV, and Mg II emission lines due to the difference in M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT of the two sub-samples. The different strength of the Lyα𝛼\alphaitalic_α line between our sample (or high-z𝑧zitalic_z in general) and low-z𝑧zitalic_z ones is due to the increasing absorption from the IGM toward higher redshift. The overall continuum slope of the composite is consistent with those from the literature, showing that the same spectral features are preserved in quasars at different z𝑧zitalic_z ranges.

In the next sections, we discuss an alternative version of the composite obtained including the BAL quasars, and two tests performed by dividing the 33333333 non-BAL quasars into two redshift bins first, and into two M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT bins later.

5.1 Including BAL quasars

As already stated at the beginning of Section 5, BAL features can affect the shape of a quasar spectrum. For this reason, we decide to exclude the contribution of all the BAL quasars in the sample to the creation of the final composite spectrum. However, for completeness purposes, we also create another version of the composite, this time considering all the quasars in the sample, so including the BALs. We show the comparison between the two versions in Figure 5, where the blue curve is the composite created from the 33333333 non-BAL quasars in the sample, and the orange curve is the one obtained including also the 12121212 BALs flagged in Table 3, and listed in Section 2.1. The rest of the plot shows the same quantities already described in Figure 4. From this comparison, we can see that there are no strong differences between the two versions of the composite. Both D’Odorico et al. (2023) and Yang et al. (2021) make the same investigation on the BAL contribution to their composite spectrum. They agree with this study that BALs do not cause significant differences but, unlike in this work, decide to include them in their final composite anyway.

Refer to caption
Figure 5: Bottom panel: Composite spectrum of the 33333333 non-BAL quasars in the sample (dark blue curve, which is the same as the black curve in Figure 4), compared with the composite spectrum obtained including the 12121212 BAL quasars (orange curve) which are flagged in Table 3, and also listed in Section 2.1. There are no significant differences between the two versions of the composite. The rest of the plot shows the same quantities already described in Figure 4.

5.2 Dividing the sample into two redshift bins

To check whether we are averaging any features in the final composite spectrum because of the relatively wide redshift distribution of the quasars in the sample, we perform another test dividing the 33333333 non-BAL quasars into two redshift bins and creating a composite spectrum from the spectra in each bin. If any "z𝑧zitalic_z-related" features are present, they might arise when dividing the sample into two groups based on the redshift. To account for enough statistics, we consider the median redshift of the 33333333 non-BALs (zmedian,nonBAL=6.70subscript𝑧mediannonBAL6.70z_{\rm{median,non-BAL}}=6.70italic_z start_POSTSUBSCRIPT roman_median , roman_non - roman_BAL end_POSTSUBSCRIPT = 6.70) and create a version of the composite from the 17171717 spectra having z6.70𝑧6.70z\geq 6.70italic_z ≥ 6.70 and another version from the 16161616 spectra with z<6.70𝑧6.70z<6.70italic_z < 6.70. The two versions are shown in Figure 6, where the blue curve is the composite created from the quasars at z6.70𝑧6.70z\geq 6.70italic_z ≥ 6.70, and the orange curve is the one obtained from the quasars at z<6.70𝑧6.70z<6.70italic_z < 6.70. The rest of the plot shows the same quantities already described in Figure 4.

From this comparison, we notice that the high-z𝑧zitalic_z composite has a shallower slope and thus a redder continuum. A possible interpretation of this would be a larger number of "subtle" BAL absorption at higher redshift. Bischetti et al. (2022, 2023) found that the BAL fraction in z6𝑧6z\geq 6italic_z ≥ 6 quasars is 23232-32 - 3 times higher than in quasars at z24.5similar-to-or-equals𝑧24.5z\simeq 2-4.5italic_z ≃ 2 - 4.5. The presence of BALs correlates with a redder continuum, probably due to dust attenuation. If this idea is correct, it would explain why the Lyα𝛼\alphaitalic_α and C IV lines look weaker. Alternatively, another explanation to justify the weaker Lyα𝛼\alphaitalic_α could be the presence of more neutral hydrogen in the IGM causing more absorption and suggesting the presence of statistical IGM dam** wings (Ďurovčíková et al., 2024). Finally, the difference in the Mg II lines most likely reflects a change in the average black hole mass or accretion rate of the sub-samples.

Refer to caption
Figure 6: Bottom panel: Composite spectrum of the 17171717 spectra having z6.70𝑧6.70z\geq 6.70italic_z ≥ 6.70 (dark blue curve), compared with the composite spectrum obtained from the 16161616 spectra with z<6.70𝑧6.70z<6.70italic_z < 6.70 (orange curve); where zmedian,nonBAL=6.70subscript𝑧mediannonBAL6.70z_{\rm{median,non-BAL}}=6.70italic_z start_POSTSUBSCRIPT roman_median , roman_non - roman_BAL end_POSTSUBSCRIPT = 6.70 is the median redshift of the 33333333 non-BAL quasars. The rest of the plot shows the same quantities already described in Figure 4.

5.3 Dividing the sample into two magnitude bins

The last test we perform on the composite consists of dividing the 33333333 non-BAL quasars into two M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT bins and creating a composite spectrum from the spectra in each bin. As before, to account for enough statistics, we consider the median M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT of the 33333333 non-BALs (M1450,median,nonBAL=26.0subscript𝑀1450mediannonBAL26.0M_{1450,\rm{median,non-BAL}}=-26.0italic_M start_POSTSUBSCRIPT 1450 , roman_median , roman_non - roman_BAL end_POSTSUBSCRIPT = - 26.0) and create a version of the composite from the 16161616 spectra having M1450<26.0subscript𝑀145026.0M_{1450}<-26.0italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT < - 26.0 and another version from the 17171717 spectra with M145026.0subscript𝑀145026.0M_{1450}\geq-26.0italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT ≥ - 26.0. The two versions are shown in Figure 7, where the blue curve is the composite created from the quasars having M1450<26.0subscript𝑀145026.0M_{1450}<-26.0italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT < - 26.0, and the orange curve is the one obtained from the quasars having M145026.0subscript𝑀145026.0M_{1450}\geq-26.0italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT ≥ - 26.0. The rest of the plot shows the same quantities already described in Figure 4.

The differences in the strength of the emission lines come from the Baldwin effect (Baldwin, 1977), already discussed in Section 5, proving that quasars’ luminosity anti-correlates with their emission lines strength. From this test, we show how this effect is still visible at early times.

Refer to caption
Figure 7: Bottom panel: Composite spectrum of the 16161616 spectra having M1450<26.0subscript𝑀145026.0M_{1450}<-26.0italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT < - 26.0 (dark blue curve), compared with the composite spectrum obtained from the 17171717 spectra with M145026.0subscript𝑀145026.0M_{1450}\geq-26.0italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT ≥ - 26.0 (orange curve); where M1450,median,nonBAL=26.0subscript𝑀1450mediannonBAL26.0M_{1450,\rm{median,non-BAL}}=-26.0italic_M start_POSTSUBSCRIPT 1450 , roman_median , roman_non - roman_BAL end_POSTSUBSCRIPT = - 26.0 is the median M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT of the 33333333 non-BAL quasars. The rest of the plot shows the same quantities already described in Figure 4.

6 Summary

In this section, we discuss the conclusions of this paper, the quantitative analysis performed on the sample, and future work. We summarize the two main results below.

  • We report and release a sample of 45454545 quasars at 6.50<z7.646.50𝑧7.646.50<z\leq 7.646.50 < italic_z ≤ 7.64 (zmedian=6.71subscript𝑧median6.71z_{\rm{median}}=6.71italic_z start_POSTSUBSCRIPT roman_median end_POSTSUBSCRIPT = 6.71), in the magnitude range 28.82M145024.1328.82subscript𝑀145024.13-28.82\leq M_{1450}\leq-24.13- 28.82 ≤ italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT ≤ - 24.13 (M1450,median26.1similar-to-or-equalssubscript𝑀1450median26.1M_{1450,\rm{median}}\simeq-26.1italic_M start_POSTSUBSCRIPT 1450 , roman_median end_POSTSUBSCRIPT ≃ - 26.1), where 12121212 out of the 45454545 quasars show BAL features in their spectrum, and 3333 are new unpublished quasars (Bañados et al. in prep, Belladitta et al. in prep). This represents the largest medium/moderate-resolution released sample of quasars at high redshift from ground-based instruments. The optical and NIR spectroscopy were obtained using the Gemini/GNIRS, Keck/NIRES, VLT/X-Shooter, Gemini/GMOS, Keck/DEIMOS, Keck/LRIS, and LBT/MODS instruments. The data in this sample allow us to estimate the M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT of the quasars directly from the spectra, and to determine their quality from the SNR:

    1. 1.

      After scaling the flux and inverse variance of each spectrum considering the J, Y, or KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT band magnitude of every quasar, we calculate M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT from these new flux-scaled spectra (see Section 3.3 and Appendix B). These values may be relevant in the course of the analysis on quasar proximity zones (Onorato et al., in prep), in case we have to correct the measurements for the luminosity of the quasar.

    2. 2.

      We compute the SNR of the spectra in the sample, considering three different wavelength ranges which exclude the telluric regions: [11000,13400]1100013400[11000,13400][ 11000 , 13400 ], [14500,17950]1450017950[14500,17950][ 14500 , 17950 ], and [19650,22400]1965022400[19650,22400][ 19650 , 22400 ] Å (see Section 3.4 and Appendix C). We want to test the quality of the spectra and give an idea of how much a certain spectrum can be trusted.

  • We create a z>6.5𝑧6.5z>6.5italic_z > 6.5 quasar composite spectrum using 33333333 out of the 45454545 quasars of this sample: we exclude the BAL quasars to avoid biases in the analysis due to absorption features in their spectra. We compare the composite with others from low and high redshift quasars samples from the literature. No significant redshift evolution is found for either broad UV emission lines or quasar continuum slopes (see Section 5). An alternative version of the composite, created considering also the BAL quasars, is described in Section 5.1. We notice that there are no strong differences between the two versions. Other two tests are discussed in Section 5.2 and 5.3. In the first one, we divide the sample of 33333333 non-BAL quasars into two redshift bins (17 quasars at z6.70𝑧6.70z\geq 6.70italic_z ≥ 6.70 and 16161616 at z<6.70𝑧6.70z<6.70italic_z < 6.70) and create a composite from the spectra in each bin. There are small differences in the continuum shape and the strength of Lyα𝛼\alphaitalic_α and C IV emission lines, visible in Figure 6, that we tentatively interpret as due to the presence of subtle BAL absorption at higher z𝑧zitalic_z (Bischetti et al., 2022, 2023). Alternatively, statistical dam** wings (Ďurovčíková et al., 2024) could also justify the weaker Lyα𝛼\alphaitalic_α observed at higher z𝑧zitalic_z, while the difference in Mg II could arise from changes in the average black hole mass or accretion rates. In the second test, we divide the sample of 33333333 non-BAL quasars into two M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT bins (17 quasars at M145026.0subscript𝑀145026.0M_{1450}\geq-26.0italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT ≥ - 26.0 and 16161616 at M1450<26.0subscript𝑀145026.0M_{1450}<-26.0italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT < - 26.0) and create a composite from the spectra in each bin. In Figure 7, we see how weaker emission lines are associated with brighter quasars, confirming the presence of the Baldwin effect (Baldwin, 1977), which still holds at early times.

Starting from the sample presented in this paper, in subsequent works we will reconstruct the quasar’s intrinsic blue side from the observed red side, with PCA continuum modeling (e.g., Davies et al. 2018a, b; Bosman et al. 2021), to study their proximity zones (Onorato et al., in prep), and Lyα𝛼\alphaitalic_α dam** wings (Miralda-Escudé 1998; Davies et al. 2018a; Ďurovčíková et al. 2024; Greig et al. 2024). In particular, Hennawi et al. in prep introduce a new inference approach for analyzing the IGM dam** wings, deriving a single Bayesian likelihood for the entire spectrum; while Kist et al. in prep, quantify the precision with which these IGM dam** wings analyzed with the new method can measure astrophysical parameters such as xHIdelimited-⟨⟩subscript𝑥HI\langle x_{\rm{HI}}\rangle⟨ italic_x start_POSTSUBSCRIPT roman_HI end_POSTSUBSCRIPT ⟩ and tQsubscript𝑡Qt_{\rm{Q}}italic_t start_POSTSUBSCRIPT roman_Q end_POSTSUBSCRIPT, and the dependence of this precision on the properties of the spectra analyzed. Our final goal will be to impose more stringent constraints on xHIdelimited-⟨⟩subscript𝑥HI\langle x_{\rm{HI}}\rangle⟨ italic_x start_POSTSUBSCRIPT roman_HI end_POSTSUBSCRIPT ⟩ during the EoR and the radiative efficiency of the earliest SMBHs.

Acknowledgements

This work is based in part on observations obtained at the international Gemini Observatory, a program of NSF NOIRLab, which is managed by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the U.S. National Science Foundation on behalf of the Gemini Observatory partnership: the U.S. National Science Foundation (United States), National Research Council (Canada), Agencia Nacional de Investigación y Desarrollo (Chile), Ministerio de Ciencia, Tecnología e Innovación (Argentina), Ministério da Ciência, Tecnologia, Inovações e Comunicações (Brazil), and Korea Astronomy and Space Science Institute (Republic of Korea).

Some of the data presented herein were obtained at Keck Observatory, which is a private 501(c)3 non-profit organization operated as a scientific partnership among the California Institute of Technology, the University of California, and the National Aeronautics and Space Administration. The Observatory was made possible by the generous financial support of the W. M. Keck Foundation. The authors wish to recognize and acknowledge the very significant cultural role and reverence that the summit of Maunakea has always had within the Native Hawaiian community. We are most fortunate to have the opportunity to conduct observations from this mountain. This research has made use of the Keck Observatory Archive (KOA), which is operated by the W. M. Keck Observatory and the NASA Exoplanet Science Institute (NExScI), under contract with the National Aeronautics and Space Administration.

This work is based in part on observations made with ESO telescopes at the La Silla Paranal Observatory.

This paper also uses data based on observations obtained at the LBT, an international collaboration among institutions in the United States, Italy and Germany. The LBT Corporation partners are: The University of Arizona on behalf of the Arizona university system; Istituto Nazionale di Astrofisica, Italy; LBT Beteiligungsgesellschaft, Germany, representing the Max Planck Society, the Astrophysical Institute Potsdam, and Heidelberg University; The Ohio State University; The Research Corporation, on behalf of The University of Notre Dame, University of Minnesota and University of Virginia.

We acknowledge the use of the PypeIt data reduction package.

We acknowledge helpful conversations with the ENIGMA group at UC Santa Barbara and Leiden University. SO is grateful to Ben Wang and Suk Sien Tie for help with some of the data reductions, and to Elia Pizzati and Caitlin Doughty for comments on an early version of the manuscript.

JFH acknowledges support from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement No 885301), from the National Science Foundation (NSF) under Grant No. 2307180, and from NASA under the Astrophysics Data Analysis Program (ADAP, Grant No. 80NSSC21K1568). JTS is supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) - Project number 518006966. CM acknowledges support from Fondecyt Iniciacion grant 11240336 and the ANID BASAL project FB210003. E.P.F. is supported by the international Gemini Observatory, a program of NSF’s NOIRLab, which is managed by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the National Science Foundation, on behalf of the Gemini partnership of Argentina, Brazil, Canada, Chile, the Republic of Korea, and the United States of America.

Data Availability

Data archives

The raw Gemini data (both GNIRS and GMOS) can be searched and downloaded from the Gemini Observatory Archive at this link: https://archive.gemini.edu/searchform. The user needs to set the Instrument used and the coordinates of the target (RA and Dec). Additionally, we notice that sometimes there are superimposed features like vertical stri**, horizontal banding, and quadrant offsets on GNIRS data. We use the CLEANIR666https://www.gemini.edu/instrumentation/niri/data-reduction Python routine to remove these artifacts.

The raw Keck data (all NIRES, DEIMOS, and LRIS) can be downloaded from a Basic Search on the Keck Observatory Archive (KOA) at this link: https://koa.ipac.caltech.edu/cgi-bin/KOA/nph-KOAlogin. The user needs to select the Instrument used, set the Object Name or Location, and decide how to return the results. In case we want to download multiple targets, we can look for More Search Options, select the desired Instruments, and add a file in the Multiple Object Table File section. A code to create such a table is provided in the GitHub repository storing the relevant tables and codes used in this paper, whose link is provided in Section 2.3.

The raw ESO data (VLT/X-Shooter) can be downloaded from the ESO Observatory Archive at this link: https://archive.eso.org/eso/eso_archive_main.html. The user can type the coordinates of the target (RA and Dec), set the Instrument used, and select the category of data desired. In case we want to download multiple targets, we can add a file in the List of Targets section. A code to create such a file is provided in the GitHub repository already mentioned.

The raw LBT data (MODS) come from private projects present in the LBT Archive at this link: http://archive.lbto.org/. The user needs to set the coordinates of the target (RA and Dec), the Instrument used, and to authenticate for the project.

More (and fewer) details can always be inserted in the archives when searching for targets.

Data reduction and analysis

The files to reduce the spectroscopic data with PypeIt (.pypeit, .flux, .coadd1d, and .tell), the .csv tables (containing information such as the redshift of the quasars, photometric details or SNR), and the relevant codes to reproduce the results in this paper are all stored in the GitHub repository previously mentioned.

Final FITS files (spectra and composites)

All the FITS files produced in this work (45 flux-scaled spectra and 6 different versions of the composite spectrum) are available at CDS via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via this link: https://cdsarc.cds.unistra.fr/viz-bin/cat/J/MNRAS, after the publication of the paper in MNRAS.

References

  • Aihara et al. (2018) Aihara H., et al., 2018, PASJ, 70, S4
  • Andika et al. (2020) Andika I. T., et al., 2020, ApJ, 903, 34
  • Andika et al. (2022) Andika I. T., et al., 2022, AJ, 163, 251
  • Andika et al. (2024) Andika I. T., et al., 2024, A&A, 685, A25
  • Bañados et al. (2015) Bañados E., Decarli R., Walter F., Venemans B. P., Farina E. P., Fan X., 2015, ApJ, 805, L8
  • Bañados et al. (2016) Bañados E., et al., 2016, ApJS, 227, 11
  • Bañados et al. (2018) Bañados E., et al., 2018, Nature, 553, 473
  • Bañados et al. (2019) Bañados E., et al., 2019, ApJ, 885, 59
  • Bañados et al. (2021) Bañados E., et al., 2021, ApJ, 909, 80
  • Baldwin (1977) Baldwin J. A., 1977, ApJ, 214, 679
  • Becker et al. (2015) Becker G. D., Bolton J. S., Madau P., Pettini M., Ryan-Weber E. V., Venemans B. P., 2015, MNRAS, 447, 3402
  • Bischetti et al. (2022) Bischetti M., et al., 2022, Nature, 605, 244
  • Bischetti et al. (2023) Bischetti M., et al., 2023, ApJ, 952, 44
  • Bochanski et al. (2009) Bochanski J. J., et al., 2009, PASP, 121, 1409
  • Bosman et al. (2021) Bosman S. E. I., Ďurovčíková D., Davies F. B., Eilers A.-C., 2021, MNRAS, 503, 2077
  • Bosman et al. (2022) Bosman S. E. I., et al., 2022, MNRAS, 514, 55
  • Casali et al. (2007) Casali M., et al., 2007, A&A, 467, 777
  • Clough et al. (2005) Clough S. A., Shephard M. W., Mlawer E. J., Delamere J. S., Iacono M. J., Cady-Pereira K., Boukabara S., Brown P. D., 2005, J. Quant. Spectrosc. Radiative Transfer, 91, 233
  • D’Odorico et al. (2023) D’Odorico V., et al., 2023, MNRAS, 523, 1399
  • Davies et al. (2018a) Davies F. B., et al., 2018a, ApJ, 864, 142
  • Davies et al. (2018b) Davies F. B., et al., 2018b, ApJ, 864, 143
  • Davies et al. (2019) Davies F. B., Hennawi J. F., Eilers A.-C., 2019, ApJ, 884, L19
  • Davies et al. (2020) Davies F. B., Hennawi J. F., Eilers A.-C., 2020, MNRAS, 493, 1330
  • Davies et al. (2023) Davies R. L., et al., 2023, MNRAS, 521, 289
  • Decarli et al. (2018) Decarli R., et al., 2018, ApJ, 854, 97
  • Dye et al. (2018) Dye S., et al., 2018, MNRAS, 473, 5113
  • Eilers et al. (2017) Eilers A.-C., Davies F. B., Hennawi J. F., Prochaska J. X., Lukić Z., Mazzucchelli C., 2017, ApJ, 840, 24
  • Eilers et al. (2020) Eilers A.-C., et al., 2020, ApJ, 900, 37
  • Elias et al. (2006a) Elias J. H., Rodgers B., Joyce R. R., Lazo M., Doppmann G., Winge C., Rodríguez-Ardila A., 2006a, in McLean I. S., Iye M., eds, Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series Vol. 6269, Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series. p. 626914, doi:10.1117/12.671765
  • Elias et al. (2006b) Elias J. H., Joyce R. R., Liang M., Muller G. P., Hileman E. A., George J. R., 2006b, in McLean I. S., Iye M., eds, Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series Vol. 6269, Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series. p. 62694C, doi:10.1117/12.671817
  • Faber et al. (2003) Faber S. M., et al., 2003, in Iye M., Moorwood A. F. M., eds, Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series Vol. 4841, Instrument Design and Performance for Optical/Infrared Ground-based Telescopes. pp 1657–1669, doi:10.1117/12.460346
  • Fan et al. (2006) Fan X., et al., 2006, AJ, 132, 117
  • Fan et al. (2019) Fan X., et al., 2019, ApJ, 870, L11
  • Fan et al. (2023) Fan X., Bañados E., Simcoe R. A., 2023, ARA&A, 61, 373
  • Farina et al. (2022) Farina E. P., et al., 2022, ApJ, 941, 106
  • Fujimoto et al. (2023) Fujimoto S., et al., 2023, arXiv e-prints, p. arXiv:2308.11609
  • Furtak et al. (2023) Furtak L. J., et al., 2023, arXiv e-prints, p. arXiv:2308.05735
  • Gaskell (1982) Gaskell C. M., 1982, ApJ, 263, 79
  • Goulding et al. (2023) Goulding A. D., et al., 2023, ApJ, 955, L24
  • Greene et al. (2023) Greene J. E., et al., 2023, arXiv e-prints, p. arXiv:2309.05714
  • Greig et al. (2024) Greig B., et al., 2024, MNRAS, 530, 3208
  • Gullikson et al. (2014) Gullikson K., Dodson-Robinson S., Kraus A., 2014, AJ, 148, 53
  • Gunn & Peterson (1965) Gunn J. E., Peterson B. A., 1965, ApJ, 142, 1633
  • Hennawi et al. (2006) Hennawi J. F., et al., 2006, ApJ, 651, 61
  • Hook et al. (2004) Hook I. M., Jørgensen I., Allington-Smith J. R., Davies R. L., Metcalfe N., Murowinski R. G., Crampton D., 2004, PASP, 116, 425
  • Horne (1986) Horne K., 1986, PASP, 98, 609
  • Inayoshi et al. (2020) Inayoshi K., Visbal E., Haiman Z., 2020, ARA&A, 58, 27
  • Jiang et al. (2016) Jiang L., et al., 2016, ApJ, 833, 222
  • Kokorev et al. (2023) Kokorev V., et al., 2023, ApJ, 957, L7
  • Kokorev et al. (2024) Kokorev V., et al., 2024, arXiv e-prints, p. arXiv:2401.09981
  • Labbe et al. (2023) Labbe I., et al., 2023, arXiv e-prints, p. arXiv:2306.07320
  • Lai et al. (2022) Lai S., et al., 2022, MNRAS, 513, 1801
  • Larson et al. (2023) Larson R. L., et al., 2023, ApJ, 953, L29
  • Lawrence et al. (2007) Lawrence A., et al., 2007, MNRAS, 379, 1599
  • Lusso et al. (2015) Lusso E., Worseck G., Hennawi J. F., Prochaska J. X., Vignali C., Stern J., O’Meara J. M., 2015, MNRAS, 449, 4204
  • Maiolino et al. (2023a) Maiolino R., et al., 2023a, arXiv e-prints, p. arXiv:2305.12492
  • Maiolino et al. (2023b) Maiolino R., et al., 2023b, arXiv e-prints, p. arXiv:2306.00953
  • Matsuoka et al. (2018) Matsuoka Y., et al., 2018, ApJS, 237, 5
  • Matsuoka et al. (2019a) Matsuoka Y., et al., 2019a, ApJ, 872, L2
  • Matsuoka et al. (2019b) Matsuoka Y., et al., 2019b, ApJ, 883, 183
  • Matsuoka et al. (2022) Matsuoka Y., et al., 2022, ApJS, 259, 18
  • Matthee et al. (2024) Matthee J., et al., 2024, ApJ, 963, 129
  • Mazzucchelli et al. (2017) Mazzucchelli C., et al., 2017, ApJ, 849, 91
  • Mazzucchelli et al. (2023) Mazzucchelli C., et al., 2023, A&A, 676, A71
  • McMahon et al. (2013) McMahon R. G., Banerji M., Gonzalez E., Koposov S. E., Bejar V. J., Lodieu N., Rebolo R., VHS Collaboration 2013, The Messenger, 154, 35
  • Meyer et al. (2019) Meyer R. A., Bosman S. E. I., Ellis R. S., 2019, MNRAS, 487, 3305
  • Miralda-Escudé (1998) Miralda-Escudé J., 1998, ApJ, 501, 15
  • Moorwood et al. (1998) Moorwood A., Cuby J. G., Lidman C., 1998, The Messenger, 91, 9
  • Mortlock et al. (2011) Mortlock D. J., et al., 2011, Nature, 474, 616
  • Oke et al. (1995) Oke J. B., et al., 1995, PASP, 107, 375
  • Onoue et al. (2019) Onoue M., et al., 2019, The Astrophysical Journal, 880, 77
  • Pérez-González et al. (2024) Pérez-González P. G., et al., 2024, arXiv e-prints, p. arXiv:2401.08782
  • Pogge et al. (2010) Pogge R. W., et al., 2010, in McLean I. S., Ramsay S. K., Takami H., eds, Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series Vol. 7735, Ground-based and Airborne Instrumentation for Astronomy III. p. 77350A, doi:10.1117/12.857215
  • Prochaska et al. (2020) Prochaska J., et al., 2020, The Journal of Open Source Software, 5, 2308
  • Reed et al. (2017) Reed S. L., et al., 2017, MNRAS, 468, 4702
  • Reed et al. (2019) Reed S. L., et al., 2019, MNRAS, 487, 1874
  • Richards et al. (2002) Richards G. T., Vanden Berk D. E., Reichard T. A., Hall P. B., Schneider D. P., SubbaRao M., Thakar A. R., York D. G., 2002, AJ, 124, 1
  • Rockosi et al. (2010) Rockosi C., et al., 2010, in McLean I. S., Ramsay S. K., Takami H., eds, Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series Vol. 7735, Ground-based and Airborne Instrumentation for Astronomy III. p. 77350R, doi:10.1117/12.856818
  • Ross & Cross (2020) Ross N. P., Cross N. J. G., 2020, MNRAS, 494, 789
  • Satyavolu et al. (2023) Satyavolu S., et al., 2023, MNRAS, 522, 4918
  • Schindler et al. (2020) Schindler J.-T., et al., 2020, ApJ, 905, 51
  • Selsing et al. (2016) Selsing J., Fynbo J. P. U., Christensen L., Krogager J. K., 2016, A&A, 585, A87
  • Shen & Ménard (2012) Shen Y., Ménard B., 2012, ApJ, 748, 131
  • Shen et al. (2016) Shen Y., et al., 2016, ApJ, 831, 7
  • Shen et al. (2019) Shen Y., et al., 2019, ApJ, 873, 35
  • Tang et al. (2017) Tang J.-J., et al., 2017, MNRAS, 466, 4568
  • Telfer et al. (2002) Telfer R. C., Zheng W., Kriss G. A., Davidsen A. F., 2002, ApJ, 565, 773
  • Tytler & Fan (1992) Tytler D., Fan X.-M., 1992, ApJS, 79, 1
  • Vanden Berk et al. (2001) Vanden Berk D. E., et al., 2001, AJ, 122, 549
  • Venemans et al. (2013) Venemans B. P., et al., 2013, ApJ, 779, 24
  • Venemans et al. (2015) Venemans B. P., et al., 2015, ApJ, 801, L11
  • Venemans et al. (2016) Venemans B. P., Walter F., Zschaechner L., Decarli R., De Rosa G., Findlay J. R., McMahon R. G., Sutherland W. J., 2016, ApJ, 816, 37
  • Venemans et al. (2017) Venemans B. P., et al., 2017, ApJ, 851, L8
  • Vernet et al. (2011) Vernet J., et al., 2011, A&A, 536, A105
  • Wang et al. (2017) Wang F., et al., 2017, ApJ, 839, 27
  • Wang et al. (2018) Wang F., et al., 2018, ApJ, 869, L9
  • Wang et al. (2019) Wang F., et al., 2019, ApJ, 884, 30
  • Wang et al. (2021a) Wang F., et al., 2021a, ApJ, 907, L1
  • Wang et al. (2021b) Wang F., et al., 2021b, ApJ, 908, 53
  • Wang et al. (2024) Wang F., et al., 2024, arXiv e-prints, p. arXiv:2404.15413
  • Wilson et al. (2004) Wilson J. C., et al., 2004, in Moorwood A. F. M., Iye M., eds, Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series Vol. 5492, Ground-based Instrumentation for Astronomy. pp 1295–1305, doi:10.1117/12.550925
  • Woods et al. (2019) Woods T. E., et al., 2019, Publ. Astron. Soc. Australia, 36, e027
  • Wu et al. (2015) Wu X.-B., et al., 2015, Nature, 518, 512
  • Yang et al. (2019) Yang J., et al., 2019, AJ, 157, 236
  • Yang et al. (2020) Yang J., et al., 2020, ApJ, 897, L14
  • Yang et al. (2021) Yang J., et al., 2021, ApJ, 923, 262
  • Ďurovčíková et al. (2024) Ďurovčíková D., et al., 2024, arXiv e-prints, p. arXiv:2401.10328
  • van Dokkum (2001) van Dokkum P. G., 2001, PASP, 113, 1420

Appendix A Flux scaling with J, Y, or KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT band photometry

As already mentioned in Section 3.2, the reduced spectrum of every quasar in this sample is scaled using the J, Y, or KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT band photometry. This is because the flux calibration of the spectra performed with PypeIt is relative and, while we are still looking for a definitive absolute flux calibration method, we need to tie the spectroscopic measurements with the photometric ones, which are more accurate in terms of fluxes.

To correct the spectra, we scale them to a given magnitude in a specified passband filter. First, we calculate the AB magnitude of the spectrum in a certain passband filter (mspecsubscript𝑚specm_{\rm{spec}}italic_m start_POSTSUBSCRIPT roman_spec end_POSTSUBSCRIPT) by using the Python library Speclite777https://speclite.readthedocs.io/en/latest/. Then we calculate a scale factor (SF𝑆𝐹SFitalic_S italic_F) using the difference between the calculated magnitude from the spectrum and the true magnitude from the photometry (mphotsubscript𝑚photm_{\rm{phot}}italic_m start_POSTSUBSCRIPT roman_phot end_POSTSUBSCRIPT), as shown in this formula that we get from the Pogson law:

SF=100.4(mspecmphot)𝑆𝐹superscript100.4subscript𝑚specsubscript𝑚photSF=10^{0.4(m_{\rm{spec}}-m_{\rm{phot}})}italic_S italic_F = 10 start_POSTSUPERSCRIPT 0.4 ( italic_m start_POSTSUBSCRIPT roman_spec end_POSTSUBSCRIPT - italic_m start_POSTSUBSCRIPT roman_phot end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT (4)

This scale factor is used to adjust the flux and inverse variance of the spectrum to match the true magnitude in the given passband.

Refer to caption
Refer to caption
Refer to caption
Figure 8: Top: Example case of the spectrum of J1342+++0928 before (blue) and after (orange) applying the scale factor (SF𝑆𝐹SFitalic_S italic_F) calculated from the J-band magnitude of this quasar (as described in Equation 4), and the noise vectors. The lower plot shows the transmission curve of the passband filter used to get the photometry. Middle: Same, but for J0923+++0754 and its Y-band photometry. Bottom: Same, but for J1058+2930 and its KpsubscriptKp\rm{K_{p}}roman_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT-band photometry.

Appendix B Absolute magnitude of J0910--0414 and J0923+++0402 (BAL quasars)

This Appendix describes the approach followed to calculate M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT in the two problematic cases of J0910--0414 and J0923+++0402. These quasars are affected by strong BAL features, which mostly appear around the region at rest-frame 1450145014501450 Å, making the measurement of M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT with the method described in Section 3.3 difficult to trust (note the black dashed line at λ=1450𝜆1450\lambda=1450italic_λ = 1450 Å in Figure 9 falling in the area affected by the BAL). To correct this problem, we want to visualize what the continuum would be in a quasar with the same shape, but without BAL features. For this aim, we make a comparison using the composite spectrum created from the sample excluding the BAL quasars (described in Section 5 and shown in Figure 4). We start moving the observed spectrum (already flux-scaled using its J-band magnitude) to the rest-frame, while the composite spectrum is already in the rest-frame. We scale the composite to the observed spectrum at a certain wavelength (λnorm=2000subscript𝜆norm2000\lambda_{\rm{norm}}=2000italic_λ start_POSTSUBSCRIPT roman_norm end_POSTSUBSCRIPT = 2000 Å, in both cases, produces a good result) by multiplying for a factor A𝐴Aitalic_A888It is the ratio between the flux of the BAL spectrum and the flux of the composite spectrum at the wavelength λnormsubscript𝜆norm\lambda_{\rm{norm}}italic_λ start_POSTSUBSCRIPT roman_norm end_POSTSUBSCRIPT.. We apply a power-law correction with an exponent α𝛼\alphaitalic_α, to rotate the composite and then have a better match with the orientation of the observed spectra, according to the following formula:

fcomposite,new=fcompositeA(λcompositeλnorm)αsubscript𝑓compositenewsubscript𝑓composite𝐴superscriptsubscript𝜆compositesubscript𝜆norm𝛼f_{\rm{composite,new}}=f_{\rm{composite}}\cdot A\left(\frac{\lambda_{\rm{% composite}}}{\lambda_{\rm{norm}}}\right)^{\alpha}italic_f start_POSTSUBSCRIPT roman_composite , roman_new end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT roman_composite end_POSTSUBSCRIPT ⋅ italic_A ( divide start_ARG italic_λ start_POSTSUBSCRIPT roman_composite end_POSTSUBSCRIPT end_ARG start_ARG italic_λ start_POSTSUBSCRIPT roman_norm end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT (5)

We set α=0.3𝛼0.3\alpha=0.3italic_α = 0.3 for J0910--0414 and α=0.8𝛼0.8\alpha=0.8italic_α = 0.8 for J0923+++0402.

Finally, we can calculate M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT (reported in Table 3) from the scaled composite spectrum following the method described in Section 3.3, assuming that the magenta star in Figure 9 represents the "new" continuum level for the BAL quasar at 1450145014501450 Å. Figure 9 shows the observed spectra in blue and the composite in orange, with their noise vectors reported at the bottom of each plot; the black dashed lines represent λ=1450𝜆1450\lambda=1450italic_λ = 1450 Å and the green dashed lines are the normalization wavelengths, both set at 2000200020002000 Å.

Refer to caption
Refer to caption
Figure 9: Top: Match between the spectrum of J0910--0414 and the composite spectrum obtained from the sample excluding the BALs already corrected to get a better estimate of the M1450subscript𝑀1450M_{1450}italic_M start_POSTSUBSCRIPT 1450 end_POSTSUBSCRIPT. The spectrum of the BAL quasar is shown in blue, while the composite is in orange, with their noise vectors reported at the bottom of the plot. The back dashed line falls at λ=1450𝜆1450\lambda=1450italic_λ = 1450 Å and the green dashed line is the normalization wavelength at 2000200020002000 Å. The magenta star is the "new" continuum level assumed for the BAL quasar at 1450145014501450 Å. Bottom: Same, but for J0923+++0402.

Appendix C SNR

In Figure 10, we plot the distribution of the SNR along the rest-frame wavelength grid defined in Section 3.4 for J0313--1806 and J1110--1329 (dark red curves), and the J, H, and K bands (indicated in Section 3.4 too) are in different colors. These are two good example cases of spectra with medium/high and medium/low SNR, respectively.

Refer to caption
Refer to caption
Figure 10: Top: Example case of the distribution of the SNR along the wavelength grid [1040, 3332] Å for the highest-z𝑧zitalic_z quasar in the sample, J0313--1806 (dark red). The bands (J, H, and K) in which we compute the SNR are indicated according to different colors.
Bottom: Same, but for the lowest-z𝑧zitalic_z quasar of the sample: J1110--1329.