¹¹institutetext: Centro de Astrobiología (CAB), CSIC-INTA, Camino Bajo del Castillo s/n, 28692 Villanueva de la Cañada, Madrid, Spain
¹¹email: [email protected] ²²institutetext: Departamento de Ingeniería Mecánica. Universidad de la Rioja, San José de Calasanz 31, 26004 Logroño, La Rioja, Spain ³³institutetext: Instituto de Astrofísica de Canarias, c/ Vía Láctea s/n, 38205 La Laguna, Tenerife, Spain ⁴⁴institutetext: Departamento de Astrofísica, Universidad de La Laguna, 38206 La Laguna, Tenerife, Spain ⁵⁵institutetext: Hamburger Sternwarte, Gojenbergsweg 112, 21029 Hamburg, Germany ⁶⁶institutetext: Departamento de Física de la Tierra y Astrofísica & IPARCOS-UCM (Instituto de Física de Partículas y del Cosmos de la UCM), Facultad de Ciencias Físicas, Universidad Complutense de Madrid, 28040 Madrid, Spain ⁷⁷institutetext: Departamento de Ingeniería de Organización, Administración de Empresas y Estadística, Universidad Politécnica de Madrid, c/ José Gutiérrez Abascal 2, 28006 Madrid, Spain ⁸⁸institutetext: Departamento de Construcción e Ingeniería de Fabricación, Universidad de Oviedo, Pedro Puig Adam, Sede Departamental Oeste, Módulo 7, 1^a planta, 33203 Gijón, Spain

Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/

P. Mas-Buitrago Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ A. González-Marcos Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ E. Solano Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ V. M. Passegger Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ M. Cortés-Contreras Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ J. Ordieres-Meré Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ A. Bello-García Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ J. A. Caballero Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ A. Schweitzer Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ H. M. Tabernero Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ D. Montes Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/ and C. Cifuentes Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/Using autoencoders and deep transfer learning to determine the stellar parameters of 286 CARMENES M dwarfs ^†^†thanks: Table 5 is only available in electronic form at the CDS via anonymous ftp to cdsarc.u-strasbg.fr(130.79.128.5) or via http://cdsweb.u-strasbg.fr/cgi-bin/qcat?J/A+A/

(Received 05 March 2024 / Accepted 02 May 2024)

Abstract

Context. Deep learning (DL) techniques are a promising approach among the set of methods used in the ever-challenging determination of stellar parameters in M dwarfs. In this context, transfer learning could play an important role in mitigating uncertainties in the results due to the synthetic gap (i.e. difference in feature distributions between observed and synthetic data).

Aims. We propose a feature-based deep transfer learning (DTL) approach based on autoencoders to determine stellar parameters from high-resolution spectra. Using this methodology, we provide new estimations for the effective temperature, surface gravity, metallicity, and projected rotational velocity for 286 M dwarfs observed by the CARMENES survey.

Methods. Using autoencoder architectures, we projected synthetic PHOENIX-ACES spectra and observed CARMENES spectra onto a new feature space of lower dimensionality in which the differences between the two domains are reduced. We used this low-dimensional new feature space as input for a convolutional neural network to obtain the stellar parameter determinations.

Results. We performed an extensive analysis of our estimated stellar parameters, ranging from 3050 to 4300 K, 4.7 to 5.1 dex, and $-$ 0.53 to 0.25 dex for $T_{\rm eff}$ , log g, and [Fe/H], respectively. Our results are broadly consistent with those of recent studies using CARMENES data, with a systematic deviation in our $T_{\rm eff}$ scale towards hotter values for estimations above 3 750 K. Furthermore, our methodology mitigates the deviations in metallicity found in previous DL techniques due to the synthetic gap.

Conclusions. We consolidated a DTL-based methodology to determine stellar parameters in M dwarfs from synthetic spectra, with no need for high-quality measurements involved in the knowledge transfer. These results suggest the great potential of DTL to mitigate the differences in feature distributions between the observations and the PHOENIX-ACES spectra.

Key Words.:

methods: data analysis – techniques: spectroscopic – stars: fundamental parameters – stars: late-type – stars: low-mass

1 Introduction

Low-mass dwarfs are the most common type of stars in the Galaxy, constituting approximately 70% of the stellar population (Henry et al., 1994; Reid et al., 1995; Reylé et al., 2021). In particular, M dwarfs, which are smaller, cooler, and fainter than Sun-like stars are of great importance in the study of exoplanets because of their prevalence, longevity, and proximity. Their small size and lower luminosity make it easier to detect Earth-sized planets in their habitable zones. As a result, several programs have been established with the goal of identifying potentially habitable planets orbiting M dwarfs. Notable examples include ground-based instruments like the Echelle Spectrograph for Rocky Exoplanet and Stable Spectroscopic Observations (ESPRESSO, Pepe et al., 2021) and its predecessor, the High-Accuracy Radial velocity Planet Searcher (HARPS, Mayor et al., 2003; Bonfils et al., 2013), or the Calar Alto high-Resolution search for M dwarfs with Exoearths with Near-infrared and optical Echelle Spectrographs (CARMENES, Quirrenbach et al., 2016, 2020).

The precise determination of the stellar parameters of M dwarfs is crucial to improve our understanding of planetary formation and evolution, which depends fundamentally on the thorough characterisation of their host stars (Cifuentes et al., 2020). However, well-established photometric and spectroscopic methods for determining these parameters encounter particular challenges, mainly due to the inherent faintness of M dwarfs and their frequent manifestation of strong stellar activity. Specifically for spectroscopic analyses, establishing the spectral continuum can be a difficult task. Despite these problems, numerous efforts have been devoted to estimating photospheric parameters in M dwarfs, including effective temperature ( $T_{\rm eff}$ ), surface gravity (log g), and metallicity ([M/H]). Several methods have proven successful in inferring these parameters, such as fitting synthetic spectra, as in Passegger et al. (2019, hereafter Pass19) and Marfil et al. (2021, hereafter Mar21), pseudo-equivalent widths (pEWs) (e.g. Mann et al., 2013a, 2014; Neves et al., 2014), spectral indices (e.g. Rojas-Ayala et al., 2010, 2012), empirical calibrations (e.g. Casagrande et al., 2008; Neves et al., 2012), interferometry (e.g. Boyajian et al., 2012; Rabus et al., 2019), and machine learning (e.g. Antoniadis-Karnavas et al., 2020; Passegger et al., 2020, hereafter Pass20).

The approaches based on pEWs, measurements of the strength of absorption lines in a spectrum, and spectral indices, calculated from carefully chosen spectral regions –and often derived from absorption lines or bands–, leverage their sensitivity and correlation with stellar parameters (mainly, $T_{\rm eff}$ and [Fe/H]). As a recent example of these approaches, Khata et al. (2020) determined $T_{\rm eff}$ and metallicities, among other parameters, for 53 M dwarfs using $H$ - and $K$ -band pEWs and H₂O indices. Another approach relies on empirical calibrations based on observations of M dwarfs that have an F, G, or K binary companion with known metallicity. This is grounded in the idea that the metallicity of an M dwarf is comparable to that of the hotter primary star, assuming the system originated from the same proto-stellar cloud (Neves et al., 2012; Montes et al., 2018; Duque-Arribas et al., 2024). For example, Rodríguez Martínez et al. (2019) employed the relationships of Newton et al. (2015) and Mann et al. (2013b) to derive $T_{\rm eff}$ and metallicity, respectively, from moderate-resolution spectra of 35 M dwarfs from the K2 mission. Numerous spectral indices have also been empirically calibrated. For instance, Veyette et al. (2017) determined $T_{\rm eff}$ , [Fe/H], and [Ti/H] from high-resolution Y-band spectra of 29 M dwarfs by combining spectral synthesis with empirically calibrated indices and pEWs using FGK+M systems (Bonfils et al., 2005; Mann et al., 2013a).

Interferometric measurements have also proven useful for deriving index-based calibrations for $T_{\rm eff}$ (Mann et al., 2013b), performing empirical calibrations for $T_{\rm eff}$ (Maldonado et al., 2015; Newton et al., 2015), or determining $T_{\rm eff}$ from interferometric observations in combination with parallaxes and bolometric fluxes (Boyajian et al., 2012; von Braun et al., 2014; Rabus et al., 2019). However, their application is limited to a relatively small number of stars due to the requirement that they must be bright and nearby.

The fitting of synthetic spectra relies on a minimisation algorithm to find the synthetic spectrum that best matches the observed spectrum. Variations exist in terms of the synthetic grid employed (e.g. BT-Settl, PHOENIX-ACES, MARCS), using high or low spectral resolution, and the number and wavelength of features selected for comparison. For example, the BT-Settl models (Allard et al., 2012, 2013) were used by Gaidos & Mann (2014) and Mann et al. (2015) to derive $T_{\rm eff}$ values for M dwarfs with low-resolution visible SNIFS (Supernova Integral Field Spectrograph) spectra, and by Rajpurohit et al. (2018) to compute $T_{\rm eff}$ , log g, and [Fe/H] for 292 M dwarfs using high-resolution CARMENES spectra (Reiners et al., 2018). Kuznetsov et al. (2019) applied BT-Settl models to intermediate-resolution spectra from the visible arm of VLT/X-shooter (intermediate resolution, high-efficiency spectrograph, Vernet et al., 2011) to determine $T_{\rm eff}$ , log g, [Fe/H], and $v\sin{i}$ for 153 M dwarfs. More recently, Hejazi et al. (2020) derived $T_{\rm eff}$ , $\log{g}$ , metallicity [M/H], and alpha-enhancement [ $\alpha$ /Fe] of 1 544 M dwarfs and subdwarfs from low- to medium-resolution spectra collected at the Michigan-Dartmouth-MIT observatory, Lick Observatory, Kitt Peak National Observatory, and Cerro Tololo Interamerican Observatory. Additionally, Mar21 determined $T_{\rm eff}$ , log g, and [Fe/H] for a sample of 343 M dwarfs observed with CARMENES using a Bayesian implementation of the spectral synthesis technique, the SteParSyn¹¹1https://github.com/hmtabernero/SteParSyn code (Tabernero et al., 2022).

Based on the PHOENIX-ACES library (Husser et al., 2013), Birky et al. (2017) derived $T_{\rm eff}$ , log g, and [Fe/H] for late-M and early-L dwarfs from high-resolution near-infrared APOGEE spectra (Wilson et al., 2010). Similarly, Passegger et al. (2018) and Schweitzer et al. (2019, hereafter Schw19) determined these parameters for M dwarfs observed with CARMENES in the visible wavelength region. Building upon these works, Pass19 extended the analysis by determining $T_{\rm eff}$ , log g, and [Fe/H] not only from the visible range covered with CARMENES but also from the near-infrared and the combination of visible and near-infrared data. The comparison conducted in Pass19 led to the conclusion that utilising both spectral ranges for parameter determination maximises the amount of available spectral information while minimising possible effects caused by imperfect modelling. The MARCS model atmospheres (Gustafsson et al., 2008) have also been employed to compute photospheric parameters. For instance, in a recent study by Souto et al. (2020), $T_{\rm eff}$ , log g, and [Fe/H] were determined for 21 M dwarf mid-resolution APOGEE H-band spectra using MARCS models and the turbospectrum code (Plez, 2012) through the bacchus wrapper (Masseron et al., 2016). Similarly, Sarmento et al. (2021) derived $T_{\rm eff}$ , log g, [M/H], and microturbulent velocity $v_{\rm mic}$ for 313 M dwarfs from APOGEE H-band spectra using MARCS models, turbospectrum, and iSpec python code (Blanco-Cuaresma et al., 2014).

As large surveys release extensive databases containing thousands of stars, there is a need for flexible and automated methods capable of handling vast amounts of data to infer stellar atmospheric parameters. In this sense, machine learning (ML) techniques have also been used for determining photospheric parameters for M dwarfs from stellar spectra. For example, Sarro et al. (2018) proposed an automated procedure based on genetic algorithms to identify pEWs and integrated flux ratios from BT-Settl models that yield good estimations of $T_{\rm eff}$ , log g, and [M/H] for spectra from the NASA Infrared Telescope Facility (IRTF). Also based on pEWs, Antoniadis-Karnavas et al. (2020) present an ML tool, named ODUSSEAS, to derive $T_{\rm eff}$ and [Fe/H] of M dwarf stars from 1D spectra for different resolutions. In Birky et al. (2020), The Cannon (Ness et al., 2015; Casey et al., 2016), a data-driven spectral-modelling and parameter-inference framework, is used to estimate $T_{\rm eff}$ and [Fe/H] for 5 875 M dwarfs in the APOGEE (Abolfathi et al., 2018) and Gaia DR2 (Gaia Collaboration et al., 2018) surveys. Using the Stellar LAbel Machine (SLAM, Zhang et al., 2020), Li et al. (2021a) trained a model with APOGEE stellar labels and synthetic spectra from the BT-Settl model, resulting in the determination of $T_{\rm eff}$ and [M/H] for M dwarfs from the LAMOST DR6²²2http://dr6.lamost.org/ catalogue.

This study extends previous works on applying deep learning (DL) to predict stellar parameters from high-resolution spectra observed with CARMENES. Pass20 presented a DL approach where convolutional neural networks (CNNs) were trained on synthetic PHOENIX-ACES models to estimate $T_{\rm eff}$ , log g, [M/H], and $v\sin{i}$ for 50 M dwarfs observed with CARMENES. After a thorough analysis of their methodology, in which different architectures and spectral windows were tested, they found that all DL models were able to estimate stellar parameters from synthetic spectra in a precise and accurate way. However, when testing these models on the CARMENES spectra, they found significant deviations for the metallicity because of the synthetic gap (Fabbro et al., 2018; Tabernero et al., 2022), which is the difference in feature distributions between synthetic and observed data. In a more recent study, Bello-García et al. (2023, hereafter Bello23) employed a deep transfer learning (DTL) approach to mitigate the uncertainties associated with the synthetic gap (see their Figs. 1 and 2). Following the training of DL models on a large set of synthetic spectra from the PHOENIX-ACES model, the models underwent fine-tuning based on external knowledge about stellar parameters. This external knowledge included 14 stars from the CARMENES survey with interferometric angular diameters measured by Boyajian et al. (2012), von Braun et al. (2014), and references therein. Additionally, it was supplemented with five mid-to-late M dwarf stars from Passegger et al. (2022). They achieved the determination of new $T_{\rm eff}$ and [M/H] values for 286 M dwarfs from the CARMENES survey, and although this approach improved the estimation of $T_{\rm eff}$ and [M/H] for M dwarfs from high-resolution spectra obtained with CARMENES, the lack of sufficiently large number of reference stars to transfer knowledge is a limitation for the technique. If the reference dataset is limited in size, diversity, or representation across the parameter space, the models may not generalise well to a broader range of M dwarfs.

In this work, we present a novel transfer learning approach for estimating photospheric parameters in M dwarfs based on their stellar spectra. The primary goal of the proposed method is to address the aforementioned limitation identified by Bello23 by eliminating the requirement for interferometric values in the knowledge transfer process. To achieve this, instead of employing a model-based transfer learning approach, as in Bello23, where the transferred knowledge is encoded into model parameters, priors or model architectures, we propose a feature-based transfer learning. In this approach, the knowledge to be transferred can be considered as the learned feature representation. The idea is to learn a ‘good’ feature representation so that, by projecting data onto the new representation, the differences between domains (source and target, i.e. synthetic and observed spectra in our case) can be reduced. This allows the source domain labelled data (synthetic spectra with known parameters) to be used to train a precise model for the target domain constituted by the observed spectra (Yang et al., 2020).

In Section 2, we provide details on the CARMENES sample and the PHOENIX-ACES synthetic model grid used in this study. The proposed methodology, based on autoencoders and transfer learning, is outlined in Section 3. The derived stellar atmospheric parameters are then analysed and compared with existing literature in Section 4. Finally, Section 5 summarises the main conclusions of this work.

2 Data

The proposed approach was tested using the same sample spectra as Pass19. This sample, listed in their Table B.1, comprise 282 M dwarfs observed with CARMENES. Additionally, four more stars from an independent interferometric sample, as described by Bello23, were included.

CARMENES is installed at the Calar Alto Observatory, located in Spain, and stands as one of the leading instruments in the quest for searching for Earth-like planets within the habitable zones around M dwarfs. It comprises two separate spectrographs: one for the visible (VIS) wavelength range (from 520 to 960 nm) and the other for the near-infrared (NIR) range (from 960 to 1710 nm), each offering high-spectral resolutions of R $\approx$ 94 600 and 80 500, respectively (Quirrenbach et al., 2020; Reiners et al., 2018).

A detailed description of the data reduction procedure is available in Zechmeister et al. (2014), Caballero et al. (2016b), and Pass19. Similar to the latter, we used a high signal-to-noise (S/N) template spectra for each star. These templates are generated as byproducts of the CARMENES radial-velocity pipeline, known as serval (SpEctrum Radial Velocity AnaLyser; Zechmeister et al., 2018). In the standard data flow, the code constructs a template for each target star from a minimum of five individual spectra to derive the radial velocities through least-square fitting to the template. The S/N of the observed CARMENES sample used in this work was above 150. Concerning the wavelength window, we adopted the range 8 800–8 835 Å, consistent with Bello23, as this window displayed the smallest mean squared error among all the investigated windows in Pass20.

To train the neural network models, we utilised the PHOENIX-ACES spectra library³³3https://phoenix.astro.physik.uni-goettingen.de/ (Husser et al., 2013). This library is chosen for its consideration of spectral features present in cool dwarfs. Furthermore, the use of synthetic models enables the generation of a large number of spectra with known parameters, eliminating the need for limited samples of observations with well-known stellar parameters. We used the same PHOENIX-ACES grid as in previous works (Pass20; Bello23), which was generated by linearly interpolating between the existing grid points using pyterpol (Nemravová et al., 2016). The complete dataset contains a grid of 449 806 synthetic high-resolution spectra between 8 800 Å and 8 835 Å with $T_{\rm eff}$ between 2 300 and 4 500 K (step 25 K), log g between 4.2 and 5.5 dex (step 0.1 dex), [M/H] between -1.0 and 0.8 dex (step 0.1 dex), and $v\sin{i}$ between 1.5 and 60.0 km s^-1 (with a variable step of 0.5, 1.0, 2.0 or 5.0; see Table 1 in Pass20). A degeneracy between $T_{\rm eff}$ , log g, and [Fe/H] was described by Passegger et al. (2018), who found exceptionally high values of log g and [Fe/H] for well-fitting PHOENIX-ACES models. This degeneracy was further underscored by Pass19 and Pass20 during the application of DL models to the observed CARMENES spectra, and the latter imposed additional constraints to the grid leveraging the PARSEC v1.2S evolutionary models (Bressan et al., 2012; Chen et al., 2014, 2015; Tang et al., 2014). Degeneracies between stellar parameters are often found when fitting synthetic spectra, and some authors have explored several ways to help break them (Buzzoni et al., 2001; Brewer et al., 2015). The refinement performed by Pass20 aimed to exclude parameter combinations for M dwarfs that do not fit the main sequence, as discussed in Section 4.2 of their work. Notably, Pass20 demonstrated that the imposition of these constraints on the synthetic model grid used in the training of the DL models is capable of breaking the observed parameter degeneracy. After applying these restrictions, the grid includes 22 933 PHOENIX-ACES spectra.

Due to the negligible presence of telluric features in the investigated range, telluric correction was not applied to the VIS spectra. For normalisation, we employed the Gaussian Inflection Spline Interpolation Continuum (GISIC⁴⁴4https://pypi.org/project/GISIC/), the same method and routine used by Pass20 and developed by D. D. Whitten, designed for spectra with strong molecular features. Following the same approach as Bello23, we applied this procedure to both observed and synthetic spectra within the spectral window 8800–8835 Å with an additional 5 Å on each side to mitigate potential edge effects. Moreover, the observed spectra underwent radial velocity correction to align with the rest frame of the synthetic spectra, achieved through cross-correlation (crosscorrRV from PyAstronomy, Czesla et al., 2019) between a PHOENIX model spectrum and the observed spectrum. To ensure a universal wavelength grid, essential for applying the proposed method, the wavelength grid of the observed spectra was linearly interpolated with the grid of the synthetic spectra.

In spite of the performed spectra preparation, differences in the feature distributions of the synthetic and observed sets of spectra (i.e. synthetic gap) were identified. We used the Uniform Manifold Approximation and Projection (UMAP; McInnes et al., 2018), with a metric that considers the correlation between the spectra, to project the high-dimensional input space (3 500 flux values for each spectrum) into a two-dimensional space while preserving inter-distances. As shown in Fig. 1, akin to Pass20 and Bello23, most of the CARMENES spectra (grey triangles) do not align precisely within the synthetic spectra (colour-coded dots). Thus, a transfer learning approach appears appropriate to extend the applicability of the regression models trained with the synthetic spectra to the observed spectra.

Refer to caption — Figure 1: Two-dimensional UMAP projection of PHOENIX-ACES (dots colour-coded by $T_{\rm eff}$ ) and CARMENES (grey triangles) spectra from the 8 800–8 835 $\AA$ window. Almost all CARMENES spectra are isolated from the PHOENIX-ACES family feature space.

3 Methodology

The DTL approach proposed in this paper can be summarised as follows. Initially, we extract a low-dimensional representation of synthetic spectra based on the PHOENIX-ACES library using autoencoders (AEs), a special kind of neural network initially proposed for dimensionality reduction (Hinton & Salakhutdinov, 2006). Then, the knowledge transfer process is performed by fine-tuning these AEs with high-resolution spectra observed with the CARMENES instrument. It must be noted that no stellar parameters were used during this re-training. With the low-dimensional representations of the synthetic spectra resulting from the initial step, we trained CNNs. Finally, using these CNNs, we estimated the stellar parameters ( $T_{\rm eff}$ , log g, [M/H], and $v\sin{i}$ ) for 286 CARMENES M dwarfs by using their low-dimensional representations obtained after the fine-tuning step.

3.1 Feature extraction using an autoencoder

In this study, we explore unsupervised feature extraction from stellar spectra using AEs to facilitate feature-based transfer learning and leverage the new representations for estimating photospheric parameters. Belonging to representation learning –a subfield of machine learning–, AEs have the capability to capture the underlying factors hidden in the observed data (Bengio et al., 2013; Goodfellow et al., 2016). They have been succesfully used in various astrophysical applications, including unsupervised feature learning from galaxy spectral energy distribution (Frontera-Pons et al., 2017), learning of non-linear representations from rest-frame spectroscopic data for redshift estimation (Frontera-Pons et al., 2019), galaxy classification (Cheng et al., 2021), astrophysical component separation (Milosevic et al., 2021), reconstruction of missing magnitudes from observed objects before classifying them into stars, galaxies, and quasars (Khramtsov et al., 2021), and telluric correction (Kjærsgaard et al., 2023). In addition, some authors have used AEs to estimate stellar atmospheric parameters from spectra (Yang & Li, 2015; Li et al., 2017). However, their approach is different from our proposal since the training of the models was performed in a supervised manner: spectra from SDSS/SEGUE DR7 (Abazajian et al., 2009) were used, and $T_{\rm eff}$ , log g, and [Fe/H] were obtained from the SDSS/SEGUE Spectroscopic Parameter Pipeline (SSPP; Lee et al., 2008b, a; Prieto et al., 2008; Smolinski et al., 2011) for stars in the temperature range 4088-9747 K (earlier than our CARMENES targets). In our case, we are interested in the use of AEs to enable transfer learning, as representation learning enables the transfer of knowledge when there are features useful for different settings or tasks that correspond to underlying factors appearing in more than one setting (Goodfellow et al., 2016).

The rationale behind the first step of our methodology is to find a meaningful low-dimensional representation, referred to as the latent space, of the synthetic spectra. To accomplish this, we employed an AE, which consists of an ‘encoder’ trained to transform the high-dimensional spectrum into a low-dimensional code, and a ‘decoder’ trained to reconstruct the original spectrum as accurate as possible from its lower-dimensional latent space (see Fig. 2).

First, we divided the grid of synthetic spectra into a training set (70 %) and a test set (30 %). We considered multiple AE architectures, develo** a python code to create a flexible AE structure. The number of neurons on each layer, the L1 regularisation term for the dense layers (used to prevent overfitting), and the learning rate for the Adam optimisation (a computationally efficient stochastic gradient descent method, Kingma & Ba, 2014) were passed as parameters. For this code, we relied on the Keras⁵⁵5https://keras.io/about/ (Chollet, 2015) deep learning API, which runs on top of the Tensorflow⁶⁶6https://www.tensorflow.org/ (Abadi et al., 2015) machine learning platform. Next, we created a grid for these hyperparameters and performed an exhaustive search using the GridSearchCV class from the scikit-learn⁷⁷7https://scikit-learn.org/stable/ package, which optimises the hyperparameters of an estimator through k-fold cross-validation, using any scoring metric to evaluate the model. In our case, we used 4-fold cross-validation and the mean squared error between the reconstructed and the original validation data as the scoring metric. To integrate our python code into a scikit-learn workflow, we used the KerasRegressor wrapper from the scikeras⁸⁸8https://adriangb.com/scikeras/stable/ python package.

After this search for the best hyperparameter combinations, we only kept those with a mean cross-validation score below the median, evaluated using the entire grid. We trained an AE for each of these architectures, adding a contractive regularisation term in the loss function, consisting of the squared Frobenius norm of the Jacobian matrix of the encoder activations with respect to the input:

\left\|\,J_{f}\,(x)\,\right\|_{F}^{2}=\sum_{ij}\left(\frac{\partial h_{j}\,(x)% }{\partial x_{i}}\right)^{2},

(1)

where $f$ represents the encoding function that maps the input $x$ to the hidden representation $h$ . The main idea of contractive AEs is to make the feature extraction more robust to small perturbations in the training data. In the overall loss function optimisation, the trade-off between the reconstruction and the L1 regularisation terms will retain the important variations in the latent space for the reconstruction of the input (Rifai et al., 2011).

We only kept the AEs with a learning rate equal to 0.0001, as we found that some of them with a higher learning rate were not able to converge properly, leading to a poor latent representation of the spectra. With this, we ended up with 26 final AE architectures and evaluated them on the test set, obtaining mean squared reconstruction errors $\sim 5\cdot 10^{-5}$ . Fig. 3 shows the reconstruction and the latent space of a PHOENIX-ACES synthetic spectrum for one of the AEs. Using the encoder networks of the AEs, we obtained 26 sets (one for each AE) of 32-dimensional compressed representations for the grid of synthetic spectra.

3.2 Deep transfer learning

The dependence of DL algorithms on massive training data is a crucial hurdle to overcome when a research scenario requires labelled data. In some fields, such as astrophysics, building a large, annotated data set can be incredibly complex and expensive. A straightforward and widely used solution to this problem is the use of synthetic data to train the DL models, but this may include a systematic error in the methodology if the synthetic gap (see Section 2) is significant, as is the case in this work.

Transfer learning (TL) plays a key role in solving the above problems, as it allows knowledge to be transferred from a rich source domain to a related but not identical target domain. The transition from TL to deep transfer learning (DTL), with incomplete DTL as an intermediate stage (deep neural networks are only used as feature extractors in TL models; Yu et al., 2022), came with the integration of DL techniques into the TL paradigm.

In the context of TL, a domain can be represented as $D=\left\{\mathcal{X},P(X)\right\}$ , where $\mathcal{X}$ denotes a feature space and $P(X)$ represents the marginal probability distribution for $X=\left\{x_{1},...,x_{n}\right\}\in\mathcal{X}$ . Also, a task can be represented as $T=\left\{Y,f(\cdot)\right\}$ , where $Y$ denotes a label space and $f(\cdot)$ is a predictive function. According to the definition provided by Pan & Yang (2010), given a source domain $D_{\rm S}$ and task $T_{\mathrm{S}}$ , and a target domain $D_{\mathrm{T}}$ and task $T_{\mathrm{T}}$ , TL aims to enhance the performance of a predictive function $f_{\mathrm{T}}(\cdot)$ in $D_{\mathrm{T}}$ , using the knowledge available in $D_{\mathrm{S}}$ and $T_{\mathrm{S}}$ , where $D_{\mathrm{S}}\neq D_{\mathrm{T}}$ and/or $T_{\mathrm{S}}\neq T_{\mathrm{T}}$ . In our work, the source domain is represented by the grid of synthetic PHOENIX-ACES spectra, while the target domain is built from the 286 CARMENES observed spectra. Moreover, the predictive function is defined as the encoder network of the AE architecture, responsible for compressing the input spectra into the low-dimensional latent representation.

The purpose of this step in the methodology is to adopt a DTL-based strategy, in particular the fine-tuning approach (Chu et al., 2016; Yosinski et al., 2014), using the AE architectures we already trained in the source domain to obtain a meaningful low-dimensional latent representation of our data-poor target domain. In this process, we kept the weights frozen in all encoder layers until the last one, leaving only the deepest encoder layer, the bottleneck (i.e. the latent space or compressed representation of the spectrum, as illustrated in Fig. 2), and the decoder network to be re-trained. The motivation for kee** the lower layers frozen is to prevent generic learning from being overwritten, thus preserving the knowledge acquired by the network to recognise relevant spectral features, while the more specific features are tailored to the target domain (Vafaei Sadr et al., 2020).

Pan et al. (2008) already explored the possibility of finding a low-dimensional latent space in which source and domain data are close to each other, and using it as a bridge to transfer the knowledge from the labelled source domain to the unlabelled target domain. In our case, the ultimate goal of this process is to find a low-dimensional representation of the observed spectra that is closer to the synthetic latent representation than in the initial high-dimensional space of the spectra (see Fig. 1). Furthermore, we want for these target representations to be as meaningful as possible, since we intend to use them later as a starting point for estimating the stellar parameters.

First, we divided the target set of 286 CARMENES spectra into a training set (80 %) and a test set (20 %), with the latter being used to assess the reconstruction error across the target domain. Then, we fine-tuned the 26 AE architectures, following the process explained above, obtaining mean squared reconstruction errors $\sim 4\cdot 10^{-4}$ on the test set, in contrast to the reconstruction errors ( $\sim 3\cdot 10^{-3}$ ) obtained on the CARMENES set using the AEs pre-trained on the PHOENIX-ACES spectra. It must be noted that no stellar parameters were used during this re-training.

Fig. 4 illustrates the importance of this step for the AE to effectively adapt to our specific target domain, ensuring that the compressed representations provided by the fine-tuned encoders will be more meaningful than those we would have obtained with the initial training. Using these fine-tuned encoder networks, we obtained the final 26 sets of 32-dimensional representations for the observed CARMENES spectra.

While our goal was to preserve the meaningfulness of the low-dimensional representations of the synthetic and observed spectra, we aimed, above all, to minimise the disparity between the observed and synthetic compressed representations. For instance, Fig. 5 illustrates a UMAP two-dimensional projection, using the same metric as in Fig. 1, for one of the 26 sets of PHOENIX-ACES and CARMENES representations. In contrast to Fig. 1, in this case, the CARMENES objects are integrated over the space occupied by the PHOENIX-ACES family of projections, leading to a significant reduction of the differences in feature distributions between the two domains. Consequently, we calculated the minimum Euclidean distance from each CARMENES instance to the synthetic grid in both the initial high-dimensional space and the new low-dimensional feature space. While the mean distance is $2.72$ when evaluated in the initial feature space (Fig. 1), it is reduced to a mean value of $0.086$ for the encoded representations (Fig. 5), averaged over the 26 sets. In this manner, a latent space that encodes the shared knowledge from both domains was learned, effectively bridging the gap between them.

3.3 Stellar parameter estimation

In the final step of our methodology, we employed CNNs, one of the oldest deep learning approaches (Lecun et al., 1998), to estimate the stellar parameters of the 286 CARMENES stars. As a starting point for this process, we used the 26 sets of encoded representations for the PHOENIX-ACES and CARMENES spectra obtained in the previous steps of our work.

Inspired by the hierarchical structure of the human visual nervous system (a precursor of CNNs; Fukushima, 1980), CNNs are therefore generally used to deal with image data. They are a specific class of multilayered feedforward neural networks, initially developed for image classification and visual pattern recognisition (Lecun et al., 1998; Krizhevsky et al., 2012; Simonyan & Zisserman, 2014). The distinctive factor of CNNs is the use of convolution operations, in the convolutional layers, to automatically extract features from data. After the convolutional structure, the set of features is flattened and passed to an artificial neural network (ANN) to perform the classification or regression task.

In each forward-propagation process, the input of each neuron of the convolutional layer is obtained with an element-wise dot product between a convolution kernel (or filter), with trainable coefficients, and the outputs of the previous layer. The resulting arrays and a tunable bias are added up and passed through an activation function to obtain the output feature map of the neuron. The set of kernels is tuned during the training process, as the weights of the deep ANN layers are adjusted, so that the different feature maps of the layer represent specific features detected in the input data. Li et al. (2021b) provided a detailed review of CNNs.

In one-dimensional (1D) CNNs (see Fig. 6), the convolution kernel slides along a sequence of non-independent values to extract relevant features, and they have proven to be highly performant in several applications during the recent years (Kiranyaz et al., 2021). Sharma et al. (2020) presented a semisupervised learning approach to handle the scarcity of labelled samples, using AE and 1D CNN architectures for stellar spectral classification. Zheng & Qiu (2020) explored how the generation of stellar spectra to balance the training data set can significantly improve the performance of a 1D CNN classifier.

Since we used 32-component vectors as input data for the stellar parameter estimation, we built a 1D CNN architecture. This architecture consists of two convolutional layers (Conv1D) with a variable number of filters (see Table 1), followed by four fully-connected (Dense) layers. A flattening step is incorporated between the convolutional and the ANN components to reshape the output of the final convolutional layer (number of outputs $\times$ number of filters) into a one-dimensional vector. This vector is then fed into the dense layers. We used a rectified linear unit (ReLU) activation function in all layers except the output layer, with a linear activation. We estimated $T_{\rm eff}$ , log $g$ , [M/H], and $v\sin{i}$ independently, searching for the optimal hyperparameters of the 1D CNN architecture (same procedure as in Section 3.1) in the estimation of each parameter. Table 1 describes in detail the CNN architectures used. We followed the same procedure in the independent estimation of the different stellar parameters. To have a significant number of final estimates and to assess the robustness of our methodology, we built five CNN models for each of the 26 sets of encoded representations, thus obtaining a total of 130 regressors for each of the parameters.

Table 1: CNN architectures used for the estimation of

T_{\rm eff}

, log

g

, [M/H], and

v\sin{i}

Layer	Output size				Number of filters				Number of parameters
	$T_{\rm eff}$	log $g$	[M/H]	$v\sin{i}$	$T_{\rm eff}$	log $g$	[M/H]	$v\sin{i}$	$T_{\rm eff}$	log $g$	[M/H]	$v\sin{i}$
Conv1D	32	32	32	32	64	16	32	64	192	48	96	192
Conv1D	32	32	32	32	32	64	8	8	4 128	2 112	520	1 032
Flatten	1 024	2 048	256	256	…	…	…	…	0	0	0	0
Dense	256	256	256	256	…	…	…	…	262 400	524 544	65 792	65 792
Dense	128	128	128	128	…	…	…	…	32 896	32 896	32 896	32 896
Dense	64	64	64	64	…	…	…	…	8 256	8 256	8 256	8 256
Dense	1	1	1	1	…	…	…	…	65	65	65	65

To train the CNN models, we use stratified sampling to create the indices of the traning (70 %) and test (30 %) sets from the PHOENIX-ACES low-dimensional representations, ensuring that the distribution of the target parameter is representative of the overall distribution in both sets. For this, we relied on the StratifiedShuffleSplit class of the scikit-learn python package, which automatically performs stratification based on a target variable and generates indices to split data into training and test set. We trained the CNN models using the synthetic compressed representations, with a mean squared error loss function, and evaluated them on the test set. As final regressors, we kept the 80 models with the lowest mean squared error in the test set, obtaining an upper value of 353 K, 0.0042 dex, 0.0016 dex, and 0.054 km s^-1 for $T_{\rm eff}$ , log $g$ , [M/H], and $v\sin{i}$ , respectively. Using these models, we obtained 80 final parameter estimates for each of the CARMENES stars.

We followed the same strategy used by Pass20 and Bello23 for the uncertainty estimation of the stellar parameters. For each star, we gathered the 80 estimations and computed the probability density function using the Kernel Density Estimate (KDE; Chen et al., 1997; Poggio et al., 2021) technique. We took the maximum of this probability density function as the confident estimation for the stellar parameter, together with the 1 $\sigma$ thresholds as the corresponding uncertainties. Here, the final stellar parameter is derived from a distribution of parameter estimates which come from 26 different sets of input features, together with the five CNN models built for each set. Therefore, the uncertainties provided should be understood as an intrinsic error of our methodology. Fig. 7 shows an example of the results for a single star.

4 Results and discussion

4.1 Stellar parameters analysis

Table 5 presents the stellar atmospheric parameters determined with our methodology. The top left panel in Fig. 8 shows a Kiel diagram that relates all our estimated parameters, along with isochrones based on the PAdova and TRieste Stellar Evolution Code (PARSEC release v1.2S; Bressan et al., 2012) for 5 Gyr and [M/H] $=-0.4,0.0,$ and $0.1$ dex. The results obtained with our methodology follow the trend set by the isochrones and the structure observed in the estimated metallicities is also consistent with them. The remaining three panels in Fig. 8 show a Hertzsprung-Russell diagram (HRD) of our results, with different features highlighted in each of them. We computed the bolometric luminosities, $L_{\mathrm{bol}}$ , as Cifuentes et al. (2020) using the latest astrometry and photometry from Gaia DR3 (Gaia Collaboration et al., 2023b). Theoretical isochrones, for solar metallicity, from PARSEC v1.2S and from evolutionary models presented by Baraffe et al. (2015) are overplotted in the top right panel for 0.1 and 5 Gyr. Both the Kiel diagram and the HRD reveal a clear outlier region at the lowest temperatures (mid M-dwarf regime; Cifuentes et al., 2020; Pecaut & Mamajek, 2013), populated mostly by the stars with a high estimated projected rotational velocity ( $v\sin{i}$ ). These fast rotators in our sample are located at the expected M-dwarf regime, following the relation between the spectral types from the CARMENES input catalogue (Carmencita; Alonso-Floriano et al., 2015; Caballero et al., 2016a) and the $v\sin{i}$ values calculated by Reiners et al. (2018) (see Fig. 2 in Mar21, ).

The bottom panels in Fig. 8 help to understand the outliers that deviate from the main sequence. The bottom left panel shows that almost all the overluminuous outliers in the HRD are identified as H $\alpha$ active stars by Schöfer et al. (2019), considered as such if the pseudo-EW of the H $\alpha$ line satisfies pEW^′(H $\alpha)<-0.3$ Å (H $\alpha$ flag from Table B.1 in Mar21). As found in previous works (e.g. Jeffers et al., 2018; Reiners et al., 2018), the fraction of H $\alpha$ active stars is higher at later spectral types. There are clear patterns in the HRD which arise from the kinematic membership of the targets. For instance, and in agreement with Jeffers et al. (2018), most H $\alpha$ active and rapidly rotating stars are kinematically young (dots marked with a + in the bottom right panel).

To study the possible membership of our sample to nearby young stellar associatons, we relied on BANYAN $\Sigma$ ⁹⁹9http://www.exoplanetes.umontreal.ca/banyan/ (Gagné et al., 2018), a Bayesian analysis tool to identify members of young associations. Modelled with multivariate Gaussians in six-dimensional $\rm XYZUVW$ space, BANYAN $\Sigma$ can derive membership probabilities for all known and well-characterised young associations within 150 pc. In our case, we used the python version of BANYAN $\Sigma$ ¹⁰¹⁰10https://github.com/jgagneastro/banyan_sigma, and included the Gaia DR3 sky coordinates, proper motion, radial velocity, and parallax of our target stars as input parameters to the algorithm. The classifier gave a high Bayesian probability (¿80 %) for 9 objects to belong to a young stellar association, in 7 of the cases with a probability greater than 95 %. Table 2 lists the details of these objects. All these stars with a possible membership in a young stellar associaton are represented with a thick open circle in the bottom right panel of Fig. 8. Here, we also considered four extra stars, namely J09133+688 (G 234-057), J12156+526 (StKM 2-809), J15218+209 (GJ 9520), and J18174+483 (TYC 3529-1437-1), which Schw19 mentioned as young age-based outliers.

Table 2: Stars in our sample classified by BANYAN

\Sigma

with a high Bayesian probability of belonging to a young stellar association.

Karmn	Name ^(a)	BANYAN $\Sigma$ Prob. ^(b)	Young association ^(c)	Association reference
J02088+494	G 173-039	99.94 %	AB Doradus	Zuckerman et al. (2004)
J02519+224	RBS 365	99.79 %	$\beta$ Pictoris	Zuckerman et al. (2001)
J03473-019	G 080-021	99.94 %	AB Doradus	Zuckerman et al. (2004)
J05019+011 ^(d)	1RXS J050156.7+010845	99.91 %	$\beta$ Pictoris	Zuckerman et al. (2001)
J05062+046 ^(d)	RX J0506.2+0439	99.79 %	$\beta$ Pictoris	Zuckerman et al. (2001)
J09163-186	LP 787-052	95.01 %	Argus	Zuckerman (2018)
J10289+008	BD+01 2447	99.97 %	AB Doradus	Zuckerman et al. (2004)
J19511+464	G 208-042	94.17 %	Argus	Zuckerman (2018)
J21164+025	LSPM J2116+0234	85.20 %	Argus	Zuckerman (2018)

¹¹¹¹11^(a) As in Cifuentes et al. (2020). ^(b) The Bayesian probability that this object belongs to the young stellar association. ^(c) Most probable Bayesian hypothesis (including the field). ^(d) Already mentioned in Schw19 as candidate members of the corresponding young stellar association.

The bottom left panel in Fig. 8 shows that outliers below the main sequence are typically members of the thick disc Galactic population (Cortés-Contreras et al., in prep.; Cortés-Contreras, 2017). Furthermore, four of these outliers are reported to have a behaviour akin to subdwarfs (empty squares in top and bottom left panels) both by Mar21 and Schw19. Table 3 details all the outliers we identified with low-metallicity behaviour, along with the metallicity estimations found in the literature. As discussed by Jao et al. (2008), with the decrease in the metallicity of these objects the TiO opacity also strongly decreases, and this less blanketing from the TiO bands causes more continuum flux to radiate from the deeper and hotter layer of the stellar atmosphere, so that these stars appear bluer than their solar metallicity counterparts (see Fig. 1 in Jao et al. (2008)). Our [M/H] determinations for these stars are, in general, in good agreement with the literature.

Table 3: Low-metallicity stars identified in Fig. 8.

Karmn	Name	[Fe/H] ${}_{\mathrm{AE}}^{\,(a)}$	[Fe/H] ${}_{\mathrm{DTL}}^{\,(b)}$	[Fe/H] ${}_{\mathrm{Mann15}}^{\,(c)}$	[Fe/H] ${}_{\mathrm{corr,Mar21}}^{\,(d)}$	[Fe/H] ${}_{\mathrm{Schw19}}^{\,(e)}$	Pop. ${}^{\,(f)}$
		[dex]	[dex]	[dex]	[dex]	[dex]
J00183+440	GX And	$-0.33_{-0.17}^{+0.06}$	$-0.26_{-\ldots}^{+\ldots}$	$-0.30\pm 0.08$	$-0.52\pm 0.11$	$-0.25\pm 0.16$	D
J02123+035	BD+02 348	$-0.35_{-0.10}^{+0.12}$	$-0.33_{-0.01}^{+0.01}$	$-0.36\pm 0.08$	$-0.49\pm 0.06$	$-0.05\pm 0.16$	TD
J06371+175	HD 260655	$-0.41_{-0.13}^{+0.11}$	$-0.37_{-0.02}^{+0.02}$	$-0.34\pm 0.08$	$-0.43\pm 0.04$	$-0.42\pm 0.16$	TD-D
J11033+359	Lalande 21185	$-0.34_{-0.13}^{+0.08}$	$-0.31_{-\ldots}^{+\ldots}$	$-0.38\pm 0.08$	$-0.49\pm 0.10$	$-0.09\pm 0.16$	TD
J11054+435	BD+44 2051A	$-0.40_{-0.18}^{+0.07}$	$-0.35_{-\ldots}^{+\ldots}$	$-0.37\pm 0.08$	$-0.56\pm 0.09$	$-0.3\pm 0.16$	TD-D
J12248-182 ^(g)	Ross 695	$-0.33_{-0.18}^{+0.06}$	$-0.40_{-0.04}^{+0.02}$	…	$-0.60\pm 0.09$	$-0.17\pm 0.16$	TD
J13450+176	BD+18 2776	$-0.53_{-0.27}^{+0.09}$	$-0.46_{-0.05}^{+0.06}$	$-0.54\pm 0.08$	$-0.54\pm 0.03$	$-0.43\pm 0.16$	TD
J16254+543 ^(g)	GJ 625	$-0.33_{-0.15}^{+0.05}$	$-0.32_{-0.03}^{+0.02}$	$-0.35\pm 0.08$	$-0.28\pm 0.07$	$-0.26\pm 0.16$	YD
J17378+185	BD+18 3421	$-0.33_{-0.08}^{+0.11}$	$-0.23_{-0.03}^{+0.01}$	$-0.25\pm 0.08$	$-0.40\pm 0.07$	$-0.23\pm 0.16$	D
J19070+208 ^(g)	Ross 730	$-0.34_{-0.18}^{+0.05}$	$-0.32_{-0.02}^{+0.01}$	$-0.33\pm 0.08$	$-0.46\pm 0.07$	$-0.20\pm 0.16$	D
J19072+208 ^(g)	HD 349726	$-0.32_{-0.17}^{+0.05}$	$-0.32_{-0.02}^{+0.02}$	$-0.35\pm 0.08$	$-0.46\pm 0.06$	$-0.23\pm 0.16$	D
J23492+024	BR Psc	$-0.43_{-0.12}^{+0.11}$	$-0.40_{-0.03}^{+0.02}$	$-0.45\pm 0.08$	$-0.55\pm 0.08$	$-0.13\pm 0.16$	TD

¹²¹²12As explained in Passegger et al. (2020, 2022), our [M/H] results directly translate into [Fe/H] values. ^(a) From this work. ^(b) From Bello23. ^(c) From Mann et al. (2015). ^(d) From Mar21, corrected from the

\alpha

enhancement. ^(e) From Schw19. ^(f) Galactic populations, including the thick disc (TD), the thick disc-thin disc transition (TD-D), the thin disc (D), and the young disc (YD), following Cortés-Contreras et al. in prep. ^(g) Reported to have a behaviour akin to subdwarfs in Mar21 or Schw19. In particular, J19070+208 and J19072+208 are both components of the wide binary system LDS 1017, and Houdebine (2008) already identified them as subdwarfs.

Fig. 9 shows the distribution of our predicted metallicities broken down by kinematic membership in the thick disc (TD), thick disc-thin disc transition (TD-D), thin disc (D), and young disc (YD) Galactic populations (Cortés-Contreras et al., in prep.; Cortés-Contreras, 2017). This breakdown reveals the distinction between metal-rich thin disc stars and metal-poor stars in the older thick disc (Bensby et al., 2005; Gaia Collaboration et al., 2023a), with the TD-D transition as an intermediate step. To prove this, we performed a two-sample Kolmogorov-Smirnov test (Kolmogorov, 1933; Smirnov, 1948) on the thin and thick disc samples, which returned a $p\,\rm{value}=0.0071$ , rejecting the hypothesis that both samples come from the same distribution.

Also, the 2MASS-Gaia G ${}_{\mathrm{BP}}-G_{\mathrm{RP}}$ versus $G-J$ colour-colour diagram in Fig. 10 shows how the evolution in our estimated effective temperatures is coherent with the colour-colour relationship (see Fig. 14 in Cifuentes et al. 2020). For this diagram, we only considered stars with reliable 2MASS $J$ -band and Gaia DR3 G_BP and G_RP photometry.

4.2 Comparison with the literature

We compared our results with different collections found in the literature. Whereas this section focuses on the latest studies using CARMENES data, namely Bello23, Mar21, Pass19, Pass20, and Schw19, a more extensive compilation of literature, together with the uncertainties of the estimations, is provided in Appendix B. For Pass19, we considered the parameters derived from VIS spectra. Table 4 lists the mean difference ( $\overline{\Delta}$ ; literature $-$ this work), root mean squared error (rmse), and Pearson correlation coefficient ( $r_{\rm p}$ ) for the comparison with each of the literature collections. An interactive version of the results presented in this section is available to the astronomical community ¹³¹³13https://cab.inta-csic.es/users/pmas/.

Table 4: Mean difference (

\overline{\Delta}

; literature

-

this work), root mean square error (rmse), and Pearson correlation coefficient (

r_{\rm p}

) for the comparison between our results and the literature.

Reference	$T_{\rm eff}$ [K]			log $g$ [dex]			[Fe/H] [dex]			$v\sin{i}$ [km s^-1]
	$\overline{\Delta}$	rmse	$r_{\rm p}$	$\overline{\Delta}$	rmse	$r_{\rm p}$	$\overline{\Delta}$	rmse	$r_{\rm p}$	$\overline{\Delta}$	rmse	$r_{\rm p}$
Bello23	-117	180	0.87	…	…	…	0.01	0.14	0.60	…	…	…
Mar21	-19	102	0.94	0.12	0.18	0.39	-0.11	0.16	0.65	…	…	…
Pass19	-80	117	0.96	0.00	0.05	0.86	0.06	0.15	0.52	…	…	…
Pass20	-35	51	0.99	-0.04	0.06	0.93	0.23	0.25	0.76	1.64	1.94	0.99
Rein18 ${}^{\,(a)}$	…	…	…	…	…	…	…	…	…	-0.86	1.51	0.98
Schw19	-40	93	0.96	0.13	0.14	0.89	0.00	0.10	0.63	…	…	…

¹⁴¹⁴14^(a) From Reiners et al. (2018).

Figure 11 depicts the comparison with literature values for $T_{\rm eff}$ . The left panels show a similar linear trend among Mar21, Pass19, and Schw19 with our values, all of them with a slope of less than one, for the region $T_{\rm eff}$ (this work) $\lesssim 3\,750$ K. From this value onwards, where the number of stars in our training set is smaller, the dispersion increases significantly and our $T_{\rm eff}$ estimations deviate towards hotter values, resulting in a mean difference of $\overline{\Delta}=-19$ K, $\overline{\Delta}=-80$ K, $\overline{\Delta}=-40$ K for Mar21, Pass19, and Schw19, respectively. The figures provided in Appendix B show that the uncertainties intrinsic to our methodology are also larger for estimations above 3 750 K. The right panels show how the agreement with the values obtained following the approach described by Pass20 is excellent, which is expected since their methodology is the closest to the one presented in this work. Moreover, the comparison with the results from Bello23 reveals the same structure, but inverted, as shown in Fig. 9 of their work, with a larger dispersion than that observed for the other literature collections. The black stars in the top right panel represent the 14 interferometrically derived $T_{\rm eff}$ values (see Table 1 in Bello23), which are on average cooler than the temperatures obtained with our methodology ( $\overline{\Delta}_{\rm interf}=-119$ K). The $r_{\rm p}$ values listed in Table 4 show a strong correlation with all the collections.

Figure 12 shows a similar literature comparison for log $g$ . For Schw19, we considered the values derived using their mass-radius relation and the Stefan-Boltzmann’s law. The log $g$ values from Mar21 show a large dispersion ( $r_{\rm p}=0.39$ ), as already mentioned in their work, and are generally spread towards higher values ( $\overline{\Delta}=0.12$ dex). While the results from Pass19 cover the same range and are similar on average to our obtained log $g$ ( $\overline{\Delta}=0.00$ dex), those from Schw19 extend to higher values and are on average higher than ours ( $\overline{\Delta}=0.13$ dex). It should be noted that, while Pass19 and Schw19 fix log $g$ using theoretical isochrones, Mar21 has log $g$ as a free parameter. Moreover, our results show a good correlation ( $r_{\rm p}=0.93$ ) with those obtained following the methodology described by Pass20, although the latter are deviated to lower values ( $\overline{\Delta}=-0.04$ dex).

As discussed in Passegger et al. (2022), several discrepancies can be found when comparing metallicities of M dwarfs obtained with different methodologies. Figure 13 shows the comparison with literature values for our [M/H] estimations, which directly translate into [Fe/H] values (Passegger et al., 2020, 2022). For Mar21, we considered the values corrected for alpha enhancement. Our results are similar on average to those from Schw19 ( $\overline{\Delta}=0.00$ dex), while Pass19 and Mar21 results tend to be higher and lower, with $\overline{\Delta}=0.06$ and $\overline{\Delta}=-0.11$ dex, respectively. As already mentioned in Passegger et al. (2022), the results from the DL methodology described by Pass20 are deviated towards more metal-rich values, with $\overline{\Delta}=0.23$ dex. We note that this deviation, which is attributed to the synthetic gap by Pass20, does not appear in the DTL methodologies presented by Bello23 and here. Bello23 metallicities cover more or less the same range as our results, and the spectroscopically determined [M/H] values from FGK+M systems (see Table 3 in Bello23) (black stars in the top right panel) are systematically lower ( $\overline{\Delta}=-0.13$ dex).

We also compared our $v\sin{i}$ determinations with the ones derived by Reiners et al. (2018) using the cross-correlation method and with those obtained following the DL methodology described by Pass20. Fig. 14 shows how our derived $v\sin{i}$ are mostly consistent with the literature within their errors. Both Pass20 and Reiners et al. (2018) results show a good correlation with our values ( $r_{\rm p}=0.99$ and 0.98, respectively). Since most of the objects are located at lower $v\sin{i}$ values, it is convenient to split the analysis provided in Table 4 at a cut-off value of $v\sin{i}\,{\rm(this\,work)}=12$ km s^-1. Below this value, Pass20 presents $\overline{\Delta}=1.83$ km s^-1 and ${\rm rmse}=1.97$ km s^-1, with $\overline{\Delta}=-1.22$ km s^-1 and ${\rm rmse}=1.45$ km s^-1 for faster rotators. Similarly, for Reiners et al. (2018), we obtained $\overline{\Delta}=-0.68$ km s^-1 and ${\rm rmse}=1.24$ km s^-1 for values below the threshold, and $\overline{\Delta}=-3.47$ km s^-1 and ${\rm rmse}=3.71$ km s^-1 for values above.

5 Conclusions

This work serves as an extension of a series of papers (Pass20; Bello23) dedicated to exploring the use of DL for stellar parameter estimation of CARMENES M dwarfs, based on synthetic spectra. Bello23 developed a model-based DTL technique to bridge the significant differences in flux features between the two spectral families, reported by Pass20. Here, we propose a parallel feature-based DTL strategy that addresses the limitations mentioned in their work regarding the need for high-quality stellar parameter estimations in the knowledge transfer process. All the resources, including the code developed to build the methodology described in Section 3 and the code to reproduce the figures displayed in Section 4 are publicly available at GitHub¹⁵¹⁵15https://github.com/pedromasb/autoencoders-CARMENES.

Using a methodology that combines the use of AEs and CNNs, we derived new estimations for the stellar parameters $T_{\rm eff}$ , log $g$ , [M/H], and $v\sin{i}$ of 286 M dwarfs observed with CARMENES. The AE models were trained on PHOENIX-ACES synthetic spectra and then fine-tuned using the CARMENES high-S/N, high-resolution spectra. In the fine-tuning process, no data other than the observed spectra are required, which gives our methodology great flexibility, as no measured stellar parameters are involved in the knowledge transfer. We used the low-dimensional representations of the synthetic and observed spectra, resulting from the initial training and the fine-tuning steps, respectively, as input to the CNNs for the estimation of the stellar parameters. In this way, parameter estimation is conducted using a dataset in which no significant differences in the feature distributions between the synthetic and observed data are evident.

We performed an in-depth analysis of our estimated stellar parameters, using the diagram shown in Fig. 8 to study the objects that deviate from the main sequence. We found that almost all the overlumimuous outliers are identified as H $\alpha$ active stars by Schöfer et al. (2019), while outliers located below the main sequence are typically metal-poor stars from the thick disc Galactic population. In particular, using the BANYAN $\Sigma$ tool, we found 9 objects with a high Bayesian probability of belonging to five different young stellar associations, in 7 of these cases with a probability of more than 95 %. Together with the low-metallicity objects already reported in Mar21 and Schw19, we identified eight more stars that exhibit the same behaviour.

We also conducted a comparative study between our results and the latest studies using CARMENES data, finding good consistency with the literature in most cases. Both our $T_{\rm eff}$ and log $g$ determinations are, in general, strongly correlated with the results from the literature, with a systematic deviation in our $T_{\rm eff}$ scale towards hotter values for estimations above 3 750 K. As expected, our parameter determinations are in very good agreement with Pass20, since their methodology is the most similar to the one presented in this paper. More importantly, the deviation in metallicity attributed to the synthetic gap in their work is not observed in ours thanks to the DTL approach. This, together with the work presented by Bello23, demonstrates the great potential of DTL-based strategies to bridge the synthetic gap in stellar parameter estimation from synthetic spectra.

Acknowledgements.

We thank the anonymous referee for the comments that helped to improve the quality of this paper. We acknowledge financial support from the Agencia Estatal de Investigación (AEI/10.13039/501100011033) of the Ministerio de Ciencia e Innovación and the ERDF ‘A way of making Europe’ through projects PID2022-137241NB-C4[2,4], PID2020-112949GB-I00 (Spanish Virtual Observatory https://svo.cab.inta-csic.es), PID2020-117493GB-I00, PID2019-109522GB-C5[1,4], and grant PR47/21 TAU-CM PRTR-CM, the Instituto Nacional de Técnica Aeroespacial through grant PRE-OVE, and the Gobierno de Canarias through project ProID2020010129. We made extensive use of Python throughout the entire process, including the packages pandas¹⁶¹⁶16https://github.com/pandas-dev/pandas, seaborn (Waskom, 2021), numpy (Harris et al., 2020), matplotlib (Hunter, 2007), scikit-learn (Pedregosa et al., 2011), tensorflow (Abadi et al., 2015), plotly¹⁷¹⁷17https://github.com/plotly/plotly.py, scipy (Jones et al., 2001), and umap-learn (McInnes et al., 2018).

References

Abadi et al. (2015) Abadi, M., Agarwal, A., Barham, P., et al. 2015, TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
Abazajian et al. (2009) Abazajian, K. N., Adelman-McCarthy, J. K., Agüeros, M. A., et al. 2009, The Astrophysical Journal Supplement Series, 182, 543
Abolfathi et al. (2018) Abolfathi, B., Aguado, D. S., Aguilar, G., et al. 2018, The Astrophysical Journal Supplement Series, 235, 42
Allard et al. (2012) Allard, F., Homeier, D., & Freytag, B. 2012, Philosophical Transactions of the Royal Society of London Series A, 370, 2765
Allard et al. (2013) Allard, F., Homeier, D., Freytag, B., et al. 2013, Memorie della Societa Astronomica Italiana Supplementi, 24, 128
Alonso-Floriano et al. (2015) Alonso-Floriano, F. J., Morales, J. C., Caballero, J. A., et al. 2015, A&A, 577, A128
Antoniadis-Karnavas et al. (2020) Antoniadis-Karnavas, A., Sousa, S. G., Delgado-Mena, E., et al. 2020, A&A, 636, A9
Baraffe et al. (2015) Baraffe, I., Homeier, D., Allard, F., & Chabrier, G. 2015, A&A, 577, A42
Bello-García et al. (2023) Bello-García, A., Passegger, V. M., Ordieres-Meré, J., et al. 2023, A&A, 673, A105
Bengio et al. (2013) Bengio, Y., Courville, A., & Vincent, P. 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence, 35, 1798
Bensby et al. (2005) Bensby, T., Feltzing, S., Lundström, I., & Ilyin, I. 2005, A&A, 433, 185
Birky et al. (2020) Birky, J., Hogg, D. W., Mann, A. W., & Burgasser, A. 2020, ApJ, 892, 31
Birky et al. (2017) Birky, J. L., Aganze, C., Burgasser, A. J., et al. 2017, in American Astronomical Society Meeting Abstracts, Vol. 229, American Astronomical Society Meeting Abstracts #229, 240.18
Blanco-Cuaresma et al. (2014) Blanco-Cuaresma, S., Soubiran, C., Heiter, U., & Jofré, P. 2014, iSpec: Stellar atmospheric parameters and chemical abundances
Bonfils et al. (2013) Bonfils, X., Delfosse, X., Udry, S., et al. 2013, A&A, 549, A109
Bonfils et al. (2005) Bonfils, X., Delfosse, X., Udry, S., et al. 2005, A&A, 442, 635
Boyajian et al. (2012) Boyajian, T. S., von Braun, K., van Belle, G., et al. 2012, ApJ, 757, 112
Bressan et al. (2012) Bressan, A., Marigo, P., Girardi, L., et al. 2012, MNRAS, 427, 127
Brewer et al. (2015) Brewer, J. M., Fischer, D. A., Basu, S., Valenti, J. A., & Piskunov, N. 2015, ApJ, 805, 126
Buzzoni et al. (2001) Buzzoni, A., Chavez, M., Malagnini, M. L., & Morossi, C. 2001, PASP, 113, 1365
Caballero et al. (2016a) Caballero, J. A., Cortés-Contreras, M., Alonso-Floriano, F. J., et al. 2016a, in 19th Cambridge Workshop on Cool Stars, Stellar Systems, and the Sun (CS19), Cambridge Workshop on Cool Stars, Stellar Systems, and the Sun, 148
Caballero et al. (2016b) Caballero, J. A., Guàrdia, J., López del Fresno, M., et al. 2016b, in Proc. SPIE, Vol. 9910, Observatory Operations: Strategies, Processes, and Systems VI, 99100E
Casagrande et al. (2008) Casagrande, L., Flynn, C., & Bessell, M. 2008, Monthly Notices of the Royal Astronomical Society, 389, 585
Casey et al. (2016) Casey, A. R., Hogg, D. W., Ness, M., et al. 2016, arXiv e-prints, arXiv:1603.03040
Chen et al. (1997) Chen, B., Asiain, R., Figueras, F., & Torra, J. 1997, A&A, 318, 29
Chen et al. (2015) Chen, Y., Bressan, A., Girardi, L., et al. 2015, MNRAS, 452, 1068
Chen et al. (2014) Chen, Y., Girardi, L., Bressan, A., et al. 2014, MNRAS, 444, 2525
Cheng et al. (2021) Cheng, T.-Y., Huertas-Company, M., Conselice, C. J., et al. 2021, Monthly Notices of the Royal Astronomical Society, 503, 4446
Chollet (2015) Chollet, F. 2015, KERAS
Chu et al. (2016) Chu, B., Madhavan, V., Beijbom, O., Hoffman, J., & Darrell, T. 2016, Best Practices for Fine-Tuning Visual Classifiers to New Domains, 435–442
Cifuentes et al. (2020) Cifuentes, C., Caballero, J. A., Cortés-Contreras, M., et al. 2020, A&A, 642, A115
Cortés-Contreras (2017) Cortés-Contreras, M. 2017, PhD thesis, Complutense University of Madrid, Spain
Czesla et al. (2019) Czesla, S., Schröter, S., Schneider, C. P., et al. 2019, PyA: Python astronomy-related packages
Duque-Arribas et al. (2024) Duque-Arribas, C., Tabernero, H. M., Montes, D., & Caballero, J. A. 2024, MNRAS, 528, 3028
Fabbro et al. (2018) Fabbro, S., Venn, K., O’Briain, T., et al. 2018, MNRAS, 475, 2978
Frontera-Pons et al. (2017) Frontera-Pons, J., Sureau, F., Bobin, J., & Le Floc’h, E. 2017, A&A, 603, A60
Frontera-Pons et al. (2019) Frontera-Pons, J., Sureau, F., Moraes, B., Bobin, J., & Abdalla, F. B. 2019, A&A, 625, A73
Fukushima (1980) Fukushima, K. 1980, Biological Cybernetics, 36, 193
Gagné et al. (2018) Gagné, J., Mamajek, E. E., Malo, L., et al. 2018, ApJ, 856, 23
Gaia Collaboration et al. (2018) Gaia Collaboration, Brown, A. G. A., Vallenari, A., et al. 2018, A&A, 616, A1
Gaia Collaboration et al. (2023a) Gaia Collaboration, Recio-Blanco, A., Kordopatis, G., et al. 2023a, A&A, 674, A38
Gaia Collaboration et al. (2023b) Gaia Collaboration, Vallenari, A., Brown, A. G. A., et al. 2023b, A&A, 674, A1
Gaidos & Mann (2014) Gaidos, E. & Mann, A. W. 2014, ApJ, 791, 54
Gaidos & Mann (2014) Gaidos, E. & Mann, A. W. 2014, The Astrophysical Journal, 791, 54
Gaidos et al. (2014) Gaidos, E., Mann, A. W., Lépine, S., et al. 2014, MNRAS, 443, 2561
Goodfellow et al. (2016) Goodfellow, I., Bengio, Y., & Courville, A. 2016, Deep Learning (MIT Press)
Gustafsson et al. (2008) Gustafsson, B., Edvardsson, B., Eriksson, K., et al. 2008, A&A, 486, 951
Harris et al. (2020) Harris, C. R., Millman, K. J., van der Walt, S. J., et al. 2020, Nature, 585, 357
Hejazi et al. (2020) Hejazi, N., Lépine, S., Homeier, D., Rich, R. M., & Shara, M. M. 2020, The Astronomical Journal, 159, 30
Henry et al. (1994) Henry, T. J., Kirkpatrick, J. D., & Simons, D. A. 1994, AJ, 108, 1437
Hinton & Salakhutdinov (2006) Hinton, G. E. & Salakhutdinov, R. R. 2006, Science, 313, 504
Houdebine (2008) Houdebine, E. R. 2008, MNRAS, 390, 1081
Hunter (2007) Hunter, J. D. 2007, Computing in Science and Engineering, 9, 90
Husser et al. (2013) Husser, T.-O., Wende-von Berg, S., Dreizler, S., et al. 2013, A&A, 553, A6
Jao et al. (2008) Jao, W.-C., Henry, T. J., Beaulieu, T. D., & Subasavage, J. P. 2008, AJ, 136, 840
Jeffers et al. (2018) Jeffers, S. V., Schöfer, P., Lamert, A., et al. 2018, A&A, 614, A76
Jones et al. (2001) Jones, E., Oliphant, T., & Peterson, P. 2001, SciPy: Open Source Scientific Tools for Python
Khata et al. (2020) Khata, D., Mondal, S., Das, R., Ghosh, S., & Ghosh, S. 2020, MNRAS, 493, 4533
Khramtsov et al. (2021) Khramtsov, V., Spiniello, C., Agnello, A., & Sergeyev, A. 2021, A&A, 651, A69
Kingma & Ba (2014) Kingma, D. & Ba, J. 2014, International Conference on Learning Representations
Kiranyaz et al. (2021) Kiranyaz, S., Avci, O., Abdeljaber, O., et al. 2021, Mechanical Systems and Signal Processing, 151, 107398
Kjærsgaard et al. (2023) Kjærsgaard, R. D., Bello-Arufe, A., Rathcke, A. D., Buchhave, L. A., & Clemmensen, L. K. H. 2023, A&A, 677, A120
Kolmogorov (1933) Kolmogorov, A. L. 1933, G. Ist. Ital. Attuari, 4, 83
Krizhevsky et al. (2012) Krizhevsky, A., Sutskever, I., & Hinton, G. 2012, Neural Information Processing Systems, 25
Kuznetsov et al. (2019) Kuznetsov, M. K., del Burgo, C., Pavlenko, Y. V., & Frith, J. 2019, ApJ, 878, 134
Lecun et al. (1998) Lecun, Y., Bottou, L., Bengio, Y., & Haffner, P. 1998, Proceedings of the IEEE, 86, 2278
Lee et al. (2008a) Lee, Y. S., Beers, T. C., Sivarani, T., et al. 2008a, The Astronomical Journal, 136, 2050
Lee et al. (2008b) Lee, Y. S., Beers, T. C., Sivarani, T., et al. 2008b, The Astronomical Journal, 136, 2022
Li et al. (2021a) Li, J., Liu, C., Zhang, B., et al. 2021a, The Astrophysical Journal Supplement Series, 253, 45
Li et al. (2017) Li, X.-R., Pan, R.-Y., & Duan, F.-Q. 2017, Research in Astronomy and Astrophysics, 17, 036
Li et al. (2021b) Li, Z., Liu, F., Yang, W., Peng, S., & Zhou, J. 2021b, IEEE Transactions on Neural Networks and Learning Systems, PP, 1
Maldonado et al. (2015) Maldonado, J., Affer, L., Micela, G., et al. 2015, A&A, 577, A132
Mann et al. (2013a) Mann, A. W., Brewer, J. M., Gaidos, E., Lépine, S., & Hilton, E. J. 2013a, AJ, 145, 52
Mann et al. (2014) Mann, A. W., Deacon, N. R., Gaidos, E., et al. 2014, AJ, 147, 160
Mann et al. (2015) Mann, A. W., Feiden, G. A., Gaidos, E., Boyajian, T., & von Braun, K. 2015, ApJ, 804, 64
Mann et al. (2013b) Mann, A. W., Gaidos, E., & Ansdell, M. 2013b, ApJ, 779, 188
Marfil et al. (2021) Marfil, E., Tabernero, H. M., Montes, D., et al. 2021, A&A, 656, A162
Masseron et al. (2016) Masseron, T., Merle, T., & Hawkins, K. 2016, BACCHUS: Brussels Automatic Code for Characterizing High accUracy Spectra
Mayor et al. (2003) Mayor, M., Pepe, F., Queloz, D., et al. 2003, The Messenger, 114, 20
McInnes et al. (2018) McInnes, L., Healy, J., Saul, N., & Großberger, L. 2018, Journal of Open Source Software, 3, 861
Milosevic et al. (2021) Milosevic, S., Frank, P., Leike, R. H., Müller, A., & Enßlin, T. A. 2021, A&A, 650, A100
Montes et al. (2018) Montes, D., González-Peinado, R., Tabernero, H. M., et al. 2018, Monthly Notices of the Royal Astronomical Society, 479, 1332
Nemravová et al. (2016) Nemravová, J. A., Harmanec, P., Brož, M., et al. 2016, A&A, 594, A55
Ness et al. (2015) Ness, M., Hogg, D. W., Rix, H. W., Ho, A. Y. Q., & Zasowski, G. 2015, ApJ, 808, 16
Neves et al. (2012) Neves, V., Bonfils, X., Santos, N. C., et al. 2012, A&A, 538, A25
Neves et al. (2014) Neves, V., Bonfils, X., Santos, N. C., et al. 2014, A&A, 568, A121
Newton et al. (2015) Newton, E. R., Charbonneau, D., Irwin, J., & Mann, A. W. 2015, ApJ, 800, 85
Pan et al. (2008) Pan, S. J., Kwok, J. T.-Y., & Yang, Q. 2008, in AAAI Conference on Artificial Intelligence
Pan & Yang (2010) Pan, S. J. & Yang, Q. 2010, IEEE Transactions on Knowledge and Data Engineering, 22, 1345
Passegger et al. (2022) Passegger, V. M., Bello-García, A., Ordieres-Meré, J., et al. 2022, A&A, 658, A194
Passegger et al. (2020) Passegger, V. M., Bello-García, A., Ordieres-Meré, J., et al. 2020, A&A, 642, A22
Passegger et al. (2018) Passegger, V. M., Reiners, A., Jeffers, S. V., et al. 2018, A&A, 615, A6
Passegger et al. (2019) Passegger, V. M., Schweitzer, A., Shulyak, D., et al. 2019, A&A, 627, A161
Pecaut & Mamajek (2013) Pecaut, M. J. & Mamajek, E. E. 2013, ApJS, 208, 9
Pedregosa et al. (2011) Pedregosa, F., Varoquaux, G., Gramfort, A., et al. 2011, Journal of Machine Learning Research, 12, 2825
Pepe et al. (2021) Pepe, F., Cristiani, S., Rebolo, R., et al. 2021, A&A, 645, A96
Plez (2012) Plez, B. 2012, Turbospectrum: Code for spectral synthesis
Poggio et al. (2021) Poggio, E., Drimmel, R., Cantat-Gaudin, T., et al. 2021, A&A, 651, A104
Prieto et al. (2008) Prieto, C. A., Sivarani, T., Beers, T. C., et al. 2008, The Astronomical Journal, 136, 2070
Quirrenbach et al. (2016) Quirrenbach, A., Amado, P. J., Caballero, J. A., et al. 2016, in Ground-based and Airborne Instrumentation for Astronomy VI, ed. C. J. Evans, L. Simard, & H. Takami, Vol. 9908, International Society for Optics and Photonics (SPIE), 990812
Quirrenbach et al. (2020) Quirrenbach, A., Amado, P. J., Ribas, I., et al. 2020, in Ground-based and Airborne Instrumentation for Astronomy VIII, ed. C. J. Evans, J. J. Bryant, & K. Motohara, Vol. 11447, International Society for Optics and Photonics (SPIE), 114473C
Rabus et al. (2019) Rabus, M., Lachaume, R., Jordán, A., et al. 2019, MNRAS, 484, 2674
Rajpurohit et al. (2018) Rajpurohit, A. S., Allard, F., Rajpurohit, S., et al. 2018, A&A, 620, A180
Reid et al. (1995) Reid, I. N., Hawley, S. L., & Gizis, J. E. 1995, AJ, 110, 1838
Reiners et al. (2018) Reiners, A., Zechmeister, M., Caballero, J. A., et al. 2018, A&A, 612, A49
Reylé et al. (2021) Reylé, C., Jardine, K., Fouqué, P., et al. 2021, A&A, 650, A201
Rifai et al. (2011) Rifai, S., Muller, X., Glorot, X., et al. 2011, Computing Research Repository - CORR
Rodríguez Martínez et al. (2019) Rodríguez Martínez, R., Ballard, S., Mayo, A., et al. 2019, AJ, 158, 135
Rojas-Ayala et al. (2010) Rojas-Ayala, B., Covey, K. R., Muirhead, P. S., & Lloyd, J. P. 2010, ApJ, 720, L113
Rojas-Ayala et al. (2012) Rojas-Ayala, B., Covey, K. R., Muirhead, P. S., & Lloyd, J. P. 2012, ApJ, 748, 93
Sarmento et al. (2021) Sarmento, P., Rojas-Ayala, B., Delgado Mena, E., & Blanco-Cuaresma, S. 2021, A&A, 649, A147
Sarro et al. (2018) Sarro, L. M., Ordieres-Meré, J., Bello-García, A., González-Marcos, A., & Solano, E. 2018, MNRAS, 476, 1120
Schöfer et al. (2019) Schöfer, P., Jeffers, S. V., Reiners, A., et al. 2019, A&A, 623, A44
Schweitzer et al. (2019) Schweitzer, A., Passegger, V. M., Cifuentes, C., et al. 2019, A&A, 625, A68
Sharma et al. (2020) Sharma, K., Kembhavi, A., Kembhavi, A., et al. 2020, MNRAS, 491, 2280
Simonyan & Zisserman (2014) Simonyan, K. & Zisserman, A. 2014, arXiv e-prints, arXiv:1409.1556
Smirnov (1948) Smirnov, N. V. 1948, Annals of Mathematical Statistics, 19, 279
Smolinski et al. (2011) Smolinski, J. P., Lee, Y. S., Beers, T. C., et al. 2011, The Astronomical Journal, 141, 89
Souto et al. (2020) Souto, D., Cunha, K., Smith, V. V., et al. 2020, ApJ, 890, 133
Tabernero et al. (2022) Tabernero, H. M., Marfil, E., Montes, D., & González Hernández, J. I. 2022, A&A, 657, A66
Tang et al. (2014) Tang, J., Bressan, A., Rosenfield, P., et al. 2014, MNRAS, 445, 4287
Vafaei Sadr et al. (2020) Vafaei Sadr, A., Bassett, B. A., Oozeer, N., Fantaye, Y., & Finlay, C. 2020, MNRAS, 499, 379
Vernet et al. (2011) Vernet, J., Dekker, H., D’Odorico, S., et al. 2011, A&A, 536, A105
Veyette et al. (2017) Veyette, M. J., Muirhead, P. S., Mann, A. W., et al. 2017, ApJ, 851, 26
von Braun et al. (2014) von Braun, K., Boyajian, T. S., van Belle, G. T., et al. 2014, MNRAS, 438, 2413
Waskom (2021) Waskom, M. 2021, The Journal of Open Source Software, 6, 3021
Wilson et al. (2010) Wilson, J. C., Hearty, F., Skrutskie, M. F., et al. 2010, in Ground-based and Airborne Instrumentation for Astronomy III, ed. I. S. McLean, S. K. Ramsay, & H. Takami, Vol. 7735, International Society for Optics and Photonics (SPIE), 77351C
Yang et al. (2020) Yang, Q., Zhang, Y., Dai, W., & Pan, S. J. 2020, Transfer Learning (Cambridge University Press)
Yang & Li (2015) Yang, T. & Li, X. 2015, Monthly Notices of the Royal Astronomical Society, 452, 158
Yosinski et al. (2014) Yosinski, J., Clune, J., Bengio, Y., & Lipson, H. 2014, Advances in Neural Information Processing Systems (NIPS), 27
Yu et al. (2022) Yu, F., Xiu, X., & Li, Y. 2022, Mathematics, 10
Zechmeister et al. (2014) Zechmeister, M., Anglada-Escudé, G., & Reiners, A. 2014, A&A, 561, A59
Zechmeister et al. (2018) Zechmeister, M., Reiners, A., Amado, P. J., et al. 2018, A&A, 609, A12
Zhang et al. (2020) Zhang, B., Liu, C., & Deng, L.-C. 2020, The Astrophysical Journal Supplement Series, 246, 9
Zheng & Qiu (2020) Zheng, Z. & Qiu, B. 2020, Journal of Physics: Conference Series, 1626, 012017
Zuckerman (2018) Zuckerman, B. 2018, The Astrophysical Journal, 870, 27
Zuckerman et al. (2004) Zuckerman, B., Song, I., & Bessell, M. S. 2004, ApJ, 613, L65
Zuckerman et al. (2001) Zuckerman, B., Song, I., Bessell, M. S., & Webb, R. A. 2001, The Astrophysical Journal, 562, L87

Appendix A Additional tables

Table 5 is available in its entirety in electronic form at the CDS. This appendix only shows an extract of the table to facilitate its understanding.

Table 5: Stellar atmospheric parameters, together with their uncertainties, determined with our methodology. Only the first five rows of the table are shown.

Karmn	Name	$\alpha\,^{(a)}$	$\delta\,^{(a)}$	$T_{\rm eff}$	log $g$	[Fe/H]	$v\sin{i}$
		[J2016.0]	[J2016.0]	[K]	[dex]	[dex]	[km s^-1]
J00051+457	GJ 2	00:05:12.22	03:03:08.6	$3780_{-34}^{+41}$	$4.70_{-0.04}^{+0.01}$	$0.03_{-0.04}^{+0.06}$	$3.19_{-0.16}^{+0.48}$
J00067-075	GJ 1002	00:06:42.32	23:29:48.8	$3073_{-27}^{+18}$	$5.10_{-0.09}^{+0.04}$	$0.06_{-0.15}^{+0.08}$	$3.02_{-0.66}^{+0.33}$
J00162+198E	LP 404-062	00:16:16.96	01:19:26.6	$3362_{-25}^{+34}$	$4.90_{-0.06}^{+0.02}$	$0.07_{-0.17}^{+0.02}$	$2.13_{-0.22}^{+0.30}$
J00183+440	GX And	00:18:27.17	02:56:05.9	$3709_{-43}^{+15}$	$4.80_{-0.07}^{+0.03}$	$-0.33_{-0.17}^{+0.06}$	$2.02_{-0.30}^{+0.20}$
J00184+440	GQ And	00:18:30.07	02:56:06.9	$3251_{-13}^{+36}$	$4.96_{-0.03}^{+0.05}$	$-0.20_{-0.10}^{+0.09}$	$2.82_{-0.24}^{+0.22}$

¹⁸¹⁸18^(a) From Gaia DR3.

Appendix B Additional comparison with the literature

In this appendix, we provide an extensive comparison of this work with different results from the literature, as discussed in Section 4.2. Also, we repeat the comparison shown in Figs. 11, 12 and 13, but including the error bars. Table 6 replicates Table 4 for the additional literature collections. Figures 15, 16, 17, 18, 19, 20, 21, 22, and 23 show the comparison with Bello23, Mar21, Pass20, Pass19, Schw19, Passegger et al. (2018), Mann et al. (2015), Gaidos et al. (2014), and Gaidos & Mann (2014), respectively.

Table 6: Comparison between our results and the additional literature collections. The structure is the same as in Table 4.

Reference	$T_{\rm eff}$ [K]			log $g$ [dex]			[Fe/H] [dex]
	$\overline{\Delta}$	rmse	$r_{\rm p}$	$\overline{\Delta}$	rmse	$r_{\rm p}$	$\overline{\Delta}$	rmse	$r_{\rm p}$
Pass18 ${}^{\,(a)}$	-59	98	0.96	0.12	0.14	0.89	0.01	0.09	0.73
Mann15 ${}^{\,(b)}$	-109	136	0.96	…	…	…	0.04	0.11	0.89
Gaid14 ${}^{\,(c)}$	-69	151	0.87	…	…	…	0.05	0.14	0.75
GM14 ${}^{\,(d)}$	-42	102	0.93	…	…	…	0.04	0.10	0.88