¹¹institutetext: (1) Centre for ExoLife Sciences, Niels Bohr Institute, Øster Voldgade 5, 1350 Copenhagen, Denmark
(2) Institute of Astronomy, KU Leuven, Celestijnenlaan 200D, 3001, Leuven, Belgium
(3) Max-Planck-Institut für Astronomie, Königstuhl 17, 69117 Heidelberg, Germany
(4) Montefiore Institute, University of Liège, Liège, Belgium
(5) Space Research Institute, Austrian Academy of Sciences, Schmiedlstrasse 6, A-8042 Graz, Austria

To understand high precision observations of exoplanets and brown dwarfs, we need detailed and complex general circulation models (GCMs) that incorporate hydrodynamics, chemistry, and radiation. For this study, we specifically examined the coupling between chemistry and radiation in GCMs and compared different methods for the mixing of opacities of different chemical species in the correlated-k assumption, when equilibrium chemistry cannot be assumed. We propose a fast machine learning method based on DeepSets (DS), which effectively combines individual correlated-k opacities (k-tables). We evaluated the DS method alongside other published methods such as adaptive equivalent extinction (AEE) and random overlap with rebinning and resorting (RORR). We integrated these mixing methods into our GCM (expeRT/MITgcm) and assessed their accuracy and performance for the example of the hot Jupiter HD 209458 b. Our findings indicate that the DS method is both accurate and efficient for GCM usage, whereas RORR is too slow. Additionally, we observed that the accuracy of AEE depends on its specific implementation and may introduce numerical issues in achieving radiative transfer solution convergence. We then applied the DS mixing method in a simplified chemical disequilibrium situation, where we modeled the rainout of TiO and VO, and confirmed that the rainout of TiO and VO would hinder the formation of a stratosphere. To further expedite the development of consistent disequilibrium chemistry calculations in GCMs, we provide documentation and code for coupling the DS mixing method with correlated-k radiative transfer solvers. The DS method has been extensively tested to be accurate enough for GCMs; however, other methods might be needed for accelerating atmospheric retrievals.

Harnessing machine learning for accurate treatment of overlap** opacity species in general circulation models

Aaron David Schneider

{}^{1,2}

Paul Mollière

{}^{3}

Gilles Louppe

{}^{4}

Ludmila Carone

{}^{5}

Uffe Gråe Jørgensen

{}^{1}

Leen Decin

{}^{2}

& Christiane Helling

{}^{5}

[email protected] [email protected]

(December 6, 2023)

Key Words.:

Radiation: dynamics – Radiative transfer – Planets and satellites: atmospheres – Planets and satellites: gaseous planets – Methods: numerical

^†^†offprints: A. Schneider,

or

1 Introduction

General circulation models (GCMs) have been applied with a lot of success to understand the 3D nature of exoplanetary atmospheres (for a review see Showman et al., 2020). These models usually consist of a dynamical core that solves the equations of hydrodynamics, coupled to physical parameterizations with differing complexity that describe the forcing on the temperature and velocity field. A very common physical parameterization in GCMs is that of heating and cooling by (gas) radiative transfer.

With the advent of detailed spectra from medium-resolution space-based telescopes such as JWST, we will soon have the ability to map the spatial distribution of chemical species in the atmospheres of hot gas giants. Ground-based high-resolution spectroscopy already allows us to measure chemical variations between morning and evening terminators (e.g., Ehrenreich et al., 2020; Kesseli & Snellen, 2021; Kesseli et al., 2022). To comprehend these spatial distribution maps, we need 3D numerical models that can couple hydrodynamics, chemistry, and radiative transport. Only one GCM, the unified model (UM), currently possesses this capability (Drummond et al., 2020; Zamyatina et al., 2023). Lee et al. (2023) employed a faster chemical network, but did not incorporate a chemically consistent radiative transfer solver. However, these studies consistently indicate long runtimes.

One of the challenges of such models is the consistent coupling between chemical abundances and line opacities as used in the radiative transfer (Amundsen et al., 2017). Line opacities of molecular species in the pressure-temperature range of warm and hot gas giants often are a collection of millions of lines, which must also be accurately accounted for in low resolution (fast) radiative transfer. It was therefore realized early on that rapid calculations would need some simplifications. One of these is the correlated-k method (Goody et al., 1989), which is similar to the method of opacity distribution functions (ODFs) introduced for stellar atmospheres (Gustafsson et al., 1975). The correlated-k method as well as the ODF method converts the wavelength-dependent opacity into distribution functions (called k-tables or ODFs) by sorting opacity values within spectral bins (see Sect. 2). This approach captures the dynamic range of the opacity and allows radiative transfer calculations with a small set of spectral bins (usually five to 30 for GCMs and a few hundred for spectra) and an accuracy of a few percent on bolometric fluxes (e.g., Amundsen et al., 2014; Leconte, 2021; Schneider et al., 2022b).

The main drawback of the correlated-k method is the loss of all wavelength information in a spectral bin, when the opacity is converted to k-tables. Since the wavelength distribution of the opacity is depth-dependent, the loss of wavelength information at first-order approximation causes erroneous optical depth calculation through the atmosphere, and hence erroneous radiative transfer computation and energy balance. This can be corrected for in statistical ways, as described in detail below using the RORR method (see Sect. 2.1). In the corresponding ODF scheme traditionally used in older stellar atmosphere computations, the problem is the same and was discussed for example in Saxner & Gustafsson (1984), where it was concluded that the cost in increased computing time as function of the number of individual opacity species made it unfeasible to continue the ODF scheme for cooler stars. Newer stellar models are therefore usually computed based on the opacity sampling scheme, as discussed for example in Jorgensen (1992), Gustafsson & Jorgensen (1994), and Helling & Jorgensen (1998). It is, however, not obvious how one should treat the strong atomic lines correctly in the opacity sampling scheme, and if these are important in exoplanetary atmospheres alongside with the multitude of molecular lines, the correlated-k method may turn out to be superior, or a new hybrid method may be needed.

In order to still use the correlated-k method, GCMs often use premixed k-tables, which tabulate the k-tables for a given chemical mixture assuming that the gas is in chemical equilibrium and abundances can be constrained as a function of pressure and temperature alone (e.g., Showman et al., 2009; Lee et al., 2021; Schneider et al., 2022b; Deitrick et al., 2022). The k-table at a certain grid point is then recovered by interpolating on the pressure-temperature dependent premixed grid of k-tables. This approach, however, is not chemically correct when the gas is not in chemical equilibrium, which is the case if processes such as chemical kinetics or photochemistry are taken into account in the model.

The UM (Amundsen et al., 2016) is, to our knowledge, currently the only hot Jupiter GCM that can handle k-table mixing during runtime, without the need of premixed tables. Using a 1D radiative transfer code, Amundsen et al. (2017) benchmarked several numerical schemes that can approximate k-table mixing. Treating the k-table mixing during runtime rather than using premixed tables, introduces more freedom in the radiative transfer computation, but for gasses with many opacity sources only if it can be performed sufficiently fast. We therefore extend upon Amundsen et al. (2017) in this paper, by introducing a new machine learning accelerated technique, and by coupling these methods to our GCM (expeRT/MITgcm). This paper starts by introducing the correlated-k method and several approximate methods for k-table mixing in Sect. 2. We then discuss the setup of our GCM in Sect. 3, show the benchmarking results in Sect. 4, and a simple disequilibrium application in Sect. 5, where we apply our model in a simple rainout situation, where condensation of TiO and VO are approximated. We finally conclude and discuss the implications of this work in Sect. 6.

2 Mixing species

According to the Lambert Beer law, the total transmission $\mathcal{T}$ of light passing through a homogeneous slab of gas with density $\rho$ [ $\text{\,}\mathrm{kg}\text{\,}{\mathrm{m}}^{-3}$ ] opacity $\kappa$ [ $\text{\,}{\mathrm{m}}^{2}\text{\,}{\mathrm{kg}}^{-1}$ ] and thickness $d$ [ $\text{\,}\mathrm{m}$ ] in the spectral window from frequency $\nu_{0}$ [ $\text{\,}\mathrm{Hz}$ ] to $\nu_{1}$ [ $\text{\,}\mathrm{Hz}$ ] is given by

\mathcal{T}=\frac{1}{\nu_{1}-\nu_{0}}\int_{\nu_{0}}^{\nu_{1}}\exp\left(-\kappa% (\nu)\rho d\right)\mathrm{d}\nu.

(1)

In the correlated-k method, the integral of Eq. 1 is solved by substituting $\nu$ ,

\mathcal{T}=\frac{1}{\nu_{1}-\nu_{0}}\int_{0}^{1}\exp\left(-\kappa(g)\rho d% \right)\mathrm{d}g,

(2)

such that the integration over frequency is substituted by an integration over a new independent variable $g$ . This independent variable $g$ represents the cumulative opacity distribution function and is then given by the opacity distribution function $f$ , such that

g(\kappa)=\int_{0}^{\kappa}f(\kappa^{\prime})\mathrm{d}\kappa^{\prime}.

(3)

The cumulative opacity distribution function $g$ can be understood as the probability to find an opacity value of less than $\kappa$ at a specific frequency $\nu$ . While both Eqs. 1 and 2 are formally identical, since they only differ in the order in which the sum is evaluated, they might differ significantly in the discrete limit, where individual summation points need to represent the value of the opacity for a certain nonzero width $\Delta\nu$ or $\Delta g$ . In practice, the correlated-k method divides the total computed frequency range into coarse frequency bins, in which all integrals over frequency are substituted into integrals over $g$ . The radiative transfer equation for the intensity $I(\nu,g)$ [ $\text{\,}\mathrm{W}\text{\,}{\mathrm{m}}^{-2}\text{\,}{\mathrm{sr}}^{-1}\text{% \,}{\mathrm{Hz}}^{-1}$ ] of the coarse frequency bin between $\nu_{0}$ and $\nu_{1}$ at the discrete sub bin $g$ in $g$ -space then becomes

\mathbf{n}\cdot\nabla I(\nu,g)=\rho\kappa(\nu,g)\left[S(\nu,g)-I(\nu,g)\right],

(4)

where $S(\nu,g)$ [ $\text{\,}\mathrm{W}\text{\,}{\mathrm{m}}^{-2}\text{\,}{\mathrm{sr}}^{-1}\text{% \,}{\mathrm{Hz}}^{-1}$ ] is the source function and $\mathbf{n}$ is the unit vector of the direction in which the intensity is measured. Thus, in each of these coarse frequency bins, the radiative transfer is solved individually for each $g$ grid value and integrated over $g$ similarly to Eq. 2 afterward to obtain the intensity $I$ of the coarse frequency bin from $\nu_{0}$ to $\nu_{1}$ . In this way, the correlated-k method allows for rapid calculations by requiring less radiative transfer computations for the same level of accuracy.

To obtain the correct k-tables for a mixture of individual species, one would need to sum up the individual contributions of the individual species, weighted with their abundance, and construct the opacity distribution functions of the total opacity. Several methods have been put forward to solve this issue, and we explain how we tested some of these in this work. It is important to note that the opacity distribution functions vary as a function of pressure and temperature and mixture in the gas, thus, in order to use the correlated-k method, one requires methods to construct these $\kappa(\nu,g)$ (k-table) values accurately.

2.1 Random overlap with rebinning and resorting (RORR)

Both Saxner & Gustafsson (1984) and Lacis & Oinas (1991) independently introduced a similar method for ODFs and k-tables respectively that treats the mixing of multiple opacity species under the assumption that the distributions of the individual opacity species are not correlated:

f_{\mathrm{tot}}(\kappa_{1},...,\kappa_{N_{s}})=f(\kappa_{1})\cdot...\cdot f(% \kappa_{N_{s}}).

(5)

In simple terms, this means that, for example, the line cores are randomly distributed and do not systematically occur at the same frequencies. In the case of correlated-k, this method is called the random overlap with rebinning and resorting (RORR) method. An in depth introduction to the RORR method can be found in Amundsen et al. (2017) and in Mollière et al. (2015), and we instead just briefly outline its basic function.

The core of the RORR method is the assumption that the opacities of two species are uncorrelated. This will then imply that their transmissions are also uncorrelated (Mollière et al., 2015). From Eq. 1, we can then see that the transmission of both species can be multiplied to get the total transmission. It is then possible to find a k-table of a mixture by convolving their probability distributions (e.g., Mollière et al., 2015; Amundsen et al., 2017). The final result of the convolution calculation can be resorted and binned back to the original $g$ grid for further computations. Repeating this procedure with the combined k-table of the two species and a third species yields the next step. This procedure is then repeated until all opacity species are included in the total k-table (Amundsen et al. (2017) provides a useful visualization of the procedure in their Fig. 1). From the methods outlined in this work, RORR is the slowest but most accurate method. Furthermore, RORR is well benchmarked against line-by-line calculations (e.g., Amundsen et al., 2014; Mollière et al., 2015). It is therefore the method of choice for most correlated-k 1D atmosphere models, such as ATMO (Tremblin et al., 2015), petitRADTRANS (Mollière et al., 2019) or PICASO (Mukherjee et al., 2023).

2.2 Premixed k-tables

Assuming equilibrium chemistry, one can create k-tables as lookup tables of pressure and temperature, which can be interpolated on during the radiative transfer calculations. It is important to note that these premixed tables are subject to the exact input to the equilibrium chemistry calculations (e.g., metallicity or C/O ratio) and need to be recomputed if the atmosphere is expected to deviate from these. Premixed tables can be computed in multiple ways. Showman et al. (2009) calculated premixed tables by calculating the distribution functions of the mixture. In Schneider et al. (2022b), we have calculated these k-tables using RORR on the individual k-tables. As already pointed out in Amundsen et al. (2017), the accuracy of this approach is subject to the pressure and temperature resolution of the lookup table, since the equilibrium abundances are expected to vary by many orders of magnitude, much more than the individual opacities themselves. In Schneider et al. (2022b), we have therefore computed the lookup tables, such that they match the pressure grid used in the GCM, removing the need to interpolate in pressure and allowing for a fine grid in temperature (1000 temperature points). While these lookup tables can be very precise, if resolved sufficiently, they come at the cost of flexibility, since they require assumptions on the abundances as a function of temperature and pressure.

2.3 Summation

In fact, the easiest way to calculate the mixed k-table from a mixture of different opacity species is to approximate the convolution by a sum:

\kappa_{\mathrm{tot}}(\nu,g)=\sum_{i=1}^{N_{s}}\kappa_{i}(\nu,g),

(6)

where $N_{s}$ is the number of species and the subscript $\kappa_{i}$ is the individual k-table of species $i$ , weighted with its mass mixing ratio. While there is no logical justification for this approach, the approach is certainly the fastest method, as it only requires the evaluation of the sum of k-tables. This approach is certainly attractive, it will, however, naturally underestimate $\kappa_{\mathrm{tot}}$ at small $g$ and overestimate $\kappa_{\mathrm{tot}}$ at large $g$ . This can be best seen in RORR¹¹1for a visualization, see Fig. 1 of Amundsen et al. (2017), where the evaluation of the convolution would add $\kappa_{i}$ values from larger $g$ as well as those of smaller $g$ to the $\kappa_{\mathrm{tot}}$ values at small $g$ (see Appendix A for a quantitative comparison). This is particularly important, since the small $g$ values, which encode the small $\kappa_{\mathrm{tot}}$ values, decide the depth up to which stellar irradiation can be absorbed.

2.4 Adaptive equivalent extinction

Adaptive equivalent extinction (AEE) is a variation of equivalent extinction Edwards & Slingo (1996) introduced by Amundsen et al. (2017). The idea is to determine the most important species and then treat all other species as gray within the spectral band. To obtain the major absorber, one first calculates semi-gray opacities for each species. The semi-gray opacity of species $i$ is calculated as an average of the k-table in a given spectral bin and is weighted with a function $w$ that depends on the value of $g$ :

\kappa_{\mathrm{av},i}(\nu)=\frac{\int_{0}^{1}\kappa_{i}(\nu,g)w(\nu,g)\mathrm% {d}g}{\int_{0}^{1}w(\nu,g)\mathrm{d}g}.

(7)

We show below that the choice of the weighting function $w$ has a significant impact on the accuracy of the AEE method. A good measure for the importance of the opacity at a given $g$ value on the accuracy of the radiative transfer calculation is the magnitude of the flux at that value of $g$ (Amundsen et al., 2017). We thus chose to use the absolute values of the stellar and planetary fluxes through a $g$ value as a weighting function. However, since we can only know the fluxes, once we have already mixed the k-tables and calculated the radiative transfer, we need to rely on the value of the planetary and stellar flux from a previous radiative transfer calculation, which would be the previous radiative time step in the case of GCMs.

Using these $\kappa_{\mathrm{av},i}(\nu)$ values, a major absorber is found by vertically integrating the transmission (Eq. 1) from the top of the atmosphere down to an optical depth of one for each species. The first species to reach an optical depth of one is then used as the major absorber in the vertical column. The final total opacity in each spectral bin is then given as

\kappa_{\mathrm{tot}}(\nu,g)=\kappa_{m}(\nu,g)+\sum_{i\neq m}^{N_{s}}\kappa_{% \mathrm{av},i}(\nu),

(8)

where $m$ is the species, which has been identified as the major absorber. We note that the UM uses a slightly different and less sophisticated method (called equivalent extinction or EE), where the major absorber is determined locally and without integrating over the atmospheric column. A detailed introduction of the AEE method can be found in Amundsen et al. (2017).

2.5 DeepSet approach

The RORR mixing approach has three important attributes. Firstly, the method stays the same, no matter how many species are mixed with each other and in what order. RORR is thus invariant to permutations in the set of opacities that are to be mixed. Secondly, to a first approximation RORR can be approximated by a simple sum as mentioned in Sect. 2.3 and verified below. Lastly, although RORR is computationally expensive compared to the other outlined approaches, in its core for each frequency bin, it only consists of a convolution, a sorting step and an interpolation step, which is repeated $N_{s}-1$ -times. These three attributes greatly constrain a possible emulation of RORR by a machine learning algorithm, since only algorithms with fast inference, versatile input size and permutation invariance can be used.

We tested several architectures, such as a convolutional network similar to the U-net (Ronneberger et al., 2015), gradient boosted regression trees using the abundances as input and the mixed opacities as output using XGboost (Chen & Guestrin, 2016). With the U-net we ended up needing too many convolution blocks and with XGboost, we needed structures that were too deep and therefore too memory consuming to get reasonable accuracy. One of the reasons for the poor performance of these methods in our context is that they are too different from a simple sum (see Sect. 2.3). We have therefore settled with a DeepSet approach (Zaheer et al., 2017). A DeepSet for our case of k-table mixing can be written as

y=\aleph\left(\bigoplus_{i=1}^{N_{s}}\beth\left(X_{i}\right)\right),

(9)

where $y$ is the response of the DeepSet (the mixed k-tables, see below Eq. 13), $X=\{X_{1},...,X_{N_{s}}\}$ is the set of input vectors (the individual k-tables, see below Eq. 12), and $\aleph$ and $\beth$ are functions.

Refer to caption — Figure 1: Neural network used in the DeepSet (DS) approach.

Simply put, the idea is to use a function $\beth$ to encode each vector $X_{i}$ of a set into a hidden representation $\beth(X_{i})$ . These representations are then summed up²²2Note that any permutation invariant operation could be used in this step., and subsequently decoded by function $\aleph$ to get the output. We illustrate the concept of DeepSets (Zaheer et al., 2017) for the problem of k-table mixing in Fig. 1. We used

\beth(X_{i})=\max(a_{1}\cdot X_{i},0)

(10)

and

\aleph(z)=a_{2}\cdot z,

(11)

with weights $a_{1}$ and $a_{2}$ , which are matrices of size $N_{g}\times N_{g}$ . Whereas $\aleph$ is a linear function, $\beth$ is nonlinear due to the inclusion of a rectified linear activation (ReLU). Different, more complex functions could also be used, but we found that the accuracy reached by using these simple functions is already good enough. More complex functions, would thus only result in a slower performance. The weights $a_{1}$ and $a_{2}$ of the functions $\aleph$ and $\beth$ are then learned by a neural network.

To train the neural network, we chose to implement the network in Keras (Chollet et al., 2015). The weights are learned by minimizing the mean squared error using the Adam optimizer (Kingma & Ba, 2014). We then performed a Bayesian hyperparameter search using hyperopt (Bergstra et al., 2012) to find the best amount of features for the hidden representation and to determine the optimal learning rate of the optimizer. The loss did not improve significantly by the use of more than $N_{g}$ features in the hidden representation, and we therefore chose to use $N_{g}$ features. Additionally, we found that a learning rate of $\alpha=$1\text{\times}{10}^{-3}$$ seems to perform best.

The neural network acts on each frequency bin individually and therefore does not care about the frequency resolution. To generate the training and test data, we computed the mixed k-tables with RORR from $\approx$8\text{\times}{10}^{5}$$ sets of 14 k-tables each. These 14 k-tables were taken from the 11-bin resolution (S1) of the 14 opacity species taken into account in expeRT/MITgcm Schneider et al. (2022b) (see Sec. 3 for details on the species) and were uniformly randomly weighted with reasonable abundance ranges.³³3We note that the network can, by construction, deal with any number of species without retraining needed. The input for the network is not the plain individual k-tables, but instead we scale them with the sum of the k-tables as

X_{i}(\nu,g)=\log\left(\frac{\kappa_{i}(\nu,g)}{\sum_{j=1}^{N_{s}}\kappa_{j}(% \nu,g)}\right).

(12)

Similarly, we scale the targets (e.g., the predictions of the network) by

y(\nu,g)=\log\left(\frac{\kappa_{\mathrm{tot}}(\nu,g)}{\sum_{j=1}^{N_{s}}% \kappa_{j}(\nu,g)}\right).

(13)

The advantage of this input and output scaling comes three-fold. Foremost, we are interested in minimizing the error of the small values in the k-table, as those are the ones that generate windows in the spectrum, which are very important for the temperature structure, hence taking the logarithm is useful because it pronounces relatively large deviations of small k-table values in the loss. Secondly, we find that the sum of the k-tables is already a reasonable approximation for the mixed k-tables (see Sec. 4). Thirdly, the output scaling nicely captures the positivity of the problem, prohibiting the possibility of negative predictions when reversing the output scaling. It is also important to stress that all species $\kappa_{i}(g)$ are fed individually through the same function $\beth$ (with the same weight) to create a unique nonlinear representation for each species.

The main advantage of the DeepSet method is its flexibility, because trained on the individual frequency bins of k-tables, it operates independent of chemical composition or opacity species. It can by construction of the training set operate on any composition and metallicity. Furthermore, we think that it can in principle work on any set of k-tables with shapes that are similar enough to those of the training set. We have tested this by changing the frequency resolution of the training set. Doing so, we found that this did not significantly affect the accuracy, when applied to a different frequency resolution than the one being trained on. We therefore think that it would be only necessary to retrain the network, if the discretization of $g$ values changes. We discuss further numerical considerations of this mixing method in Appendix B.

3 Methods

Table 1: Simulations

label	mixing method	reference
RORR	random overlap with rebinning and resorting (RORR)	Sect. 2.1
PRE	premixed k-tables	Sect. 2.2
DS	DeepSet approach	Sect. 2.5
AEE_we	adaptive equivalent extinction with flux weighting	Sect. 2.4
AEE	adaptive equivalent extinction without flux weighting	Sect. 2.4
ADD	sum of all k-tables	Sect. 2.3

•

Notes: Explanation of the mixing methods used in the individual simulations, as labeled in the figures of this work.

To test the individual mixing methods, we use the 3D GCM expeRT/MITgcm (Carone et al., 2020; Schneider et al., 2022b). expeRT/MITgcm builds on the dynamical core of the MITgcm (Adcroft et al., 1997, 2004), which solves the hydrostatic primitive equations of hydrodynamics on a cubed-sphere grid. In order to accurately account for radiative heating and cooling, expeRT/MITgcm solves the radiative transfer using the Feautrier method (Feautrier, 1964) with approximate Lambda iteration (Olson et al., 1986) and Ng-acceleration (Ng, 1974). The routine that solves the radiative transfer is an adapted version of the flux routine in petitRADTRANS (Mollière et al., 2019, 2020). We have incorporated the radiative transfer solver and benchmarked it in expeRT/MITgcm. We found in Schneider et al. (2022b), that the combination of five frequency bins and 16 $g$ values achieves good enough accuracy for long term convergence studies such as those in Schneider et al. (2022a) and Sainsbury-Martinez et al. (2023). In this work, we use 11 frequency bins, with each 16 $g$ values, which is good enough for comparisons of GCMs to observations. We note that other hot Jupiter GCMs typically use eight $g$ values, with sometimes a higher frequency resolution of $\approx 30$ frequency bins (Showman et al., 2009; Amundsen et al., 2016; Lee et al., 2021). Future studies should investigate whether using fewer frequency points and more $g$ values is more accurate than using fewer $g$ values and more frequency points. The line opacity species used in this work are identical to the ones used in Schneider et al. (2022b) and include H ${}_{2}$ O (Polyansky et al., 2018), CO ${}_{2}$ (Yurchenko et al., 2020), CH ${}_{4}$ (Yurchenko et al., 2017), NH ${}_{3}$ (Coles et al., 2019), CO (Li et al., 2015), H ${}_{2}$ S (Azzam et al., 2016), HCN (Barber et al., 2014), PH ${}_{3}$ (Sousa-Silva et al., 2015), TiO (McKemmish et al., 2019), VO (McKemmish et al., 2016), FeH (Wende et al., 2010), Na (Piskunov et al., 1995), and K (Piskunov et al., 1995).

We chose to use HD 209458 b as a planet and the setup is identical to the setup in Schneider et al. (2022b), where the models in this work only differ by the method with which opacities are mixed. The different mixing methods and their corresponding labels are laid out in Table 1. To this end, we have implemented each of the abovementioned mixing methods. expeRT/MITgcm can now run in two modes by either mixing k-tables during runtime (utilizing one of the aforementioned methods) or using premixed k-tables. To incorporate mixing during runtime in the GCM, we updated our preprocessing toolkit to additionally output a pressure-temperature grid of equilibrium abundances (taken from the easyCHEM (Mollière et al., 2017) interface to petitRADTRANS (Mollière et al., 2019)), along with a pressure-temperature grid of k-tables for the individual absorbers. In the on-fly mixing mode, abundances and k-tables of each of the considered absorbers are linearly interpolated to the pressure and temperature in the GCM, weighted by their abundance and then mixed by one of the abovementioned methods.

The weighting in the adaptive equivalent extinction method induces the need for more scattering iterations, because the k-table becomes dependent on the bolometric flux from the previous time-step, effectively inserting a time dependent perturbation into the opacities, because the scattering source function will be subject to these opacity perturbations as well, rendering its guess from the previous time-step less accurate. For performance reasons, we thus found that the weighted adaptive equivalent extinction method required us to limit the amount of maximum scattering iterations per radiative time step to two instead of 500, which is generally enough for planets similar to HD 209458 b with only Rayleigh scattering, since the source function is reused as initial guess in the next radiative time-step (see Schneider et al., 2022b, for an explanation of scattering in expeRT/MITgcm) and the source function is thereby naturally iterated on during model convergence. To be consistent in all models, we have thus chosen to generally limit the amount of scattering iterations per radiative time-step to two, if not otherwise stated.

All models have been integrated up to 2000 days with a radiative time-step of $100\text{\,}\mathrm{s}$ and a dynamical time-step of $25\text{\,}\mathrm{s}$ , which are typical values for hot Jupiter GCMs (e.g., Showman et al., 2009; Lee et al., 2021; Schneider et al., 2022b). All models use equilibrium chemistry to constrain the abundances. In practice, the code interpolates the abundances on a grid of pressure and temperature. In Section 5, we show a model, where we use the DeepSet mixing of k-tables, but removed all of TiO and VO, whenever TiO and VO would reappear in equilibrium chemistry in the gas phase, although it is condensed out further down in the atmosphere. This method is similar to the methods of rainout described elsewhere (e.g., Lodders & Fegley, 2002; Marley et al., 2021), but less sophisticated compared to 3D models that include proper chemical transport schemes (e.g., Parmentier et al., 2013; Lee et al., 2023; Drummond et al., 2018). A more detailed description of the algorithm for the detection of rainout is outlined in Appendix D.

4 Results

In this work, we compare the mixing methods introduced in Sect. 2 to the slow RORR method. An additional comparison between the premixed method (PRE) used in our GCM, as introduced and used in Schneider et al. (2022b), and the RORR method can be found in Appendix C. Mixing opacities during runtime in a GCM requires a tradeoff between accuracy, performance, and flexibility. When comparing the accuracy of mixing methods, it is important to keep in mind that the low spectral resolution of the k-tables used in GCMs induces an error of a few percent on bolometric fluxes (e.g., Amundsen et al., 2014; Leconte, 2021; Schneider et al., 2022b). It is therefore pointless to aim for accuracies of less than one percent, since the overall error will be governed by the chosen spectral resolution. Comparing the individual mixing methods with each other thus needs to consider all of these perspectives. In order to have a fair comparison between the individual methods, we chose to compare all simulations in two aspects: The accuracy on the resulting atmospheric state and the computational time. We provide additional accuracy diagnostics in Appendix A, where we compare the $\kappa_{\mathrm{tot}}$ values obtained by different methods. We also diagnose fluxes and heating rates in Appendix E.

In order to qualitatively compare the mixing methods, we show the atmospheric state at two pressure layers ( $0.01\text{\,}\mathrm{bar}$ and $0.1\text{\,}\mathrm{bar}$ ) together with the zonal wind speed in Fig. 2. Aside from the adaptive equivalent extinction (AEE) model without flux weighting, all models look similar at a first glimpse. The jet strength and day-night temperature contrasts are not significantly affected by these different methods. However, the AEE method seems to produce a significantly higher day-night temperature contrast and a faster jet. These differences are more pronounced at lower pressure, but are still notable at higher pressures of $0.1\text{\,}\mathrm{bar}$ .

Since most methods produce the same qualitative trend in winds and temperatures, it can be useful to calculate spatial temperature averages and compare those to the RORR models. For visual reasons, we have split these comparisons into two figures, where Fig. 3 compares the DeepSet (DS) and summation (ADD) method to the RORR method and Fig. 4 compares both the weighted and non-weighted adaptive equivalent extinction methods to the RORR method. It was surprising to see that the ADD method performs well, given its simplicity and methodological flaws. However, the temperature is often slightly cooler at higher pressures, which might be related to the underestimation of $\kappa_{\mathrm{tot}}$ at small $g$ and the overestimation at large $g$ (see Appendix A). The overestimation of $\kappa_{\mathrm{tot}}$ at large $g$ can lead to an enhancement of the absorption of the stellar flux in the upper layers, which cannot penetrate deep enough to cause heating in the deeper layers. These flaws of the ADD method do not seem to persist in the DS mixing, which uses the ADD method in its preprocessing (see Sect. 2.5). Instead, we find that the DS mixing performs very well.

Looking at the adaptive equivalent mixing method with and without weighting, we find that the AEE_we method is almost as accurate as the DS mixing. The weighting certainly helps to find a good estimate of the major absorber and drastically increases the accuracy of this approach. Looking at the residuals, we see a strong correlation between the error of the weighted and non-weighted method, which points to a general issue of the method instead of an issue with the major absorber. Unlike in the case of simply summing up k-tables, the AEE method tends to not overestimate $\kappa_{\mathrm{tot}}$ at large $g$ but instead to underestimate it. This might be explained by the minor absorbers, which flatten a k-table by offsetting $\kappa_{\mathrm{tot}}$ at the small $g$ and decreasing the impact of the high $g$ values (see Appendix A). Similar to the overestimation of $\kappa_{\mathrm{tot}}$ at small $g$ values, the underestimation of $\kappa_{\mathrm{tot}}$ at large $g$ values also shifts the location at which irradiation is absorbed, in this case, into the opposite direction by leading to less absorption in the uppermost layers.

We thus conclude that both the ADD method and AEE method, with and without weighting, introduce systematic noise to $\kappa_{\mathrm{tot}}$ at both small and large $g$ values. This noise can amplify the errors of the AEE and ADD methods. In contrast, the DS method exhibits no systematic error (see Appendix A) but instead uniformly distributed random noise. The overall error in temperature estimation is thereby not significantly affected. Therefore, we do not recommend using the ADD method or unweighted AEE method and instead recommend the use of the DS method.

Table 2: Runtime

label	time [ $\text{\,}\mathrm{h}$ ]	relative to PRE
PRE	3.91	1.00
ADD	7.20	1.84
AEE	7.44	1.90
AEE_we	13.37	3.42
DS	10.88	2.78
RORR	25.12	6.43

•

Notes: Runtime of the GCM needed for the first 100 days of the simulation. The node used to run the model includes 2x Intel(R) Xeon(R) Gold 6248R CPU @ 3.00GHz, and we utilized all 48 cores.

In terms of computational costs, one needs to consider two general computational overheads during runtime, compared to using a premixed grid. The first overhead comes from the handling of the individual k-tables, such as the interpolation, as compared to handling of just one premixed k-table. Secondly, obviously the computation of the mixing itself. We show the computation time needed to run the initial 100 days of the simulation in Table 2. The performance of the summation (ADD) method, in which individual k-tables are simply added up, is mainly constrained by the handling of the individual k-tables, since the cost of the summation can be neglected, whereas all the other methods are also subject to the computational cost of the mixing. Therefore, a significant fraction of the computational cost in the AEE, AEE_we, and DS simulations can be explained by the overhead of handling individual k-tables.

The high computational cost of RORR mixing makes it impossible to use it during model runtime. Even when using an optimized sorting algorithm, which is now the standard in petitRADTRANS, and which significantly speeds up RORR, the RORR technique performs at least six times slower compared to the premixed case. One of the main reasons for the poor performance of RORR in our specific setup is the quadratic computational dependence on the number of $g$ values. GCMs that use eight $g$ values instead of 16 could therefore (at the cost of accuracy) have faster performance of the RORR method (for a discussion see Amundsen et al., 2017).

Due to the increased amount of scattering iterations, the adaptive equivalent extinction with weighting (AEE_we) is generally the slowest of all the approaches. The poor convergence behavior of the AEE_we method makes this method less reliable and less performant. This will be an even bigger issue, when scattering becomes non-negligible, which could be the case for lower temperatures or if clouds and hazes are included. However, when limiting the maximum amount of scattering iterations to two, as we have done here, it is fast enough to compete with the other methods and if the weighting were to be neglected (AEE), it would even be similarly fast as the ADD method and faster than the DS mixing. Performance-wise, we thus think that either of the ADD, AEE (with and without flux weighing), or DS mixing approach could be equally used in a setup that requires mixing during runtime.

Based on the discussions and findings above and in Appendices A and E, we conclude that the AEE_we or DS method should be used in a GCM for accurate results with good computational performance. However, in our multi-stream setup where we iterate over the source function, we do not recommend using the AEE_we method due to numerical issues with its convergence.

5 Rainout

Mixing during runtime is only relevant in GCMs if the chemical abundances are to be changed from chemical equilibrium. A sufficiently well resolved premixed table, will in most cases of chemical equilibrium be the fastest and most reliable method of choice. However, in the case of chemical disequilibrium, which is the case if photochemistry or chemical kinetics were to be considered, we can not simply premix k-tables, because the abundances of the relevant opacity species need to be changed during runtime. Another very simple scenario for such a situation could be the rainout of heavy refractory species such as TiO and VO. Due to their strong absorption in the UV, they absorb a significant fraction of the stellar flux at high altitude, thus heating the upper atmosphere significantly. Such a strong heating in the upper atmosphere can lead to a thermal inversion, where the atmosphere becomes hotter toward the top (e.g., Showman et al., 2009). However, due to vertical mixing and advection, condensed TiO and VO could gravitationally settle and therefore not be available in the gas phase at higher altitudes (e.g., Parmentier et al., 2013).

In Fig. 5 we show a model, in which all TiO and VO is removed by a simple rainout prescription (see Appendix D for details), which removes all TiO and VO from the gas phase, if it is condensed further down in the atmosphere. Using a premixed table, we would not be able to calculate the radiative effect of the rainout on the atmospheric structure, however, by using mixing during runtime we can trace the effect of the change in chemical abundances on the temperature. The lack of the strong UV absorbers TiO and VO in the upper layers means that the upper layers get cooler, because less of the stellar flux is absorbed in those layers, whereas the intermediate pressure layers get warmer, where the bulk of the stellar flux is absorbed instead. As expected, we find that the thermal inversion caused by TiO and VO, as seen for example in Fig. 3, is self-consistently removed. The absence of the stratosphere on the day side effects the night side as well. This effect is noticeable at $0.1\text{\,}\mathrm{bar}$ . In the simulation with rainout, the absence of the stratosphere leads to warmer gas compared to the simulation without rainout. As a result, the superrotating jet transports the warmer gas to the nightside, causing it to heat up.

6 Discussion and conclusion

The correlated-k method is a useful approximation for rapid radiative transfer calculations with accuracies of a few percent (e.g., Amundsen et al., 2014; Leconte, 2021; Schneider et al., 2022b), when used with resolutions typical for GCMs. We have demonstrated the performance and accuracy of several methods that could be used in GCMs to calculate the total opacity in the correlated-k assumption. We extended the work of Amundsen et al. (2017), who performed a similar analysis for the adaptive equivalent extinction (AEE) method. Furthermore, we have introduced two additional methods: The DeepSet (DS) mixing and a simple method in which k-tables are simply summed up (ADD). Whereas the work of Amundsen et al. (2017) only considered the accuracy of heating rates and fluxes, we incorporated the RORR, ADD, DS, and AEE mixing methods into our GCM to demonstrate the performance in a real application. The DeepSet method turns out to be fairly accurate and flexible, leveraging machine learning, to calculate k-tables of gas mixtures. Overall, we find that

1.

The random overlap with resorting and rebinning (RORR) method is too slow to be used in GCMs for mixing during model runtime.
2.

The AEE method requires a proper weighting to be accurate. Such a weighting, however, affects the numerical stability of the radiative transfer calculation, which will be especially important if scattering is non-negligible.
3.

The ADD method and the AEE method are prone to systematic errors. This is especially problematic for the unweighted AEE and the ADD method, rendering a use of these methods questionable.
4.

The DS method has minor statistical errors on fluxes and heating rates that do not seem to enhance the overall error, which seems to be an advantage of the DS method.

The DS mixing method provided by this work is accurate and open source⁴⁴4https://github.com/AaronDavidSchneider/opacmixer, and can be easily implemented in any radiative transfer package with no need to use complex libraries, as it only requires two matrix multiplications. Once trained, the network can perform on any composition and any set of opacity species. Although not strictly needed, we recommend training the network for a specific frequency resolution to maximize the accuracy. The amount of training data needed is small, and training can be performed within minutes on a standard personal computer. The provided open source package currently works with binned down petitRADTRANS-format k-tables, but can be easily extended to load any k-table format, and documentation is provided for how to achieve this.

The methods tested in this work, have been tested in terms of accuracy on the atmospheric structure, and we think that these methods provided here will be key, when moving forward toward self-consistent transport of chemicals in the atmospheres of planets. We note, however, that these methods, do not translate to models that predict spectra. Future work is thus needed to test, if similar methods could also be used for atmospheric retrievals.

By implementing a simple chemical rainout procedure, we mimic the gravitational settling of TiO and VO, to demonstrate the DS method in a use-case of disequilibrium chemistry. By accounting for rainout in this way, we find that TiO and VO can be trapped in the deeper atmosphere, thus hindering the formation of a stratosphere. We note, that this approach is fairly simplified, and hope that our work enables future models to treat cold trap** self-consistently.

Acknowledgements.

A.D.S., L.D., U.G.J. and C.H. acknowledge funding from the European Union H2020-MSCA-ITN-2019 under Grant no. 860470 (CHAMELEON). U.G.J. acknowledges funding from the Novo Nordisk Foundation Interdisciplinary Synergy Program grant no. NNF19OC0057374. The bibliography of this publication has been typesetted using bibmanager (Cubillos, 2020)⁵⁵5https://bibmanager.readthedocs.io/en/latest/. The post-processing of GCM data has been performed with gcm-toolkit (Schneider et al., 2022) ⁶⁶6https://gcm-toolkit.readthedocs.io/en/latest/. We would like to express our gratitude to the referee, Joanna Barstow, for her valuable comments that enhanced the quality of the manuscript.

References

Adcroft et al. (2004) Adcroft, A., Campin, J.-M., Hill, C., & Marshall, J. 2004, Monthly Weather Review, 132, 2845
Adcroft et al. (1997) Adcroft, A., Hill, C., & Marshall, J. 1997, Monthly Weather Review, 125, 2293
Amundsen et al. (2014) Amundsen, D. S., Baraffe, I., Tremblin, P., et al. 2014, A&A, 564, A59
Amundsen et al. (2016) Amundsen, D. S., Mayne, N. J., Baraffe, I., et al. 2016, A&A, 595, A36
Amundsen et al. (2017) Amundsen, D. S., Tremblin, P., Manners, J., Baraffe, I., & Mayne, N. J. 2017, A&A, 598, A97
Azzam et al. (2016) Azzam, A. A. A., Tennyson, J., Yurchenko, S. N., & Naumenko, O. V. 2016, MNRAS, 460, 4063
Barber et al. (2014) Barber, R. J., Strange, J. K., Hill, C., et al. 2014, MNRAS, 437, 1828
Bergstra et al. (2012) Bergstra, J., Yamins, D., & Cox, D. D. 2012, arXiv e-prints, arXiv:1209.5111
Carone et al. (2020) Carone, L., Baeyens, R., Mollière, P., et al. 2020, MNRAS, 496, 3582
Chen & Guestrin (2016) Chen, T. & Guestrin, C. 2016, arXiv e-prints, arXiv:1603.02754
Chollet et al. (2015) Chollet, F. et al. 2015, Keras, https://keras.io
Coles et al. (2019) Coles, P. A., Yurchenko, S. N., & Tennyson, J. 2019, MNRAS, 490, 4638
Cubillos (2020) Cubillos, P. E. 2020, bibmanager: A BibTeX manager for LaTeX projects, Zenodo
Deitrick et al. (2022) Deitrick, R., Heng, K., Schroffenegger, U., et al. 2022, MNRAS, 512, 3759
Drummond et al. (2020) Drummond, B., Hébrard, E., Mayne, N. J., et al. 2020, A&A, 636, A68
Drummond et al. (2018) Drummond, B., Mayne, N. J., Manners, J., et al. 2018, ApJ, 855, L31
Edwards & Slingo (1996) Edwards, J. M. & Slingo, A. 1996, Quarterly Journal of the Royal Meteorological Society, 122, 689
Ehrenreich et al. (2020) Ehrenreich, D., Lovis, C., Allart, R., et al. 2020, Nature, 580, 597
Feautrier (1964) Feautrier, P. 1964, Comptes Rendus Academie des Sciences (serie non specifiee), 258, 3189
Goody et al. (1989) Goody, R., West, R., Chen, L., & Crisp, D. 1989, J. Quant. Spec. Radiat. Transf., 42, 539
Gustafsson et al. (1975) Gustafsson, B., Bell, R. A., Eriksson, K., & Nordlund, A. 1975, A&A, 42, 407
Gustafsson & Jorgensen (1994) Gustafsson, B. & Jorgensen, U. G. 1994, A&A Rev., 6, 19
Helling & Jorgensen (1998) Helling, C. & Jorgensen, U. G. 1998, A&A, 337, 477
Jorgensen (1992) Jorgensen, U. G. 1992, Rev. Mexicana Astron. Astrofis., 23, 195
Kesseli & Snellen (2021) Kesseli, A. Y. & Snellen, I. A. G. 2021, ApJ, 908, L17
Kesseli et al. (2022) Kesseli, A. Y., Snellen, I. A. G., Casasayas-Barris, N., Mollière, P., & Sánchez-López, A. 2022, AJ, 163, 107
Kingma & Ba (2014) Kingma, D. P. & Ba, J. 2014, arXiv e-prints, arXiv:1412.6980
Lacis & Oinas (1991) Lacis, A. A. & Oinas, V. 1991, J. Geophys. Res., 96, 9027
Leconte (2021) Leconte, J. 2021, A&A, 645, A20
Lee et al. (2021) Lee, E. K. H., Parmentier, V., Hammond, M., et al. 2021, MNRAS, 506, 2695
Lee et al. (2023) Lee, E. K. H., Tsai, S.-M., Hammond, M., & Tan, X. 2023, A&A, 672, A110
Li et al. (2015) Li, G., Gordon, I. E., Rothman, L. S., et al. 2015, ApJS, 216, 15
Lodders & Fegley (2002) Lodders, K. & Fegley, B. 2002, Icarus, 155, 393
Marley et al. (2021) Marley, M. S., Saumon, D., Visscher, C., et al. 2021, ApJ, 920, 85
McKemmish et al. (2019) McKemmish, L. K., Masseron, T., Hoeijmakers, H. J., et al. 2019, MNRAS, 488, 2836
McKemmish et al. (2016) McKemmish, L. K., Yurchenko, S. N., & Tennyson, J. 2016, MNRAS, 463, 771
Mollière et al. (2020) Mollière, P., Stolker, T., Lacour, S., et al. 2020, A&A, 640, A131
Mollière et al. (2017) Mollière, P., van Boekel, R., Bouwman, J., et al. 2017, A&A, 600, A10
Mollière et al. (2015) Mollière, P., van Boekel, R., Dullemond, C., Henning, T., & Mordasini, C. 2015, ApJ, 813, 47
Mollière et al. (2019) Mollière, P., Wardenier, J. P., van Boekel, R., et al. 2019, A&A, 627, A67
Mukherjee et al. (2023) Mukherjee, S., Batalha, N. E., Fortney, J. J., & Marley, M. S. 2023, ApJ, 942, 71
Ng (1974) Ng, K. C. 1974, J. Chem. Phys., 61, 2680
Olson et al. (1986) Olson, G. L., Auer, L. H., & Buchler, J. R. 1986, J. Quant. Spec. Radiat. Transf., 35, 431
Parmentier et al. (2013) Parmentier, V., Showman, A. P., & Lian, Y. 2013, A&A, 558, A91
Piskunov et al. (1995) Piskunov, N. E., Kupka, F., Ryabchikova, T. A., Weiss, W. W., & Jeffery, C. S. 1995, A&AS, 112, 525
Polyansky et al. (2018) Polyansky, O. L., Kyuberis, A. A., Zobov, N. F., et al. 2018, MNRAS, 480, 2597
Ronneberger et al. (2015) Ronneberger, O., Fischer, P., & Brox, T. 2015, arXiv e-prints, arXiv:1505.04597
Sainsbury-Martinez et al. (2023) Sainsbury-Martinez, F., Tremblin, P., Schneider, A. D., et al. 2023, MNRAS, 524, 1316
Saxner & Gustafsson (1984) Saxner, M. & Gustafsson, B. 1984, A&A, 140, 334
Schneider et al. (2022) Schneider, A. D., Baeyens, R., & Kiefer, S. 2022, gcm_toolkit, tools for GCM postprocessing
Schneider et al. (2022a) Schneider, A. D., Carone, L., Decin, L., Jørgensen, U. G., & Helling, C. 2022a, A&A, 666, L11
Schneider et al. (2022b) Schneider, A. D., Carone, L., Decin, L., et al. 2022b, A&A, 664, A56
Showman et al. (2009) Showman, A. P., Fortney, J. J., Lian, Y., et al. 2009, ApJ, 699, 564
Showman et al. (2020) Showman, A. P., Tan, X., & Parmentier, V. 2020, Space Sci. Rev., 216, 139
Sousa-Silva et al. (2015) Sousa-Silva, C., Al-Refaie, A. F., Tennyson, J., & Yurchenko, S. N. 2015, MNRAS, 446, 2337
Tremblin et al. (2015) Tremblin, P., Amundsen, D. S., Mourier, P., et al. 2015, ApJ, 804, L17
Wende et al. (2010) Wende, S., Reiners, A., Seifahrt, A., & Bernath, P. F. 2010, A&A, 523, A58
Yurchenko et al. (2017) Yurchenko, S. N., Amundsen, D. S., Tennyson, J., & Waldmann, I. P. 2017, A&A, 605, A95
Yurchenko et al. (2020) Yurchenko, S. N., Mellor, T. M., Freedman, R. S., & Tennyson, J. 2020, MNRAS, 496, 5282
Zaheer et al. (2017) Zaheer, M., Kottur, S., Ravanbakhsh, S., et al. 2017, arXiv e-prints, arXiv:1703.06114
Zamyatina et al. (2023) Zamyatina, M., Hébrard, E., Drummond, B., et al. 2023, MNRAS, 519, 3129

Appendix A Accuracy on individual $g$ -values

In Sect. 2.4, we mentioned that summing up averaged k-tables to the k-table of the major absorber in the AEE method leads to underestimation of high $g$ values and overestimation of small $g$ values. In contrast, as discussed in Sect. 2.3, the ADD method underestimates small $g$ values and may overestimate large $g$ values. On the other hand, the DS method fits the RORR method using mean squared loss, without favoring over- or underestimation of any $g$ value.

To test these methods for systematic errors, we generate mixed k-tables using the counterpart of the training set from Sect. 2.5. These k-tables are then compared to the results from the RORR method. The comparison is shown in Fig. 6. We note that the AEE method (both weighted and non-weighted) necessitates a 1D column to identify the maximum absorber (see Sect. 2.4). However, as our test is 0D, we were unable to implement this here. Instead, we determine the primary absorber in each set by selecting the highest value from the 14 k-tables.

In Fig. 6, we observe that the resulting $\kappa_{\mathrm{tot}}$ values from the DS method are symmetrically distributed around the values predicted from the RORR method, indicating a statistically random error. However, this is not the case for the AEE and ADD methods. More specifically, we find that the results for the AEE method indeed lead to a significant overestimation of $\kappa_{\mathrm{tot}}$ at small $g$ values and a clear underestimation of $\kappa_{\mathrm{tot}}$ at large $g$ values. However, we note again, that this trend would be less dramatic if the major absorber is picked wisely and if the averaged kappa values are determined from a proper weighting. It is thus vital to use the AEE method with a proper weighting, even though this might result in numerical challenges. Similarly, we see that indeed the ADD method leads to a significant underestimation of $\kappa_{\mathrm{tot}}$ at small $g$ values. The overestimation at large $g$ values, on the other hand, is very subtle and can only be seen by looking closely at the bottom left panel.

Appendix B Numerical considerations for the DeepSet implementation

One of the advantages of the DeepSet is its simplicity, which allows an easy naive implementation, since it only requires two matrix multiplication operations and one summation. There are several considerations regarding the performance of this approach. The first consideration is the use of the logarithm and, to reverse back to k-tables, the use of the exponential function. Both of these operations are unfortunately quite slow, albeit being needed as described in Sect. 2.5. Profiling the FORTRAN code used in the GCM shows that approximately half of the CPU time is spent on calculating the logarithm for the input scaling⁷⁷7The input scaling needs to be calculated $N_{s}$ times more often than the reverse output scaling, see Fig. 1. The other half of the CPU time is spent on the matrix multiplication, which can be optimized using hardware dependent compiler optimizations. For our architecture (using ifort on an intel CPU), we found that the build in MATMUL delivers the best performance compared to MKL DGEMM, OpenBLAS DGEMM and a manual implementation. The reason for the missing performance increase from the highly optimized DGEMM can most likely be found in the overhead of calling DGEMM compared to the small size of the matrix that is to be multiplied ( $16\times 16$ ).⁸⁸8We note that faster and more efficient implementations could also be achieved by stacking the computations and deploying the computations to a GPU.

Appendix C Benchmarking RORR against PRE

In this work, we benchmarked several approximate k-table mixing methods to the RORR method (see Sect. 2.1). As discussed in Sect. 2.2, if the atmosphere is in chemical equilibrium, the abundances can be directly constrained as a function of pressure and temperature. It is then possible to use these abundances to create a grid of k-tables using the RORR method. This step can be done as a preprocessing step, so that the grid can be used as a lookup table to interpolate on during runtime. As mentioned in Amundsen et al. (2017), the accuracy of this approach will be greatly dependent on the resolution of the grid. We match the pressure coordinates with our premixed tables in expeRT/MITgcm, removing the need for interpolation in pressure and thereby resulting in a higher accuracy and faster runtime. We then use 1000 temperature grid points in this work and in Schneider et al. (2022b) to further maximize the accuracy.

In Fig. 7 we show the temperature pressure profiles of the RORR simulation compared to the premixed (PRE) simulation. We find that both methods result in temperature profiles that overlap very well, with an error of less than 0.5%. Premixed tables will therefore stay the method of choice for setups, where equilibrium chemistry can be assumed, given their much faster performance (see Table 2).

Appendix D A simple algorithm to calculate rainout

Algorithm 1 Procedure that takes in the pressure layers

p

(sorted from bottom to top) and mass mixing ratios

\eta

of an opacity species and returns the altered mass mixing ratios, under consideration of rainout.

procedure do_rainout(

p,\eta

)

\tilde{\eta}\leftarrow\text{copy}(\eta)

\text{species\_appearing}\leftarrow\text{False}

\text{species\_cond}\leftarrow\text{False}

for

i\leftarrow 2

\text{length}(p)-1

\triangleright

Loop from bottom to top

\text{grad}\leftarrow-0.5\times\left(\frac{{\eta_{i-1}-\eta_{i}}}{{p_{i-1}-p_{% i}}}+\frac{{\eta_{i}-\eta_{i+1}}}{{p_{i}-p_{i+1}}}\right)

(\text{grad}>0)

then

\triangleright

abundance increases

\text{species\_appearing}\leftarrow\text{True}

end if

(\text{species\_appearing})

then

(\text{grad}<0)

then

\triangleright

abundance decreases again

\text{species\_cond}\leftarrow\text{True}

end if

(\text{species\_cond})

then

\triangleright

Do the rainout

(\tilde{\eta}_{i-1}<\tilde{\eta}_{i})

then

\tilde{\eta}_{i}\leftarrow\tilde{\eta}_{i-1}

end if

end for

(\text{species\_cond})

then

\triangleright

Treat the boundary

(\tilde{\eta}_{-2}<\tilde{\eta}_{-1})

then

\tilde{\eta}_{-1}\leftarrow\tilde{\eta}_{-2}

end if

return

\tilde{\eta}

end procedure

In this work, we consider the rainout of TiO and VO in order to demonstrate a possible use case of the DS method (Sect. 5). We consider a species rained out, if, going upward from the bottom of the computational domain, the species has, in local chemical equilibrium, first become available in the gas phase and has then disappeared further up in the atmosphere. If the species became more abundant in the gas in local chemical equilibrium, we would not allow this and instead keep the fixed abundance from the layers below. These calculations are in practice performed by looking at the gradient of the mass mixing ratios, and we outline the algorithm used in this work in Algorithm 1. We would like to note that the algorithm presented here is likely oversimplified. Nevertheless, it provides mass mixing ratios that deviate from chemical equilibrium, and thus serves for demonstration purpose.

Appendix E Accuracy of fluxes and heating rates

The thermal forcing in the GCM is given by the thermodynamic heating rate $H$ , which is calculated as (e.g., Amundsen et al. 2014; Showman et al. 2009)

H=-\frac{\mathrm{d}F}{\mathrm{d}z}=\frac{gp}{R_{s}T}\frac{\mathrm{d}F}{\mathrm% {d}p},

(14)

where $F$ is the net flux integrated over frequency, consisting of the bolometric stellar and planetary fluxes, $g$ , $p$ , $z$ , and $R_{s}$ are the surface gravity, pressure, geometric height, and specific gas constant respectively. For a deeper understanding of the accuracy of the different approaches, we compare the accuracy of the resulting fluxes and heating rates of the individual methods in comparison to the RORR method. Since all the simulations lead to different atmospheric states, we opted to compare the fluxes and heating rates in a separate set of models. These models start from the final output of the RORR models at $2000\text{\,}\mathrm{d}$ and run for 10 radiative time-steps (a total of 1000 seconds). Unlike in the 2000-day-long simulations mentioned above, we did not limit the amount of scattering iterations in these extra runs. Using one radiative time-step for the comparison is not good enough, since the AEE_we method needs the flux from the previous time-step. The advantage of this approach is that the temperature profiles are identical in these models. The comparison of fluxes and heating rates is thus fairer. We show the fluxes and heating rates for the substellar point in Figs. 8 and 9 respectively, and those of the antistellar point in Figs. 10 and 11 respectively.

Looking at the day side (substellar point), we can see that AEE_we and DS result in equally accurate heating rates and fluxes, followed by the ADD simulation, which also results in reasonable fluxes and heating rates. On the other hand, the non-weighted AEE simulations result in completely wrong fluxes and thus heating rates. The total flux at the substellar point is dominated by the stellar flux. We can see that the AEE method results in larger negative values of the stellar flux, again hinting to too little absorption in the upper parts of the atmosphere, as discussed in Sect. 4. Conversely, due to the overestimation of $\kappa_{\mathrm{tot}}$ at small $g$ , almost all radiation is absorbed within a fine pressure range. While these trends can also be seen in the weighted AEE method, it seems to cause much less of an error, if the major absorber is wisely picked.

Overall, we find that considering the substellar and antistellar fluxes and heating rates, the DS and AEE_we method reproduce similarly accurate results. One of the reasons, why the DS method yields an overall better accuracy on the final atmospheric state might be found in the systematic error of the AEE method, which enhances $\kappa_{\mathrm{tot}}$ at small $g$ and decreases $\kappa_{\mathrm{tot}}$ at large $g$ , which is not found in the DS method, which exhibits a random noise on the different $g$ values in each frequency bin (see Appendix A for a discussion). These diagnostics reinforce that adaptive equivalent extinction should only be used together with a properly weighted average opacity.

Harnessing machine learning for accurate treatment of overlap** opacity species in general circulation models

Key Words.:

1 Introduction

2 Mixing species

2.1 Random overlap with rebinning and resorting (RORR)

2.2 Premixed k-tables

2.3 Summation

2.4 Adaptive equivalent extinction

2.5 DeepSet approach

3 Methods

4 Results

5 Rainout

6 Discussion and conclusion

Acknowledgements.

References

Appendix A Accuracy on individual g𝑔gitalic_g-values

Appendix B Numerical considerations for the DeepSet implementation

Appendix C Benchmarking RORR against PRE

Appendix D A simple algorithm to calculate rainout

Appendix E Accuracy of fluxes and heating rates

Appendix A Accuracy on individual $g$ -values