1.4pt
Discovering Nuclear Models from Symbolic Machine Learning
Abstract
Numerous phenomenological nuclear models have been proposed to describe specific observables within different regions of the nuclear chart. However, develo** a unified model that describes the complex behavior of all nuclei remains an open challenge. Here, we explore whether novel symbolic Machine Learning (ML) can rediscover traditional nuclear physics models or identify alternatives with improved simplicity, fidelity, and predictive power. To address this challenge, we developed a Multi-objective Iterated Symbolic Regression approach that handles symbolic regressions over multiple target observables, accounts for experimental uncertainties and is robust against high-dimensional problems. As a proof of principle, we applied this method to describe the nuclear binding energies and charge radii of light and medium mass nuclei. Our approach identified simple analytical relationships based on the number of protons and neutrons, providing interpretable models with precision comparable to state-of-the-art nuclear models. Additionally, we integrated this ML-discovered model with an existing complementary model to estimate the limits of nuclear stability. These results highlight the potential of symbolic ML to develop accurate nuclear models and guide our description of complex many-body problems.
Introduction
The atomic nucleus, a strongly correlated quantum many-body system, exhibits a vast array of phenomena that cannot be explained by a single, unified theory. This is mainly because the underlying theory of the nuclear force, quantum chromodynamics (QCD), is non-perturbative at low energies [1]. Despite the exciting progress that has been made in develo** a comprehensive characterization of the nuclear force and powerful many-body methods, a theoretical description of all the nuclei, with a direct link to QCD, remains a major unsolved challenge [2]. Consequently, physicists have developed a myriad of models, each designed to address specific aspects of nuclear structure and reactions. These models range from the simple liquid drop model to sophisticated many-body calculations, which are continuously refined to reproduce available nuclear data, such as binding energies, nuclear radii, electromagnetic moments, excitation energies, and decay properties [3, 4, 5, 6, 7, 8, 9]. Thus, one hopes that improved models can provide an interpretation of the observed phenomena and predict the behavior of nuclei at the extremes of the nuclear chart, where data is not yet available.
Recent advancements in Machine Learning (ML) have emerged as a pragmatic tool to tackle this challenge, with the expectation of fitting models that could reproduce the vast amount of available nuclear properties [10, 11, 12, 13, 14, 7, 15]. Nonetheless, this approach results in black boxes with obscure physical interpretability [16, 17, 18], and often its use is limited to specialized users who can have access to and run specific software. Moreover, conventional ML models rely on extensive experimental data or high-fidelity emulators, which may not always be available and are very prone to over-fitting, drastically limiting their extrapolation power [19, 20].
An alternative approach is given by symbolic ML, which aims to provide analytical expressions to describe a given observable [21]. Intriguingly, this methodology is remarkably similar to how nuclear physicists have traditionally tackled the problem of describing complex nuclear phenomena. Unlike traditional regression techniques that fit data to a pre-specified model, such as linear or polynomial functions, modern symbolic regression approaches do not assume any particular prior model. In contrast, it changes the existing functional forms in an evolutive manner [22, 23, 24]. Because of this, discovering complex equations is now conceivable, something that could not be accomplished by evaluating every possible mathematical expression by brute force [25, 26]. These methods have been widely successfully applied in scientific research, including Cosmology [27, 28], Astronomy [29, 30], Materials science [31], Chemistry [32], Dynamics [33], and Particle physics [34, 35]. However, applying existing methods to describe nuclear physics observables poses several challenges, as most of them have not been developed to solve multi-objective problems and/or do not consider experimental uncertainties.
In this work, we explore the natural question of whether symbolic ML approaches, with minimal human bias, can rediscover traditional nuclear physics models or propose alternative ones with similar or better simplicity, fidelity, and predictive power. To achieve these goals, we developed a Symbolic Regression technique that can systematically handle regressions over a set of targets, considers experimental uncertainties, and is robust to high-dimensional problems.
Our approach achieves three key results. First, we provide an interpretable model that discovers relatively simple analytical relationships between the number of nucleons and nuclear properties such as the charge radius and binding energy. Second, we use ML techniques to estimate these nuclear observables beyond our current experimental knowledge. Finally, an improved model, with quantifiable uncertainties, is used to provide predictions of the limits of stability of both neutron-deficient and neutron-rich nuclei.
Symbolic Regression for Nuclear Observables
We developed a Multi-objective Iterated Symbolic Regression (MISR) to iteratively search for analytical models that best describe a set of target variables up to arbitrary accuracy, leading to an expansion-like expression. We build on top of symbolic regression methods [36] to adopt multi-objective capabilities, which allows us to fit the description of a set of diverse target variables, considering the associated experimental uncertainty for each variable. A diagrammatic representation of the MISR approach is shown in Fig. 1. It is worth noting the MISR is in principle agnostic to the SR tool to be employed, as it only constitutes a block of the pipeline. As illustrated in the figure, the main variable, regressed via the standard symbolic regression, is then used to trace over the Pareto-optimal expressions for minimizing the error over the auxiliary variables in the several random sub-samples of the data (known as -folds). This process is then repeated using boosting to fit the residuals of the current analytical model until we reach a stop** criteria [37, 38]. Part of these advances were motivated by the application of multi-objective approaches for dynamic modeling [39].
Unlike traditional regression techniques that offer a static view of feature relevance, MISR continuously re-evaluates in each boosting step the feature importance via both Boosted Decision trees and Mutual Information Regression [37, 40]. This allows each one of the obtained terms to represent a particular behavior within the modeled phenomena. This process not only enhances the predictive power but also streamlines the model by eliminating redundant or less significant features.
One of the critical challenges in ML approaches is the quantification of uncertainty in predictive modeling, especially given the often limited and low-precision experimental data that exists for certain nuclear properties. To achieve an estimation of uncertainty for the MISR model, we resample different training datasets using jackknife bootstrap** [41] to link each one of the model’s predictions to a probability distribution, from which we sample the posteriors. In addition, we include a truncation uncertainty, for a specific cutoff, given by the absolute value of the subsequent expansion term. This methodology was inspired by an analogy to how effective theories are developed [42].
This iterative refinement allows us to express the target data as a sum of the resultant analytical forms , and estimate the uncertainty of the model. The MISR can thus be expressed as
(1) |
where and are the input variables, representing the number of neutrons and protons in the nuclei, respectively. is a Gaussian distribution numerically fitted after the training process using the discriminative jackknife technique [41].
We define the nuclear mass as , isospin asymmetry, , and the Casten factor , where and represent the valence protons and neutrons filling pre-defined shell-model orbits, i.e. the difference between the number of nucleons and the closest magic number [43]. The model was trained using the input variables . More details of the methodology can be found in the section Methods I.1.
As a proof-of-principle of our approach, we focus on the study of nuclear binding energies and charge radii. However, our proposed method can be generally applied to describe other nuclear observables, with an arbitrary set of input variables. Binding energy, the energy required to disassemble a nucleus into its constituent protons and neutrons, provides important information to guide our understanding of atomic nuclei. This observable is arguably one of the most studied nuclear observables, with a relatively large collection of experimental data available. To train our algorithm, we used a subset of the experimental data of binding energies reported in the AME2020 mass evaluation dataset [44, 45, 46], and experimental charge radii reported in [47]. For the charge radii —characterizing the charge distributions inside the nuclei— there is a more scarce set of experimental measurements [48]. This is particularly challenging for light isotopes, which, combined with the large variations in their radii, motivated us to focus on nuclei with , for which reliable experimental data exists. From these data sets, we sampled uniformly of the nuclei for testing purposes. The cutoff on light and medium mass nuclei allows us to study the extrapolation capabilities of our algorithm and provide a test of the model’s ability to deduce complex nuclear properties from relatively small data sets. This demonstrates the broader applicability of our MIRS method, which can describe physical observables where data scarcity is common.
Results for nuclear binding energies
The study of nuclear binding energies has led to the proposal of several analytical formulas. One of the simplest, the Liquid Drop Model (LDM) [49], represents the nucleus as a liquid drop to explain its global properties as a function of and :
(2) |
where , , , and are the volume, surface, Coulomb, and asymmetry coefficients, respectively, and represents the pairing term between neutrons and proton computed as
(3) |
where is another fitted constant [49]. More complex empirical models such as the Finite Range Droplet Model (FRDM) [50], and the Duflo-Zuker (DZ) [51], attempt to incorporate microscopic corrections to these macroscopic trends, refining the reproduction of nuclear binding energies across the chart of nuclides. The DZ model integrates both macroscopic and microscopic aspects of nuclear structure, featuring a wide array of terms to capture various models of nuclear interactions and nucleon configurations. The DZ model is not represented by a concise formula, but rather as a computational tool. This model is relatively complex and includes several terms to numerically account for nuclear shell effects, monopole Hamiltonian contributions, deformation energies, symmetry, and Coulomb energies.
State-of-the-art nuclear models, such as Density Functional Theory (DFT) [52] and ab initio calculations [53], follow a very different approach. These models aim to provide a microscopic description of nuclei, starting from inter-nucleon interactions and solving numerically the nuclear quantum many-body problem. DFT, in particular, has been very successful in describing the binding energy and radii across the nuclear chart [4, 54]. For ab initio methods, the simultaneous reproduction of both binding energy and nuclear charge radii remains an open challenge [6, 55].
To explore the multi-objective capabilities of our model, we apply the MISR algorithm to predict the binding energy, using as auxiliary observables the binding energy per nucleon, and the one and two nucleon separation energies across the same region, defined as , , , .
At leading order, in the first iteration of MISR, we obtained a relatively simple expression:
(4) |
with , , , and MeV. Additional MISR iterations, up to , are presented in the section Methods, Appendix 4.
The differences between the experimental values and model predictions for various observables are shown in Fig. 2. These differences decrease as one includes more orders of the MISR expansion, illustrating how the regression converges without a sign of overfitting to the trained region. In principle, one can keep an arbitrary number of terms in the expansion. However, the Pareto Frontier hints it is optimal to keep up to 10 terms.
As illustrated in Fig. 2, our MISR results for exhibit a significant improvement in the prediction of nuclear binding energies with respect to the LDM. This is also true for the neutron and proton separation energies.
The results for the binding energy per nucleon, , together with the distribution of their predicted uncertainties are shown in Fig. 3. The ability to provide uncertainty estimates allows for a more robust analysis of the predictive power of the model. This feature of MISR assesses where additional experimental data or refinement of the model might be necessary. The standard deviation of the binding energy predictions, as indicated by the color gradient, tends to correlate with the residual distribution. We remark that the uncertainty is not uniform across the nuclear chart. Instead, the highest uncertainties coincide with the transitional regions between different nuclear shells. These regions are known for increased structural changes that impact nuclear stability and are more challenging to predict accurately. This shell structure effect is illustrated in Fig. 3 c). The MISR expansion is well-behaved around closed-shell structures but its accuracy decreases for open-shell nuclei. The contrary is true for the Duflo-Zuker model, with the discrepancy with respect to the experiment larger around neutron magic numbers.
In Table 1 we compare the performance of different nuclear models in terms of Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE). Note that these results represent only a subset of DFT calculations and do not include the latest DFT developments. MAE is calculated as the average absolute difference between the predicted and experimental values, while RMSE is derived from the square root of the average of the squared differences between the predicted and experimental binding energy values. For comparison, the table includes the results from DFT calculations using different density functionals such as DD-ME2 [56], NL3* [57], HFB24 [58], and UNEDF1 [59]. Remarkably, the MISR10 model shows competitive performance, particularly when considering the fact that it is built on a small number of input variables.
Model | Ref. | MAE [MeV] | RMSE [MeV] |
---|---|---|---|
MISR1 | - | 5.11 | 6.17 |
MISR10 | - | 0.78 | 0.99 |
DD-ME2 | [56] | 2.48 | 2.83 |
NL3* | [57] | 1.95 | 2.45 |
Duflo-Zuker | [51] | 0.62 | 0.87 |
FRDM | [50] | 0.67 | 0.90 |
HFB24 | [58] | 0.55 | 0.73 |
UNEDF1 | [59] | 1.88 | 2.25 |
LDM | [49] | 3.56 | 5.08 |
Results for the nuclear charge radii
Now we shift our focus to the study of the nuclear charge radius, . This nuclear property is highly sensitive to the details of the inter-nucleon interactions, not yet fully understood. The reproduction of the magnitude of is an unresolved challenge for ab initio nuclear theory [60].
At leading order, we obtain the first MISR expression to be:
(5) |
with fm, fm, fm. Additional terms obtained for higher order corrections, up to , can be found in the section Methods, Appendix 5. The first term is similar to the well-known droplet model for the isospin symmetric case, . Interestingly, the second term in this equation shows an exponential dependence on the isospin asymmetry, , which, to our knowledge, does not appear in other phenomenological models for the charge radii. Similar exponential terms have been proposed in nuclear mass models [61].
The difference between the experiment and our MISR results is shown in Fig. 4. The MISR expansion captures the overall trend of the charge radii. We do find a higher residual outlier in the distribution for (74Se), which is mainly due to its relatively large experimental uncertainty. Similarly, as with the results obtained for binding energies, the residuals are larger between magic numbers.
The performance obtained on the training and testing sets are shown in Table 2. Our MISR results are contrasted with different DFT calculations in the same table. For comparison, we include an analytical model proposed by Nerlo-Pomorska with Casten modifications [62]:
(6) |
where the constants are fitted to the experimental data.
Model | Ref. | Train [fm] | Test[fm] |
---|---|---|---|
MISR1 | - | 0.017 | 0.014 |
MISR10 | - | 0.009 | 0.009 |
DD-ME2 | [56] | 0.019 | 0.016 |
NL3* | [57] | 0.028 | 0.026 |
SKMS | [63] | 0.019 | 0.016 |
UNEDF1 | [59] | 0.026 | 0.020 |
NP | [62] | 0.023 | 0.018 |
Our MISR for the nuclear charge radii exhibits an overall good agreement with the experimental data, which is better than all models presented in Table 2. This is the case even at leading order, Eq. 5, with an expression of remarkable simplicity.
While Equation 5 is obtained by training only on light nuclei, it extrapolates well to heavy nuclei. This is illustrated in Fig. 5. This implies that our multi-objective optimization yields physical information valid across the entire nuclear chart. A similar extrapolation is observed for the binding energy predicted by Equation 4, which shows a smooth linear increase with the proton number. This suggests that adding a simple linear term in can provide a major improvement for heavy nuclei. Such a term would likely appear if large nuclei were included in the training process. We choose not to add this term manually, as our goal is to allow the ML algorithm to discover analytical expressions with minimal human intervention.
In contrast to nuclear binding energies, the nuclear charge radii are known to exhibit more complex and distinct behavior across the nuclear chart. In what follows, we explore the performance of our MISR result in describing complex charge radii trends for selected isotopic chains with . Our results are compared with the experimental values in Fig. 6.
Overall, the MISR prediction, including uncertainty estimation, provides a good description of the experimental trends. The model captures the expected kinks at magic numbers and part of the odd-even staggering effects, with large discrepancies found for Ca (). These isotopes exhibit a unique trend, which is known to be challenging to describe by nuclear theory [65, 66].
A different, distinct charge radii trend is observed for Krypton (Kr) isotopes (). The MISR results are compared with the experiment in Fig. 7. Here, the MISR model performs remarkably well, capturing all the main features: i. the kink at ; ii. a parabolic trend for neutron-deficient isotopes; iii. staggering between isotopes with odd and even neutron numbers.
Limits of the nuclear landscape
Inspired by the fact that our MISR model appears to be highly complementary to the DZ model, i.e. MISR exhibits better performance around closed shells, while the DZ performs better for open-shell nuclei (see Fig. 3), we combined them as an ensemble model. Both models can be combined using a Bayesian Automatic Relevance Determination (ARD) regression [68], which can determine the weight of each model and provide a combined prediction with uncertainty. The obtained assembled model, labeled as ARD provides an MAE of 0.389 (0.411) MeV on the train (test) samples, yielding a relative improvement of around with respect to the best-performing HFB24 model [58], a comparison of different models is presented in Table 3.
Model | Ref. | [MeV] | [MeV] | [MeV] |
---|---|---|---|---|
MISR10 | - | 0.79 | 0.46 | 0.96 |
ARD | - | 0.39 | 0.28 | 0.77 |
Duflo-Zuker | [51] | 0.62 | 0.30 | 0.81 |
FRDM | [50] | 0.67 | 0.34 | 0.84 |
HFB24 | [58] | 0.55 | 0.48 | 0.91 |
UNEDF1 | [59] | 1.88 | 0.41 | 0.82 |
This motivates us to use the ARD model to study open questions in nuclear science, such as the limits of the existence of nuclear matter. The ARD model provides a robust estimation of nucleon separation energies, which is critical for predicting the limits of stability of nuclei. Using our model uncertainties, the probability of having a bound nucleus can be estimated for any number of protons and neutrons. Proceeding with this approach, we compute the one and two nucleon separation energies and obtain the probability of having a positive central value via the Cumulative Density Probability of a Gaussian distribution [53]. The results for the limits of stability predicted by the ARD model for nuclei with are presented in Fig. 8. Where data are available, overall good agreement with the experiment was found.
Conclusions and outlooks
We developed a Multi-objective Iterative Symbolic Regression framework, labeled as MISR, to enable the discovery of analytical models of nuclear properties. As a proof of principle, the MISR method was employed to describe the binding energies and charge radii of light and medium mass nuclei. Remarkably, simple expressions were found as functions of the neutron and proton numbers. The models found can provide a relatively good description of the available data, with precision comparable to that of state-of-the-art nuclear models.
The MISR model was combined with the well-known DZ model to enable a powerful, complementary description of binding energies and nucleon separation energies. The combined model, ARD, resulted in an overall agreement with the experiment. The uncertainty estimation provided by the ARD model was used to estimate the limits of stability of nuclei in the region . These results highlight the potential of integrating physics-informed ML approaches with established complementary theoretical models to improve predictions of yet-to-be-explored regions of the nuclear chart.
By combining the predictive power of machine learning with the interpretability of symbolic expressions, MISR offers a promising avenue for advancing our understanding of complex nuclear phenomena and, hopefully, guiding the development of more accurate and insightful nuclear models.
Future work will focus on extending the application of MISR to describe complementary nuclear observables using different sets of input variables. Of particular interest would be the development of the MISR method to discover the analytical forms of nuclear interactions and, more generally, interaction Hamiltonians of quantum many-body systems. This could be combined with powerful many-body methods to establish a direct link between microscopic interactions and observables. Work is ongoing to generalize the MISR approach for this purpose, including vectors and tensor operators.
Acknowledgements
This work was supported by the Office of Nuclear Physics, U.S. Department of Energy, under grants DE-SC0021176 and DE-SC0021179. We are grateful for useful discussions and suggestions from S. Wilkins, J. Holt, A. Belley, A. Galindo, and K. Matchev for their helpful comments and insights.
References
- Tews et al. [2020a] I. Tews, Z. Davoudi, A. Ekström, J. D. Holt, and J. E. Lynn, Journal of Physics G: Nuclear and Particle Physics 47, 103001 (2020a).
- Epelbaum et al. [2009] E. Epelbaum, H.-W. Hammer, and U.-G. Meißner, Rev. Mod. Phys. 81, 1773 (2009).
- Navratil et al. [2016] P. Navratil, S. Quaglioni, G. Hupin, C. Romero-Redondo, and A. Calci, Physica Scripta 91, 053002 (2016).
- Reinhard and Nazarewicz [2017] P.-G. Reinhard and W. Nazarewicz, Phys. Rev. C 95, 064328 (2017).
- Sassarini et al. [2022] P. L. Sassarini, J. Dobaczewski, J. Bonnard, and R. F. Garcia Ruiz, J. Phys. G 49, 11LT01 (2022), arXiv:2111.04675 [nucl-th] .
- Tews et al. [2020b] I. Tews, Z. Davoudi, A. Ekström, J. D. Holt, and J. E. Lynn, J. Phys. G 47, 103001 (2020b), arXiv:2001.03334 [nucl-th] .
- Lovato et al. [2022] A. Lovato, C. Adams, G. Carleo, and N. Rocco, Physical Review Research 4, 043178 (2022).
- Vernon et al. [2022] A. R. Vernon et al., Nature 607, 260 (2022).
- Karthein et al. [2023] J. Karthein et al., arXiv , arXiv:2310.15093 (2023).
- Gao et al. [2021] Z.-P. Gao, Y.-J. Wang, H.-L. Lü, Q.-F. Li, C.-W. Shen, and L. Liu, Nuclear Science and Techniques 32, 109 (2021).
- Su et al. [2023] P. Su, W.-B. He, and D.-Q. Fang, Symmetry 15, 1040 (2023).
- Neufcourt et al. [2018a] L. Neufcourt, Y. Cao, W. Nazarewicz, F. Viens, et al., Physical Review C 98, 034318 (2018a).
- Akkoyun and Yakhelef [2022] S. Akkoyun and A. Yakhelef, Phys. Rev. C 105, 044309 (2022), arXiv:2112.12562 [nucl-th] .
- Mumpower et al. [2023] M. Mumpower, M. Li, T. M. Sprouse, B. S. Meyer, A. E. Lovell, and A. T. Mohan, Frontiers in Physics 11, 1198572 (2023).
- Neufcourt et al. [2018b] L. Neufcourt, Y. Cao, W. Nazarewicz, and F. Viens, Phys. Rev. C 98, 034318 (2018b), arXiv:1806.00552 [nucl-th] .
- Munoz et al. [2023] J. M. Munoz, S. Akkoyun, Z. P. Reyes, and L. A. Pachon, Phys. Rev. C 107, 034308 (2023).
- Nolte et al. [2023] N. Nolte, O. Kitouni, S. Trifinopoulos, S. Kantamneni, and M. Williams, in 1st Workshop on the Synergy of Scientific and Machine Learning Modeling @ ICML (2023).
- Mumpower et al. [2022] M. R. Mumpower, T. M. Sprouse, A. E. Lovell, and A. T. Mohan, Phys. Rev. C 106, L021301 (2022).
- Belkin et al. [2018] M. Belkin, D. J. Hsu, and P. Mitra, Advances in neural information processing systems 31 (2018).
- Jiang et al. [2019] W. G. Jiang, G. Hagen, and T. Papenbrock, Phys. Rev. C 100, 054326 (2019).
- Gerwin [1974] D. Gerwin, Behavioral Science 19, 314 (1974).
- Schmidt and Lipson [2009] M. Schmidt and H. Lipson, Science 324, 81 (2009), https://www.science.org/doi/pdf/10.1126/science.1165893 .
- Keren et al. [2023] L. S. Keren, A. Liberzon, and T. Lazebnik, Scientific Reports 13, 1249 (2023).
- Keijzer [2004] M. Keijzer, Genetic Programming and Evolvable Machines 5, 259 (2004).
- Udrescu and Tegmark [2020] S.-M. Udrescu and M. Tegmark, Science Advances 6 (2020), 10.1126/sciadv.aay2631.
- Udrescu et al. [2020] S.-M. Udrescu, A. Tan, J. Feng, O. Neto, T. Wu, and M. Tegmark, Advances in Neural Information Processing Systems 33, 4860 (2020).
- Villaescusa-Navarro et al. [2021] F. Villaescusa-Navarro et al. (CAMELS), Astrophys. J. 915, 71 (2021), arXiv:2010.00619 [astro-ph.CO] .
- Tenachi et al. [2023] W. Tenachi, R. Ibata, and F. I. Diakogiannis, The Astrophysical Journal 959, 99 (2023).
- Matchev et al. [2022] K. T. Matchev, K. Matcheva, and A. Roman, The Astrophysical Journal 930, 33 (2022).
- Lemos et al. [2023] P. Lemos, N. Jeffrey, M. Cranmer, S. Ho, and P. Battaglia, Machine Learning: Science and Technology 4, 045002 (2023).
- Wang et al. [2019] Y. Wang, N. Wagner, and J. M. Rondinelli, MRS Communications 9, 793 (2019).
- Weng et al. [2020] B. Weng, Z. Song, R. Zhu, Q. Yan, Q. Sun, C. G. Grice, Y. Yan, and W.-J. Yin, Nature Communications 11, 3513 (2020).
- Derner et al. [2020] E. Derner, J. Kubalík, N. Ancona, and R. Babuška, Applied Soft Computing 94, 106432 (2020).
- Dong et al. [2023] Z. Dong, K. Kong, K. T. Matchev, and K. Matcheva, Physical Review D 107, 055018 (2023).
- Tsoi et al. [2024] H. F. Tsoi, V. Loncar, S. Dasu, and P. Harris, (2024), arXiv:2401.09949 [cs.LG] .
- Cranmer et al. [2020] M. Cranmer, A. Sanchez Gonzalez, P. Battaglia, R. Xu, K. Cranmer, D. Spergel, and S. Ho, Advances in Neural Information Processing Systems 33, 17429 (2020).
- Schapire [2003] R. E. Schapire, Nonlinear estimation and classification , 149 (2003).
- Ngatchou et al. [2005] P. Ngatchou, A. Zarei, and A. El-Sharkawi, in Proceedings of the 13th international conference on, intelligent systems application to power systems (IEEE, 2005) pp. 84–91.
- Kubalík et al. [2021] J. Kubalík, E. Derner, and R. Babuška, Expert Systems with Applications 182, 115210 (2021).
- Kozachenko and Leonenko [1987] L. F. Kozachenko and N. N. Leonenko, Problemy Peredachi Informatsii 23, 9 (1987).
- Alaa and Van Der Schaar [2020] A. Alaa and M. Van Der Schaar, in International Conference on Machine Learning (PMLR, 2020) pp. 165–174.
- Furnstahl et al. [2015] R. Furnstahl, N. Klco, D. Phillips, and S. Wesolowski, Physical Review C 92, 024005 (2015).
- Casten et al. [1987] R. F. Casten, D. S. Brenner, and P. E. Haustein, Phys. Rev. Lett. 58, 658 (1987).
- Wang et al. [2021] M. Wang, W. Huang, F. G. Kondev, G. Audi, and S. Naimi, Chinese Physics C 45, 030003 (2021).
- Buskirk et al. [2023] L. Buskirk, K. Godbey, P. Giuliani, W. Nazarewicz, and Y. Yamauchi, in APS Meeting Abstracts, APS Meeting Abstracts (2023) p. M06.002.
- [46] F. L. for Nuclear Sciences and N. S. C. Laboratory, “Dft mass tables,” https://massexplorer.frib.msu.edu/content/DFTMassTables.html, accessed: 2024-06-27.
- Marinova and Angeli [2013] K. Marinova and I. Angeli, “Nuclear charge radii,” International Atomic Energy Agency (IAEA) (2013).
- Yang et al. [2023] X. Yang, S. Wang, S. Wilkins, and R. G. Ruiz, Progress in Particle and Nuclear Physics 129, 104005 (2023).
- Benzaid et al. [2020] D. Benzaid, S. Bentridi, A. Kerraci, and N. Amrani, Nuclear Science and Techniques 31, 9 (2020).
- Möller et al. [2012] P. Möller, W. D. Myers, H. Sagawa, and S. Yoshida, Physical Review Letters 108, 052501 (2012).
- Mendoza-Temis et al. [2010] J. Mendoza-Temis, J. G. Hirsch, and A. P. Zuker, Nucl. Phys. A 843, 14 (2010), arXiv:0912.0882 [nucl-th] .
- Erler et al. [2012] J. Erler, N. Birge, M. Kortelainen, W. Nazarewicz, E. Olsen, A. M. Perhac, and M. Stoitsov, Nature (London) 486, 509 (2012).
- Stroberg et al. [2021] S. R. Stroberg, J. D. Holt, A. Schwenk, and J. Simonis, Phys. Rev. Lett. 126, 022501 (2021).
- Perera et al. [2021] U. C. Perera, A. V. Afanasjev, and P. Ring, Phys. Rev. C 104, 064313 (2021), arXiv:2108.02245 [nucl-th] .
- Malbrunot-Ettenauer and Kaufmann [2022] S. Malbrunot-Ettenauer and e. a. Kaufmann, Phys. Rev. Lett. 128, 022502 (2022).
- Day Goodacre et al. [2021] T. Day Goodacre, A. V. Afanasjev, and et. al., Phys. Rev. Lett. 126, 032502 (2021).
- Agbemava et al. [2014] S. E. Agbemava, A. V. Afanasjev, D. Ray, and P. Ring, Phys. Rev. C 89, 054320 (2014).
- Goriely et al. [2016] S. Goriely, N. Chamel, and J. Pearson, in Journal of Physics: Conference Series, Vol. 665 (2016) p. 012038.
- Kortelainen et al. [2012] M. Kortelainen, J. McDonnell, W. Nazarewicz, P.-G. Reinhard, J. Sarich, N. Schunck, M. V. Stoitsov, and S. M. Wild, Physical Review C 85 (2012), 10.1103/physrevc.85.024304.
- Rossi [2021] M. Rossi, The Journal of Chemical Physics 154 (2021).
- Seeger [1961] P. A. Seeger, Nuclear Physics 25, 1 (1961).
- Sheng et al. [2015] Z. Sheng, G. Fan, J. Qian, and J. Hu, The European Physical Journal A 51, 40 (2015).
- Buskirk et al. [2023] L. Buskirk, K. Godbey, W. Nazarewicz, and W. Satula, “Nucleonic shells and nuclear masses,” (2023), arXiv:2309.16871 [nucl-th] .
- Angeli and Marinova [2013] I. Angeli and K. Marinova, Atomic Data and Nuclear Data Tables 99, 69 (2013).
- Garcia Ruiz et al. [2016] R. F. Garcia Ruiz, M. Bissell, K. Blaum, A. Ekström, N. Frömmgen, G. Hagen, M. Hammen, K. Hebeler, J. Holt, G. Jansen, et al., Nature Physics 12, 594 (2016).
- Koszorus et al. [2021] A. Koszorus, X. F. Yang, W. G. Jiang, S. J. Novario, S. W. Bai, J. Billowes, C. L. Binnersley, M. L. Bissell, T. E. Cocolios, B. S. Cooper, R. P. de Groote, A. Ekstrom, K. T. Flanagan, C. Forssen, S. Franchoo, R. F. G. Ruiz, F. P. Gustafsson, G. Hagen, G. R. Jansen, A. Kanellakopoulos, M. Kortelainen, W. Nazarewicz, G. Neyens, T. Papenbrock, P.-G. Reinhard, C. M. Ricketts, B. K. Sahoo, A. R. Vernon, and S. G. Wilkins, Nature Physics 17, 439 (2021).
- Heylen et al. [2016] H. Heylen, C. Babcock, R. Beerwerth, J. Billowes, M. L. Bissell, K. Blaum, J. Bonnard, P. Campbell, B. Cheal, T. Day Goodacre, D. Fedorov, S. Fritzsche, R. F. Garcia Ruiz, W. Geithner, C. Geppert, W. Gins, L. K. Grob, M. Kowalska, K. Kreim, S. M. Lenzi, I. D. Moore, B. Maass, S. Malbrunot-Ettenauer, B. Marsh, R. Neugart, G. Neyens, W. Nörtershäuser, T. Otsuka, J. Papuga, R. Rossel, S. Rothe, R. Sánchez, Y. Tsunoda, C. Wraith, L. Xie, X. F. Yang, and D. T. Yordanov, Phys. Rev. C 94, 054321 (2016).
- MacKay et al. [1994] D. J. MacKay et al., ASHRAE transactions 100, 1053 (1994).
- Wilstrup and Kasak [2021] C. Wilstrup and J. Kasak, arXiv preprint arXiv:2103.15147 (2021).
- Gregorutti et al. [2017] B. Gregorutti, B. Michel, and P. Saint-Pierre, Statistics and Computing 27, 659 (2017).
- Peng et al. [2005] H. Peng, F. Long, and C. Ding, IEEE Transactions on pattern analysis and machine intelligence 27, 1226 (2005).
I METHODS
I.1 Multi-objective Iterated Symbolic Regression
Symbolic Regression (SR) methods optimize both the parameters and structure of analytical models and have been increasingly applied across various fields such as physics, biology, and finance due to their interpretability and ability to produce simple mathematical expressions [36]. SR evolves a population of candidate solutions by crossover and elimination, based on fitness criteria to better fit the data. Moreover, it differs from traditional regression methods in that it does not assume a predefined model structure. Instead, it evolves the structure of the model itself, allowing for discoveries of novel relationships between variables, while requiring significantly less data to train [69]. In the following, we present Multi-objective Iterated Symbolic Regression (MISR), a novel framework for performing SR.
Overview of the Core Algorithm
The core algorithm of MISR is rooted in the principles of SR but extends beyond traditional methodologies to address complex challenges in physics. It does so by leveraging a framework that iteratively refines feature selection, equation generation, and model optimization. This iterative process aims to construct a comprehensive model by aggregating simpler sub-models, each capturing unique aspects of the underlying physical processes.
A key aspect of our methodology is the dynamic assessment of feature importance in each step. For this, we fit a Boosted Decision Tree’s feature importances averaged with the Mutual Information score of each input feature [70, 71]. Each iteration involves k-fold cross-validation, where we downsample the feature space based on its importance.
SR framework
Within each fold and iteration, we perform a symbolic regression constrained by the number of terms—a significant hyperparameter influencing equation simplicity. Each expression acts as an individual within the evolutionary algorithm, its fitness gauged via the mean error description length (MEDL) based on optimized parameters [25, 26]. These regressions are executed in parallel across folds for computational efficiency. The top-performing equations are identified based on their performance score on various objective variables which are related by our physical models, with experimental uncertainties inversely weighing the evaluation for robustness.
Iterative refinement and residual modeling
Our iterative approach refines the model’s target variable to be the residuals from the best-performing model of the preceding iteration by repeating the fitting process in the residuals. The iterative process continues until reaching a predefined maximum number of steps or when the improvement ratio diminishes below a certain threshold.
Estimation of weighting distribution
After we have modeled all of the expanding terms in our expression, we estimate the errors via the technique of Discriminative Jackknife [41] on the test set. Then the inference uncertainty is obtained via resampling from the distribution of the weights modeled as Gaussian distributions and adding the subsequent term value in the expansion cutoff. Fig. 9 illustrates the distribution of sampled weights for charge radii’s regressor. The figure demonstrates how the Jackknife resampling allows the assignment of characteristic Gaussian distributions to each one of the terms in the MISR expansion.
The total related uncertainty is computed by approximating all of the sampling weights to a Gaussian distribution and bootstrap** to one since we assume the independence in each distribution. Then, we orthonormally add the uncertainties with the approximation coming from the cutoff in the expansion (i.e. the next expansion term).
Nucleon separation energies
Fig. 10 presents the trend of neutron separation energies for different neutron-rich isotopes of Argonne (Ar), Calcium (Ca), Titanium (Ti), and Manganese (Mn). Experimental results are compared with the ARD model
As a complementary example, the proton separation energies of Krypton isotopes are shown in Fig. 11.
I.2 MISR for Binding Energy
In table 4 we present the obtained models term by term. We label the functions and respectively. In addition, we denote as is done traditionally in the literature.
Model Order | Obtained Function |
---|---|
1 | |
2 | |
3 | |
4 | |
5 | |
6 | |
7 | |
8 | |
9 | |
10 |
I.3 MISR for Charge Radii
Table 5 presents the obtained models term by term. We label the functions and respectively. In addition, we denote as is done traditionally in the literature.
Model Order | Obtained Function |
---|---|
1 | |
2 | |
3 | |
4 | |
5 | |
6 | |
7 | |
8 | |
9 | |
10 |