Showing 1–2 of 2 results for author: Reichardt, I

Search v0.5.6 released 2020-02-24

arXiv:2204.02704 [pdf, other]

cs.LG cond-mat.dis-nn cond-mat.stat-mech physics.comp-ph physics.data-an

doi 10.1038/s41467-023-36657-z

Fundamental limits to learning closed-form mathematical models from data

Authors: Oscar Fajardo-Fontiveros, Ignasi Reichardt, Harry R. De Los Rios, Jordi Duch, Marta Sales-Pardo, Roger Guimera

Abstract: Given a finite and noisy dataset generated with a closed-form mathematical model, when is it possible to learn the true generating model from the data alone? This is the question we investigate here. We show that this model-learning problem displays a transition from a low-noise phase in which the true model can be learned, to a phase in which the observation noise is too high for the true model t… ▽ More Given a finite and noisy dataset generated with a closed-form mathematical model, when is it possible to learn the true generating model from the data alone? This is the question we investigate here. We show that this model-learning problem displays a transition from a low-noise phase in which the true model can be learned, to a phase in which the observation noise is too high for the true model to be learned by any method. Both in the low-noise phase and in the high-noise phase, probabilistic model selection leads to optimal generalization to unseen data. This is in contrast to standard machine learning approaches, including artificial neural networks, which in this particular problem are limited, in the low-noise phase, by their ability to interpolate. In the transition region between the learnable and unlearnable phases, generalization is hard for all approaches including probabilistic model selection. △ Less

Submitted 16 December, 2022; v1 submitted 6 April, 2022; originally announced April 2022.
arXiv:2004.12157 [pdf]

cs.LG physics.data-an stat.ML

doi 10.1126/sciadv.aav6971

A Bayesian machine scientist to aid in the solution of challenging scientific problems

Authors: Roger Guimera, Ignasi Reichardt, Antoni Aguilar-Mogas, Francesco A Massucci, Manuel Miranda, Jordi Pallares, Marta Sales-Pardo

Abstract: Closed-form, interpretable mathematical models have been instrumental for advancing our understanding of the world; with the data revolution, we may now be in a position to uncover new such models for many systems from physics to the social sciences. However, to deal with increasing amounts of data, we need "machine scientists" that are able to extract these models automatically from data. Here, w… ▽ More Closed-form, interpretable mathematical models have been instrumental for advancing our understanding of the world; with the data revolution, we may now be in a position to uncover new such models for many systems from physics to the social sciences. However, to deal with increasing amounts of data, we need "machine scientists" that are able to extract these models automatically from data. Here, we introduce a Bayesian machine scientist, which establishes the plausibility of models using explicit approximations to the exact marginal posterior over models and establishes its prior expectations about models by learning from a large empirical corpus of mathematical expressions. It explores the space of models using Markov chain Monte Carlo. We show that this approach uncovers accurate models for synthetic and real data and provides out-of-sample predictions that are more accurate than those of existing approaches and of other nonparametric methods. △ Less

Submitted 25 April, 2020; originally announced April 2020.

Journal ref: Sci. Adv. 6 (5) , eaav6971 (2020)

Search v0.5.6 released 2020-02-24