Skip to main content

Showing 1–13 of 13 results for author: Wandelt, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13867  [pdf, other

    cs.LG cs.AI

    Scaling-laws for Large Time-series Models

    Authors: Thomas D. P. Edwards, James Alvey, Justin Alsing, Nam H. Nguyen, Benjamin D. Wandelt

    Abstract: Scaling laws for large language models (LLMs) have provided useful guidance on how to train ever larger models for predictable performance gains. Time series forecasting shares a similar sequential structure to language, and is amenable to large-scale transformer architectures. Here we show that foundational decoder-only time series transformer models exhibit analogous scaling-behavior to LLMs, wh… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 8 pages, 3 figures

  2. arXiv:2402.17492  [pdf, other

    astro-ph.CO astro-ph.IM cs.LG cs.NE

    syren-halofit: A fast, interpretable, high-precision formula for the $Λ$CDM nonlinear matter power spectrum

    Authors: Deaglan J. Bartlett, Benjamin D. Wandelt, Matteo Zennaro, Pedro G. Ferreira, Harry Desmond

    Abstract: Rapid and accurate evaluation of the nonlinear matter power spectrum, $P(k)$, as a function of cosmological parameters and redshift is of fundamental importance in cosmology. Analytic approximations provide an interpretable solution, yet current approximations are neither fast nor accurate relative to numerical emulators. We use symbolic regression to obtain simple analytic approximations to the n… ▽ More

    Submitted 15 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 11 pages, 8 figures. Accepted for publication in A&A

    Journal ref: A&A 686, A150 (2024)

  3. arXiv:2402.05137  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA cs.LG

    LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and Cosmology

    Authors: Matthew Ho, Deaglan J. Bartlett, Nicolas Chartier, Carolina Cuesta-Lazaro, Simon Ding, Axel Lapel, Pablo Lemos, Christopher C. Lovell, T. Lucas Makinen, Chirag Modi, Viraj Pandya, Shivam Pandey, Lucia A. Perez, Benjamin Wandelt, Greg L. Bryan

    Abstract: This paper presents the Learning the Universe Implicit Likelihood Inference (LtU-ILI) pipeline, a codebase for rapid, user-friendly, and cutting-edge machine learning (ML) inference in astrophysics and cosmology. The pipeline includes software for implementing various neural architectures, training schemata, priors, and density estimators in a manner easily adaptable to any research workflow. It i… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 22 pages, 10 figures, accepted in the Open Journal of Astrophysics. Code available at https://github.com/maho3/ltu-ili

    Journal ref: 2024 OJA, Vol. 7

  4. arXiv:2311.15865  [pdf, other

    astro-ph.CO astro-ph.IM cs.LG cs.NE

    A precise symbolic emulator of the linear matter power spectrum

    Authors: Deaglan J. Bartlett, Lukas Kammerer, Gabriel Kronberger, Harry Desmond, Pedro G. Ferreira, Benjamin D. Wandelt, Bogdan Burlacu, David Alonso, Matteo Zennaro

    Abstract: Computing the matter power spectrum, $P(k)$, as a function of cosmological parameters can be prohibitively slow in cosmological analyses, hence emulating this calculation is desirable. Previous analytic approximations are insufficiently accurate for modern applications, so black-box, uninterpretable emulators are often used. We utilise an efficient genetic programming based symbolic regression fra… ▽ More

    Submitted 15 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 9 pages, 5 figures. Accepted for publication in A&A

    Journal ref: A&A 686, A209 (2024)

  5. arXiv:2311.05742  [pdf, other

    stat.ML astro-ph.IM cs.AI cs.GT cs.LG

    Optimal simulation-based Bayesian decisions

    Authors: Justin Alsing, Thomas D. P. Edwards, Benjamin Wandelt

    Abstract: We present a framework for the efficient computation of optimal Bayesian decisions under intractable likelihoods, by learning a surrogate model for the expected utility (or its distribution) as a function of the action and data spaces. We leverage recent advances in simulation-based inference and Bayesian optimization to develop active learning schemes to choose where in parameter and action space… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 12 pages, 4 figures

  6. arXiv:2310.03812  [pdf, other

    cs.LG stat.ML

    Fishnets: Information-Optimal, Scalable Aggregation for Sets and Graphs

    Authors: T. Lucas Makinen, Justin Alsing, Benjamin D. Wandelt

    Abstract: Set-based learning is an essential component of modern deep learning and network science. Graph Neural Networks (GNNs) and their edge-free counterparts Deepsets have proven remarkably useful on ragged and topologically challenging datasets. The key to learning informative embeddings for set members is a specified aggregation function, usually a sum, max, or mean. We propose Fishnets, an aggregatio… ▽ More

    Submitted 28 June, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: 15 pages, 6 figures, 2 tables. Submitted to JMLR

  7. arXiv:2305.11241  [pdf, other

    cs.LG astro-ph.CO astro-ph.IM stat.ML

    Evidence Networks: simple losses for fast, amortized, neural Bayesian model comparison

    Authors: Niall Jeffrey, Benjamin D. Wandelt

    Abstract: Evidence Networks can enable Bayesian model comparison when state-of-the-art methods (e.g. nested sampling) fail and even when likelihoods or priors are intractable or unknown. Bayesian model comparison, i.e. the computation of Bayes factors or evidence ratios, can be cast as an optimization problem. Though the Bayesian interpretation of optimal classification is well-known, here we change perspec… ▽ More

    Submitted 10 January, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: 21 pages, 8 figures, accepted by Machine Learning: Science and Technology

    Journal ref: http://iopscience.iop.org/article/10.1088/2632-2153/ad1a4d, 2024, Machine Learning: Science and Technology, 2632-2153

  8. arXiv:2305.11213  [pdf, other

    cs.LG

    Information-Ordered Bottlenecks for Adaptive Semantic Compression

    Authors: Matthew Ho, Xiaosheng Zhao, Benjamin Wandelt

    Abstract: We present the information-ordered bottleneck (IOB), a neural layer designed to adaptively compress data into latent variables ordered by likelihood maximization. Without retraining, IOB nodes can be truncated at any bottleneck width, capturing the most crucial information in the first latent variables. Unifying several previous approaches, we show that IOBs achieve near-optimal compression for a… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 14 pages, 6 figures, 1 table, Submitted to NeurIPS 2023

  9. arXiv:2201.01300  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM cs.AI cs.LG

    The CAMELS project: public data release

    Authors: Francisco Villaescusa-Navarro, Shy Genel, Daniel Anglés-Alcázar, Lucia A. Perez, Pablo Villanueva-Domingo, Digvijay Wadekar, Helen Shao, Faizan G. Mohammad, Sultan Hassan, Emily Moser, Erwin T. Lau, Luis Fernando Machado Poletti Valle, Andrina Nicola, Leander Thiele, Yongseok Jo, Oliver H. E. Philcox, Benjamin D. Oppenheimer, Megan Tillman, ChangHoon Hahn, Neerav Kaushal, Alice Pisani, Matthew Gebhardt, Ana Maria Delgado, Joyce Caliendo, Christina Kreisch , et al. (22 additional authors not shown)

    Abstract: The Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) project was developed to combine cosmology with astrophysics through thousands of cosmological hydrodynamic simulations and machine learning. CAMELS contains 4,233 cosmological simulations, 2,049 N-body and 2,184 state-of-the-art hydrodynamic simulations that sample a vast volume in parameter space. In this paper we present… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

    Comments: 18 pages, 3 figures. More than 350 Tb of data from thousands of simulations publicly available at https://www.camel-simulations.org

  10. arXiv:2109.10915  [pdf, other

    cs.LG astro-ph.CO astro-ph.GA astro-ph.IM cs.CV

    The CAMELS Multifield Dataset: Learning the Universe's Fundamental Parameters with Artificial Intelligence

    Authors: Francisco Villaescusa-Navarro, Shy Genel, Daniel Angles-Alcazar, Leander Thiele, Romeel Dave, Desika Narayanan, Andrina Nicola, Yin Li, Pablo Villanueva-Domingo, Benjamin Wandelt, David N. Spergel, Rachel S. Somerville, Jose Manuel Zorrilla Matilla, Faizan G. Mohammad, Sultan Hassan, Helen Shao, Digvijay Wadekar, Michael Eickenberg, Kaze W. K. Wong, Gabriella Contardo, Yongseok Jo, Emily Moser, Erwin T. Lau, Luis Fernando Machado Poletti Valle, Lucia A. Perez , et al. (3 additional authors not shown)

    Abstract: We present the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) Multifield Dataset, CMD, a collection of hundreds of thousands of 2D maps and 3D grids containing many different properties of cosmic gas, dark matter, and stars from 2,000 distinct simulated universes at several cosmic times. The 2D maps and 3D grids represent cosmic regions that span $\sim$100 million light year… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: 17 pages, 1 figure. Third paper of a series of four. Hundreds of thousands of labeled 2D maps and 3D grids from thousands of simulated universes publicly available at https://camels-multifield-dataset.readthedocs.io

  11. arXiv:2109.10360  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM cs.CV cs.LG

    Robust marginalization of baryonic effects for cosmological inference at the field level

    Authors: Francisco Villaescusa-Navarro, Shy Genel, Daniel Angles-Alcazar, David N. Spergel, Yin Li, Benjamin Wandelt, Leander Thiele, Andrina Nicola, Jose Manuel Zorrilla Matilla, Helen Shao, Sultan Hassan, Desika Narayanan, Romeel Dave, Mark Vogelsberger

    Abstract: We train neural networks to perform likelihood-free inference from $(25\,h^{-1}{\rm Mpc})^2$ 2D maps containing the total mass surface density from thousands of hydrodynamic simulations of the CAMELS project. We show that the networks can extract information beyond one-point functions and power spectra from all resolved scales ($\gtrsim 100\,h^{-1}{\rm kpc}$) while performing a robust marginalizat… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 7 pages, 4 figures. Second paper of a series of four. The 2D maps, codes, and network weights used in this paper are publicly available at https://camels-multifield-dataset.readthedocs.io

  12. arXiv:2109.09747  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM cs.CV cs.LG

    Multifield Cosmology with Artificial Intelligence

    Authors: Francisco Villaescusa-Navarro, Daniel Anglés-Alcázar, Shy Genel, David N. Spergel, Yin Li, Benjamin Wandelt, Andrina Nicola, Leander Thiele, Sultan Hassan, Jose Manuel Zorrilla Matilla, Desika Narayanan, Romeel Dave, Mark Vogelsberger

    Abstract: Astrophysical processes such as feedback from supernovae and active galactic nuclei modify the properties and spatial distribution of dark matter, gas, and galaxies in a poorly understood way. This uncertainty is one of the main theoretical obstacles to extract information from cosmological surveys. We use 2,000 state-of-the-art hydrodynamic simulations from the CAMELS project spanning a wide vari… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 11 pages, 7 figures. First paper of a series of four. All 2D maps, codes, and networks weights publicly available at https://camels-multifield-dataset.readthedocs.io

  13. arXiv:2011.05991  [pdf, other

    stat.ML astro-ph.CO cs.LG

    Solving high-dimensional parameter inference: marginal posterior densities & Moment Networks

    Authors: Niall Jeffrey, Benjamin D. Wandelt

    Abstract: High-dimensional probability density estimation for inference suffers from the "curse of dimensionality". For many physical inference problems, the full posterior distribution is unwieldy and seldom used in practice. Instead, we propose direct estimation of lower-dimensional marginal distributions, bypassing high-dimensional density estimation or high-dimensional Markov chain Monte Carlo (MCMC) sa… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Accepted in the Third Workshop on Machine Learning and the Physical Sciences, NeurIPS 2020, Vancouver, Canada