Skip to main content

Showing 1–43 of 43 results for author: Eickenberg, M

.
  1. arXiv:2406.02585  [pdf, other

    cs.LG cs.AI stat.ML

    Contextual Counting: A Mechanistic Study of Transformers on a Quantitative Task

    Authors: Siavash Golkar, Alberto Bietti, Mariel Pettee, Michael Eickenberg, Miles Cranmer, Keiya Hirashima, Geraud Krawezik, Nicholas Lourie, Michael McCabe, Rudy Morel, Ruben Ohana, Liam Holden Parker, Bruno Régaldo-Saint Blancard, Kyunghyun Cho, Shirley Ho

    Abstract: Transformers have revolutionized machine learning across diverse domains, yet understanding their behavior remains crucial, particularly in high-stakes applications. This paper introduces the contextual counting task, a novel toy problem aimed at enhancing our understanding of Transformers in quantitative and scientific contexts. This task requires precise localization and computation within datas… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

  2. arXiv:2406.02052  [pdf, other

    cs.LG stat.ML

    PETRA: Parallel End-to-end Training with Reversible Architectures

    Authors: Stéphane Rivaud, Louis Fournier, Thomas Pumir, Eugene Belilovsky, Michael Eickenberg, Edouard Oyallon

    Abstract: Reversible architectures have been shown to be capable of performing on par with their non-reversible architectures, being applied in deep learning for memory savings and generative modeling. In this work, we show how reversible architectures can solve challenges in parallelizing deep model training. We introduce PETRA, a novel alternative to backpropagation for parallelizing gradient computations… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2406.01365  [pdf, other

    cs.CV cs.CR cs.LG

    From Feature Visualization to Visual Circuits: Effect of Adversarial Model Manipulation

    Authors: Geraldin Nanfack, Michael Eickenberg, Eugene Belilovsky

    Abstract: Understanding the inner working functionality of large-scale deep neural networks is challenging yet crucial in several high-stakes applications. Mechanistic inter- pretability is an emergent field that tackles this challenge, often by identifying human-understandable subgraphs in deep neural networks known as circuits. In vision-pretrained models, these subgraphs are usually interpreted by visual… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Under review

  4. arXiv:2404.04228  [pdf, other

    astro-ph.CO

    {\sc SimBIG}: Cosmological Constraints using Simulation-Based Inference of Galaxy Clustering with Marked Power Spectra

    Authors: Elena Massara, ChangHoon Hahn, Michael Eickenberg, Shirley Ho, Jiamin Hou, Pablo Lemos, Chirag Modi, Azadeh Moradinezhad Dizgah, Liam Parker, Bruno Régaldo-Saint Blancard

    Abstract: We present the first $Λ$CDM cosmological analysis performed on a galaxy survey using marked power spectra. The marked power spectrum is the two-point function of a marked field, where galaxies are weighted by a function that depends on their local density. The presence of the mark leads these statistics to contain higher-order information of the original galaxy field, making them a good candidate… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 15 pages, 6 figures

  5. arXiv:2402.04958  [pdf, other

    cs.CV

    Channel-Selective Normalization for Label-Shift Robust Test-Time Adaptation

    Authors: Pedro Vianna, Muawiz Chaudhary, Paria Mehrbod, An Tang, Guy Cloutier, Guy Wolf, Michael Eickenberg, Eugene Belilovsky

    Abstract: Deep neural networks have useful applications in many different tasks, however their performance can be severely affected by changes in the data distribution. For example, in the biomedical field, their performance can be affected by changes in the data (different machines, populations) between training and test datasets. To ensure robustness and generalization to real-world scenarios, test-time a… ▽ More

    Submitted 29 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Accepted at the Conference on Lifelong Learning Agents (CoLLAs) 2024

  6. arXiv:2401.15074  [pdf, other

    astro-ph.CO

    ${\rm S{\scriptsize IM}BIG}$: Cosmological Constraints from the Redshift-Space Galaxy Skew Spectra

    Authors: Jiamin Hou, Azadeh Moradinezhad Dizgah, ChangHoon Hahn, Michael Eickenberg, Shirley Ho, Pablo Lemos, Elena Massara, Chirag Modi, Liam Parker, Bruno Régaldo-Saint Blancard

    Abstract: Extracting the non-Gaussian information of the cosmic large-scale structure (LSS) is vital in unlocking the full potential of the rich datasets from the upcoming stage-IV galaxy surveys. Galaxy skew spectra serve as efficient beyond-two-point statistics, encapsulating essential bispectrum information with computational efficiency akin to power spectrum analysis. This paper presents the first cosmo… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 23 pages, 12 figures, 2 tables

  7. arXiv:2310.15256  [pdf, other

    astro-ph.CO cs.LG

    SimBIG: Field-level Simulation-Based Inference of Galaxy Clustering

    Authors: Pablo Lemos, Liam Parker, ChangHoon Hahn, Shirley Ho, Michael Eickenberg, Jiamin Hou, Elena Massara, Chirag Modi, Azadeh Moradinezhad Dizgah, Bruno Regaldo-Saint Blancard, David Spergel

    Abstract: We present the first simulation-based inference (SBI) of cosmological parameters from field-level analysis of galaxy clustering. Standard galaxy clustering analyses rely on analyzing summary statistics, such as the power spectrum, $P_\ell$, with analytic models based on perturbation theory. Consequently, they do not fully exploit the non-linear and non-Gaussian features of the galaxy distribution.… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 14 pages, 4 figures. A previous version of the paper was published in the ICML 2023 Workshop on Machine Learning for Astrophysics

  8. arXiv:2310.15250  [pdf, other

    astro-ph.CO

    ${\rm S{\scriptsize IM}BIG}$: Galaxy Clustering Analysis with the Wavelet Scattering Transform

    Authors: Bruno Régaldo-Saint Blancard, ChangHoon Hahn, Shirley Ho, Jiamin Hou, Pablo Lemos, Elena Massara, Chirag Modi, Azadeh Moradinezhad Dizgah, Liam Parker, Yuling Yao, Michael Eickenberg

    Abstract: The non-Gaussisan spatial distribution of galaxies traces the large-scale structure of the Universe and therefore constitutes a prime observable to constrain cosmological parameters. We conduct Bayesian inference of the $Λ$CDM parameters $Ω_m$, $Ω_b$, $h$, $n_s$, and $σ_8$ from the BOSS CMASS galaxy sample by combining the wavelet scattering transform (WST) with a simulation-based inference approa… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 11+5 pages, 8+2 figures

  9. arXiv:2310.15246  [pdf, other

    astro-ph.CO

    ${\rm S{\scriptsize IM}BIG}$: The First Cosmological Constraints from Non-Gaussian and Non-Linear Galaxy Clustering

    Authors: ChangHoon Hahn, Pablo Lemos, Liam Parker, Bruno Régaldo-Saint Blancard, Michael Eickenberg, Shirley Ho, Jiamin Hou, Elena Massara, Chirag Modi, Azadeh Moradinezhad Dizgah, David Spergel

    Abstract: The 3D distribution of galaxies encodes detailed cosmological information on the expansion and growth history of the Universe. We present the first cosmological constraints that exploit non-Gaussian cosmological information on non-linear scales from galaxy clustering, inaccessible with current standard analyses. We analyze a subset of the BOSS galaxy survey using ${\rm S{\scriptsize IM}BIG}$, a ne… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 13 pages, 5 figures, submitted to Nature Astronomy, comments welcome

  10. arXiv:2310.15243  [pdf, other

    astro-ph.CO

    ${\rm S{\scriptsize IM}BIG}$: The First Cosmological Constraints from the Non-Linear Galaxy Bispectrum

    Authors: ChangHoon Hahn, Michael Eickenberg, Shirley Ho, Jiamin Hou, Pablo Lemos, Elena Massara, Chirag Modi, Azadeh Moradinezhad Dizgah, Liam Parker, Bruno Régaldo-Saint Blancard

    Abstract: We present the first cosmological constraints from analyzing higher-order galaxy clustering on non-linear scales. We use ${\rm S{\scriptsize IM}BIG}$, a forward modeling framework for galaxy clustering analyses that employs simulation-based inference to perform highly efficient cosmological inference using normalizing flows. It leverages the predictive power of high-fidelity simulations and robust… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 13 pages, 7 figures, submitted to PRD, comments welcome

  11. arXiv:2310.03024  [pdf, other

    astro-ph.IM cs.AI cs.LG

    AstroCLIP: A Cross-Modal Foundation Model for Galaxies

    Authors: Liam Parker, Francois Lanusse, Siavash Golkar, Leopoldo Sarra, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Geraud Krawezik, Michael McCabe, Ruben Ohana, Mariel Pettee, Bruno Regaldo-Saint Blancard, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

    Abstract: We present AstroCLIP, a single, versatile model that can embed both galaxy images and spectra into a shared, physically meaningful latent space. These embeddings can then be used - without any model fine-tuning - for a variety of downstream tasks including (1) accurate in-modality and cross-modality semantic similarity search, (2) photometric redshift estimation, (3) galaxy property estimation fro… ▽ More

    Submitted 14 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 18 pages, accepted in Monthly Notices of the Royal Astronomical Society, Presented at the NeurIPS 2023 AI4Science Workshop

  12. arXiv:2310.02994  [pdf, other

    cs.LG cs.AI stat.ML

    Multiple Physics Pretraining for Physical Surrogate Models

    Authors: Michael McCabe, Bruno Régaldo-Saint Blancard, Liam Holden Parker, Ruben Ohana, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Siavash Golkar, Geraud Krawezik, Francois Lanusse, Mariel Pettee, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

    Abstract: We introduce multiple physics pretraining (MPP), an autoregressive task-agnostic pretraining approach for physical surrogate modeling. MPP involves training large surrogate models to predict the dynamics of multiple heterogeneous physical systems simultaneously by learning features that are broadly useful across diverse physical tasks. In order to learn effectively in this setting, we introduce a… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  13. arXiv:2310.02989  [pdf, other

    stat.ML cs.AI cs.CL cs.LG

    xVal: A Continuous Number Encoding for Large Language Models

    Authors: Siavash Golkar, Mariel Pettee, Michael Eickenberg, Alberto Bietti, Miles Cranmer, Geraud Krawezik, Francois Lanusse, Michael McCabe, Ruben Ohana, Liam Parker, Bruno Régaldo-Saint Blancard, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

    Abstract: Large Language Models have not yet been broadly adapted for the analysis of scientific datasets due in part to the unique difficulties of tokenizing numbers. We propose xVal, a numerical encoding scheme that represents any real number using just a single token. xVal represents a given real number by scaling a dedicated embedding vector by the number value. Combined with a modified number-inference… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 10 pages 7 figures. Supplementary: 5 pages 2 figures

  14. arXiv:2307.14362  [pdf, other

    astro-ph.IM astro-ph.CO cs.LG

    Learnable wavelet neural networks for cosmological inference

    Authors: Christian Pedersen, Michael Eickenberg, Shirley Ho

    Abstract: Convolutional neural networks (CNNs) have been shown to both extract more information than the traditional two-point statistics from cosmological fields, and marginalise over astrophysical effects extremely well. However, CNNs require large amounts of training data, which is potentially problematic in the domain of expensive cosmological simulations, and it is difficult to interpret the network. I… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted at ICML 2022 Workshop on Machine Learning for Astrophysics, Baltimore, Maryland, USA, 2022

  15. arXiv:2306.15012  [pdf, other

    stat.ML astro-ph.IM cs.LG eess.SP

    Statistical Component Separation for Targeted Signal Recovery in Noisy Mixtures

    Authors: Bruno Régaldo-Saint Blancard, Michael Eickenberg

    Abstract: Separating signals from an additive mixture may be an unnecessarily hard problem when one is only interested in specific properties of a given signal. In this work, we tackle simpler "statistical component separation" problems that focus on recovering a predefined set of statistical descriptors of a target signal from a noisy mixture. Assuming access to samples of the noise process, we investigate… ▽ More

    Submitted 28 February, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 13+17 pages, 6+8 figures, published in TMLR, code: https://github.com/bregaldo/stat_comp_sep

  16. arXiv:2306.07397  [pdf, other

    cs.LG cs.CV

    Adversarial Attacks on the Interpretation of Neuron Activation Maximization

    Authors: Geraldin Nanfack, Alexander Fulleringer, Jonathan Marty, Michael Eickenberg, Eugene Belilovsky

    Abstract: The internal functional behavior of trained Deep Neural Networks is notoriously difficult to interpret. Activation-maximization approaches are one set of techniques used to interpret and analyze trained deep-learning models. These consist in finding inputs that maximally activate a given neuron or feature map. These inputs can be selected from a data set or obtained by optimization. However, inter… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  17. arXiv:2306.06968  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Can Forward Gradient Match Backpropagation?

    Authors: Louis Fournier, Stéphane Rivaud, Eugene Belilovsky, Michael Eickenberg, Edouard Oyallon

    Abstract: Forward Gradients - the idea of using directional derivatives in forward differentiation mode - have recently been shown to be utilizable for neural network training while avoiding problems generally associated with backpropagation gradient computation, such as locking and memorization requirements. The cost is the requirement to guess the step direction, which is hard in high dimensions. While c… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Journal ref: Fortieth International Conference on Machine Learning, Jul 2023, Honolulu (Hawaii), USA, United States

  18. arXiv:2305.07583  [pdf, other

    cs.LG math.OC

    MoMo: Momentum Models for Adaptive Learning Rates

    Authors: Fabian Schaipp, Ruben Ohana, Michael Eickenberg, Aaron Defazio, Robert M. Gower

    Abstract: Training a modern machine learning architecture on a new task requires extensive learning-rate tuning, which comes at a high computational cost. Here we develop new Polyak-type adaptive learning rates that can be used on top of any momentum method, and require less tuning to perform well. We first develop MoMo, a Momentum Model based adaptive learning rate for SGD-M (stochastic gradient descent wi… ▽ More

    Submitted 5 June, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

    MSC Class: 90C53; 74S60; 90C06; 62L20; 68W20; 15B52; 65Y20; 68W40 ACM Class: G.1.6

  19. arXiv:2301.07635  [pdf, other

    cs.LG cs.NE

    Local Learning with Neuron Groups

    Authors: Adeetya Patel, Michael Eickenberg, Eugene Belilovsky

    Abstract: Traditional deep network training methods optimize a monolithic objective function jointly for all the components. This can lead to various inefficiencies in terms of potential parallelization. Local learning is an approach to model-parallelism that removes the standard end-to-end learning setup and utilizes local objective functions to permit parallel learning amongst model components in a deep n… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  20. arXiv:2211.00723  [pdf, other

    astro-ph.CO

    ${\rm S{\scriptsize IM}BIG}$: A Forward Modeling Approach To Analyzing Galaxy Clustering

    Authors: ChangHoon Hahn, Michael Eickenberg, Shirley Ho, Jiamin Hou, Pablo Lemos, Elena Massara, Chirag Modi, Azadeh Moradinezhad Dizgah, Bruno Régaldo-Saint Blancard, Muntazir M. Abidi

    Abstract: We present the first-ever cosmological constraints from a simulation-based inference (SBI) analysis of galaxy clustering from the new ${\rm S{\scriptsize IM}BIG}$ forward modeling framework. ${\rm S{\scriptsize IM}BIG}$ leverages the predictive power of high-fidelity simulations and provides an inference framework that can extract cosmological information on small non-linear scales, inaccessible w… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 9 pages, 5 figures

  21. ${\rm S{\scriptsize IM}BIG}$: Mock Challenge for a Forward Modeling Approach to Galaxy Clustering

    Authors: ChangHoon Hahn, Michael Eickenberg, Shirley Ho, Jiamin Hou, Pablo Lemos, Elena Massara, Chirag Modi, Azadeh Moradinezhad Dizgah, Bruno Régaldo-Saint Blancard, Muntazir M. Abidi

    Abstract: Simulation-Based Inference of Galaxies (${\rm S{\scriptsize IM}BIG}$) is a forward modeling framework for analyzing galaxy clustering using simulation-based inference. In this work, we present the ${\rm S{\scriptsize IM}BIG}$ forward model, which is designed to match the observed SDSS-III BOSS CMASS galaxy sample. The forward model is based on high-resolution ${\rm Q{\scriptsize UIJOTE}}$ $N$-body… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 28 pages, 6 figures

  22. arXiv:2210.14273  [pdf, other

    astro-ph.CO

    Towards a non-Gaussian Generative Model of large-scale Reionization Maps

    Authors: Yu-Heng Lin, Sultan Hassan, Bruno Régaldo-Saint Blancard, Michael Eickenberg, Chirag Modi

    Abstract: High-dimensional data sets are expected from the next generation of large-scale surveys. These data sets will carry a wealth of information about the early stages of galaxy formation and cosmic reionization. Extracting the maximum amount of information from the these data sets remains a key challenge. Current simulations of cosmic reionization are computationally too expensive to provide enough re… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: 7 pages, 3 figures, accept in Machine Learning and the Physical Sciences workshop at NeurIPS 2022

  23. arXiv:2208.03538  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM

    Generative Models of Multi-channel Data from a Single Example -- Application to Dust Emission

    Authors: Bruno Régaldo-Saint Blancard, Erwan Allys, Constant Auclair, François Boulanger, Michael Eickenberg, François Levrier, Léo Vacher, Sixin Zhang

    Abstract: The quest for primordial $B$-modes in the cosmic microwave background has emphasized the need for refined models of the Galactic dust foreground. Here, we aim at building a realistic statistical model of the multi-frequency dust emission from a single example. We introduce a generic methodology relying on microcanonical gradient descent models conditioned by an extended family of wavelet phase har… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: 18 pages, 7 figures, submitted to ApJ, code: https://github.com/bregaldo/dust_genmodels

  24. arXiv:2207.08435  [pdf, other

    astro-ph.CO cs.LG

    Robust Simulation-Based Inference in Cosmology with Bayesian Neural Networks

    Authors: Pablo Lemos, Miles Cranmer, Muntazir Abidi, ChangHoon Hahn, Michael Eickenberg, Elena Massara, David Yallup, Shirley Ho

    Abstract: Simulation-based inference (SBI) is rapidly establishing itself as a standard machine learning technique for analyzing data in cosmological surveys. Despite continual improvements to the quality of density estimation by learned models, applications of such techniques to real data are entirely reliant on the generalization power of neural networks far outside the training distribution, which is mos… ▽ More

    Submitted 2 March, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: 5 pages, 3 figures. Preliminary version accepted at the ML4Astro Machine Learning for Astrophysics Workshop at the Thirty-ninth International Conference on Machine Learning (ICML 2022). Final version published at Machine Learning: Science and Technology

    Journal ref: Mach. Learn.: Sci. Technol. 4 01LT01 (2023)

  25. arXiv:2207.04616  [pdf, other

    physics.flu-dyn

    TNT: Vision Transformer for Turbulence Simulations

    Authors: Yuchen Dang, Zheyuan Hu, Miles Cranmer, Michael Eickenberg, Shirley Ho

    Abstract: Turbulence is notoriously difficult to model due to its multi-scale nature and sensitivity to small perturbations. Classical solvers of turbulence simulation generally operate on finer grids and are computationally inefficient. In this paper, we propose the Turbulence Neural Transformer (TNT), which is a learned simulator based on the transformer architecture, to predict turbulent dynamics on coar… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  26. Cosmological Information in the Marked Power Spectrum of the Galaxy Field

    Authors: Elena Massara, Francisco Villaescusa-Navarro, ChangHoon Hahn, Muntazir M. Abidi, Michael Eickenberg, Shirley Ho, Pablo Lemos, Azadeh Moradinezhad Dizgah, Bruno Régaldo-Saint Blancard

    Abstract: Marked power spectra are two-point statistics of a marked field obtained by weighting each location with a function that depends on the local density around that point. We consider marked power spectra of the galaxy field in redshift space that up-weight low density regions, and perform a Fisher matrix analysis to assess the information content of this type of statistics using the Molino mock cata… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: 19 pages, 12 figures

  27. arXiv:2204.07646  [pdf, other

    astro-ph.CO stat.AP

    Wavelet Moments for Cosmological Parameter Estimation

    Authors: Michael Eickenberg, Erwan Allys, Azadeh Moradinezhad Dizgah, Pablo Lemos, Elena Massara, Muntazir Abidi, ChangHoon Hahn, Sultan Hassan, Bruno Regaldo-Saint Blancard, Shirley Ho, Stephane Mallat, Joakim Andén, Francisco Villaescusa-Navarro

    Abstract: Extracting non-Gaussian information from the non-linear regime of structure formation is key to fully exploiting the rich data from upcoming cosmological surveys probing the large-scale structure of the universe. However, due to theoretical and computational complexities, this remains one of the main challenges in analyzing observational data. We present a set of summary statistics for cosmologica… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

  28. arXiv:2201.01300  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM cs.AI cs.LG

    The CAMELS project: public data release

    Authors: Francisco Villaescusa-Navarro, Shy Genel, Daniel Anglés-Alcázar, Lucia A. Perez, Pablo Villanueva-Domingo, Digvijay Wadekar, Helen Shao, Faizan G. Mohammad, Sultan Hassan, Emily Moser, Erwin T. Lau, Luis Fernando Machado Poletti Valle, Andrina Nicola, Leander Thiele, Yongseok Jo, Oliver H. E. Philcox, Benjamin D. Oppenheimer, Megan Tillman, ChangHoon Hahn, Neerav Kaushal, Alice Pisani, Matthew Gebhardt, Ana Maria Delgado, Joyce Caliendo, Christina Kreisch , et al. (22 additional authors not shown)

    Abstract: The Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) project was developed to combine cosmology with astrophysics through thousands of cosmological hydrodynamic simulations and machine learning. CAMELS contains 4,233 cosmological simulations, 2,049 N-body and 2,184 state-of-the-art hydrodynamic simulations that sample a vast volume in parameter space. In this paper we present… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

    Comments: 18 pages, 3 figures. More than 350 Tb of data from thousands of simulations publicly available at https://www.camel-simulations.org

  29. arXiv:2110.02983  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.IM

    HIFlow: Generating Diverse HI Maps and Inferring Cosmology while Marginalizing over Astrophysics using Normalizing Flows

    Authors: Sultan Hassan, Francisco Villaescusa-Navarro, Benjamin Wandelt, David N. Spergel, Daniel Anglés-Alcázar, Shy Genel, Miles Cranmer, Greg L. Bryan, Romeel Davé, Rachel S. Somerville, Michael Eickenberg, Desika Narayanan, Shirley Ho, Sambatra Andrianomena

    Abstract: A wealth of cosmological and astrophysical information is expected from many ongoing and upcoming large-scale surveys. It is crucial to prepare for these surveys now and develop tools that can efficiently extract most information. We present HIFlow: a fast generative model of the neutral hydrogen (HI) maps that is conditioned only on cosmology ($Ω_{m}$ and $σ_{8}$) and designed using a class of no… ▽ More

    Submitted 18 August, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: updated: 14 pages, 10 figures, a new section on inference has been added during revision. Accepted for publication in ApJ

  30. arXiv:2109.10915  [pdf, other

    cs.LG astro-ph.CO astro-ph.GA astro-ph.IM cs.CV

    The CAMELS Multifield Dataset: Learning the Universe's Fundamental Parameters with Artificial Intelligence

    Authors: Francisco Villaescusa-Navarro, Shy Genel, Daniel Angles-Alcazar, Leander Thiele, Romeel Dave, Desika Narayanan, Andrina Nicola, Yin Li, Pablo Villanueva-Domingo, Benjamin Wandelt, David N. Spergel, Rachel S. Somerville, Jose Manuel Zorrilla Matilla, Faizan G. Mohammad, Sultan Hassan, Helen Shao, Digvijay Wadekar, Michael Eickenberg, Kaze W. K. Wong, Gabriella Contardo, Yongseok Jo, Emily Moser, Erwin T. Lau, Luis Fernando Machado Poletti Valle, Lucia A. Perez , et al. (3 additional authors not shown)

    Abstract: We present the Cosmology and Astrophysics with MachinE Learning Simulations (CAMELS) Multifield Dataset, CMD, a collection of hundreds of thousands of 2D maps and 3D grids containing many different properties of cosmic gas, dark matter, and stars from 2,000 distinct simulated universes at several cosmic times. The 2D maps and 3D grids represent cosmic regions that span $\sim$100 million light year… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: 17 pages, 1 figure. Third paper of a series of four. Hundreds of thousands of labeled 2D maps and 3D grids from thousands of simulated universes publicly available at https://camels-multifield-dataset.readthedocs.io

  31. arXiv:2107.09539  [pdf, other

    cs.LG eess.SP

    Parametric Scattering Networks

    Authors: Shanel Gauthier, Benjamin Thérien, Laurent Alsène-Racicot, Muawiz Chaudhary, Irina Rish, Eugene Belilovsky, Michael Eickenberg, Guy Wolf

    Abstract: The wavelet scattering transform creates geometric invariants and deformation stability. In multiple signal domains, it has been shown to yield more discriminative representations compared to other non-learned representations and to outperform learned representations in certain tasks, particularly on limited labeled data and highly structured signals. The wavelet filters used in the scattering tra… ▽ More

    Submitted 15 August, 2022; v1 submitted 20 July, 2021; originally announced July 2021.

    ACM Class: F.2.2; I.2.7

  32. arXiv:2106.06401  [pdf, other

    cs.LG cs.DC

    Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning

    Authors: Eugene Belilovsky, Louis Leconte, Lucas Caccia, Michael Eickenberg, Edouard Oyallon

    Abstract: A commonly cited inefficiency of neural network training using back-propagation is the update locking problem: each layer must wait for the signal to propagate through the full network before updating. Several alternatives that can alleviate this issue have been proposed. In this context, we consider a simple alternative based on minimal feedback, which we call Decoupled Greedy Learning (DGL). It… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:1901.08164

  33. arXiv:2012.07386  [pdf, other

    cs.LG cs.CV physics.optics stat.ML

    Phase Retrieval with Holography and Untrained Priors: Tackling the Challenges of Low-Photon Nanoscale Imaging

    Authors: Hannah Lawrence, David A. Barmherzig, Henry Li, Michael Eickenberg, Marylou Gabrié

    Abstract: Phase retrieval is the inverse problem of recovering a signal from magnitude-only Fourier measurements, and underlies numerous imaging modalities, such as Coherent Diffraction Imaging (CDI). A variant of this setup, known as holography, includes a reference object that is placed adjacent to the specimen of interest before measurements are collected. The resulting inverse problem, known as holograp… ▽ More

    Submitted 20 April, 2021; v1 submitted 14 December, 2020; originally announced December 2020.

  34. arXiv:1901.08164  [pdf, other

    cs.LG stat.ML

    Decoupled Greedy Learning of CNNs

    Authors: Eugene Belilovsky, Michael Eickenberg, Edouard Oyallon

    Abstract: A commonly cited inefficiency of neural network training by back-propagation is the update locking problem: each layer must wait for the signal to propagate through the full network before updating. Several alternatives that can alleviate this issue have been proposed. In this context, we consider a simpler, but more effective, substitute that uses minimal feedback, which we call Decoupled Greedy… ▽ More

    Submitted 19 June, 2020; v1 submitted 23 January, 2019; originally announced January 2019.

  35. arXiv:1812.11446  [pdf, other

    cs.LG stat.ML

    Greedy Layerwise Learning Can Scale to ImageNet

    Authors: Eugene Belilovsky, Michael Eickenberg, Edouard Oyallon

    Abstract: Shallow supervised 1-hidden layer neural networks have a number of favorable properties that make them easier to interpret, analyze, and optimize than their deep counterparts, but lack their representational power. Here we use 1-hidden layer learning problems to sequentially build deep networks layer by layer, which can inherit properties from shallow networks. Contrary to previous approaches usin… ▽ More

    Submitted 23 April, 2019; v1 submitted 29 December, 2018; originally announced December 2018.

  36. arXiv:1812.11214  [pdf, ps, other

    cs.LG cs.CV cs.SD eess.AS stat.ML

    Kymatio: Scattering Transforms in Python

    Authors: Mathieu Andreux, Tomás Angles, Georgios Exarchakis, Roberto Leonarduzzi, Gaspar Rochette, Louis Thiry, John Zarka, Stéphane Mallat, Joakim andén, Eugene Belilovsky, Joan Bruna, Vincent Lostanlen, Muawiz Chaudhary, Matthew J. Hirn, Edouard Oyallon, Sixin Zhang, Carmine Cella, Michael Eickenberg

    Abstract: The wavelet scattering transform is an invariant signal representation suitable for many signal processing and machine learning applications. We present the Kymatio software package, an easy-to-use, high-performance Python implementation of the scattering transform in 1D, 2D, and 3D that is compatible with modern deep learning frameworks. All transforms may be executed on a GPU (in addition to CPU… ▽ More

    Submitted 31 May, 2022; v1 submitted 28 December, 2018; originally announced December 2018.

  37. arXiv:1805.00571  [pdf, other

    physics.chem-ph cs.CE cs.LG stat.ML

    Solid Harmonic Wavelet Scattering for Predictions of Molecule Properties

    Authors: Michael Eickenberg, Georgios Exarchakis, Matthew Hirn, Stéphane Mallat, Louis Thiry

    Abstract: We present a machine learning algorithm for the prediction of molecule properties inspired by ideas from density functional theory. Using Gaussian-type orbital functions, we create surrogate electronic densities of the molecule from which we compute invariant "solid harmonic scattering coefficients" that account for different types of interactions at different scales. Multi-linear regressions of v… ▽ More

    Submitted 1 May, 2018; originally announced May 2018.

    Comments: Keywords: wavelets, electronic structure calculations, solid harmonics, invariants, multilinear regression

    Journal ref: J. Chem. Phys. 148, 241732 (2018)

  38. arXiv:1708.09762  [pdf, other

    stat.AP

    Gaussian Processes for HRF estimation for BOLD fMRI

    Authors: Michael Eickenberg, Aina Frau-Pascual, Andrés Hoyos-Idrobo

    Abstract: We present a non-parametric joint estimation method for fMRI task activation values and the hemodynamic response function (HRF). The HRF is modeled as a Gaussian process, making continuous evaluation possible for jittered paradigms and providing a variance estimate at each point.

    Submitted 31 August, 2017; originally announced August 2017.

  39. arXiv:1512.06999  [pdf, ps, other

    q-bio.NC cs.LG stat.CO stat.ML

    FAASTA: A fast solver for total-variation regularization of ill-conditioned problems with application to brain imaging

    Authors: Gaël Varoquaux, Michael Eickenberg, Elvis Dohmatob, Bertand Thirion

    Abstract: The total variation (TV) penalty, as many other analysis-sparsity problems, does not lead to separable factors or a proximal operatorwith a closed-form expression, such as soft thresholding for the $\ell\_1$ penalty. As a result, in a variational formulation of an inverse problem or statisticallearning estimation, it leads to challenging non-smooth optimization problemsthat are often solved with e… ▽ More

    Submitted 22 December, 2015; originally announced December 2015.

    Journal ref: Colloque GRETSI, Sep 2015, Lyon, France. Gretsi, 2015, http://www.gretsi.fr/colloque2015/myGretsi/programme.php

  40. arXiv:1412.3919  [pdf, other

    cs.LG cs.CV stat.ML

    Machine Learning for Neuroimaging with Scikit-Learn

    Authors: Alexandre Abraham, Fabian Pedregosa, Michael Eickenberg, Philippe Gervais, Andreas Muller, Jean Kossaifi, Alexandre Gramfort, Bertrand Thirion, Gäel Varoquaux

    Abstract: Statistical machine learning methods are increasingly used for neuroimaging data analysis. Their main virtue is their ability to model high-dimensional datasets, e.g. multivariate analysis of activation images or resting-state time series. Supervised learning is typically used in decoding or encoding settings to relate brain images to behavioral or clinical observations, while unsupervised learnin… ▽ More

    Submitted 12 December, 2014; originally announced December 2014.

    Comments: Frontiers in neuroscience, Frontiers Research Foundation, 2013, pp.15

  41. Data-driven HRF estimation for encoding and decoding models

    Authors: Fabian Pedregosa, Michael Eickenberg, Philippe Ciuciu, Bertrand Thirion, Alexandre Gramfort

    Abstract: Despite the common usage of a canonical, data-independent, hemodynamic response function (HRF), it is known that the shape of the HRF varies across brain regions and subjects. This suggests that a data-driven estimation of this function could lead to more statistical power when modeling BOLD fMRI data. However, unconstrained estimation of the HRF can yield highly unstable results when the number o… ▽ More

    Submitted 7 November, 2014; v1 submitted 27 February, 2014; originally announced February 2014.

    Comments: appears in NeuroImage (2015)

  42. arXiv:1310.1257  [pdf, other

    cs.CV

    Second order scattering descriptors predict fMRI activity due to visual textures

    Authors: Michael Eickenberg, Fabian Pedregosa, Senoussi Mehdi, Alexandre Gramfort, Bertrand Thirion

    Abstract: Second layer scattering descriptors are known to provide good classification performance on natural quasi-stationary processes such as visual textures due to their sensitivity to higher order moments and continuity with respect to small deformations. In a functional Magnetic Resonance Imaging (fMRI) experiment we present visual textures to subjects and evaluate the predictive power of these descri… ▽ More

    Submitted 10 August, 2013; originally announced October 2013.

    Comments: 3nd International Workshop on Pattern Recognition in NeuroImaging (2013)

  43. arXiv:1305.2788  [pdf, other

    cs.LG stat.AP

    HRF estimation improves sensitivity of fMRI encoding and decoding models

    Authors: Fabian Pedregosa, Michael Eickenberg, Bertrand Thirion, Alexandre Gramfort

    Abstract: Extracting activation patterns from functional Magnetic Resonance Images (fMRI) datasets remains challenging in rapid-event designs due to the inherent delay of blood oxygen level-dependent (BOLD) signal. The general linear model (GLM) allows to estimate the activation from a design matrix and a fixed hemodynamic response function (HRF). However, the HRF is known to vary substantially between subj… ▽ More

    Submitted 13 May, 2013; originally announced May 2013.

    Comments: 3nd International Workshop on Pattern Recognition in NeuroImaging (2013)