Search | arXiv e-print repository

El Gordo needs El Anzuelo: Probing the structure of cluster members with multi-band extended arcs in JWST data

Authors: A. Galan, G. B. Caminha, J. Knollmüller, J. Roth, S. H. Suyu

Abstract: Gravitational lensing by galaxy clusters involves hundreds of galaxies over a large redshift range and increases the likelihood of rare phenomena (supernovae, microlensing, dark substructures, etc.). Characterizing the mass and light distributions of foreground and background objects often requires a combination of high-resolution data and advanced modeling techniques. We present the detailed anal… ▽ More Gravitational lensing by galaxy clusters involves hundreds of galaxies over a large redshift range and increases the likelihood of rare phenomena (supernovae, microlensing, dark substructures, etc.). Characterizing the mass and light distributions of foreground and background objects often requires a combination of high-resolution data and advanced modeling techniques. We present the detailed analysis of El Anzuelo, a prominent quintuply imaged dusty star forming galaxy ($z_{\rm s}=2.29$), mainly lensed by three members of the massive galaxy cluster ACT-CL$\,$J0102$-$4915, also known as El Gordo ($z_{\rm d}=0.87$). We leverage JWST/NIRCam data containing previously unseen lensing features using a Bayesian, multi-wavelength, differentiable and GPU-accelerated modeling framework that combines Herculens (lens modeling) and NIFTy (field model and inference) software packages. For one of the deflectors, we complement lensing constraints with stellar kinematics measured from VLT/MUSE data. In our lens model, we explicitly include the mass distribution of the cluster, locally corrected by a constant shear field. We find that the two main deflectors (L1 and L2) have logarithmic mass density slopes steeper than isothermal, with $γ_{\rm L1} = 2.23\pm0.05$ and $γ_{\rm L2} = 2.21\pm0.04$. We argue that such steep density profiles can arise due to tidally truncated mass distributions, which we probe thanks to the cluster lensing boost and the strong asymmetry of the lensing configuration. Moreover, our three-dimensional source model captures most of the surface brightness of the lensed galaxy, revealing a clump of at most $400$ parsecs at the source redshift, visible at wavelengths $λ_{\rm rest}\gtrsim0.6$ $μ$m. Finally, we caution on using point-like features within extended arcs to constrain galaxy-scale lens models before securing them with extended arc modeling. △ Less

Submitted 28 February, 2024; originally announced February 2024.

Comments: 26 pages

arXiv:2311.03042 [pdf, other]

Progress in the Partial-Wave Analysis Methods at COMPASS

Authors: Florian Markus Kaspar, Julien Beckers, Jakob Knollmüller

Abstract: We study the excitation spectrum of light and strange mesons in diffractive scattering. We identify different hadron resonances through partial wave analysis, which inherently relies on analysis models. Besides statistical uncertainties, the model dependence of the analysis introduces dominant systematic uncertainties. We discuss several of their sources for the $π^-π^-π^+$ and $K^0_S K^-$ final s… ▽ More We study the excitation spectrum of light and strange mesons in diffractive scattering. We identify different hadron resonances through partial wave analysis, which inherently relies on analysis models. Besides statistical uncertainties, the model dependence of the analysis introduces dominant systematic uncertainties. We discuss several of their sources for the $π^-π^-π^+$ and $K^0_S K^-$ final states and present methods to reduce them. We have developed a new approach exploiting a-priori knowledge of signal continuity over adjacent final-state-mass bins to stably fit a large pool of partial-waves to our data, allowing a clean identification of very small signals in our large data sets. For two-body final states of scalar particles, such as $K^0_S K^-$, mathematical ambiguities in the partial-wave decomposition lead to the same intensity distribution for different combinations of amplitude values. We will discuss these ambiguities and present solutions to resolve or at least reduce the number of possible solutions. Resolving these issues will allow for a complementary analysis of the $a_J$-like resonance sector in these two final states. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: 4 pages, 6 figures, 17th International Workshop on Meson Physics

arXiv:2311.00449 [pdf, other]

Progress in the partial-wave analysis methods at COMPASS

Authors: Julien Beckers, Florian Kaspar, Jakob Knollmüller

Abstract: We study the excitation spectrum of light and strange mesons in diffractive scattering. We identify different hadron resonances through partial wave analysis, which inherently relies on analysis models. Besides statistical uncertainties, the model dependence of the analysis introduces dominant systematic uncertainties. We discuss several of their sources for the $π^-π^-π^+$ and $K^0_S K^-$ final s… ▽ More We study the excitation spectrum of light and strange mesons in diffractive scattering. We identify different hadron resonances through partial wave analysis, which inherently relies on analysis models. Besides statistical uncertainties, the model dependence of the analysis introduces dominant systematic uncertainties. We discuss several of their sources for the $π^-π^-π^+$ and $K^0_S K^-$ final states and present methods to reduce them. We have developed a new approach exploiting a-priori knowledge of signal continuity over adjacent final-state-mass bins to stably fit a large pool of partial-waves to our data, allowing a clean identification of very small signals in our large data sets. For two-body final states of scalar particles, such as $K^0_S K^-$, mathematical ambiguities in the partial-wave decomposition lead to the same intensity distribution for different combinations of amplitude values. We will discuss these ambiguities and present solutions to resolve or at least reduce the number of possible solutions. Resolving these issues will allow for a complementary analysis of the $a_J$-like resonance sector in these two final states. △ Less

Submitted 1 November, 2023; originally announced November 2023.

Comments: 5 pages, 6 figures. Proceedings of the 20th International Conference on Hadron Spectroscopy and Structure (HADRON 2023), to be published in Il Nuovo Cimento C, Colloquia and Communications in Physics

arXiv:2310.16889 [pdf, other]

Resolving Horizon-Scale Dynamics of Sagittarius A*

Authors: Jakob Knollmüller, Philipp Arras, Torsten Enßlin

Abstract: Sagittarius A* (Sgr A*), the supermassive black hole at the heart of our galaxy, provides unique opportunities to study black hole accretion, jet formation, and gravitational physics. The rapid structural changes in Sgr A*'s emission pose a significant challenge for traditional imaging techniques. We present dynamic reconstructions of Sgr A* using Event Horizon Telescope (EHT) data from April 6th… ▽ More Sagittarius A* (Sgr A*), the supermassive black hole at the heart of our galaxy, provides unique opportunities to study black hole accretion, jet formation, and gravitational physics. The rapid structural changes in Sgr A*'s emission pose a significant challenge for traditional imaging techniques. We present dynamic reconstructions of Sgr A* using Event Horizon Telescope (EHT) data from April 6th and 7th, 2017, analyzed with a one-minute temporal resolution with the Resolve framework. This Bayesian approach employs adaptive Gaussian Processes and Variational Inference for data-driven self-regularization. Our results not only fully confirm the initial findings by the EHT Collaboration for a time-averaged source but also reveal intricate details about the temporal dynamics within the black hole environment. We find an intriguing dynamic feature on April 6th that propagates in a clock-wise direction. Geometric modelling with ray-tracing, although not fully conclusive, indicates compatibility with high-inclination configurations of about $θ_o = 160^\circ$, as seen in other studies. △ Less

Submitted 25 October, 2023; originally announced October 2023.

arXiv:2308.09176 [pdf, other]

First spatio-spectral Bayesian imaging of SN1006 in X-ray

Authors: Margret Westerkamp, Vincent Eberle, Matteo Guardiani, Philipp Frank, Lukas Platz, Philipp Arras, Jakob Knollmüller, Julia Stadler, Torsten Enßlin

Abstract: Supernovae are an important source of energy in the interstellar medium. Young remnants of supernovae have a peak emission in the X-ray region, making them interesting objects for X-ray observations. In particular, the supernova remnant SN1006 is of great interest due to its historical record, proximity and brightness. It has therefore been studied by several X-ray telescopes. Improving the X-ray… ▽ More Supernovae are an important source of energy in the interstellar medium. Young remnants of supernovae have a peak emission in the X-ray region, making them interesting objects for X-ray observations. In particular, the supernova remnant SN1006 is of great interest due to its historical record, proximity and brightness. It has therefore been studied by several X-ray telescopes. Improving the X-ray imaging of this and other remnants is important but challenging as it requires to address a spatially varying instrument response in order to achieve a high signal-to-noise ratio. Here, we use Chandra observations to demonstrate the capabilities of Bayesian image reconstruction using information field theory. Our objective is to reconstruct denoised, deconvolved and spatio-spectral resolved images from X-ray observations and to decompose the emission into different morphologies, namely diffuse and point-like. Further, we aim to fuse data from different detectors and pointings into a mosaic and quantify the uncertainty of our result. Utilizing prior knowledge on the spatial and spectral correlation structure of the two components, diffuse emission and point sources, the presented method allows the effective decomposition of the signal into these. In order to accelerate the imaging process, we introduce a multi-step approach, in which the spatial reconstruction obtained for a single energy range is used to derive an informed starting point for the full spatio-spectral reconstruction. The method is applied to 11 Chandra observations of SN1006 from 2008 and 2012, providing a detailed, denoised and decomposed view of the remnant. In particular, the separated view of the diffuse emission should provide new insights into its complex small-scale structures in the center of the remnant and at the shock front profiles. △ Less

Submitted 18 December, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

arXiv:2306.17667 [pdf, other]

doi 10.1016/j.nima.2024.169259

Bias-Free Estimation of Signals on Top of Unknown Backgrounds

Authors: Johannes Diehl, Jakob Knollmüller, Oliver Schulz

Abstract: We present a method for obtaining unbiased signal estimates in the presence of a significant unknown background, eliminating the need for a parametric model for the background itself. Our approach is based on a minimal set of conditions for observation and background estimators, which are typically satisfied in practical scenarios. To showcase the effectiveness of our method, we apply it to simula… ▽ More We present a method for obtaining unbiased signal estimates in the presence of a significant unknown background, eliminating the need for a parametric model for the background itself. Our approach is based on a minimal set of conditions for observation and background estimators, which are typically satisfied in practical scenarios. To showcase the effectiveness of our method, we apply it to simulated data from the planned dielectric axion haloscope MADMAX. △ Less

Submitted 25 March, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

Comments: 11 pages, 7 figures, 2 tables (v2 corresponds to the published version)

Report number: MPP-2023-138

Journal ref: NIM A 1063 (2024) 169259

arXiv:2212.11355 [pdf, other]

The ngEHT Analysis Challenges

Authors: Freek Roelofs, Lindy Blackburn, Greg Lindahl, Sheperd S. Doeleman, Michael D. Johnson, Philipp Arras, Koushik Chatterjee, Razieh Emami, Christian Fromm, Antonio Fuentes, Jakob Knollmueller, Nikita Kosogorov, Hendrik Mueller, Nimesh Patel, Alexander Raymond, Paul Tiede, Thalia Traianou, Justin Vega

Abstract: The next-generation Event Horizon Telescope (ngEHT) will be a significant enhancement of the Event Horizon Telescope (EHT) array, with $\sim 10$ new antennas and instrumental upgrades of existing antennas. The increased $uv$-coverage, sensitivity, and frequency coverage allow a wide range of new science opportunities to be explored. The ngEHT Analysis Challenges have been launched to inform develo… ▽ More The next-generation Event Horizon Telescope (ngEHT) will be a significant enhancement of the Event Horizon Telescope (EHT) array, with $\sim 10$ new antennas and instrumental upgrades of existing antennas. The increased $uv$-coverage, sensitivity, and frequency coverage allow a wide range of new science opportunities to be explored. The ngEHT Analysis Challenges have been launched to inform development of the ngEHT array design, science objectives, and analysis pathways. For each challenge, synthetic EHT and ngEHT datasets are generated from theoretical source models and released to the challenge participants, who analyze the datasets using image reconstruction and other methods. The submitted analysis results are evaluated with quantitative metrics. In this work, we report on the first two ngEHT Analysis Challenges. These have focused on static and dynamical models of M87* and Sgr A*, and shown that high-quality movies of the extended jet structure of M87* and near-horizon hourly timescale variability of Sgr A* can be reconstructed by the reference ngEHT array in realistic observing conditions, using current analysis algorithms. We identify areas where there is still room for improvement of these algorithms and analysis strategies. Other science cases and arrays will be explored in future challenges. △ Less

Submitted 21 December, 2022; originally announced December 2022.

Comments: 32 pages, 14 figures, accepted for publication in Galaxies

arXiv:2212.01804 [pdf, other]

doi 10.3390/galaxies11020038

Accretion Flow Morphology in Numerical Simulations of Black Holes from the ngEHT Model Library: The Impact of Radiation Physics

Authors: Koushik Chatterjee, Andrew Chael, Paul Tiede, Yosuke Mizuno, Razieh Emami, Christian Fromm, Angelo Ricarte, Lindy Blackburn, Freek Roelofs, Michael D. Johnson, Sheperd S. Doeleman, Philipp Arras, Antonio Fuentes, Jakob Knollmüller, Nikita Kosogorov, Greg Lindahl, Hendrik Müller, Nimesh Patel, Alexander Raymond, Efthalia Traianou, Justin Vega

Abstract: In the past few years, the Event Horizon Telescope (EHT) has provided the first-ever event horizon-scale images of the supermassive black holes (BHs) (M87*) and Sagittarius A$^*$ (Sgr A*). The next-generation EHT project is an extension of the EHT array that promises larger angular resolution and higher sensitivity to the dim, extended flux around the central ring-like structure, possibly connecti… ▽ More In the past few years, the Event Horizon Telescope (EHT) has provided the first-ever event horizon-scale images of the supermassive black holes (BHs) (M87*) and Sagittarius A$^*$ (Sgr A*). The next-generation EHT project is an extension of the EHT array that promises larger angular resolution and higher sensitivity to the dim, extended flux around the central ring-like structure, possibly connecting the accretion flow and the jet. The ngEHT Analysis Challenges aim to understand the science extractability from synthetic images and movies to inform the ngEHT array design and analysis algorithm development. In this work, we compare the accretion flow structure and dynamics in numerical fluid simulations that specifically target M87* and Sgr A*, and were used to construct the source models in the challenge set. We consider (1) a steady-state axisymmetric radiatively inefficient accretion flow model with a time-dependent shearing hotspot, (2) two time-dependent single fluid general relativistic magnetohydrodynamic (GRMHD) simulations from the H-AMR code, (3) a two-temperature GRMHD simulation from the BHAC code, and (4) a two-temperature radiative GRMHD simulation from the KORAL code. We find that the different models exhibit remarkably similar temporal and spatial properties, except for the electron temperature, since radiative losses substantially cool down electrons near the BH and the jet sheath, signaling the importance of radiative cooling even for slowly accreting BHs such as M87*. We restrict ourselves to standard torus accretion flows, and leave larger explorations of alternate accretion models to future work. △ Less

Submitted 7 March, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

Comments: Accepted in Galaxies; 23 pages, 7 figures

arXiv:2204.11715 [pdf, other]

The Galactic 3D large-scale dust distribution via Gaussian process regression on spherical coordinates

Authors: R. H. Leike, G. Edenhofer, J. Knollmüller, C. Alig, P. Frank, T. A. Enßlin

Abstract: Knowing the Galactic 3D dust distribution is relevant for understanding many processes in the interstellar medium and for correcting many astronomical observations for dust absorption and emission. Here, we aim for a 3D reconstruction of the Galactic dust distribution with an increase in the number of meaningful resolution elements by orders of magnitude with respect to previous reconstructions, w… ▽ More Knowing the Galactic 3D dust distribution is relevant for understanding many processes in the interstellar medium and for correcting many astronomical observations for dust absorption and emission. Here, we aim for a 3D reconstruction of the Galactic dust distribution with an increase in the number of meaningful resolution elements by orders of magnitude with respect to previous reconstructions, while taking advantage of the dust's spatial correlations to inform the dust map. We use iterative grid refinement to define a log-normal process in spherical coordinates. This log-normal process assumes a fixed correlation structure, which was inferred in an earlier reconstruction of Galactic dust. Our map is informed through 111 Million data points, combining data of PANSTARRS, 2MASS, Gaia DR2 and ALLWISE. The log-normal process is discretized to 122 Billion degrees of freedom, a factor of 400 more than our previous map. We derive the most probable posterior map and an uncertainty estimate using natural gradient descent and the Fisher-Laplace approximation. The dust reconstruction covers a quarter of the volume of our Galaxy, with a maximum coordinate distance of $16\,\text{kpc}$, and meaningful information can be found up to at distances of $4\,$kpc, still improving upon our earlier map by a factor of 5 in maximal distance, of $900$ in volume, and of about eighteen in angular grid resolution. Unfortunately, the maximum posterior approach chosen to make the reconstruction computational affordable introduces artifacts and reduces the accuracy of our uncertainty estimate. Despite of the apparent limitations of the presented 3D dust map, a good part of the reconstructed structures are confirmed by independent maser observations. Thus, the map is a step towards reliable 3D Galactic cartography and already can serve for a number of tasks, if used with care. △ Less

Submitted 25 April, 2022; originally announced April 2022.

arXiv:2204.09360 [pdf, other]

doi 10.1051/0004-6361/202243819

Multi-Component Imaging of the Fermi Gamma-ray Sky in the Spatio-spectral Domain

Authors: Lukas I. Platz, Jakob Knollmüller, Philipp Arras, Philipp Frank, Martin Reinecke, Dominik Jüstel, Torsten A. Enßlin

Abstract: We perform two distinct spatio-spectral reconstructions of the gamma-ray sky in the range of 0.56-316 GeV based on Fermi Large Area Telescope (LAT) data. Both describe the sky brightness to be composed of a diffuse-emission and a point-source component. The first model requires minimal assumptions and provides a template-free reconstruction as a reference. It makes use of spatial and spectral corr… ▽ More We perform two distinct spatio-spectral reconstructions of the gamma-ray sky in the range of 0.56-316 GeV based on Fermi Large Area Telescope (LAT) data. Both describe the sky brightness to be composed of a diffuse-emission and a point-source component. The first model requires minimal assumptions and provides a template-free reconstruction as a reference. It makes use of spatial and spectral correlations to distinguish between the different components. The second model is physics-informed and further differentiates between diffuse emission of hadronic and leptonic origin. For this, we assume parametric, but spatially varying energy spectra to distinguish between the processes and use thermal Galactic dust observations to indicate the preferred sites of hadronic interactions. To account for instrumental effects we model the point-spread, the energy dispersion, and the exposure of the telescope throughout the observation. The reconstruction problem is formulated as a Bayesian inference task, that is solved by variational inference. We show decompositions of the Gamma-ray flux into diffuse and point-like emissions, and of the diffuse emissions into multiple physically motivated components. The diffuse decomposition provides an unprecedented view of the Galactic leptonic diffuse emission. It shows the Fermi bubbles and their spectral variations in high fidelity and other areas exhibiting strong cosmic ray electron contents, such as a thick disk in the inner Galaxy and outflow regions. Furthermore, we report a hard spectrum gamma ray arc in the northern outer bubble co-spatial with the reported X-ray arc by the eROSITA collaboration. All our spatio-spectral sky reconstructions and their uncertainty quantification are publicly available. △ Less

Submitted 6 December, 2023; v1 submitted 20 April, 2022; originally announced April 2022.

Comments: Please see the journal version for a language-edited release of this manuscript (open-access): https://doi.org/10.1051/0004-6361/202243819

Journal ref: A&A 680, A2 (2023)

arXiv:2105.13393 [pdf, other]

doi 10.3390/psf2022005012

Classification and Uncertainty Quantification of Corrupted Data using Semi-Supervised Autoencoders

Authors: Philipp Joppich, Sebastian Dorn, Oliver De Candido, Wolfgang Utschick, Jakob Knollmüller

Abstract: Parametric and non-parametric classifiers often have to deal with real-world data, where corruptions like noise, occlusions, and blur are unavoidable - posing significant challenges. We present a probabilistic approach to classify strongly corrupted data and quantify uncertainty, despite the model only having been trained with uncorrupted data. A semi-supervised autoencoder trained on uncorrupted… ▽ More Parametric and non-parametric classifiers often have to deal with real-world data, where corruptions like noise, occlusions, and blur are unavoidable - posing significant challenges. We present a probabilistic approach to classify strongly corrupted data and quantify uncertainty, despite the model only having been trained with uncorrupted data. A semi-supervised autoencoder trained on uncorrupted data is the underlying architecture. We use the decoding part as a generative model for realistic data and extend it by convolutions, masking, and additive Gaussian noise to describe imperfections. This constitutes a statistical inference task in terms of the optimal latent space activations of the underlying uncorrupted datum. We solve this problem approximately with Metric Gaussian Variational Inference (MGVI). The supervision of the autoencoder's latent space allows us to classify corrupted data directly under uncertainty with the statistically inferred latent space activations. Furthermore, we demonstrate that the model uncertainty strongly depends on whether the classification is correct or wrong, setting a basis for a statistical "lie detector" of the classification. Independent of that, we show that the generative model can optimally restore the uncorrupted datum by decoding the inferred latent space activations. △ Less

Submitted 20 April, 2023; v1 submitted 27 May, 2021; originally announced May 2021.

Journal ref: hysical Sciences Forum. 2022; 5(1):12

arXiv:2002.05218 [pdf, other]

doi 10.1038/s41550-021-01548-0

Variable structures in M87* from space, time and frequency resolved interferometry

Authors: Philipp Arras, Philipp Frank, Philipp Haim, Jakob Knollmüller, Reimar Leike, Martin Reinecke, Torsten Enßlin

Abstract: Observing the dynamics of compact astrophysical objects provides insights into their inner workings, thereby probing physics under extreme conditions. The immediate vicinity of an active supermassive black hole with its event horizon, photon ring, accretion disk, and relativistic jets is a perfect place to study general relativity and magneto-hydrodynamics. The observations of M87* with Very Long… ▽ More Observing the dynamics of compact astrophysical objects provides insights into their inner workings, thereby probing physics under extreme conditions. The immediate vicinity of an active supermassive black hole with its event horizon, photon ring, accretion disk, and relativistic jets is a perfect place to study general relativity and magneto-hydrodynamics. The observations of M87* with Very Long Baseline Interferometry (VLBI) by the Event Horizon Telescope (EHT) allows to investigate its dynamical processes on time scales of days. Compared to regular radio interferometers, VLBI networks typically have fewer antennas and low signal to noise ratios (SNRs). Furthermore, the source is variable, prohibiting integration over time to improve SNR. Here, we present an imaging algorithm that copes with the data scarcity and temporal evolution, while providing uncertainty quantification. Our algorithm views the imaging task as a Bayesian inference problem of a time-varying brightness, exploits the correlation structure in time, and reconstructs a ${2+1+1}$ dimensional time-variable and spectrally resolved image at once. We apply this method to the EHT observation of M87* and validate our approach on synthetic data. The time- and frequency-resolved reconstruction of M87* confirms variable structures on the emission ring. The reconstruction indicates extended and time-variable emission structures outside the ring itself. △ Less

Submitted 5 June, 2022; v1 submitted 12 February, 2020; originally announced February 2020.

Comments: 32 pages, 17 figures, 6 tables

arXiv:2001.11031 [pdf, other]

doi 10.3390/e23060693

Bayesian Reasoning with Trained Neural Networks

Authors: Jakob Knollmüller, Torsten Enßlin

Abstract: We showed how to use trained neural networks to perform Bayesian reasoning in order to solve tasks outside their initial scope. Deep generative models provide prior knowledge, and classification/regression networks impose constraints. The tasks at hand were formulated as Bayesian inference problems, which we approximately solved through variational or sampling techniques. The approach built on top… ▽ More We showed how to use trained neural networks to perform Bayesian reasoning in order to solve tasks outside their initial scope. Deep generative models provide prior knowledge, and classification/regression networks impose constraints. The tasks at hand were formulated as Bayesian inference problems, which we approximately solved through variational or sampling techniques. The approach built on top of already trained networks, and the addressable questions grew super-exponentially with the number of available networks. In its simplest form, the approach yielded conditional generative models. However, multiple simultaneous constraints constitute elaborate questions. We compared the approach to specifically trained generators, showed how to solve riddles, and demonstrated its compatibility with state-of-the-art architectures. △ Less

Submitted 1 June, 2021; v1 submitted 29 January, 2020; originally announced January 2020.

Journal ref: Entropy 2021, 23(6), 693

arXiv:1901.11033 [pdf, other]

Metric Gaussian Variational Inference

Authors: Jakob Knollmüller, Torsten A. Enßlin

Abstract: Solving Bayesian inference problems approximately with variational approaches can provide fast and accurate results. Capturing correlation within the approximation requires an explicit parametrization. This intrinsically limits this approach to either moderately dimensional problems, or requiring the strongly simplifying mean-field approach. We propose Metric Gaussian Variational Inference (MGVI)… ▽ More Solving Bayesian inference problems approximately with variational approaches can provide fast and accurate results. Capturing correlation within the approximation requires an explicit parametrization. This intrinsically limits this approach to either moderately dimensional problems, or requiring the strongly simplifying mean-field approach. We propose Metric Gaussian Variational Inference (MGVI) as a method that goes beyond mean-field. Here correlations between all model parameters are taken into account, while still scaling linearly in computational time and memory. With this method we achieve higher accuracy and in many cases a significant speedup compared to traditional methods. MGVI is an iterative method that performs a series of Gaussian approximations to the posterior. We alternate between approximating the covariance with the inverse Fisher information metric evaluated at an intermediate mean estimate and optimizing the KL-divergence for the given covariance with respect to the mean. This procedure is iterated until the uncertainty estimate is self-consistent with the mean parameter. We achieve linear scaling by avoiding to store the covariance explicitly at any time. Instead we draw samples from the approximating distribution relying on an implicit representation and numerical schemes to approximately solve linear equations. Those samples are used to approximate the KL-divergence and its gradient. The usage of natural gradient descent allows for rapid convergence. Formulating the Bayesian model in standardized coordinates makes MGVI applicable to any inference problem with continuous parameters. We demonstrate the high accuracy of MGVI by comparing it to HMC and its fast convergence relative to other established methods in several examples. We investigate real-data applications, as well as synthetic examples of varying size and complexity and up to a million model parameters. △ Less

Submitted 30 January, 2020; v1 submitted 30 January, 2019; originally announced January 2019.

Comments: Code is part of NIFTy5 release at https://gitlab.mpcdf.mpg.de/ift/NIFTy

arXiv:1812.04403 [pdf, other]

Encoding prior knowledge in the structure of the likelihood

Authors: Jakob Knollmüller, Torsten A. Enßlin

Abstract: The inference of deep hierarchical models is problematic due to strong dependencies between the hierarchies. We investigate a specific transformation of the model parameters based on the multivariate distributional transform. This transformation is a special form of the reparametrization trick, flattens the hierarchy and leads to a standard Gaussian prior on all resulting parameters. The transform… ▽ More The inference of deep hierarchical models is problematic due to strong dependencies between the hierarchies. We investigate a specific transformation of the model parameters based on the multivariate distributional transform. This transformation is a special form of the reparametrization trick, flattens the hierarchy and leads to a standard Gaussian prior on all resulting parameters. The transformation also transfers all the prior information into the structure of the likelihood, hereby decoupling the transformed parameters a priori from each other. A variational Gaussian approximation in this standardized space will be excellent in situations of relatively uninformative data. Additionally, the curvature of the log-posterior is well-conditioned in directions that are weakly constrained by the data, allowing for fast inference in such a scenario. In an example we perform the transformation explicitly for Gaussian process regression with a priori unknown correlation structure. Deep models are inferred rapidly in highly and slowly in poorly informed situations. The flat model show exactly the opposite performance pattern. A synthesis of both, the deep and the flat perspective, provides their combined advantages and overcomes the individual limitations, leading to a faster inference. △ Less

Submitted 11 December, 2018; originally announced December 2018.

arXiv:1804.05591 [pdf, other]

Separating diffuse from point-like sources - a Bayesian approach

Authors: Jakob Knollmüller, Philipp Frank, Torsten A. Enßlin

Abstract: We present the starblade algorithm, a method to separate superimposed point sources from auto-correlated, diffuse flux using a Bayesian model. Point sources are assumed to be independent from each other and to follow a power-law brightness distribution. The diffuse emission is described as a non-parametric log-normal model with a priori unknown correlation structure. This model enforces positivity… ▽ More We present the starblade algorithm, a method to separate superimposed point sources from auto-correlated, diffuse flux using a Bayesian model. Point sources are assumed to be independent from each other and to follow a power-law brightness distribution. The diffuse emission is described as a non-parametric log-normal model with a priori unknown correlation structure. This model enforces positivity of the underlying emission and allows for variation in the order of magnitudes. The correlation structure is recovered non-parametrically in addition to the diffuse flux and is used for the separation of the point sources. Additionally many measurement artifacts appear as point-like or quasi-point-like effects, not compatible with superimposed diffuse emission. An estimate of the separation uncertainty can be provided as well. We demonstrate the capabilities of the derived method on synthetic data and data obtained by the Hubble Space Telescope, emphasizing its effect on instrumental artifacts as well as physical sources. The performance of this method is compared to the background estimation of the SExtractor method, as well as to a denoising auto-encoder. △ Less

Submitted 6 August, 2018; v1 submitted 16 April, 2018; originally announced April 2018.

arXiv:1803.02174 [pdf, other]

doi 10.23919/EUSIPCO.2018.8553533

Radio Imaging With Information Field Theory

Authors: Philipp Arras, Jakob Knollmüller, Henrik Junklewitz, Torsten A. Enßlin

Abstract: Data from radio interferometers provide a substantial challenge for statisticians. It is incomplete, noise-dominated and originates from a non-trivial measurement process. The signal is not only corrupted by imperfect measurement devices but also from effects like fluctuations in the ionosphere that act as a distortion screen. In this paper we focus on the imaging part of data reduction in radio a… ▽ More Data from radio interferometers provide a substantial challenge for statisticians. It is incomplete, noise-dominated and originates from a non-trivial measurement process. The signal is not only corrupted by imperfect measurement devices but also from effects like fluctuations in the ionosphere that act as a distortion screen. In this paper we focus on the imaging part of data reduction in radio astronomy and present RESOLVE, a Bayesian imaging algorithm for radio interferometry in its new incarnation. It is formulated in the language of information field theory. Solely by algorithmic advances the inference could be sped up significantly and behaves noticeably more stable now. This is one more step towards a fully user-friendly version of RESOLVE which can be applied routinely by astronomers. △ Less

Submitted 6 March, 2018; originally announced March 2018.

Comments: 5 pages, 3 figures

Journal ref: 2018 26th European Signal Processing Conference (EUSIPCO)

arXiv:1711.02955 [pdf, other]

Inference of signals with unknown correlation structure from nonlinear measurements

Authors: Jakob Knollmüller, Theo Steininger, Torsten A. Enßlin

Abstract: We present a method to reconstruct autocorrelated signals together with their autocorrelation structure from nonlinear, noisy measurements for arbitrary monotonous nonlinear instrument response. In the presented formulation the algorithm provides a significant speedup compared to prior implementations, allowing for a wider range of application. The nonlinearity can be used to model instrument char… ▽ More We present a method to reconstruct autocorrelated signals together with their autocorrelation structure from nonlinear, noisy measurements for arbitrary monotonous nonlinear instrument response. In the presented formulation the algorithm provides a significant speedup compared to prior implementations, allowing for a wider range of application. The nonlinearity can be used to model instrument characteristics or to enforce properties on the underlying signal, such as positivity. Uncertainties on any posterior quantities can be provided due to independent samples from an approximate posterior distribution. We demonstrate the methods applicability via simulated and real measurements, using different measurement instruments, nonlinearities and dimensionality. △ Less

Submitted 13 February, 2018; v1 submitted 8 November, 2017; originally announced November 2017.

arXiv:1708.01073 [pdf, other]

NIFTy 3 - Numerical Information Field Theory - A Python framework for multicomponent signal inference on HPC clusters

Authors: Theo Steininger, Jait Dixit, Philipp Frank, Maksim Greiner, Sebastian Hutschenreuter, Jakob Knollmüller, Reimar Leike, Natalia Porqueres, Daniel Pumpe, Martin Reinecke, Matevž Šraml, Csongor Varady, Torsten Enßlin

Abstract: NIFTy, "Numerical Information Field Theory", is a software framework designed to ease the development and implementation of field inference algorithms. Field equations are formulated independently of the underlying spatial geometry allowing the user to focus on the algorithmic design. Under the hood, NIFTy ensures that the discretization of the implemented equations is consistent. This enables the… ▽ More NIFTy, "Numerical Information Field Theory", is a software framework designed to ease the development and implementation of field inference algorithms. Field equations are formulated independently of the underlying spatial geometry allowing the user to focus on the algorithmic design. Under the hood, NIFTy ensures that the discretization of the implemented equations is consistent. This enables the user to prototype an algorithm rapidly in 1D and then apply it to high-dimensional real-world problems. This paper introduces NIFTy 3, a major upgrade to the original NIFTy framework. NIFTy 3 allows the user to run inference algorithms on massively parallel high performance computing clusters without changing the implementation of the field equations. It supports n-dimensional Cartesian spaces, spherical spaces, power spaces, and product spaces as well as transforms to their harmonic counterparts. Furthermore, NIFTy 3 is able to treat non-scalar fields. The functionality and performance of the software package is demonstrated with example code, which implements a real inference algorithm from the realm of information field theory. NIFTy 3 is open-source software available under the GNU General Public License v3 (GPL-3) at https://gitlab.mpcdf.mpg.de/ift/NIFTy/ △ Less

Submitted 3 August, 2017; originally announced August 2017.

Comments: 18 pages, 7 figures, 1 table, available at https://gitlab.mpcdf.mpg.de/ift/NIFTy/

arXiv:1705.02344 [pdf, other]

doi 10.1103/PhysRevE.96.042114

Noisy independent component analysis of auto-correlated components

Authors: Jakob Knollmüller, Torsten A. Enßlin

Abstract: We present a new method for the separation of superimposed, independent, auto-correlated components from noisy multi-channel measurement. The presented method simultaneously reconstructs and separates the components, taking all channels into account and thereby increases the effective signal-to-noise ratio considerably, allowing separations even in the high noise regime. Characteristics of the mea… ▽ More We present a new method for the separation of superimposed, independent, auto-correlated components from noisy multi-channel measurement. The presented method simultaneously reconstructs and separates the components, taking all channels into account and thereby increases the effective signal-to-noise ratio considerably, allowing separations even in the high noise regime. Characteristics of the measurement instruments can be included, allowing for application in complex measurement situations. Independent posterior samples can be provided, permitting error estimates on all desired quantities. Using the concept of information field theory, the algorithm is not restricted to any dimensionality of the underlying space or discretization scheme thereof. △ Less

Submitted 4 August, 2017; v1 submitted 5 May, 2017; originally announced May 2017.

Journal ref: Phys. Rev. E 96, 042114 (2017)

arXiv:1612.08406 [pdf, other]

Correlated signal inference by free energy exploration

Authors: Torsten A. Enßlin, Jakob Knollmüller

Abstract: The inference of correlated signal fields with unknown correlation structures is of high scientific and technological relevance, but poses significant conceptual and numerical challenges. To address these, we develop the correlated signal inference (CSI) algorithm within information field theory (IFT) and discuss its numerical implementation. To this end, we introduce the free energy exploration (… ▽ More The inference of correlated signal fields with unknown correlation structures is of high scientific and technological relevance, but poses significant conceptual and numerical challenges. To address these, we develop the correlated signal inference (CSI) algorithm within information field theory (IFT) and discuss its numerical implementation. To this end, we introduce the free energy exploration (FrEE) strategy for numerical information field theory (NIFTy) applications. The FrEE strategy is to let the mathematical structure of the inference problem determine the dynamics of the numerical solver. FrEE uses the Gibbs free energy formalism for all involved unknown fields and correlation structures without marginalization of nuisance quantities. It thereby avoids the complexity marginalization often impose to IFT equations. FrEE simultaneously solves for the mean and the uncertainties of signal, nuisance, and auxiliary fields, while exploiting any analytically calculable quantity. Finally, FrEE uses a problem specific and self-tuning exploration strategy to swiftly identify the optimal field estimates as well as their uncertainty maps. For all estimated fields, properly weighted posterior samples drawn from their exact, fully non-Gaussian distributions can be generated. Here, we develop the FrEE strategies for the CSI of a normal, a log-normal, and a Poisson log-normal IFT signal inference problem and demonstrate their performances via their NIFTy implementations. △ Less

Submitted 13 February, 2017; v1 submitted 26 December, 2016; originally announced December 2016.

Comments: 19 pages, 5 figures, submitted, updated acknowledgements

MSC Class: 62F15

Showing 1–21 of 21 results for author: Knollmueller, J