-
Provably Efficient Posterior Sampling for Sparse Linear Regression via Measure Decomposition
Authors:
Andrea Montanari,
Yuchen Wu
Abstract:
We consider the problem of sampling from the posterior distribution of a $d$-dimensional coefficient vector $\boldsymbolθ$, given linear observations $\boldsymbol{y} = \boldsymbol{X}\boldsymbolθ+\boldsymbol{\varepsilon}$. In general, such posteriors are multimodal, and therefore challenging to sample from. This observation has prompted the exploration of various heuristics that aim at approximatin…
▽ More
We consider the problem of sampling from the posterior distribution of a $d$-dimensional coefficient vector $\boldsymbolθ$, given linear observations $\boldsymbol{y} = \boldsymbol{X}\boldsymbolθ+\boldsymbol{\varepsilon}$. In general, such posteriors are multimodal, and therefore challenging to sample from. This observation has prompted the exploration of various heuristics that aim at approximating the posterior distribution.
In this paper, we study a different approach based on decomposing the posterior distribution into a log-concave mixture of simple product measures. This decomposition allows us to reduce sampling from a multimodal distribution of interest to sampling from a log-concave one, which is tractable and has been investigated in detail. We prove that, under mild conditions on the prior, for random designs, such measure decomposition is generally feasible when the number of samples per parameter $n/d$ exceeds a constant threshold. We thus obtain a provably efficient (polynomial time) sampling algorithm in a regime where this was previously not known. Numerical simulations confirm that the algorithm is practical, and reveal that it has attractive statistical properties compared to state-of-the-art methods.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
H.E.S.S. observations of the 2021 periastron passage of PSR B1259-63/LS 2883
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaoui,
R. Brose,
A. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
S. Caroff,
S. Casanova
, et al. (119 additional authors not shown)
Abstract:
PSR B1259-63 is a gamma-ray binary system that hosts a pulsar in an eccentric orbit, with a 3.4 year period, around an O9.5Ve star. At orbital phases close to periastron passages, the system radiates bright and variable non-thermal emission. We report on an extensive VHE observation campaign conducted with the High Energy Stereoscopic System, comprised of ~100 hours of data taken from $t_p-24$ day…
▽ More
PSR B1259-63 is a gamma-ray binary system that hosts a pulsar in an eccentric orbit, with a 3.4 year period, around an O9.5Ve star. At orbital phases close to periastron passages, the system radiates bright and variable non-thermal emission. We report on an extensive VHE observation campaign conducted with the High Energy Stereoscopic System, comprised of ~100 hours of data taken from $t_p-24$ days to $t_p+127$ days around the system's 2021 periastron passage. We also present the timing and spectral analyses of the source. The VHE light curve in 2021 is consistent with the stacked light curve of all previous observations. Within the light curve, we report a VHE maximum at times coincident with the third X-ray peak first detected in the 2021 X-ray light curve. In the light curve -- although sparsely sampled in this time period -- we see no VHE enhancement during the second disc crossing. In addition, we see no correspondence to the 2021 GeV flare in the VHE light curve. The VHE spectrum obtained from the analysis of the 2021 dataset is best described by a power law of spectral index $Γ= 2.65 \pm 0.04_{\text{stat}}$ $\pm 0.04_{\text{sys}}$, a value consistent with the previous H.E.S.S. observations of the source. We report spectral variability with a difference of $ΔΓ= 0.56 ~\pm~ 0.18_{\text{stat}}$ $~\pm~0.10_{\text{sys}}$ at 95% c.l., between sub-periods of the 2021 dataset. We also find a linear correlation between contemporaneous flux values of X-ray and TeV datasets, detected mainly after $t_p+25$ days, suggesting a change in the available energy for non-thermal radiation processes. We detect no significant correlation between GeV and TeV flux points, within the uncertainties of the measurements, from $\sim t_p-23$ days to $\sim t_p+126$ days. This suggests that the GeV and TeV emission originate from different electron populations.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Which exceptional low-dimensional projections of a Gaussian point cloud can be found in polynomial time?
Authors:
Andrea Montanari,
Kangjie Zhou
Abstract:
Given $d$-dimensional standard Gaussian vectors $\boldsymbol{x}_1,\dots, \boldsymbol{x}_n$, we consider the set of all empirical distributions of its $m$-dimensional projections, for $m$ a fixed constant. Diaconis and Freedman (1984) proved that, if $n/d\to \infty$, all such distributions converge to the standard Gaussian distribution. In contrast, we study the proportional asymptotics, whereby…
▽ More
Given $d$-dimensional standard Gaussian vectors $\boldsymbol{x}_1,\dots, \boldsymbol{x}_n$, we consider the set of all empirical distributions of its $m$-dimensional projections, for $m$ a fixed constant. Diaconis and Freedman (1984) proved that, if $n/d\to \infty$, all such distributions converge to the standard Gaussian distribution. In contrast, we study the proportional asymptotics, whereby $n,d\to \infty$ with $n/d\to α\in (0, \infty)$. In this case, the projection of the data points along a typical random subspace is again Gaussian, but the set $\mathscr{F}_{m,α}$ of all probability distributions that are asymptotically feasible as $m$-dimensional projections contains non-Gaussian distributions corresponding to exceptional subspaces.
Non-rigorous methods from statistical physics yield an indirect characterization of $\mathscr{F}_{m,α}$ in terms of a generalized Parisi formula. Motivated by the goal of putting this formula on a rigorous basis, and to understand whether these projections can be found efficiently, we study the subset $\mathscr{F}^{\rm alg}_{m,α}\subseteq \mathscr{F}_{m,α}$ of distributions that can be realized by a class of iterative algorithms. We prove that this set is characterized by a certain stochastic optimal control problem, and obtain a dual characterization of this problem in terms of a variational principle that extends Parisi's formula.
As a byproduct, we obtain computationally achievable values for a class of random optimization problems including `generalized spherical perceptron' models.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Identifiability of Differential-Algebraic Systems
Authors:
Arthur N. Montanari,
François Lamoline,
Robert Bereza,
Jorge Gonçalves
Abstract:
Data-driven modeling of dynamical systems often faces numerous data-related challenges. A fundamental requirement is the existence of a unique set of parameters for a chosen model structure, an issue commonly referred to as identifiability. Although this problem is well studied for ordinary differential equations (ODEs), few studies have focused on the more general class of systems described by di…
▽ More
Data-driven modeling of dynamical systems often faces numerous data-related challenges. A fundamental requirement is the existence of a unique set of parameters for a chosen model structure, an issue commonly referred to as identifiability. Although this problem is well studied for ordinary differential equations (ODEs), few studies have focused on the more general class of systems described by differential-algebraic equations (DAEs). Examples of DAEs include dynamical systems with algebraic equations representing conservation laws or approximating fast dynamics. This work introduces a novel identifiability test for models characterized by nonlinear DAEs. Unlike previous approaches, our test only requires prior knowledge of the system equations and does not need nonlinear transformation, index reduction, or numerical integration of the DAEs. We employed our identifiability analysis across a diverse range of DAE models, illustrating how system identifiability depends on the choices of sensors, experimental conditions, and model structures. Given the added challenges involved in identifying DAEs when compared to ODEs, we anticipate that our findings will have broad applicability and contribute significantly to the development and validation of data-driven methods for DAEs and other structure-preserving models.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
On Smale's 17th problem over the reals
Authors:
Andrea Montanari,
Eliran Subag
Abstract:
We consider the problem of efficiently solving a system of $n$ non-linear equations in ${\mathbb R}^d$. Addressing Smale's 17th problem stated in 1998, we consider a setting whereby the $n$ equations are random homogeneous polynomials of arbitrary degrees. In the complex case and for $n= d-1$, Beltrán and Pardo proved the existence of an efficient randomized algorithm and Lairez recently showed it…
▽ More
We consider the problem of efficiently solving a system of $n$ non-linear equations in ${\mathbb R}^d$. Addressing Smale's 17th problem stated in 1998, we consider a setting whereby the $n$ equations are random homogeneous polynomials of arbitrary degrees. In the complex case and for $n= d-1$, Beltrán and Pardo proved the existence of an efficient randomized algorithm and Lairez recently showed it can be de-randomized to produce a deterministic efficient algorithm. Here we consider the real setting, to which previously developed methods do not apply. We describe an algorithm that efficiently finds solutions (with high probability) for $n= d -O(\sqrt{d\log d})$. If the maximal degree is very large, we also give an algorithm that works up to $n=d-1$.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Sampling from Spherical Spin Glasses in Total Variation via Algorithmic Stochastic Localization
Authors:
Brice Huang,
Andrea Montanari,
Huy Tuan Pham
Abstract:
We consider the problem of algorithmically sampling from the Gibbs measure of a mixed $p$-spin spherical spin glass. We give a polynomial-time algorithm that samples from the Gibbs measure up to vanishing total variation error, for any model whose mixture satisfies $$ξ''(s) < \frac{1}{(1-s)^2}, \qquad \forall s\in [0,1).$$ This includes the pure $p$-spin glasses above a critical temperature that i…
▽ More
We consider the problem of algorithmically sampling from the Gibbs measure of a mixed $p$-spin spherical spin glass. We give a polynomial-time algorithm that samples from the Gibbs measure up to vanishing total variation error, for any model whose mixture satisfies $$ξ''(s) < \frac{1}{(1-s)^2}, \qquad \forall s\in [0,1).$$ This includes the pure $p$-spin glasses above a critical temperature that is within an absolute ($p$-independent) constant of the so-called shattering phase transition. Our algorithm follows the algorithmic stochastic localization approach introduced in (Alaoui, Montanari, Sellke, 20022). A key step of this approach is to estimate the mean of a sequence of tilted measures. We produce an improved estimator for this task by identifying a suitable correction to the TAP fixed point selected by approximate message passing (AMP). As a consequence, we improve the algorithm's guarantee over previous work, from normalized Wasserstein to total variation error. In particular, the new algorithm and analysis opens the way to perform inference about one-dimensional projections of the measure.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Unveiling extended gamma-ray emission around HESS J1813-178
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
A. Baktash,
V. Barbosa Martins,
J. Barnard,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaoui,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin
, et al. (126 additional authors not shown)
Abstract:
HESS J1813$-$178 is a very-high-energy $γ$-ray source spatially coincident with the young and energetic pulsar PSR J1813$-$1749 and thought to be associated with its pulsar wind nebula (PWN). Recently, evidence for extended high-energy emission in the vicinity of the pulsar has been revealed in the Fermi Large Area Telescope (LAT) data. This motivates revisiting the HESS J1813$-$178 region, taking…
▽ More
HESS J1813$-$178 is a very-high-energy $γ$-ray source spatially coincident with the young and energetic pulsar PSR J1813$-$1749 and thought to be associated with its pulsar wind nebula (PWN). Recently, evidence for extended high-energy emission in the vicinity of the pulsar has been revealed in the Fermi Large Area Telescope (LAT) data. This motivates revisiting the HESS J1813$-$178 region, taking advantage of improved analysis methods and an extended data set. Using data taken by the High Energy Stereoscopic System (H.E.S.S.) experiment and the Fermi-LAT, we aim to describe the $γ$-ray emission in the region with a consistent model, to provide insights into its origin. We performed a likelihood-based analysis on 32 hours of H.E.S.S. data and 12 years of Fermi-LAT data and fit a spectro-morphological model to the combined datasets. These results allowed us to develop a physical model for the origin of the observed $γ$-ray emission in the region. In addition to the compact very-high-energy $γ$-ray emission centered on the pulsar, we find a significant yet previously undetected component along the Galactic plane. With Fermi-LAT data, we confirm extended high-energy emission consistent with the position and elongation of the extended emission observed with H.E.S.S. These results establish a consistent description of the emission in the region from GeV energies to several tens of TeV. This study suggests that HESS J1813$-$178 is associated with a $γ$-ray PWN powered by PSR J1813$-$1749. A possible origin of the extended emission component is inverse Compton emission from electrons and positrons that have escaped the confines of the pulsar and form a halo around the PWN.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Spectrum and extension of the inverse-Compton emission of the Crab Nebula from a combined Fermi-LAT and H.E.S.S. analysis
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
A. Baktash,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
F. Bradascio,
M. Breuhaus,
R. Brose,
A. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin
, et al. (137 additional authors not shown)
Abstract:
The Crab Nebula is a unique laboratory for studying the acceleration of electrons and positrons through their non-thermal radiation. Observations of very-high-energy $γ$ rays from the Crab Nebula have provided important constraints for modelling its broadband emission. We present the first fully self-consistent analysis of the Crab Nebula's $γ$-ray emission between 1 GeV and $\sim$100 TeV, that is…
▽ More
The Crab Nebula is a unique laboratory for studying the acceleration of electrons and positrons through their non-thermal radiation. Observations of very-high-energy $γ$ rays from the Crab Nebula have provided important constraints for modelling its broadband emission. We present the first fully self-consistent analysis of the Crab Nebula's $γ$-ray emission between 1 GeV and $\sim$100 TeV, that is, over five orders of magnitude in energy. Using the open-source software package Gammapy, we combined 11.4 yr of data from the Fermi Large Area Telescope and 80 h of High Energy Stereoscopic System (H.E.S.S.) data at the event level and provide a measurement of the spatial extension of the nebula and its energy spectrum. We find evidence for a shrinking of the nebula with increasing $γ$-ray energy. Furthermore, we fitted several phenomenological models to the measured data, finding that none of them can fully describe the spatial extension and the spectral energy distribution at the same time. Especially the extension measured at TeV energies appears too large when compared to the X-ray emission. Our measurements probe the structure of the magnetic field between the pulsar wind termination shock and the dust torus, and we conclude that the magnetic field strength decreases with increasing distance from the pulsar. We complement our study with a careful assessment of systematic uncertainties.
△ Less
Submitted 21 March, 2024; v1 submitted 19 March, 2024;
originally announced March 2024.
-
Performance of a modular ton-scale pixel-readout liquid argon time projection chamber
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1340 additional authors not shown)
Abstract:
The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi…
▽ More
The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmic ray events collected in the spring of 2021. We use this sample to demonstrate the imaging performance of the charge and light readout systems as well as the signal correlations between the two. We also report argon purity and detector uniformity measurements, and provide comparisons to detector simulations.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Curvature in the very-high energy gamma-ray spectrum of M87
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
F. Bradascio,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik C. Burger-Scheidlin,
T. Bylund,
S. Casanova,
R. Cecil,
J. Celic,
M. Cerruti
, et al. (110 additional authors not shown)
Abstract:
The radio galaxy M87 is a variable very-high energy (VHE) gamma-ray source, exhibiting three major flares reported in 2005, 2008, and 2010. Despite extensive studies, the origin of the VHE gamma-ray emission is yet to be understood. In this study, we investigate the VHE gamma-ray spectrum of M87 during states of high gamma-ray activity, utilizing 20.2$\,$ hours the H.E.S.S. observations. Our findi…
▽ More
The radio galaxy M87 is a variable very-high energy (VHE) gamma-ray source, exhibiting three major flares reported in 2005, 2008, and 2010. Despite extensive studies, the origin of the VHE gamma-ray emission is yet to be understood. In this study, we investigate the VHE gamma-ray spectrum of M87 during states of high gamma-ray activity, utilizing 20.2$\,$ hours the H.E.S.S. observations. Our findings indicate a preference for a curved spectrum, characterized by a log-parabola model with extra-galactic background light (EBL) model above 0.3$\,$TeV at the 4$σ$ level, compared to a power-law spectrum with EBL. We investigate the degeneracy between the absorption feature and the EBL normalization and derive upper limits on EBL models mainly sensitive in the wavelength range 12.4$\,$$μ$m - 40$\,$$μ$m.
△ Less
Submitted 25 April, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.
-
Scaling laws for learning with real and surrogate data
Authors:
Ayush Jain,
Andrea Montanari,
Eren Sasoglu
Abstract:
Collecting large quantities of high-quality data can be prohibitively expensive or impractical, and a bottleneck in machine learning. One may instead augment a small set of $n$ data points from the target distribution with data from more accessible sources, e.g. data collected under different circumstances or synthesized by generative models. We refer to such data as `surrogate data.' We introduce…
▽ More
Collecting large quantities of high-quality data can be prohibitively expensive or impractical, and a bottleneck in machine learning. One may instead augment a small set of $n$ data points from the target distribution with data from more accessible sources, e.g. data collected under different circumstances or synthesized by generative models. We refer to such data as `surrogate data.' We introduce a weighted empirical risk minimization (ERM) approach for integrating surrogate data into training. We analyze mathematically this method under several classical statistical models, and validate our findings empirically on datasets from different domains. Our main findings are: $(i)$ Integrating surrogate data can significantly reduce the test error on the original distribution. Surprisingly, this can happen even when the surrogate data is unrelated to the original ones. We trace back this behavior to the classical Stein's paradox. $(ii)$ In order to reap the benefit of surrogate data, it is crucial to use optimally weighted ERM. $(iii)$ The test error of models trained on mixtures of real and surrogate data is approximately described by a scaling law. This scaling law can be used to predict the optimal weighting scheme, and to choose the amount of surrogate data to add.
△ Less
Submitted 28 June, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
On the Popov-Belevitch-Hautus tests for functional observability and output controllability
Authors:
Arthur N. Montanari,
Chao Duan,
Adilson E. Motter
Abstract:
Functional observability and output controllability are properties that establish the conditions respectively for the partial estimation and partial control of the system state. In the special case of full-state observability and controllability, the Popov-Belevitch-Hautus (PBH) tests provide conditions for the properties to hold based on the system eigenspace. Generalizations of the Popov-Belevit…
▽ More
Functional observability and output controllability are properties that establish the conditions respectively for the partial estimation and partial control of the system state. In the special case of full-state observability and controllability, the Popov-Belevitch-Hautus (PBH) tests provide conditions for the properties to hold based on the system eigenspace. Generalizations of the Popov-Belevitch-Hautus (PBH) test have been recently proposed for functional observability and output controllability but were proved to be valid only for diagonalizable systems thus far. Here, we rigorously establish a more general class of systems based on their Jordan decomposition under which a generalized PBH test for functional observability is valid. Likewise, we determine the class of systems under which the generalized PBH test is sufficient and necessary for output controllability. These results have immediate implications for observer and controller design, pole assignment, and optimal placement of sensors and drivers.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Do** Liquid Argon with Xenon in ProtoDUNE Single-Phase: Effects on Scintillation Light
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
H. Amar Es-sghir,
P. Amedo,
J. Anderson,
D. A. Andrade,
C. Andreopoulos
, et al. (1300 additional authors not shown)
Abstract:
Do** of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first do** test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUN…
▽ More
Do** of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first do** test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUNE-SP) at CERN, featuring 770 t of total liquid argon mass with 410 t of fiducial mass. The goal of the run was to measure the light and charge response of the detector to the addition of xenon, up to a concentration of 18.8 ppm. The main purpose was to test the possibility for reduction of non-uniformities in light collection, caused by deployment of photon detectors only within the anode planes. Light collection was analysed as a function of the xenon concentration, by using the pre-existing photon detection system (PDS) of ProtoDUNE-SP and an additional smaller set-up installed specifically for this run. In this paper we first summarize our current understanding of the argon-xenon energy transfer process and the impact of the presence of nitrogen in argon with and without xenon dopant. We then describe the key elements of ProtoDUNE-SP and the injection method deployed. Two dedicated photon detectors were able to collect the light produced by xenon and the total light. The ratio of these components was measured to be about 0.65 as 18.8 ppm of xenon were injected. We performed studies of the collection efficiency as a function of the distance between tracks and light detectors, demonstrating enhanced uniformity of response for the anode-mounted PDS. We also show that xenon do** can substantially recover light losses due to contamination of the liquid argon by nitrogen.
△ Less
Submitted 9 February, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Duality between controllability and observability for target control and estimation in networks
Authors:
Arthur N. Montanari,
Chao Duan,
Adilson E. Motter
Abstract:
Controllability and observability are properties that establish the existence of full-state controllers and observers, respectively. The notions of output controllability and functional observability are generalizations that enable respectively the control and estimation of part of the state vector. These generalizations are of utmost importance in applications to high-dimensional systems, such as…
▽ More
Controllability and observability are properties that establish the existence of full-state controllers and observers, respectively. The notions of output controllability and functional observability are generalizations that enable respectively the control and estimation of part of the state vector. These generalizations are of utmost importance in applications to high-dimensional systems, such as large-scale networks, in which only a target subset of variables (nodes) are sought to be controlled or estimated. Although the duality between controllability and observability is well established, the characterization of the duality between their generalized counterparts remains an outstanding problem. Here, we establish both the weak and the strong duality between output controllability and functional observability. Specifically, we show that functional observability of a system implies output controllability of a dual system (weak duality), and that under a certain condition the converse also holds (strong duality). As an application of the strong duality principle, we derive a necessary and sufficient condition for target control via static feedback. This allow us to establish a separation principle between the design of a feedback target controller and the design of a functional observer in closed-loop systems. These results generalize the well-known duality and separation principles in modern control theory.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Acceleration and transport of relativistic electrons in the jets of the microquasar SS 433
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaou,
M. Breuhau,
R. Brose,
A. M. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
S. Caroff
, et al. (140 additional authors not shown)
Abstract:
SS 433 is a microquasar, a stellar binary system with collimated relativistic jets. We observed SS 433 in gamma rays using the High Energy Stereoscopic System (H.E.S.S.), finding an energy-dependent shift in the apparent position of the gamma-ray emission of the parsec-scale jets. These observations trace the energetic electron population and indicate the gamma rays are produced by inverse-Compton…
▽ More
SS 433 is a microquasar, a stellar binary system with collimated relativistic jets. We observed SS 433 in gamma rays using the High Energy Stereoscopic System (H.E.S.S.), finding an energy-dependent shift in the apparent position of the gamma-ray emission of the parsec-scale jets. These observations trace the energetic electron population and indicate the gamma rays are produced by inverse-Compton scattering. Modelling of the energy-dependent gamma-ray morphology constrains the location of particle acceleration and requires an abrupt deceleration of the jet flow. We infer the presence of shocks on either side of the binary system at distances of 25 to 30 parsecs and conclude that self-collimation of the precessing jets forms the shocks, which then efficiently accelerate electrons.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Optimization of random cost functions and statistical physics
Authors:
Andrea Montanari
Abstract:
This is the text of my report presented at the 29th Solvay Conference on Physics on `The Structure and Dynamics of Disordered Systems' held in Bruxelles from October 19 to 21, 2023. I consider the problem of minimizing a random energy function $H(σ)$, where $σ$ is an $N$-dimensional vector, in the high-dimensional regime $N\gg 1$. Using as a reference point a 1986 paper by Fu and Anderson, I take…
▽ More
This is the text of my report presented at the 29th Solvay Conference on Physics on `The Structure and Dynamics of Disordered Systems' held in Bruxelles from October 19 to 21, 2023. I consider the problem of minimizing a random energy function $H(σ)$, where $σ$ is an $N$-dimensional vector, in the high-dimensional regime $N\gg 1$. Using as a reference point a 1986 paper by Fu and Anderson, I take stock of the progress on this question over the last 40 years. In particular, I focus on the influence and ramifications of ideas originating from statistical physics. My own conclusion is that several of the most fundamental questions in this area (which in 1986 were barely formulated) have now received mathematically rigorous answers, at least in simple -- yet highly nontrivial -- settings. Instrumental to this spectacular progress was the dialogue between different research communities: physics, computer science, mathematics.
△ Less
Submitted 20 January, 2024;
originally announced January 2024.
-
Succinctness of Cosafety Fragments of LTL via Combinatorial Proof Systems (extended version)
Authors:
Luca Geatti,
Alessio Mansutti,
Angelo Montanari
Abstract:
This paper focuses on succinctness results for fragments of Linear Temporal Logic with Past (LTL) devoid of binary temporal operators like until, and provides methods to establish them. We prove that there is a family of cosafety languages (Ln)_{n>=1} such that Ln can be expressed with a pure future formula of size O(n), but it requires formulae of size 2^Ω(n) to be captured with past formulae. As…
▽ More
This paper focuses on succinctness results for fragments of Linear Temporal Logic with Past (LTL) devoid of binary temporal operators like until, and provides methods to establish them. We prove that there is a family of cosafety languages (Ln)_{n>=1} such that Ln can be expressed with a pure future formula of size O(n), but it requires formulae of size 2^Ω(n) to be captured with past formulae. As a by-product, such a succinctness result shows the optimality of the pastification algorithm proposed in [Artale et al., KR, 2023]. We show that, in the considered case, succinctness cannot be proven by relying on the classical automata-based method introduced in [Markey, Bull. EATCS, 2003]. In place of this method, we devise and apply a combinatorial proof system whose deduction trees represent LTL formulae. The system can be seen as a proof-centric (one-player) view on the games used by Adler and Immerman to study the succinctness of CTL.
△ Less
Submitted 17 June, 2024; v1 submitted 18 January, 2024;
originally announced January 2024.
-
TeV flaring activity of the AGN PKS 0625-354 in November 2018
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
A. Baktash,
V. Barbosa Martins,
J. Barnard,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
F. Bradascio,
M. Breuhaus,
R. Brose,
A. Brown,
F. Brun,
B. Bruno
, et al. (117 additional authors not shown)
Abstract:
Most $γ$-ray detected active galactic nuclei are blazars with one of their relativistic jets pointing towards the Earth. Only a few objects belong to the class of radio galaxies or misaligned blazars. Here, we investigate the nature of the object PKS 0625-354, its $γ$-ray flux and spectral variability and its broad-band spectral emission with observations from H.E.S.S., Fermi-LAT, Swift-XRT, and U…
▽ More
Most $γ$-ray detected active galactic nuclei are blazars with one of their relativistic jets pointing towards the Earth. Only a few objects belong to the class of radio galaxies or misaligned blazars. Here, we investigate the nature of the object PKS 0625-354, its $γ$-ray flux and spectral variability and its broad-band spectral emission with observations from H.E.S.S., Fermi-LAT, Swift-XRT, and UVOT taken in November 2018. The H.E.S.S. light curve above 200 GeV shows an outburst in the first night of observations followed by a declining flux with a halving time scale of 5.9h. The $γγ$-opacity constrains the upper limit of the angle between the jet and the line of sight to $\sim10^\circ$. The broad-band spectral energy distribution shows two humps and can be well fitted with a single-zone synchrotron self Compton emission model. We conclude that PKS 0625-354, as an object showing clear features of both blazars and radio galaxies, can be classified as an intermediate active galactic nuclei. Multi-wavelength studies of such intermediate objects exhibiting features of both blazars and radio galaxies are sparse but crucial for the understanding of the broad-band emission of $γ$-ray detected active galactic nuclei in general.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
SubRiemannian cut time and cut locus in Reiter-Heisenberg groups
Authors:
Annamaria Montanari,
Daniele Morbidelli
Abstract:
We study the subRiemannian cut time and cut locus of a given point in a class of step-2 Carnot groups of Reiter-Heisenberg type. Following the Hamiltonian point of view, we write and analyze extremal curves, getting the cut time of any of them, and a precise description of the set of cut points.
We study the subRiemannian cut time and cut locus of a given point in a class of step-2 Carnot groups of Reiter-Heisenberg type. Following the Hamiltonian point of view, we write and analyze extremal curves, getting the cut time of any of them, and a precise description of the set of cut points.
△ Less
Submitted 12 February, 2024; v1 submitted 22 December, 2023;
originally announced December 2023.
-
The DUNE Far Detector Vertical Drift Technology, Technical Design Report
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade,
C. Andreopoulos
, et al. (1304 additional authors not shown)
Abstract:
DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precisi…
▽ More
DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precision measurements of the PMNS matrix parameters, including the CP-violating phase. It will also stand ready to observe supernova neutrino bursts, and seeks to observe nucleon decay as a signature of a grand unified theory underlying the standard model.
The DUNE far detector implements liquid argon time-projection chamber (LArTPC) technology, and combines the many tens-of-kiloton fiducial mass necessary for rare event searches with the sub-centimeter spatial resolution required to image those events with high precision. The addition of a photon detection system enhances physics capabilities for all DUNE physics drivers and opens prospects for further physics explorations. Given its size, the far detector will be implemented as a set of modules, with LArTPC designs that differ from one another as newer technologies arise.
In the vertical drift LArTPC design, a horizontal cathode bisects the detector, creating two stacked drift volumes in which ionization charges drift towards anodes at either the top or bottom. The anodes are composed of perforated PCB layers with conductive strips, enabling reconstruction in 3D. Light-trap-style photon detection modules are placed both on the cryostat's side walls and on the central cathode where they are optically powered.
This Technical Design Report describes in detail the technical implementations of each subsystem of this LArTPC that, together with the other far detector modules and the near detector, will enable DUNE to achieve its physics goals.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Sampling from Mean-Field Gibbs Measures via Diffusion Processes
Authors:
Ahmed El Alaoui,
Andrea Montanari,
Mark Sellke
Abstract:
We consider Ising mixed $p$-spin glasses at high-temperature and without external field, and study the problem of sampling from the Gibbs distribution $μ$ in polynomial time. We develop a new sampling algorithm with complexity of the same order as evaluating the gradient of the Hamiltonian and, in particular, at most linear in the input size. We prove that, at sufficiently high-temperature, it pro…
▽ More
We consider Ising mixed $p$-spin glasses at high-temperature and without external field, and study the problem of sampling from the Gibbs distribution $μ$ in polynomial time. We develop a new sampling algorithm with complexity of the same order as evaluating the gradient of the Hamiltonian and, in particular, at most linear in the input size. We prove that, at sufficiently high-temperature, it produces samples from a distribution $μ^{alg}$ which is close in normalized Wasserstein distance to $μ$. Namely, there exists a coupling of $μ$ and $μ^{alg}$ such that if $({\boldsymbol x},{\boldsymbol x}^{alg})\in\{-1,+1\}^n\times \{-1,+1\}^n$ is a pair drawn from this coupling, then $n^{-1}{\mathbb E}\{\|{\boldsymbol x}-{\boldsymbol x}^{alg}\|_2^2\}=o_n(1)$. For the case of the Sherrington-Kirkpatrick model, our algorithm succeeds in the full replica-symmetric phase.
We complement this result with a negative one for sampling algorithms satisfying a certain `stability' property, which is verified by many standard techniques.
No stable algorithm can approximately sample at temperatures below the onset of shattering, even under the normalized Wasserstein metric. Further, no algorithm can sample at temperatures below the onset of replica symmetry breaking.
Our sampling method implements a discretized version of a diffusion process that has become recently popular in machine learning under the name of `denoising diffusion.' We derive the same process from the general construction of stochastic localization. Implementing the diffusion process requires to efficiently approximate the mean of the tilted measure. To this end, we use an approximate message passing algorithm that, as we prove, achieves sufficiently accurate mean estimation.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Discovery of a Radiation Component from the Vela Pulsar Reaching 20 Teraelectronvolts
Authors:
The H. E. S. S. Collaboration,
:,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
F. Bradascio,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin
, et al. (157 additional authors not shown)
Abstract:
Gamma-ray observations have established energetic isolated pulsars as outstanding particle accelerators and antimatter factories in the Galaxy. There is, however, no consensus regarding the acceleration mechanisms and the radiative processes at play, nor the locations where these take place. The spectra of all observed gamma-ray pulsars to date show strong cutoffs or a break above energies of a fe…
▽ More
Gamma-ray observations have established energetic isolated pulsars as outstanding particle accelerators and antimatter factories in the Galaxy. There is, however, no consensus regarding the acceleration mechanisms and the radiative processes at play, nor the locations where these take place. The spectra of all observed gamma-ray pulsars to date show strong cutoffs or a break above energies of a few gigaelectronvolt (GeV). Using the H.E.S.S. array of Cherenkov telescopes, we discovered a novel radiation component emerging beyond this generic GeV cutoff in the Vela pulsar's broadband spectrum. The extension of gamma-ray pulsation energies up to at least 20 teraelectronvolts (TeV) shows that Vela pulsar can accelerate particles to Lorentz factors higher than $4\times10^7$. This is an order of magnitude larger than in the case of the Crab pulsar, the only other pulsar detected in the TeV energy range. Our results challenge the state-of-the-art models for high-energy emission of pulsars while providing a new probe, i.e. the energetic multi-TeV component, for constraining the acceleration and emission processes in their extreme energy limit.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Universality of max-margin classifiers
Authors:
Andrea Montanari,
Feng Ruan,
Basil Saeed,
Youngtak Sohn
Abstract:
Maximum margin binary classification is one of the most fundamental algorithms in machine learning, yet the role of featurization maps and the high-dimensional asymptotics of the misclassification error for non-Gaussian features are still poorly understood. We consider settings in which we observe binary labels $y_i$ and either $d$-dimensional covariates ${\boldsymbol z}_i$ that are mapped to a…
▽ More
Maximum margin binary classification is one of the most fundamental algorithms in machine learning, yet the role of featurization maps and the high-dimensional asymptotics of the misclassification error for non-Gaussian features are still poorly understood. We consider settings in which we observe binary labels $y_i$ and either $d$-dimensional covariates ${\boldsymbol z}_i$ that are mapped to a $p$-dimension space via a randomized featurization map ${\boldsymbol φ}:\mathbb{R}^d \to\mathbb{R}^p$, or $p$-dimensional features of non-Gaussian independent entries. In this context, we study two fundamental questions: $(i)$ At what overparametrization ratio $p/n$ do the data become linearly separable? $(ii)$ What is the generalization error of the max-margin classifier?
Working in the high-dimensional regime in which the number of features $p$, the number of samples $n$ and the input dimension $d$ (in the nonlinear featurization setting) diverge, with ratios of order one, we prove a universality result establishing that the asymptotic behavior is completely determined by the expected covariance of feature vectors and by the covariance between features and labels. In particular, the overparametrization threshold and generalization error can be computed within a simpler Gaussian model.
The main technical challenge lies in the fact that max-margin is not the maximizer (or minimizer) of an empirical average, but the maximizer of a minimum over the samples. We address this by representing the classifier as an average over support vectors. Crucially, we find that in high dimensions, the support vector count is proportional to the number of samples, which ultimately yields universality.
△ Less
Submitted 29 September, 2023;
originally announced October 2023.
-
Towards a statistical theory of data selection under weak supervision
Authors:
Germain Kolossov,
Andrea Montanari,
Pulkit Tandon
Abstract:
Given a sample of size $N$, it is often useful to select a subsample of smaller size $n<N$ to be used for statistical estimation or learning. Such a data selection step is useful to reduce the requirements of data labeling and the computational complexity of learning. We assume to be given $N$ unlabeled samples $\{{\boldsymbol x}_i\}_{i\le N}$, and to be given access to a `surrogate model' that ca…
▽ More
Given a sample of size $N$, it is often useful to select a subsample of smaller size $n<N$ to be used for statistical estimation or learning. Such a data selection step is useful to reduce the requirements of data labeling and the computational complexity of learning. We assume to be given $N$ unlabeled samples $\{{\boldsymbol x}_i\}_{i\le N}$, and to be given access to a `surrogate model' that can predict labels $y_i$ better than random guessing. Our goal is to select a subset of the samples, to be denoted by $\{{\boldsymbol x}_i\}_{i\in G}$, of size $|G|=n<N$. We then acquire labels for this set and we use them to train a model via regularized empirical risk minimization.
By using a mixture of numerical experiments on real and synthetic data, and mathematical derivations under low- and high- dimensional asymptotics, we show that: $(i)$~Data selection can be very effective, in particular beating training on the full sample in some cases; $(ii)$~Certain popular choices in data selection methods (e.g. unbiased reweighted subsampling, or influence function-based subsampling) can be substantially suboptimal.
△ Less
Submitted 4 October, 2023; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Target Controllability and Target Observability of Structured Network Systems
Authors:
Arthur N. Montanari,
Chao Duan,
Adilson E. Motter
Abstract:
The duality between controllability and observability enables methods developed for full-state control to be applied to full-state estimation, and vice versa. In applications in which control or estimation of all state variables is unfeasible, the generalized notions of output controllability and functional observability establish the minimal conditions for the control and estimation of a target s…
▽ More
The duality between controllability and observability enables methods developed for full-state control to be applied to full-state estimation, and vice versa. In applications in which control or estimation of all state variables is unfeasible, the generalized notions of output controllability and functional observability establish the minimal conditions for the control and estimation of a target subset of state variables, respectively. Given the seemly unrelated nature of these properties, thus far methods for target control and target estimation have been developed independently in the literature. Here, we characterize the graph-theoretic conditions for target controllability and target observability (which are, respectively, special cases of output controllability and functional observability for structured systems). This allow us to rigorously establish a weak and strong duality between these generalized properties. When both properties are equivalent (strongly dual), we show that efficient algorithms developed for target controllability can be used for target observability, and vice versa, for the optimal placement of sensors and drivers. These results are applicable to large-scale networks, in which control and monitoring are often sought for small subsets of nodes.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
TeV gamma-ray sensitivity to velocity-dependent dark matter models in the Galactic Center
Authors:
Alessandro Montanari,
Oscar Macias,
Emmanuel Moulin
Abstract:
The center of the Milky Way is a prime site to search for signals of dark matter (DM) annihilation due to its proximity and expected high concentration of DM. The amplification of the dispersion velocity of DM particles in the Galactic center (GC), caused by baryonic contraction and feedback, makes this particular region of the sky an even more promising target for exploring velocity-dependent DM…
▽ More
The center of the Milky Way is a prime site to search for signals of dark matter (DM) annihilation due to its proximity and expected high concentration of DM. The amplification of the dispersion velocity of DM particles in the Galactic center (GC), caused by baryonic contraction and feedback, makes this particular region of the sky an even more promising target for exploring velocity-dependent DM models. Here we demonstrate that current GC observations with the H.E.S.S. telescope, presently the most sensitive TeV-scale gamma-ray telescope in operation in this region of the sky, set the strongest constraints on velocity-dependent annihilating DM particles with masses above 200 GeV. For p-wave annihilations, they improve the current constraints by a factor of $\sim$4 for a DM mass of 1 TeV. For the spatial distribution of DM, we use the results of the latest FIRE-2 zoom cosmological simulation of Milky Way-size halos. In addition, we utilize the newest version of the GALPROP cosmic-ray propagation framework to simulate the Galactic diffuse gamma-ray emission in the GC. We have found that p-wave (d-wave) DM particles with a mass of approximately 1.7 TeV and annihilating into the $W^+$$W^-$ channel exhibit a velocity-weighted annihilation cross-section upper limit of 4.6$\times$ 10$^{-22}$ cm$^3$s$^{-1}$ (9.2$\times$10$^{-17}$ cm$^3$s$^{- 1}$) at a 95\% confidence level. This is about 460 (2$\times$ 10$^{6}$) times greater than the thermal relic cross-section for p-wave (d-wave) DM models.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Six Lectures on Linearized Neural Networks
Authors:
Theodor Misiakiewicz,
Andrea Montanari
Abstract:
In these six lectures, we examine what can be learnt about the behavior of multi-layer neural networks from the analysis of linear models. We first recall the correspondence between neural networks and linear models via the so-called lazy regime. We then review four models for linearized neural networks: linear regression with concentrated features, kernel ridge regression, random feature model an…
▽ More
In these six lectures, we examine what can be learnt about the behavior of multi-layer neural networks from the analysis of linear models. We first recall the correspondence between neural networks and linear models via the so-called lazy regime. We then review four models for linearized neural networks: linear regression with concentrated features, kernel ridge regression, random feature model and neural tangent model. Finally, we highlight the limitations of the linear theory and discuss how other approaches can overcome them.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Controller Synthesis for Timeline-based Games
Authors:
Renato Acampora,
Luca Geatti,
Nicola Gigante,
Angelo Montanari,
Valentino Picotti
Abstract:
In the timeline-based approach to planning, the evolution over time of a set of state variables (the timelines) is governed by a set of temporal constraints. Traditional timeline-based planning systems excel at the integration of planning with execution by handling temporal uncertainty. In order to handle general nondeterminism as well, the concept of timeline-based games has been recently introdu…
▽ More
In the timeline-based approach to planning, the evolution over time of a set of state variables (the timelines) is governed by a set of temporal constraints. Traditional timeline-based planning systems excel at the integration of planning with execution by handling temporal uncertainty. In order to handle general nondeterminism as well, the concept of timeline-based games has been recently introduced. It has been proved that finding whether a winning strategy exists for such games is 2EXPTIME-complete. However, a concrete approach to synthesize controllers implementing such strategies is missing. This paper fills this gap, by providing an effective and computationally optimal approach to controller synthesis for timeline-based games.
△ Less
Submitted 9 April, 2024; v1 submitted 23 July, 2023;
originally announced July 2023.
-
Shattering in Pure Spherical Spin Glasses
Authors:
Ahmed El Alaoui,
Andrea Montanari,
Mark Sellke
Abstract:
We prove the existence of a shattered phase within the replica-symmetric phase of the pure spherical $p$-spin models for $p$ sufficiently large.
In this phase, we construct a decomposition of the sphere into well-separated small clusters, each of which has exponentially small Gibbs mass, yet which together carry all but an exponentially small fraction of the Gibbs mass.
We achieve this via qua…
▽ More
We prove the existence of a shattered phase within the replica-symmetric phase of the pure spherical $p$-spin models for $p$ sufficiently large.
In this phase, we construct a decomposition of the sphere into well-separated small clusters, each of which has exponentially small Gibbs mass, yet which together carry all but an exponentially small fraction of the Gibbs mass.
We achieve this via quantitative estimates on the derivative of the Franz--Parisi potential, which measures the Gibbs mass profile around a typical sample. Corollaries on dynamics are derived, in particular we show the two-times correlation function of stationary Langevin dynamics must have an exponentially long plateau. We further show that shattering implies disorder chaos for the Gibbs measure in the optimal transport sense; this is known to imply failure of sampling algorithms which are stable under perturbation in the same metric.
△ Less
Submitted 15 February, 2024; v1 submitted 10 July, 2023;
originally announced July 2023.
-
The vanishing of the primary emission region in PKS 1510-089
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
J. Barnard,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernloehr,
B. Bi,
M. de Bony de Lavergne,
M. Boettcher,
C. Boisson,
J. Bolmont,
J. Borowska,
M. Bouyahiaoui,
F. Bradascio,
M. Breuhaus,
R. Brose,
A. M. Brown,
F. Brun,
B. Bruno,
T. Bulik
, et al. (130 additional authors not shown)
Abstract:
In July 2021, PKS 1510-089 exhibited a significant flux drop in the high-energy gamma-ray (by a factor 10) and optical (by a factor 5) bands and remained in this low state throughout 2022. Similarly, the optical polarization in the source vanished, resulting in the optical spectrum being fully explained through the steady flux of the accretion disk and the broad-line region. Unlike the aforementio…
▽ More
In July 2021, PKS 1510-089 exhibited a significant flux drop in the high-energy gamma-ray (by a factor 10) and optical (by a factor 5) bands and remained in this low state throughout 2022. Similarly, the optical polarization in the source vanished, resulting in the optical spectrum being fully explained through the steady flux of the accretion disk and the broad-line region. Unlike the aforementioned bands, the very-high-energy gamma-ray and X-ray fluxes did not exhibit a significant flux drop from year to year. This suggests that the steady-state very-high-energy gamma-ray and X-ray fluxes originate from a different emission region than the vanished parts of the high-energy gamma-ray and optical jet fluxes. The latter component has disappeared through either a swing of the jet away from the line-of-sight or a significant drop in the photon production efficiency of the jet close to the black hole. Either change could become visible in high-resolution radio images.
△ Less
Submitted 4 July, 2023;
originally announced July 2023.
-
Multiwavelength Observations of the Blazar PKS 0735+178 in Spatial and Temporal Coincidence with an Astrophysical Neutrino Candidate IceCube-211208A
Authors:
A. Acharyya,
C. B. Adams,
A. Archer,
P. Bangale,
J. T. Bartkoske,
P. Batista,
W. Benbow,
A. Brill,
J. H. Buckley,
J. L. Christiansen,
A. J. Chromey,
M. Errando,
A. Falcone,
Q. Feng,
G. M. Foote,
L. Fortson,
A. Furniss,
G. Gallagher,
W. Hanlon,
D. Hanna,
O. Hervet,
C. E. Hinrichs,
J. Hoang,
J. Holder,
T. B. Humensky
, et al. (185 additional authors not shown)
Abstract:
We report on multiwavelength target-of-opportunity observations of the blazar PKS 0735+178, located 2.2$^\circ$ away from the best-fit position of the IceCube neutrino event IceCube-211208A detected on December 8, 2021. The source was in a high-flux state in the optical, ultraviolet, X-ray, and GeV gamma-ray bands around the time of the neutrino event, exhibiting daily variability in the soft X-ra…
▽ More
We report on multiwavelength target-of-opportunity observations of the blazar PKS 0735+178, located 2.2$^\circ$ away from the best-fit position of the IceCube neutrino event IceCube-211208A detected on December 8, 2021. The source was in a high-flux state in the optical, ultraviolet, X-ray, and GeV gamma-ray bands around the time of the neutrino event, exhibiting daily variability in the soft X-ray flux. The X-ray data from Swift-XRT and NuSTAR characterize the transition between the low-energy and high-energy components of the broadband spectral energy distribution (SED), and the gamma-ray data from Fermi -LAT, VERITAS, and H.E.S.S. require a spectral cut-off near 100 GeV. Both X-ray and gamma-ray measurements provide strong constraints on the leptonic and hadronic models. We analytically explore a synchrotron self-Compton model, an external Compton model, and a lepto-hadronic model. Models that are entirely based on internal photon fields face serious difficulties in matching the observed SED. The existence of an external photon field in the source would instead explain the observed gamma-ray spectral cut-off in both leptonic and lepto-hadronic models and allow a proton jet power that marginally agrees with the Eddington limit in the lepto-hadronic model. We show a numerical lepto-hadronic model with external target photons that reproduces the observed SED and is reasonably consistent with the neutrino event despite requiring a high jet power.
△ Less
Submitted 30 June, 2023;
originally announced June 2023.
-
Solving overparametrized systems of random equations: I. Model and algorithms for approximate solutions
Authors:
Andrea Montanari,
Eliran Subag
Abstract:
Consider the problem of solving a system of equations ${\boldsymbol F}({\boldsymbol x})= {\boldsymbol 0}$, subject to $\|{\boldsymbol x}\|_2=1$, whereby ${\boldsymbol F}:{\mathbb R}^d\to{\mathbb R}^n$ is a random nonlinear map. More precisely, $ {\boldsymbol F}({\boldsymbol x}) = (F_1({\boldsymbol x}),\dots,F_n({\boldsymbol x})) $ where the $ F_i(\,\cdot\, ) $'s are i.i.d. rotationally invariant G…
▽ More
Consider the problem of solving a system of equations ${\boldsymbol F}({\boldsymbol x})= {\boldsymbol 0}$, subject to $\|{\boldsymbol x}\|_2=1$, whereby ${\boldsymbol F}:{\mathbb R}^d\to{\mathbb R}^n$ is a random nonlinear map. More precisely, $ {\boldsymbol F}({\boldsymbol x}) = (F_1({\boldsymbol x}),\dots,F_n({\boldsymbol x})) $ where the $ F_i(\,\cdot\, ) $'s are i.i.d. rotationally invariant Gaussian processes. We study this problem under the proportional asymptotics $n,d\to\infty$, $n/d\toα\in [0,1)$ and establish results about the existence of solutions and polynomial-time algorithms to find them.
First, we establish upper and lower bounds $α_{UB}$, $α_{LB}$ on the threshold for existence of solutions. Namely, if the number of equations per variable satisfies $α<α_{LB}$, then the system admits exact solutions with high probability, while for $α>α_{UB}$, no solutions exist, even in an approximate sense.
We then analyze several algorithms to find solutions: gradient descent, Hessian descent, and a two-phase algorithm. In particular, for Hessian descent and the two-phase algorithm, we characterize their thresholds $α_{HD}$, $α_{TP}$. Namely, for $α<α_{HD}$ (or $α<α_{TP}$) the algorithm finds an approximate solution with high probability, while for $α>α_{HD}$ (respectively $α>α_{TP}$), it does not.
Finally, we compare the theoretical predictions within this model to empirical results obtained with structured systems of nonlinear equations.
△ Less
Submitted 23 June, 2023;
originally announced June 2023.
-
Constraints on the intergalactic magnetic field using Fermi-LAT and H.E.S.S. blazar observations
Authors:
H. E. S. S.,
Fermi-LAT Collaborations,
:,
F. Aharonian,
J. Aschersleben,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
B. Bi,
M. Bouyahiaoui,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
T. Bylund,
S. Caroff,
S. Casanova,
J. Celic,
M. Cerruti,
T. Chand,
S. Chandra
, et al. (113 additional authors not shown)
Abstract:
Magnetic fields in galaxies and galaxy clusters are believed to be the result of the amplification of intergalactic seed fields during the formation of large-scale structures in the universe. However, the origin, strength, and morphology of this intergalactic magnetic field (IGMF) remain unknown. Lower limits on (or indirect detection of) the IGMF can be obtained from observations of high-energy g…
▽ More
Magnetic fields in galaxies and galaxy clusters are believed to be the result of the amplification of intergalactic seed fields during the formation of large-scale structures in the universe. However, the origin, strength, and morphology of this intergalactic magnetic field (IGMF) remain unknown. Lower limits on (or indirect detection of) the IGMF can be obtained from observations of high-energy gamma rays from distant blazars. Gamma rays interact with the extragalactic background light to produce electron-positron pairs, which can subsequently initiate electromagnetic cascades. The $γ$-ray signature of the cascade depends on the IGMF since it deflects the pairs. Here we report on a new search for this cascade emission using a combined data set from the Fermi Large Area Telescope and the High Energy Stereoscopic System. Using state-of-the-art Monte Carlo predictions for the cascade signal, our results place a lower limit on the IGMF of $B > 7.1\times10^{-16}$ G for a coherence length of 1 Mpc even when blazar duty cycles as short as 10 yr are assumed. This improves on previous lower limits by a factor of 2. For longer duty cycles of $10^4$ ($10^7$) yr, IGMF strengths below $1.8\times10^{-14}$ G ($3.9\times10^{-14}$ G) are excluded, which rules out specific models for IGMF generation in the early universe.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
Sampling, Diffusions, and Stochastic Localization
Authors:
Andrea Montanari
Abstract:
Diffusions are a successful technique to sample from high-dimensional distributions can be either explicitly given or learnt from a collection of samples. They implement a diffusion process whose endpoint is a sample from the target distribution and whose drift is typically represented as a neural network. Stochastic localization is a successful technique to prove mixing of Markov Chains and other…
▽ More
Diffusions are a successful technique to sample from high-dimensional distributions can be either explicitly given or learnt from a collection of samples. They implement a diffusion process whose endpoint is a sample from the target distribution and whose drift is typically represented as a neural network. Stochastic localization is a successful technique to prove mixing of Markov Chains and other functional inequalities in high dimension. An algorithmic version of stochastic localization was introduced in [EAMS2022], to obtain an algorithm that samples from certain statistical mechanics models.
This notes have three objectives: (i) Generalize the construction [EAMS2022] to other stochastic localization processes; (ii) Clarify the connection between diffusions and stochastic localization. In particular we show that standard denoising diffusions are stochastic localizations but other examples that are naturally suggested by the proposed viewpoint; (iii) Describe some insights that follow from this viewpoint.
△ Less
Submitted 18 May, 2023;
originally announced May 2023.
-
Constraining the cosmic-ray pressure in the inner Virgo Cluster using H.E.S.S. observations of M 87
Authors:
H. E. S. S. Collaboration,
:,
F. Aharonian,
F. Ait Benkhali,
C. Arcaro,
J. Aschersleben,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
J. Borowska,
F. Bradascio,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
T. Bylund
, et al. (139 additional authors not shown)
Abstract:
The origin of the gamma-ray emission from M87 is currently a matter of debate. This work aims to localize the VHE (100 GeV-100 TeV) gamma-ray emission from M87 and probe a potential extended hadronic emission component in the inner Virgo Cluster. The search for a steady and extended gamma-ray signal around M87 can constrain the cosmic-ray energy density and the pressure exerted by the cosmic rays…
▽ More
The origin of the gamma-ray emission from M87 is currently a matter of debate. This work aims to localize the VHE (100 GeV-100 TeV) gamma-ray emission from M87 and probe a potential extended hadronic emission component in the inner Virgo Cluster. The search for a steady and extended gamma-ray signal around M87 can constrain the cosmic-ray energy density and the pressure exerted by the cosmic rays onto the intra-cluster medium, and allow us to investigate the role of the cosmic rays in the active galactic nucleus feedback as a heating mechanism in the Virgo Cluster. H.E.S.S. telescopes are sensitive to VHE gamma rays and have been utilized to observe M87 since 2004. We utilized a Bayesian block analysis to identify M87 emission states with H.E.S.S. observations from 2004 until 2021, dividing them into low, intermediate, and high states. Because of the causality argument, an extended ($\gtrsim$kpc) signal is allowed only in steady emission states. Hence, we fitted the morphology of the 120h low state data and found no significant gamma-ray extension. Therefore, we derived for the low state an upper limit of 58"(corresponding to $\approx$4.6kpc) in the extension of a single-component morphological model described by a rotationally symmetric 2D Gaussian model at 99.7% confidence level. Our results exclude the radio lobes ($\approx$30 kpc) as the principal component of the VHE gamma-ray emission from the low state of M87. The gamma-ray emission is compatible with a single emission region at the radio core of M87. These results, with the help of two multiple-component models, constrain the maximum cosmic-ray to thermal pressure ratio $X_{CR,max.}$$\lesssim$$0.32$ and the total energy in cosmic-ray protons (CRp) to $U_{CR}$$\lesssim$5$\times10^{58}$ erg in the inner 20kpc of the Virgo Cluster for an assumed CRp power-law distribution in momentum with spectral index $α_{p}$=2.1.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
The Logic of Prefixes and Suffixes is Elementary under Homogeneity
Authors:
Dario Della Monica,
Angelo Montanari,
Gabriele Puppis,
Pietro Sala
Abstract:
In this paper, we study the finite satisfiability problem for the logic BE under the homogeneity assumption. BE is the cornerstone of Halpern and Shoham's interval temporal logic, and features modal operators corresponding to the prefix (a.k.a. "Begins") and suffix (a.k.a. "Ends") relations on intervals. In terms of complexity, BE lies in between the "Chop logic C", whose satisfiability problem is…
▽ More
In this paper, we study the finite satisfiability problem for the logic BE under the homogeneity assumption. BE is the cornerstone of Halpern and Shoham's interval temporal logic, and features modal operators corresponding to the prefix (a.k.a. "Begins") and suffix (a.k.a. "Ends") relations on intervals. In terms of complexity, BE lies in between the "Chop logic C", whose satisfiability problem is known to be non-elementary, and the PSPACE-complete interval logic D of the sub-interval (a.k.a. "During") relation. BE was shown to be EXPSPACE-hard, and the only known satisfiability procedure is primitive recursive, but not elementary. Our contribution consists of tightening the complexity bounds of the satisfiability problem for BE, by proving it to be EXPSPACE-complete. We do so by devising an equi-satisfiable normal form with boundedly many nested modalities. The normalization technique resembles Scott's quantifier elimination, but it turns out to be much more involved due to the limitations enforced by the homogeneity assumption.
△ Less
Submitted 22 April, 2023;
originally announced April 2023.
-
Posterior Sampling from the Spiked Models via Diffusion Processes
Authors:
Andrea Montanari,
Yuchen Wu
Abstract:
Sampling from the posterior is a key technical problem in Bayesian statistics. Rigorous guarantees are difficult to obtain for Markov Chain Monte Carlo algorithms of common use. In this paper, we study an alternative class of algorithms based on diffusion processes. The diffusion is constructed in such a way that, at its final time, it approximates the target posterior distribution. The stochastic…
▽ More
Sampling from the posterior is a key technical problem in Bayesian statistics. Rigorous guarantees are difficult to obtain for Markov Chain Monte Carlo algorithms of common use. In this paper, we study an alternative class of algorithms based on diffusion processes. The diffusion is constructed in such a way that, at its final time, it approximates the target posterior distribution. The stochastic differential equation that defines this process is discretized (using a Euler scheme) to provide an efficient sampling algorithm. Our construction of the diffusion is based on the notion of observation process and the related idea of stochastic localization. Namely, the diffusion process describes a sample that is conditioned on increasing information. An overlap** family of processes was derived in the machine learning literature via time-reversal.
We apply this method to posterior sampling in the high-dimensional symmetric spiked model. We observe a rank-one matrix ${\boldsymbol θ}{\boldsymbol θ}^{\sf T}$ corrupted by Gaussian noise, and want to sample ${\boldsymbol θ}$ from the posterior. Our sampling algorithm makes use of an oracle that computes the posterior expectation of ${\boldsymbol θ}$ given the data and the additional observation process. We provide an efficient implementation of this oracle using approximate message passing. We thus develop the first sampling algorithm for this problem with approximation guarantees.
△ Less
Submitted 22 April, 2023;
originally announced April 2023.
-
Detection of extended gamma-ray emission around the Geminga pulsar with H.E.S.S
Authors:
H. E. S. S. Collaboration,
:,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
J. Borowska,
M. Bouyahiaoui,
F. Bradascio,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger Scheidlin,
F. Cangemi
, et al. (143 additional authors not shown)
Abstract:
Geminga is an enigmatic radio-quiet gamma-ray pulsar located at a mere 250 pc distance from Earth. Extended very-high-energy gamma-ray emission around the pulsar was discovered by Milagro and later confirmed by HAWC, which are both water Cherenkov detector-based experiments. However, evidence for the Geminga pulsar wind nebula in gamma rays has long evaded detection by imaging atmospheric Cherenko…
▽ More
Geminga is an enigmatic radio-quiet gamma-ray pulsar located at a mere 250 pc distance from Earth. Extended very-high-energy gamma-ray emission around the pulsar was discovered by Milagro and later confirmed by HAWC, which are both water Cherenkov detector-based experiments. However, evidence for the Geminga pulsar wind nebula in gamma rays has long evaded detection by imaging atmospheric Cherenkov telescopes (IACTs) despite targeted observations. The detection of gamma-ray emission on angular scales > 2 deg poses a considerable challenge for the background estimation in IACT data analysis. With recent developments in understanding the complementary background estimation techniques of water Cherenkov and atmospheric Cherenkov instruments, the H.E.S.S. IACT array can now confirm the detection of highly extended gamma-ray emission around the Geminga pulsar with a radius of at least 3 deg in the energy range 0.5-40 TeV. We find no indications for statistically significant asymmetries or energy-dependent morphology. A flux normalisation of $(2.8\pm0.7)\times10^{-12}$ cm$^{-2}$s$^{-1}$TeV$^{-1}$ at 1 TeV is obtained within a 1 deg radius region around the pulsar. To investigate the particle transport within the halo of energetic leptons around the pulsar, we fitted an electron diffusion model to the data. The normalisation of the diffusion coefficient obtained of $D_0 = 7.6^{+1.5}_{-1.2} \times 10^{27}$ cm$^2$s$^{-1}$, at an electron energy of 100 TeV, is compatible with values previously reported for the pulsar halo around Geminga, which is considerably below the Galactic average.
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
Impact of cross-section uncertainties on supernova neutrino spectral parameter fitting in the Deep Underground Neutrino Experiment
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1294 additional authors not shown)
Abstract:
A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics…
▽ More
A primary goal of the upcoming Deep Underground Neutrino Experiment (DUNE) is to measure the $\mathcal{O}(10)$ MeV neutrinos produced by a Galactic core-collapse supernova if one should occur during the lifetime of the experiment. The liquid-argon-based detectors planned for DUNE are expected to be uniquely sensitive to the $ν_e$ component of the supernova flux, enabling a wide variety of physics and astrophysics measurements. A key requirement for a correct interpretation of these measurements is a good understanding of the energy-dependent total cross section $σ(E_ν)$ for charged-current $ν_e$ absorption on argon. In the context of a simulated extraction of supernova $ν_e$ spectral parameters from a toy analysis, we investigate the impact of $σ(E_ν)$ modeling uncertainties on DUNE's supernova neutrino physics sensitivity for the first time. We find that the currently large theoretical uncertainties on $σ(E_ν)$ must be substantially reduced before the $ν_e$ flux parameters can be extracted reliably: in the absence of external constraints, a measurement of the integrated neutrino luminosity with less than 10\% bias with DUNE requires $σ(E_ν)$ to be known to about 5%. The neutrino spectral shape parameters can be known to better than 10% for a 20% uncertainty on the cross-section scale, although they will be sensitive to uncertainties on the shape of $σ(E_ν)$. A direct measurement of low-energy $ν_e$-argon scattering would be invaluable for improving the theoretical precision to the needed level.
△ Less
Submitted 7 July, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Search for the evaporation of primordial black holes with H.E.S.S
Authors:
H. E. S. S. collaboration,
:,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
M. Boettcher,
M. Backes,
V. Barbosa Martins,
R. Batzo,
Y. Becherini,
D. Berge,
B. Bi,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
F. Bradascio,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
S. Caro,
S. Casanova,
J. Celic
, et al. (124 additional authors not shown)
Abstract:
Primordial Black Holes (PBHs) are hypothetical black holes predicted to have been formed from density fluctuations in the early Universe. PBHs with an initial mass around $10^{14}-10^{15}$g are expected to end their evaporation at present times in a burst of particles and very-high-energy (VHE) gamma rays. Those gamma rays may be detectable by the High Energy Stereoscopic System (H.E.S.S.), an arr…
▽ More
Primordial Black Holes (PBHs) are hypothetical black holes predicted to have been formed from density fluctuations in the early Universe. PBHs with an initial mass around $10^{14}-10^{15}$g are expected to end their evaporation at present times in a burst of particles and very-high-energy (VHE) gamma rays. Those gamma rays may be detectable by the High Energy Stereoscopic System (H.E.S.S.), an array of imaging atmospheric Cherenkov telescopes. This paper reports on the search for evaporation bursts of VHE gamma rays with H.E.S.S., ranging from 10 to 120 seconds, as expected from the final stage of PBH evaporation and using a total of 4816 hours of observations. The most constraining upper limit on the burst rate of local PBHs is $2000$ pc$^{-3}$ yr$^{-1}$ for a burst interval of 120 seconds, at the 95\% confidence level. The implication of these measurements for PBH dark matter are also discussed.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
H.E.S.S. follow-up observations of GRB221009A
Authors:
H. E. S. S. Collaboration,
:,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
A. Baktash,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaoui,
F. Bradascio,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno
, et al. (138 additional authors not shown)
Abstract:
GRB221009A is the brightest gamma-ray burst ever detected. To probe the very-high-energy (VHE, $>$\!100 GeV) emission, the High Energy Stereoscopic System (H.E.S.S.) began observations 53 hours after the triggering event, when the brightness of the moonlight no longer precluded observations. We derive differential and integral upper limits using H.E.S.S. data from the third, fourth, and ninth nigh…
▽ More
GRB221009A is the brightest gamma-ray burst ever detected. To probe the very-high-energy (VHE, $>$\!100 GeV) emission, the High Energy Stereoscopic System (H.E.S.S.) began observations 53 hours after the triggering event, when the brightness of the moonlight no longer precluded observations. We derive differential and integral upper limits using H.E.S.S. data from the third, fourth, and ninth nights after the initial GRB detection, after applying atmospheric corrections. The combined observations yield an integral energy flux upper limit of $Φ_\mathrm{UL}^{95\%} = 9.7 \times 10^{-12}~\mathrm{erg\,cm^{-2}\,s^{-1}}$ above $E_\mathrm{thr} = 650$ GeV. The constraints derived from the H.E.S.S. observations complement the available multiwavelength data. The radio to X-ray data are consistent with synchrotron emission from a single electron population, with the peak in the SED occurring above the X-ray band. Compared to the VHE-bright GRB190829A, the upper limits for GRB221009A imply a smaller gamma-ray to X-ray flux ratio in the afterglow. Even in the absence of a detection, the H.E.S.S. upper limits thus contribute to the multiwavelength picture of GRB221009A, effectively ruling out an IC dominated scenario.
△ Less
Submitted 18 March, 2023;
originally announced March 2023.
-
Learning time-scales in two-layers neural networks
Authors:
Raphaël Berthier,
Andrea Montanari,
Kangjie Zhou
Abstract:
Gradient-based learning in multi-layer neural networks displays a number of striking features. In particular, the decrease rate of empirical risk is non-monotone even after averaging over large batches. Long plateaus in which one observes barely any progress alternate with intervals of rapid decrease. These successive phases of learning often take place on very different time scales. Finally, mode…
▽ More
Gradient-based learning in multi-layer neural networks displays a number of striking features. In particular, the decrease rate of empirical risk is non-monotone even after averaging over large batches. Long plateaus in which one observes barely any progress alternate with intervals of rapid decrease. These successive phases of learning often take place on very different time scales. Finally, models learnt in an early phase are typically `simpler' or `easier to learn' although in a way that is difficult to formalize.
Although theoretical explanations of these phenomena have been put forward, each of them captures at best certain specific regimes. In this paper, we study the gradient flow dynamics of a wide two-layer neural network in high-dimension, when data are distributed according to a single-index model (i.e., the target function depends on a one-dimensional projection of the covariates). Based on a mixture of new rigorous results, non-rigorous mathematical derivations, and numerical simulations, we propose a scenario for the learning dynamics in this setting. In particular, the proposed evolution exhibits separation of timescales and intermittency. These behaviors arise naturally because the population gradient flow can be recast as a singularly perturbed dynamical system.
△ Less
Submitted 17 April, 2024; v1 submitted 28 February, 2023;
originally announced March 2023.
-
HESS J1809$-$193: a halo of escaped electrons around a pulsar wind nebula?
Authors:
H. E. S. S. Collaboration,
:,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
M. Böttcher,
C. Boisson,
J. Bolmont,
J. Borowska,
M. Bouyahiaoui,
F. Bradascio,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
T. Bylund,
S. Caroff
, et al. (130 additional authors not shown)
Abstract:
Context. HESS J1809$-$193 is an unassociated very-high-energy $γ$-ray source located on the Galactic plane. While it has been connected to the nebula of the energetic pulsar PSR J1809$-$1917, supernova remnants and molecular clouds present in the vicinity also constitute possible associations. Recently, the detection of $γ$-ray emission up to energies of $\sim$100 TeV with the HAWC observatory has…
▽ More
Context. HESS J1809$-$193 is an unassociated very-high-energy $γ$-ray source located on the Galactic plane. While it has been connected to the nebula of the energetic pulsar PSR J1809$-$1917, supernova remnants and molecular clouds present in the vicinity also constitute possible associations. Recently, the detection of $γ$-ray emission up to energies of $\sim$100 TeV with the HAWC observatory has led to renewed interest in HESS J1809$-$193.
Aims. We aim to understand the origin of the $γ$-ray emission of HESS J1809$-$193.
Methods. We analysed 93.2 h of data taken on HESS J1809$-$193 above 0.27 TeV with the High Energy Stereoscopic System (H.E.S.S.), using a multi-component, three-dimensional likelihood analysis. In addition, we provide a new analysis of 12.5 yr of Fermi-LAT data above 1 GeV within the region of HESS J1809$-$193. The obtained results are interpreted in a time-dependent modelling framework.
Results. For the first time, we were able to resolve the emission detected with H.E.S.S. into two components: an extended component that exhibits a spectral cut-off at $\sim$13 TeV, and a compact component that is located close to PSR J1809$-$1917 and shows no clear spectral cut-off. The Fermi-LAT analysis also revealed extended $γ$-ray emission, on scales similar to that of the extended H.E.S.S. component.
Conclusions. Our modelling indicates that based on its spectrum and spatial extent, the extended H.E.S.S. component is likely caused by inverse Compton emission from old electrons that form a halo around the pulsar wind nebula. The compact component could be connected to either the pulsar wind nebula or the supernova remnant and molecular clouds. Due to its comparatively steep spectrum, modelling the Fermi-LAT emission together with the H.E.S.S. components is not straightforward. (abridged)
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Compressing Tabular Data via Latent Variable Estimation
Authors:
Andrea Montanari,
Eric Weiner
Abstract:
Data used for analytics and machine learning often take the form of tables with categorical entries. We introduce a family of lossless compression algorithms for such data that proceed in four steps: $(i)$ Estimate latent variables associated to rows and columns; $(ii)$ Partition the table in blocks according to the row/column latents; $(iii)$ Apply a sequential (e.g. Lempel-Ziv) coder to each of…
▽ More
Data used for analytics and machine learning often take the form of tables with categorical entries. We introduce a family of lossless compression algorithms for such data that proceed in four steps: $(i)$ Estimate latent variables associated to rows and columns; $(ii)$ Partition the table in blocks according to the row/column latents; $(iii)$ Apply a sequential (e.g. Lempel-Ziv) coder to each of the blocks; $(iv)$ Append a compressed encoding of the latents.
We evaluate it on several benchmark datasets, and study optimal compression in a probabilistic model for that tabular data, whereby latent values are independent and table entries are conditionally independent given the latent values. We prove that the model has a well defined entropy rate and satisfies an asymptotic equipartition property. We also prove that classical compression schemes such as Lempel-Ziv and finite-state encoders do not achieve this rate. On the other hand, the latent estimation strategy outlined above achieves the optimal rate.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
AIOSA: An approach to the automatic identification of obstructive sleep apnea events based on deep learning
Authors:
Andrea Bernardini,
Andrea Brunello,
Gian Luigi Gigli,
Angelo Montanari,
Nicola Saccomanno
Abstract:
Obstructive Sleep Apnea Syndrome (OSAS) is the most common sleep-related breathing disorder. It is caused by an increased upper airway resistance during sleep, which determines episodes of partial or complete interruption of airflow. The detection and treatment of OSAS is particularly important in stroke patients, because the presence of severe OSAS is associated with higher mortality, worse neuro…
▽ More
Obstructive Sleep Apnea Syndrome (OSAS) is the most common sleep-related breathing disorder. It is caused by an increased upper airway resistance during sleep, which determines episodes of partial or complete interruption of airflow. The detection and treatment of OSAS is particularly important in stroke patients, because the presence of severe OSAS is associated with higher mortality, worse neurological deficits, worse functional outcome after rehabilitation, and a higher likelihood of uncontrolled hypertension. The gold standard test for diagnosing OSAS is polysomnography (PSG). Unfortunately, performing a PSG in an electrically hostile environment, like a stroke unit, on neurologically impaired patients is a difficult task; also, the number of strokes per day outnumbers the availability of polysomnographs and dedicated healthcare professionals. Thus, a simple and automated recognition system to identify OSAS among acute stroke patients, relying on routinely recorded vital signs, is desirable. The majority of the work done so far focuses on data recorded in ideal conditions and highly selected patients, and thus it is hardly exploitable in real-life settings, where it would be of actual use. In this paper, we propose a convolutional deep learning architecture able to reduce the temporal resolution of raw waveform data, like physiological signals, extracting key features that can be used for further processing. We exploit models based on such an architecture to detect OSAS events in stroke unit recordings obtained from the monitoring of unselected patients. Unlike existing approaches, annotations are performed at one-second granularity, allowing physicians to better interpret the model outcome. Results are considered to be satisfactory by the domain experts. Moreover, based on a widely-used benchmark, we show that the proposed approach outperforms current state-of-the-art solutions.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
Functional observability and subspace reconstruction in nonlinear systems
Authors:
Arthur N. Montanari,
Leandro Freitas,
Daniele Proverbio,
Jorge Gonçalves
Abstract:
Time-series analysis is fundamental for modeling and predicting dynamical behaviors from time-ordered data, with applications in many disciplines such as physics, biology, finance, and engineering. Measured time-series data, however, are often low dimensional or even univariate, thus requiring embedding methods to reconstruct the original system's state space. The observability of a system establi…
▽ More
Time-series analysis is fundamental for modeling and predicting dynamical behaviors from time-ordered data, with applications in many disciplines such as physics, biology, finance, and engineering. Measured time-series data, however, are often low dimensional or even univariate, thus requiring embedding methods to reconstruct the original system's state space. The observability of a system establishes fundamental conditions under which such reconstruction is possible. However, complete observability is too restrictive in applications where reconstructing the entire state space is not necessary and only a specific subspace is relevant. Here, we establish the theoretic condition to reconstruct a nonlinear functional of state variables from measurement processes, generalizing the concept of functional observability to nonlinear systems. When the functional observability condition holds, we show how to construct a map from the embedding space to the desired functional of state variables, characterizing the quality of such reconstruction. The theoretical results are then illustrated numerically using chaotic systems with contrasting observability properties. By exploring the presence of functionally unobservable regions in embedded attractors, we also apply our theory for the early warning of seizure-like events in simulated and empirical data. The studies demonstrate that the proposed functional observability condition can be assessed a priori to guide time-series analysis and experimental design for the dynamical characterization of complex systems.
△ Less
Submitted 10 January, 2023;
originally announced January 2023.
-
Highly-parallelized simulation of a pixelated LArTPC on a GPU
Authors:
DUNE Collaboration,
A. Abed Abud,
B. Abi,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
Z. Ahmad,
J. Ahmed,
B. Aimard,
F. Akbar,
K. Allison,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
A. Alton,
R. Alvarez,
P. Amedo,
J. Anderson
, et al. (1282 additional authors not shown)
Abstract:
The rapid development of general-purpose computing on graphics processing units (GPGPU) is allowing the implementation of highly-parallelized Monte Carlo simulation chains for particle physics experiments. This technique is particularly suitable for the simulation of a pixelated charge readout for time projection chambers, given the large number of channels that this technology employs. Here we pr…
▽ More
The rapid development of general-purpose computing on graphics processing units (GPGPU) is allowing the implementation of highly-parallelized Monte Carlo simulation chains for particle physics experiments. This technique is particularly suitable for the simulation of a pixelated charge readout for time projection chambers, given the large number of channels that this technology employs. Here we present the first implementation of a full microphysical simulator of a liquid argon time projection chamber (LArTPC) equipped with light readout and pixelated charge readout, developed for the DUNE Near Detector. The software is implemented with an end-to-end set of GPU-optimized algorithms. The algorithms have been written in Python and translated into CUDA kernels using Numba, a just-in-time compiler for a subset of Python and NumPy instructions. The GPU implementation achieves a speed up of four orders of magnitude compared with the equivalent CPU version. The simulation of the current induced on $10^3$ pixels takes around 1 ms on the GPU, compared with approximately 10 s on the CPU. The results of the simulation are compared against data from a pixel-readout LArTPC prototype.
△ Less
Submitted 28 February, 2023; v1 submitted 19 December, 2022;
originally announced December 2022.
-
Buffering variability in cell regulation motifs close to criticality
Authors:
Daniele Proverbio,
Arthur N. Montanari,
Alexander Skupin,
Jorge Gonçalves
Abstract:
Bistable biological regulatory systems need to cope with stochastic noise to fine-tune their function close to bifurcation points. Here, we study stability properties of this regime in generic systems to demonstrate that cooperative interactions buffer system variability, hampering noise-induced regime shifts. Our analysis also shows that, in the considered cooperativity range, impending regime sh…
▽ More
Bistable biological regulatory systems need to cope with stochastic noise to fine-tune their function close to bifurcation points. Here, we study stability properties of this regime in generic systems to demonstrate that cooperative interactions buffer system variability, hampering noise-induced regime shifts. Our analysis also shows that, in the considered cooperativity range, impending regime shifts can be generically detected by statistical early warning signals from distributional data. Our generic framework, based on minimal models, can be used to extract robustness and variability properties of more complex models and empirical data close to criticality.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Equivalence of Approximate Message Passing and Low-Degree Polynomials in Rank-One Matrix Estimation
Authors:
Andrea Montanari,
Alexander S. Wein
Abstract:
We consider the problem of estimating an unknown parameter vector ${\boldsymbol θ}\in{\mathbb R}^n$, given noisy observations ${\boldsymbol Y} = {\boldsymbol θ}{\boldsymbol θ}^{\top}/\sqrt{n}+{\boldsymbol Z}$ of the rank-one matrix ${\boldsymbol θ}{\boldsymbol θ}^{\top}$, where ${\boldsymbol Z}$ has independent Gaussian entries. When information is available about the distribution of the entries o…
▽ More
We consider the problem of estimating an unknown parameter vector ${\boldsymbol θ}\in{\mathbb R}^n$, given noisy observations ${\boldsymbol Y} = {\boldsymbol θ}{\boldsymbol θ}^{\top}/\sqrt{n}+{\boldsymbol Z}$ of the rank-one matrix ${\boldsymbol θ}{\boldsymbol θ}^{\top}$, where ${\boldsymbol Z}$ has independent Gaussian entries. When information is available about the distribution of the entries of ${\boldsymbol theta}$, spectral methods are known to be strictly sub-optimal. Past work characterized the asymptotics of the accuracy achieved by the optimal estimator. However, no polynomial-time estimator is known that achieves this accuracy.
It has been conjectured that this statistical-computation gap is fundamental, and moreover that the optimal accuracy achievable by polynomial-time estimators coincides with the accuracy achieved by certain approximate message passing (AMP) algorithms. We provide evidence towards this conjecture by proving that no estimator in the (broader) class of constant-degree polynomials can surpass AMP.
△ Less
Submitted 13 December, 2022;
originally announced December 2022.
-
Complexity of Safety and coSafety Fragments of Linear Temporal Logic
Authors:
Alessandro Artale,
Luca Geatti,
Nicola Gigante,
Andrea Mazzullo,
Angelo Montanari
Abstract:
Linear Temporal Logic (LTL) is the de-facto standard temporal logic for system specification, whose foundational properties have been studied for over five decades. Safety and cosafety properties define notable fragments of LTL, where a prefix of a trace suffices to establish whether a formula is true or not over that trace. In this paper, we study the complexity of the problems of satisfiability,…
▽ More
Linear Temporal Logic (LTL) is the de-facto standard temporal logic for system specification, whose foundational properties have been studied for over five decades. Safety and cosafety properties define notable fragments of LTL, where a prefix of a trace suffices to establish whether a formula is true or not over that trace. In this paper, we study the complexity of the problems of satisfiability, validity, and realizability over infinite and finite traces for the safety and cosafety fragments of LTL. As for satisfiability and validity over infinite traces, we prove that the majority of the fragments have the same complexity as full LTL, that is, they are PSPACE-complete. The picture is radically different for realizability: we find fragments with the same expressive power whose complexity varies from 2EXPTIME-complete (as full LTL) to EXPTIME-complete. Notably, for all cosafety fragments, the complexity of the three problems does not change passing from infinite to finite traces, while for all safety fragments the complexity of satisfiability (resp., realizability) over finite traces drops to NP-complete (resp., $Π^P_2$-complete).
△ Less
Submitted 27 November, 2022;
originally announced November 2022.