Search | arXiv e-print repository

ZTF SN Ia DR2: Exploring SN Ia properties in the vicinity of under-dense environments

Authors: M. Aubert, P. Rosnet, B. Popovic, F. Ruppin, M. Smith, M. Rigault, G. Dimitriadis, A. Goobar, J. Johansson, C. Barjou-Delayre, U. Burgaz, B. Carreres, F. Feinstein, D. Fouchez, L. Galbany, M. Ginolin, T. de Jaeger, M. M. Kasliwal, Y. -L. Kim, L. Lacroix, F. J. Masci, T. E. Müller-Bravo, B. Racine, C. Ravoux, N. Regnault , et al. (7 additional authors not shown)

Abstract: The unprecedented statistics of detected Type Ia supernovae (SNe Ia) brought by the Zwicky Transient Facility enables us to probe the impact of the Large-Scale Structure on the properties of these objects. The goal of this paper is to explore the possible impact of the under-dense part of the large-scale structure on the intrinsic SALT2 light curve properties of SNe Ia and uncover possible biases… ▽ More The unprecedented statistics of detected Type Ia supernovae (SNe Ia) brought by the Zwicky Transient Facility enables us to probe the impact of the Large-Scale Structure on the properties of these objects. The goal of this paper is to explore the possible impact of the under-dense part of the large-scale structure on the intrinsic SALT2 light curve properties of SNe Ia and uncover possible biases in SN Ia analyses. With a volume-limited selection of ZTF-Cosmo-DR2 Type Ia supernovae overlap** with the SDSS-DR7 survey footprint, we investigate the distribution of their properties with regard to voids detected in the SDSS-DR7 galaxy sample. We further use Voronoi volumes as proxy for local density environments within the large-scale structure. We find a moderate dependency of the stretch toward the localisation around the void centre and none when considering colour. The local Voronoi volumes mostly affect the fraction of low/high stretch supernovae. With the current statistics available, we consider that the impact of high or low local density environment can be considered as a proxy for the colour of the host galaxy. Under-dense environments should not cause any biases in supernova analyses. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 10 pages, 8 figures. Submitted to A&A

arXiv:2406.02073 [pdf, other]

ZTF SN Ia DR2: Study of Type Ia Supernova lightcurve fits

Authors: M. Rigault, M. Smith, N. Regnault, D. W. Kenworthy, K. Maguire, A. Goobar, G. Dimitriadis, M. Amenouche, M. Aubert, C. Barjou-Delayre, C. E. Bellm, U. Burgaz, B. Carreres, Y. Copin, M. Deckers, T. de Jaeger, S. Dhawan, F. Feinstein, D. Fouchez, L. Galbany, M. Ginolin, J. M. Graham, Y. -L. Kim, M. Kowalski, D. Kuhn , et al. (12 additional authors not shown)

Abstract: Type Ia supernova (SN Ia) cosmology relies on the estimation of lightcurve parameters to derive precision distances that leads to the estimation of cosmological parameters. The empirical SALT2 lightcurve modeling that relies on only two parameters, a stretch x1, and a color c, has been used by the community for almost two decades. In this paper we study the ability of the SALT2 model to fit the ne… ▽ More Type Ia supernova (SN Ia) cosmology relies on the estimation of lightcurve parameters to derive precision distances that leads to the estimation of cosmological parameters. The empirical SALT2 lightcurve modeling that relies on only two parameters, a stretch x1, and a color c, has been used by the community for almost two decades. In this paper we study the ability of the SALT2 model to fit the nearly 3000 cosmology-grade SN Ia lightcurves from the second release of the Zwicky Transient Facility (ZTF) cosmology science working group. While the ZTF data was not used to train SALT2, the algorithm is modeling the ZTF SN Ia optical lightcurves remarkably well, except for lightcurve points prior to -10 d from maximum, where the training critically lacks statistics. We find that the lightcurve fitting is robust against the considered choice of phase-range, but we show the [-10; +40] d range to be optimal in terms of statistics and accuracy. We do not detect any significant features in the lightcurve fit residuals that could be connected to the host environment. Potential systematic population differences related to the SN Ia host properties might thus not be accountable for by the addition of extra lightcurve parameters. However, a small but significant inconsistency between residuals of blue- and red-SN Ia strongly suggests the existence of a phase-dependent color term, with potential implications for the use of SNe Ia in precision cosmology. We thus encourage modellers to explore this avenue and we emphasize the importance that SN Ia cosmology must include a SALT2 retraining to accurately model the lightcurves and avoid biasing the derivation of cosmological parameters. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 10 pages, 9 figures. Submitted to Astronomy and Astrophysics

arXiv:2406.02072 [pdf, other]

ZTF SN Ia DR2: Colour standardisation of Type Ia Supernovae and its dependence on environment

Authors: M. Ginolin, M. Rigault, Y. Copin, B. Popovic, G. Dimitriadis, A. Goobar, J. Johansson, K. Maguire, J. Nordin, M. Smith, M. Aubert, C. Barjou-Delayre, U. Burgaz, B. Carreres, S. Dhawan, M. Deckers, F. Feinstein, D. Fouchez, L. Galbany, C. Ganot, T. de Jaeger, Y. -L. Kim, D. Kuhn, L. Lacroix, T. E. Müller-Bravo , et al. (15 additional authors not shown)

Abstract: As Type Ia supernova cosmology transitions from a statistics dominated to a systematics dominated era, it is crucial to understand leftover unexplained uncertainties affecting their luminosity, such as the ones stemming from astrophysical biases. Indeed, SNe Ia are standardisable candles, whose absolute magnitude reach a 0.15~mag scatter once empirical correlations with their lightcurve stretch an… ▽ More As Type Ia supernova cosmology transitions from a statistics dominated to a systematics dominated era, it is crucial to understand leftover unexplained uncertainties affecting their luminosity, such as the ones stemming from astrophysical biases. Indeed, SNe Ia are standardisable candles, whose absolute magnitude reach a 0.15~mag scatter once empirical correlations with their lightcurve stretch and colour and with their environment are accounted for. In this paper, we investigate how the standardisation process of SNe Ia depends on environment, to ultimately reduce their scatter in magnitude, focusing on colour standardisation. We use the volume-limited ZTF SN Ia DR2 sample, which offers unprecedented statistics for the low redshift ($z<0.06$) range. We first study the colour distribution, focusing on the effects of dust, to then select a dustless subsample of objects from low stellar mass environments and from the outskirts of their host galaxies. We then look at the colour-residuals relation and its associated parameter $β$. Finally, we investigate the colour dependency of the environment-dependent magnitude offsets (steps), to try to disentangle intrinsic and extrinsic colour origin. Our sample probes well the red tail of the colour distribution, up to $c=0.8$. The dustless sample exhibits a significantly lower red tail ($4.6σ$) in comparison to the whole sample. This suggests that reddening above $c\geq0.2$ is dominated by host interstellar dust absorption. Looking at the colour-residuals relation, we find it to be linear with lightcurve colour. We show hints of a potential evolution of $β$ with host stellar mass at a $2.5σ$ level. Finally, unlike recent claims from the literature, we see no evolution of steps as a function of lightcurve colour, suggesting that dust may not be the dominating mechanism responsible for the environmental dependency of SNe Ia magnitude. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 10 pages, 7 figures, submitted to Astronomy and Astrophysics

arXiv:2406.00052 [pdf, other]

Detectability and Characterisation of Strongly Lensed Supernova Lightcurves in the Zwicky Transient Facility

Authors: A. Sagués Carracedo, A. Goobar, E. Mörtsell, N. Arendse, J. Johansson, A. Townsend, S. Dhawan, J. Nordin, J. Sollerman, S. Schulze

Abstract: The Zwicky Transient Facility (ZTF) was expected to detect more than one strong gravitationally-lensed supernova (glSN) per year, but only one event was identified in the first four years of the survey. This work investigates selection biases in the search strategy that could explain the discrepancy and revise discovery predictions. We present simulations of realistic lightcurves for lensed thermo… ▽ More The Zwicky Transient Facility (ZTF) was expected to detect more than one strong gravitationally-lensed supernova (glSN) per year, but only one event was identified in the first four years of the survey. This work investigates selection biases in the search strategy that could explain the discrepancy and revise discovery predictions. We present simulations of realistic lightcurves for lensed thermonuclear (glSNIa) and core-collapse supernova (glCCSN) explosions over a span of 5.33 years of the survey, utilizing the actual observation logs of ZTF. We find that the magnitude limit in spectroscopic screening significantly biases the selection towards highly magnified glSNe, for which the detection rates are consistent with the identification of a single object by ZTF. To reach the higher predicted rate of detections requires an optimization of the identification criteria for fainter objects. We find that around 1.36 (3.08) Type Ia SNe (CCSNe) are identifiable with the magnification method per year in ZTF, but when applying the magnitude cut of m < 19 mag, the detection rates decrease to 0.17 (0.32) per year. We compare our simulations with the previously found lensed Type Ia SNe, iPTF16geu and SN Zwicky, and conclude that considering the bias towards highly magnified events, the findings are within expectations in terms of detection rates and lensing properties of the systems. In addition, we provide a set of selection cuts based on simple observables to distinguish glSNe from regular, unlensed, supernovae to select potential candidates for spectroscopic and high-spatial resolution follow-up campaigns. We find optimal cuts in observed colours $g-r$, $g-i$, and $r-i$ as well as in the colour SALT2 fit parameter. The developed pipeline and the simulated lightcurves employed in this analysis can be found in the $LENSIT$ github repository. △ Less

Submitted 28 May, 2024; originally announced June 2024.

Comments: 12 pages, 14 figures

arXiv:2405.20965 [pdf, other]

ZTF SN Ia DR2: Environmental dependencies of stretch and luminosity of a volume limited sample of 1,000 Type Ia Supernovae

Authors: M. Ginolin, M. Rigault, M. Smith, Y. Copin, F. Ruppin, G. Dimitriadis, A. Goobar, J. Johansson, K. Maguire, J. Nordin, M. Amenouche, M. Aubert, C. Barjou-Delayre, M. Betoule, U. Burgaz, B. Carreres, M. Deckers, S. Dhawan, F. Feinstein, D. Fouchez, L. Galbany, C. Ganot, L. Harvey, T. de Jaeger, W. D. Kenworthy , et al. (21 additional authors not shown)

Abstract: To get distances, Type Ia Supernovae magnitudes are corrected for their correlation with lightcurve width and colour. Here we investigate how this standardisation is affected by the SN environment, with the aim to reduce scatter and improve standardisation. We first study the SN Ia stretch distribution, as well as its dependence on environment, as characterised by local and global (g-z) colour and… ▽ More To get distances, Type Ia Supernovae magnitudes are corrected for their correlation with lightcurve width and colour. Here we investigate how this standardisation is affected by the SN environment, with the aim to reduce scatter and improve standardisation. We first study the SN Ia stretch distribution, as well as its dependence on environment, as characterised by local and global (g-z) colour and stellar mass. We then look at the standardisation parameter $α$, which accounts for the correlation between residuals and stretch, along with its environment dependence and linearity. We finally compute magnitude offsets between SNe in different astrophysical environments after colour and stretch standardisation, aka steps. This analysis is made possible due to the unprecedented statistics of the ZTF SN Ia DR2 volume-limited sample. The stretch distribution exhibits a bimodal behaviour, as previously found in literature. However, we find the distribution means to decrease with host stellar mass at a 9.0$σ$ significance. We demonstrate, at the 14.3$σ$ level, that the stretch-magnitude relation is non-linear, challenging the usual linear stretch-residuals relation. Fitting for a broken-$α$ model, we indeed find two different slopes between stretch regimes ($x_1<-0.49\pm0.06$): $α_{low}=0.28\pm0.01$ and $α_{high}=0.09\pm0.01$, a $Δ_α=-0.19\pm0.01$ difference. As the relative proportion of SNe Ia in the high-/low-stretch modes evolves with redshift and environment, this implies that a linear $α$ also evolves with redshift and environment. Concerning the environmental magnitude offset $γ$, we find it to be greater than 0.14 mag regardless of the considered environmental tracer used (local or global colour and stellar mass), all measured at the $\geq 6σ$ level, increased to $\sim0.18\pm0.01$ mag when accounting for the stretch-non linearity. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Comments: 15 pages, 9 figures, submitted to Astronomy and Astrophysics

arXiv:2405.20409 [pdf, other]

ZTF SN Ia DR2: Peculiar velocities impact on the Hubble diagram

Authors: B. Carreres, D. Rosselli, J. E. Bautista, F. Feinstein, D. Fouchez, B. Racine, C. Ravoux, B. Sanchez, G. Dimitriadis, A. Goobar, J. Johansson, J. Nordin, M. Rigault, M. Smith, M. Amenouche, M. Aubert, C. Barjou-Delayre, U. Burgaz, W. D'Arcy Kenworthy, T. De Jaeger, S. Dhawan, L. Galbany, M. Ginolin, D. Kuhn, M. Kowalski , et al. (13 additional authors not shown)

Abstract: SNe Ia are used to determine the distance-redshift relation and build the Hubble diagram. Neglecting their host-galaxy peculiar velocities (PVs) may bias the measurement of cosmological parameters. The smaller the redshift, the larger the effect is. We use realistic simulations of SNe Ia observed by the Zwicky Transient Facility (ZTF) to investigate the effect of different methods to take into acc… ▽ More SNe Ia are used to determine the distance-redshift relation and build the Hubble diagram. Neglecting their host-galaxy peculiar velocities (PVs) may bias the measurement of cosmological parameters. The smaller the redshift, the larger the effect is. We use realistic simulations of SNe Ia observed by the Zwicky Transient Facility (ZTF) to investigate the effect of different methods to take into account PVs. We study the impact of neglecting galaxy PVs and their correlations in an analysis of the SNe Ia Hubble diagram. We find that it is necessary to use the PV full covariance matrix computed from the velocity power spectrum to take into account the sample variance. Considering the results we have obtained using simulations, we determine the PV systematic effects in the context of the ZTF DR2 SNe Ia sample. We determine the PV impact on the intercept of the Hubble diagram, $a_B$, which is directly linked to the measurement of $H_0$. We show that not taking into account PVs and their correlations results in a shift of the $H_0$ value of about $1.0$km.s$^{-1}$.Mpc$^{-1}$ and a slight underestimation of the $H_0$ error bar. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 12 pages, 4 figures

arXiv:2405.18589 [pdf, other]

Candidate strongly-lensed Type Ia supernovae in the Zwicky Transient Facility archive

Authors: A. Townsend, J. Nordin, A. Sagués Carracedo, M. Kowalski, N. Arendse, S. Dhawan, A. Goobar, J. Johansson, E. Mörtsell, S. Schulze, I. Andreoni, E. Fernández, A. G. Kim, P. E. Nugent, F. Prada, M. Rigault, N. Sarin, D. Sharma, E. C. Bellm, M. W. Coughlin, R. Dekany, S. L. Groom, L. Lacroix, R. R. Laher, R. Riddle , et al. (39 additional authors not shown)

Abstract: Gravitationally lensed Type Ia supernovae (glSNe Ia) are unique astronomical tools for studying cosmological parameters, distributions of dark matter, the astrophysics of the supernovae and the intervening lensing galaxies themselves. Only a few highly magnified glSNe Ia have been discovered by ground-based telescopes, such as the Zwicky Transient Facility (ZTF), but simulations predict the existe… ▽ More Gravitationally lensed Type Ia supernovae (glSNe Ia) are unique astronomical tools for studying cosmological parameters, distributions of dark matter, the astrophysics of the supernovae and the intervening lensing galaxies themselves. Only a few highly magnified glSNe Ia have been discovered by ground-based telescopes, such as the Zwicky Transient Facility (ZTF), but simulations predict the existence of a fainter, undetected population. We present a systematic search in the ZTF archive of alerts from 1 June 2019 to 1 September 2022. Using the AMPEL platform, we developed a pipeline that distinguishes candidate glSNe Ia from other variable sources. Initial cuts were applied to the ZTF alert photometry before forced photometry was obtained for the remaining candidates. Additional cuts were applied to refine the candidates based on their light curve colours, lens galaxy colours, and the resulting parameters from fits to the SALT2 SN Ia template. Candidates were also cross-matched with the DESI spectroscopic catalogue. Seven transients passed all the cuts and had an associated galaxy DESI redshift, which we present as glSN Ia candidates. While superluminous supernovae (SLSNe) cannot be fully rejected, two events, ZTF19abpjicm and ZTF22aahmovu, are significantly different from typical SLSNe and their light curves can be modelled as two-image glSN Ia systems. From this two-image modelling, we estimate time delays of 22 $\pm$ 3 and 34 $\pm$ 1 days for the two events, respectively, which suggests that we have uncovered a population with longer time delays. The pipeline is efficient and sensitive enough to parse full alert streams. It is currently being applied to the live ZTF alert stream to identify and follow-up future candidates while active. This pipeline could be the foundation for glSNe Ia searches in future surveys, like the Vera C. Rubin Observatory's Legacy Survey of Space and Time. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: 21 pages, 15 figures

arXiv:2405.04330 [pdf, other]

How to reveal the rank of a matrix?

Authors: Anil Damle, Silke Glas, Alex Townsend, Annan Yu

Abstract: We study algorithms called rank-revealers that reveal a matrix's rank structure. Such algorithms form a fundamental component in matrix compression, singular value estimation, and column subset selection problems. While column-pivoted QR has been widely adopted due to its practicality, it is not always a rank-revealer. Conversely, Gaussian elimination (GE) with a pivoting strategy known as global… ▽ More We study algorithms called rank-revealers that reveal a matrix's rank structure. Such algorithms form a fundamental component in matrix compression, singular value estimation, and column subset selection problems. While column-pivoted QR has been widely adopted due to its practicality, it is not always a rank-revealer. Conversely, Gaussian elimination (GE) with a pivoting strategy known as global maximum volume pivoting is guaranteed to estimate a matrix's singular values but its exponential complexity limits its interest to theory. We show that the concept of local maximum volume pivoting is a crucial and practical pivoting strategy for rank-revealers based on GE and QR. In particular, we prove that it is both necessary and sufficient; highlighting that all local solutions are nearly as good as the global one. This insight elevates Gu and Eisenstat's rank-revealing QR as an archetypal rank-revealer, and we implement a version that is observed to be at most $2\times$ more computationally expensive than CPQR. We unify the landscape of rank-revealers by considering GE and QR together and prove that the success of any pivoting strategy can be assessed by benchmarking it against a local maximum volume pivot. △ Less

Submitted 3 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

arXiv:2403.13009 [pdf, other]

Undulatory swimming in suspensions and networks of flexible filaments

Authors: Adam K. Townsend, Eric E. Keaveny

Abstract: Many biological fluids are composed of suspended polymers immersed in a viscous fluid. A prime example is mucus, where the polymers are also known to form a network. While the presence of this microstructure is linked with an overall non-Newtonian response of the fluid, swimming cells and microorganisms similar in size to the network pores and polymer filaments instead experience the heterogeneous… ▽ More Many biological fluids are composed of suspended polymers immersed in a viscous fluid. A prime example is mucus, where the polymers are also known to form a network. While the presence of this microstructure is linked with an overall non-Newtonian response of the fluid, swimming cells and microorganisms similar in size to the network pores and polymer filaments instead experience the heterogeneous nature of the environment, interacting directly with the polymers as obstacles as they swim. To characterise and understand locomotion in these heterogeneous environments, we simulate the motion of an undulatory swimmer through suspensions and networks of elastic filaments, exploring the effects of filament and link compliance and filament concentration up to 20\% volume fraction. For compliant environments, the swimming speed increases with filament concentration to values about 10\% higher than in a viscous fluid. In stiffer environments, a non-monotonic dependence is observed, with an initial increase in speed to values 5\% greater than in a viscous fluid, followed by a dramatic reduction to speeds just a fraction of its value in a viscous fluid. Velocity fluctuations are also more pronounced in stiffer environments. We demonstrate that speed enhancements are linked to hydrodynamic interactions with the microstructure, while reductions are due to the filaments restricting the amplitude of the swimmer's propulsive wave. Unlike previous studies where interactions with obstacles allowed for significant enhancements in swimming speeds, the modest enhancements seen here are more comparable to those given by models where the environment is treated as a continuous viscoelastic fluid. △ Less

Submitted 16 March, 2024; originally announced March 2024.

Comments: 24 pages, 11 figures

arXiv:2401.17739 [pdf, other]

Operator learning without the adjoint

Authors: Nicolas Boullé, Diana Halikias, Samuel E. Otto, Alex Townsend

Abstract: There is a mystery at the heart of operator learning: how can one recover a non-self-adjoint operator from data without probing the adjoint? Current practical approaches suggest that one can accurately recover an operator while only using data generated by the forward action of the operator without access to the adjoint. However, naively, it seems essential to sample the action of the adjoint. In… ▽ More There is a mystery at the heart of operator learning: how can one recover a non-self-adjoint operator from data without probing the adjoint? Current practical approaches suggest that one can accurately recover an operator while only using data generated by the forward action of the operator without access to the adjoint. However, naively, it seems essential to sample the action of the adjoint. In this paper, we partially explain this mystery by proving that without querying the adjoint, one can approximate a family of non-self-adjoint infinite-dimensional compact operators via projection onto a Fourier basis. We then apply the result to recovering Green's functions of elliptic partial differential operators and derive an adjoint-free sample complexity bound. While existing theory justifies low sample complexity in operator learning, ours is the first adjoint-free analysis that attempts to close the gap between theory and practice. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: 49 pages, 5 figures

arXiv:2401.00309 [pdf, other]

High-statistics measurement of Collins and Sivers asymmetries for transversely polarised deuterons

Authors: G. D. Alexeev, M. G. Alexeev, C. Alice, A. Amoroso, V. Andrieux, V. Anosov, S. Asatryan, K. Augsten, W. Augustyniak, C. D. R. Azevedo, B. Badelek, J. Barth, R. Beck, J. Beckers, Y. Bedfer, J. Bernhard, M. Bodlak, F. Bradamante, A. Bressan, W. -C. Chang, C. Chatterjee, M. Chiosso, A. G. Chumakov, S. -U. Chung, A. Cicuttin , et al. (162 additional authors not shown)

Abstract: New results are presented on a high-statistics measurement of Collins and Sivers asymmetries of charged hadrons produced in deep inelastic scattering of muons on a transversely polarised $^6$LiD target. The data were taken in 2022 with the COMPASS spectrometer using the 160 \gevv\ muon beam at CERN, balancing the existing data on transversely polarised proton targets. The first results from about… ▽ More New results are presented on a high-statistics measurement of Collins and Sivers asymmetries of charged hadrons produced in deep inelastic scattering of muons on a transversely polarised $^6$LiD target. The data were taken in 2022 with the COMPASS spectrometer using the 160 \gevv\ muon beam at CERN, balancing the existing data on transversely polarised proton targets. The first results from about two-thirds of the new data have total uncertainties smaller by up to a factor of three compared to the previous deuteron measurements. Using all the COMPASS proton and deuteron results, both the transversity and the Sivers distribution functions of the $u$ and $d$ quark, as well as the tensor charge in the measured $x$-range are extracted. In particular, the accuracy of the $d$ quark results is significantly improved. △ Less

Submitted 30 December, 2023; originally announced January 2024.

Report number: CERN-EP-2023-308

arXiv:2312.17489 [pdf, ps, other]

Operator learning for hyperbolic partial differential equations

Authors: Christopher Wang, Alex Townsend

Abstract: We construct the first rigorously justified probabilistic algorithm for recovering the solution operator of a hyperbolic partial differential equation (PDE) in two variables from input-output training pairs. The primary challenge of recovering the solution operator of hyperbolic PDEs is the presence of characteristics, along which the associated Green's function is discontinuous. Therefore, a cent… ▽ More We construct the first rigorously justified probabilistic algorithm for recovering the solution operator of a hyperbolic partial differential equation (PDE) in two variables from input-output training pairs. The primary challenge of recovering the solution operator of hyperbolic PDEs is the presence of characteristics, along which the associated Green's function is discontinuous. Therefore, a central component of our algorithm is a rank detection scheme that identifies the approximate location of the characteristics. By combining the randomized singular value decomposition with an adaptive hierarchical partition of the domain, we construct an approximant to the solution operator using $O(Ψ_ε^{-1}ε^{-7}\log(Ξ_ε^{-1}ε^{-1}))$ input-output pairs with relative error $O(Ξ_ε^{-1}ε)$ in the operator norm as $ε\to0$, with high probability. Here, $Ψ_ε$ represents the existence of degenerate singular values of the solution operator, and $Ξ_ε$ measures the quality of the training data. Our assumptions on the regularity of the coefficients of the hyperbolic PDE are relatively weak given that hyperbolic PDEs do not have the ``instantaneous smoothing effect'' of elliptic and parabolic PDEs, and our recovery rate improves as the regularity of the coefficients increases. △ Less

Submitted 29 December, 2023; originally announced December 2023.

Comments: 35 pages, 3 figures

MSC Class: 65M80; 65F55; 35L20; 47A58

arXiv:2312.17379 [pdf, other]

Final COMPASS results on the transverse-spin-dependent azimuthal asymmetries in the pion-induced Drell-Yan process

Authors: G. D. Alexeev, M. G. Alexeev, C. Alice, A. Amoroso, V. Andrieux, V. Anosov, K. Augsten, W. Augustyniak, C. D. R. Azevedo, B. Badelek, J. Barth, R. Beck, J. Beckers, Y. Bedfer, J. Bernhard, M. Bodlak, F. Bradamante, A. Bressan, W. -C. Chang, C. Chatterjee, M. Chiosso, A. G. Chumakov, S. -U. Chung, A. Cicuttin, P. M. M. Correia , et al. (159 additional authors not shown)

Abstract: The COMPASS Collaboration performed measurements of the Drell-Yan process in 2015 and 2018 using a 190 GeV/c $π^{-}$ beam im**ing on a transversely polarised ammonia target. Combining the data of both years, we present final results on the amplitudes of the five azimuthal modulations in the dimuon production cross section. Three of these transverse-spin-dependent azimuthal asymmetries (TSAs) pro… ▽ More The COMPASS Collaboration performed measurements of the Drell-Yan process in 2015 and 2018 using a 190 GeV/c $π^{-}$ beam im**ing on a transversely polarised ammonia target. Combining the data of both years, we present final results on the amplitudes of the five azimuthal modulations in the dimuon production cross section. Three of these transverse-spin-dependent azimuthal asymmetries (TSAs) probe the nucleon leading-twist Sivers, transversity, and pretzelosity transverse-momentum dependent (TMD) parton distribution functions (PDFs). The other two are induced by subleading effects. These TSAs provide unique new inputs for the study of the nucleon TMD PDFs and their universality properties. In particular, the Sivers TSA observed in this measurement is consistent with the fundamental QCD prediction of a sign change of naive time-reversal-odd TMD PDFs when comparing the Drell-Yan process with semi-inclusive measurements of deep inelastic scattering. Also, within the context of model predictions, the observed transversity TSA is consistent with the expectation of a sign change for the Boer-Mulders function. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Report number: CERN-EP-2023-307

arXiv:2312.14688 [pdf, other]

A Mathematical Guide to Operator Learning

Authors: Nicolas Boullé, Alex Townsend

Abstract: Operator learning aims to discover properties of an underlying dynamical system or partial differential equation (PDE) from data. Here, we present a step-by-step guide to operator learning. We explain the types of problems and PDEs amenable to operator learning, discuss various neural network architectures, and explain how to employ numerical PDE solvers effectively. We also give advice on how to… ▽ More Operator learning aims to discover properties of an underlying dynamical system or partial differential equation (PDE) from data. Here, we present a step-by-step guide to operator learning. We explain the types of problems and PDEs amenable to operator learning, discuss various neural network architectures, and explain how to employ numerical PDE solvers effectively. We also give advice on how to create and manage training data and conduct optimization. We offer intuition behind the various neural network architectures employed in operator learning by motivating them from the point-of-view of numerical linear algebra. △ Less

Submitted 22 December, 2023; originally announced December 2023.

Comments: 45 pages, 11 figures

arXiv:2311.07035 [pdf, other]

ContHutch++: Stochastic trace estimation for implicit integral operators

Authors: Jennifer Zvonek, Andrew Horning, Alex Townsend

Abstract: Hutchinson's estimator is a randomized algorithm that computes an $ε$-approximation to the trace of any positive semidefinite matrix using $\mathcal{O}(1/ε^2)$ matrix-vector products. An improvement of Hutchinson's estimator, known as Hutch++, only requires $\mathcal{O}(1/ε)$ matrix-vector products. In this paper, we propose a generalization of Hutch++, which we call ContHutch++, that uses operato… ▽ More Hutchinson's estimator is a randomized algorithm that computes an $ε$-approximation to the trace of any positive semidefinite matrix using $\mathcal{O}(1/ε^2)$ matrix-vector products. An improvement of Hutchinson's estimator, known as Hutch++, only requires $\mathcal{O}(1/ε)$ matrix-vector products. In this paper, we propose a generalization of Hutch++, which we call ContHutch++, that uses operator-function products to efficiently estimate the trace of any trace-class integral operator. Our ContHutch++ estimates avoid spectral artifacts introduced by discretization and are accompanied by rigorous high-probability error bounds. We use ContHutch++ to derive a new high-order accurate algorithm for quantum density-of-states and also show how it can estimate electromagnetic fields induced by incoherent sources. △ Less

Submitted 12 November, 2023; originally announced November 2023.

arXiv:2308.10697 [pdf, other]

Beyond expectations: Residual Dynamic Mode Decomposition and Variance for Stochastic Dynamical Systems

Authors: Matthew J. Colbrook, Qin Li, Ryan V. Raut, Alex Townsend

Abstract: Koopman operators linearize nonlinear dynamical systems, making their spectral information of crucial interest. Numerous algorithms have been developed to approximate these spectral properties, and Dynamic Mode Decomposition (DMD) stands out as the poster child of projection-based methods. Although the Koopman operator itself is linear, the fact that it acts in an infinite-dimensional space of obs… ▽ More Koopman operators linearize nonlinear dynamical systems, making their spectral information of crucial interest. Numerous algorithms have been developed to approximate these spectral properties, and Dynamic Mode Decomposition (DMD) stands out as the poster child of projection-based methods. Although the Koopman operator itself is linear, the fact that it acts in an infinite-dimensional space of observables poses challenges. These include spurious modes, essential spectra, and the verification of Koopman mode decompositions. While recent work has addressed these challenges for deterministic systems, there remains a notable gap in verified DMD methods for stochastic systems, where the Koopman operator measures the expectation of observables. We show that it is necessary to go beyond expectations to address these issues. By incorporating variance into the Koopman framework, we address these challenges. Through an additional DMD-type matrix, we approximate the sum of a squared residual and a variance term, each of which can be approximated individually using batched snapshot data. This allows verified computation of the spectral properties of stochastic Koopman operators, controlling the projection error. We also introduce the concept of variance-pseudospectra to gauge statistical coherency. Finally, we present a suite of convergence results for the spectral information of stochastic Koopman operators. Our study concludes with practical applications using both simulated and experimental data. In neural recordings from awake mice, we demonstrate how variance-pseudospectra can reveal physiologically significant information unavailable to standard expectation-based dynamical models. △ Less

Submitted 10 November, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

MSC Class: 37M10; 37H99; 37N25; 47A10; 47B33; 65P99

arXiv:2308.01245 [pdf, other]

doi 10.1007/s11538-023-01218-4

VisualPDE: rapid interactive simulations of partial differential equations

Authors: Benjamin J. Walker, Adam K. Townsend, Alexander K. Chudasama, Andrew L. Krause

Abstract: Computing has revolutionised the study of complex nonlinear systems, both by allowing us to solve previously intractable models and through the ability to visualise solutions in different ways. Using ubiquitous computing infrastructure, we provide a means to go one step further in using computers to understand complex models through instantaneous and interactive exploration. This ubiquitous infras… ▽ More Computing has revolutionised the study of complex nonlinear systems, both by allowing us to solve previously intractable models and through the ability to visualise solutions in different ways. Using ubiquitous computing infrastructure, we provide a means to go one step further in using computers to understand complex models through instantaneous and interactive exploration. This ubiquitous infrastructure has enormous potential in education, outreach and research. Here, we present VisualPDE, an online, interactive solver for a broad class of 1D and 2D partial differential equation (PDE) systems. Abstract dynamical systems concepts such as symmetry-breaking instabilities, subcritical bifurcations and the role of initial data in multistable nonlinear models become much more intuitive when you can play with these models yourself, and immediately answer questions about how the system responds to changes in parameters, initial conditions, boundary conditions or even spatiotemporal forcing. Importantly, VisualPDE is freely available, open source and highly customisable. We give several examples in teaching, research and knowledge exchange, providing high-level discussions of how it may be employed in different settings. This includes designing web-based course materials structured around interactive simulations, or easily crafting specific simulations that can be shared with students or collaborators via a simple URL. We envisage VisualPDE becoming an invaluable resource for teaching and research in mathematical biology and beyond. We also hope that it inspires other efforts to make mathematics more interactive and accessible. △ Less

Submitted 16 October, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

Comments: 19 pages, 7 figures. This is a companion paper to the website https://visualpde.com/

arXiv:2305.01691 [pdf, other]

Avoiding discretization issues for nonlinear eigenvalue problems

Authors: Matthew J. Colbrook, Alex Townsend

Abstract: The first step when solving an infinite-dimensional eigenvalue problem is often to discretize it. We show that one must be extremely careful when discretizing nonlinear eigenvalue problems. Using examples, we show that discretization can: (1) introduce spurious eigenvalues, (2) entirely miss spectra, and (3) bring in severe ill-conditioning. While there are many eigensolvers for solving matrix non… ▽ More The first step when solving an infinite-dimensional eigenvalue problem is often to discretize it. We show that one must be extremely careful when discretizing nonlinear eigenvalue problems. Using examples, we show that discretization can: (1) introduce spurious eigenvalues, (2) entirely miss spectra, and (3) bring in severe ill-conditioning. While there are many eigensolvers for solving matrix nonlinear eigenvalue problems, we propose a solver for general holomorphic infinite-dimensional nonlinear eigenvalue problems that avoids discretization issues, which we prove is stable and converges. Moreover, we provide an algorithm that computes the problem's pseudospectra with explicit error control, allowing verification of computed spectra. The algorithm and numerical examples are publicly available in $\texttt{infNEP}$, which is a software package written in MATLAB. △ Less

Submitted 2 May, 2023; originally announced May 2023.

MSC Class: 35P30; 65N25; 65N30; 47A10

arXiv:2304.14482 [pdf, other]

ULTRASAT: A wide-field time-domain UV space telescope

Authors: Y. Shvartzvald, E. Waxman, A. Gal-Yam, E. O. Ofek, S. Ben-Ami, D. Berge, M. Kowalski, R. Bühler, S. Worm, J. E. Rhoads, I. Arcavi, D. Maoz, D. Polishook, N. Stone, B. Trakhtenbrot, M. Ackermann, O. Aharonson, O. Birnholtz, D. Chelouche, D. Guetta, N. Hallakoun, A. Horesh, D. Kushnir, T. Mazeh, J. Nordin , et al. (19 additional authors not shown)

Abstract: The Ultraviolet Transient Astronomy Satellite (ULTRASAT) is scheduled to be launched to geostationary orbit in 2026. It will carry a telescope with an unprecedentedly large field of view (204 deg$^2$) and NUV (230-290nm) sensitivity (22.5 mag, 5$σ$, at 900s). ULTRASAT will conduct the first wide-field survey of transient and variable NUV sources and will revolutionize our ability to study the hot… ▽ More The Ultraviolet Transient Astronomy Satellite (ULTRASAT) is scheduled to be launched to geostationary orbit in 2026. It will carry a telescope with an unprecedentedly large field of view (204 deg$^2$) and NUV (230-290nm) sensitivity (22.5 mag, 5$σ$, at 900s). ULTRASAT will conduct the first wide-field survey of transient and variable NUV sources and will revolutionize our ability to study the hot transient universe: It will explore a new parameter space in energy and time-scale (months long light-curves with minutes cadence), with an extra-Galactic volume accessible for the discovery of transient sources that is $>$300 times larger than that of GALEX and comparable to that of LSST. ULTRASAT data will be transmitted to the ground in real-time, and transient alerts will be distributed to the community in $<$15 min, enabling a vigorous ground-based follow-up of ULTRASAT sources. ULTRASAT will also provide an all-sky NUV image to $>$23.5 AB mag, over 10 times deeper than the GALEX map. Two key science goals of ULTRASAT are the study of mergers of binaries involving neutron stars, and supernovae: With a large fraction ($>$50%) of the sky instantaneously accessible, fast (minutes) slewing capability and a field-of-view that covers the error ellipses expected from GW detectors beyond 2025, ULTRASAT will rapidly detect the electromagnetic emission following BNS/NS-BH mergers identified by GW detectors, and will provide continuous NUV light-curves of the events; ULTRASAT will provide early (hour) detection and continuous high (minutes) cadence NUV light curves for hundreds of core-collapse supernovae, including for rarer supernova progenitor types. △ Less

Submitted 27 April, 2023; originally announced April 2023.

Comments: 40 pages, 16 figures, 3 tables. Submitted to the AAS journals

arXiv:2304.03813 [pdf, ps, other]

Leveraging the Hankel norm approximation and block-AAA algorithms in reduced order modeling

Authors: Annan Yu, Alex Townsend

Abstract: Large-scale linear, time-invariant (LTI) dynamical systems are widely used to characterize complicated physical phenomena. We propose a two-stage algorithm to reduce the order of a large-scale LTI system given samples of its transfer function for a target degree $k$ of the reduced system. In the first stage, a modified adaptive Antoulas--Anderson (AAA) algorithm is used to construct a degree $d$ r… ▽ More Large-scale linear, time-invariant (LTI) dynamical systems are widely used to characterize complicated physical phenomena. We propose a two-stage algorithm to reduce the order of a large-scale LTI system given samples of its transfer function for a target degree $k$ of the reduced system. In the first stage, a modified adaptive Antoulas--Anderson (AAA) algorithm is used to construct a degree $d$ rational approximation of the transfer function that corresponds to an intermediate system, which can be numerically stably reduced in the second stage using ideas from the theory on Hankel norm approximation (HNA). We also study the numerical issues of Glover's HNA algorithm and provide a remedy for its numerical instabilities. A carefully computed rational approximation of degree $d$ gives us a numerically stable algorithm for reducing an LTI system, which is more efficient than SVD-based algorithms and more accurate than moment-matching algorithms. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: 25 pages, 5 figures

MSC Class: 41A20; 65D15; 93C05

arXiv:2302.12888 [pdf, other]

doi 10.1073/pnas.2303904120

Elliptic PDE learning is provably data-efficient

Authors: Nicolas Boullé, Diana Halikias, Alex Townsend

Abstract: PDE learning is an emerging field that combines physics and machine learning to recover unknown physical systems from experimental data. While deep learning models traditionally require copious amounts of training data, recent PDE learning techniques achieve spectacular results with limited data availability. Still, these results are empirical. Our work provides theoretical guarantees on the numbe… ▽ More PDE learning is an emerging field that combines physics and machine learning to recover unknown physical systems from experimental data. While deep learning models traditionally require copious amounts of training data, recent PDE learning techniques achieve spectacular results with limited data availability. Still, these results are empirical. Our work provides theoretical guarantees on the number of input-output training pairs required in PDE learning. Specifically, we exploit randomized numerical linear algebra and PDE theory to derive a provably data-efficient algorithm that recovers solution operators of 3D uniformly elliptic PDEs from input-output data and achieves an exponential convergence rate of the error with respect to the size of the training dataset with an exceptionally high probability of success. △ Less

Submitted 19 September, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

Comments: 25 pages, 2 figures

Journal ref: Proc. Natl. Acad. Sci. USA 120(39) (2023), e2303904120

arXiv:2302.08552 [pdf, other]

doi 10.1117/1.JATIS.8.4.045004

Initial Fabrication and Characterization of Chemically-Etched Silicon Slits for KOSMOS

Authors: Debby Tran, Sarah Tuttle, Kal Kadlec, Rishi Pahuja, Ali C. Jones, William Ketzeback, Russet McMillan, Amanda Townsend

Abstract: KOSMOS is a low-resolution, long-slit, optical spectrograph that has been upgraded at the University of Washington for its move from Kitt Peak National Observatory's Mayall 4m telescope to the Apache Point Observatory's ARC 3.5m telescope. One of the additions to KOSMOS is a slitviewer, which requires the fabrication of reflective slits, as KOSMOS previously used matte slits machined via wire EDM.… ▽ More KOSMOS is a low-resolution, long-slit, optical spectrograph that has been upgraded at the University of Washington for its move from Kitt Peak National Observatory's Mayall 4m telescope to the Apache Point Observatory's ARC 3.5m telescope. One of the additions to KOSMOS is a slitviewer, which requires the fabrication of reflective slits, as KOSMOS previously used matte slits machined via wire EDM. We explore a novel method of slit fabrication using nanofabrication methods and compare the slit edge roughness, width uniformity, and the resulting scattering of the new fabricated slits to the original slits. We find the kerf surface of the chemically-etched reflective silicon slits are generally smoother than the machined matte slits, with an upper limit average roughness of 0.42 $\pm$ 0.03 $μ$m versus 1.06 $\pm$ 0.04 $μ$m respectively. The etched slits have width standard deviations of 6 $\pm$ 3 $μ$m versus 10 $\pm$ 6 $μ$m, respectively. The scattering for the chemically-etched slits is higher than that of the machined slits, showing that the reflectivity is the major contributor to scattering, not the roughness. This scattering, however, can be effectively reduced to zero with proper background subtraction. As slit widths increase, scattering increases for both types of slits, as expected. Future work will consist of testing and comparing the throughput and spectrophotometric data quality of these nanofabricated slits to the machined slits with on-sky data, in addition to making the etched slits more robust against breakage and finalizing the slit manufacturing process. △ Less

Submitted 16 February, 2023; originally announced February 2023.

Comments: 36 pages, 12 figures, accepted by SPIE

Journal ref: J. Astron. Telesc. Instrum. Syst. 8(4), 045004 (2022)

arXiv:2302.07202 [pdf, other]

Are sketch-and-precondition least squares solvers numerically stable?

Authors: Maike Meier, Yuji Nakatsukasa, Alex Townsend, Marcus Webb

Abstract: Sketch-and-precondition techniques are efficient and popular for solving large least squares (LS) problems of the form $Ax=b$ with $A\in\mathbb{R}^{m\times n}$ and $m\gg n$. This is where $A$ is ``sketched" to a smaller matrix $SA$ with $S\in\mathbb{R}^{\lceil cn\rceil\times m}$ for some constant $c>1$ before an iterative LS solver computes the solution to $Ax=b$ with a right preconditioner $P$, w… ▽ More Sketch-and-precondition techniques are efficient and popular for solving large least squares (LS) problems of the form $Ax=b$ with $A\in\mathbb{R}^{m\times n}$ and $m\gg n$. This is where $A$ is ``sketched" to a smaller matrix $SA$ with $S\in\mathbb{R}^{\lceil cn\rceil\times m}$ for some constant $c>1$ before an iterative LS solver computes the solution to $Ax=b$ with a right preconditioner $P$, where $P$ is constructed from $SA$. Prominent sketch-and-precondition LS solvers are Blendenpik and LSRN. We show that the sketch-and-precondition technique in its most commonly used form is not numerically stable for ill-conditioned LS problems. For provable and practical backward stability and optimal residuals, we suggest using an unpreconditioned iterative LS solver on $(AP)z=b$ with $x=Pz$. Provided the condition number of $A$ is smaller than the reciprocal of the unit round-off, we show that this modification ensures that the computed solution has a backward error comparable to the iterative LS solver applied to a well-conditioned matrix. Using smoothed analysis, we model floating-point rounding errors to argue that our modification is expected to compute a backward stable solution even for arbitrarily ill-conditioned LS problems. Additionally, we provide experimental evidence that using the sketch-and-solve solution as a starting vector in sketch-and-precondition algorithms (as suggested by Rokhlin and Tygert in 2008) should be highly preferred over the zero vector. The initialization often results in much more accurate solutions -- albeit not always backward stable ones. △ Less

Submitted 10 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

Comments: 25 pages

arXiv:2212.09841 [pdf, ps, other]

Structured matrix recovery from matrix-vector products

Authors: Diana Halikias, Alex Townsend

Abstract: Can one recover a matrix efficiently from only matrix-vector products? If so, how many are needed? This paper describes algorithms to recover matrices with known structures, such as tridiagonal, Toeplitz, Toeplitz-like, and hierarchical low-rank, from matrix-vector products. In particular, we derive a randomized algorithm for recovering an $N \times N$ unknown hierarchical low-rank matrix from onl… ▽ More Can one recover a matrix efficiently from only matrix-vector products? If so, how many are needed? This paper describes algorithms to recover matrices with known structures, such as tridiagonal, Toeplitz, Toeplitz-like, and hierarchical low-rank, from matrix-vector products. In particular, we derive a randomized algorithm for recovering an $N \times N$ unknown hierarchical low-rank matrix from only $\mathcal{O}((k+p)\log(N))$ matrix-vector products with high probability, where $k$ is the rank of the off-diagonal blocks, and $p$ is a small oversampling parameter. We do this by carefully constructing randomized input vectors for our matrix-vector products that exploit the hierarchical structure of the matrix. While existing algorithms for hierarchical matrix recovery use a recursive "peeling" procedure based on elimination, our approach uses a recursive projection procedure. △ Less

Submitted 29 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

arXiv:2211.00656 [pdf, other]

doi 10.1038/s41550-023-01981-3

Uncovering a population of gravitational lens galaxies with magnified standard candle SN Zwicky

Authors: Ariel Goobar, Joel Johansson, Steve Schulze, Nikki Arendse, Ana Sagués Carracedo, Suhail Dhawan, Edvard Mörtsell, Christoffer Fremling, Lin Yan, Daniel Perley, Jesper Sollerman, Rémy Joseph, K-Ryan Hinds, William Meynardie, Igor Andreoni, Eric Bellm, Josh Bloom, Thomas E. Collett, Andrew Drake, Matthew Graham, Mansi Kasliwal, Shri Kulkarni, Cameron Lemon, Adam A. Miller, James D. Neill , et al. (13 additional authors not shown)

Abstract: Detecting gravitationally lensed supernovae is among the biggest challenges in astronomy. It involves a combination of two very rare phenomena: catching the transient signal of a stellar explosion in a distant galaxy and observing it through a nearly perfectly aligned foreground galaxy that deflects light towards the observer. High-cadence optical observations with the Zwicky Transient Facility, w… ▽ More Detecting gravitationally lensed supernovae is among the biggest challenges in astronomy. It involves a combination of two very rare phenomena: catching the transient signal of a stellar explosion in a distant galaxy and observing it through a nearly perfectly aligned foreground galaxy that deflects light towards the observer. High-cadence optical observations with the Zwicky Transient Facility, with an unparalleled large field of view, led to the detection of a multiply-imaged Type Ia supernova (SN Ia), ``SN Zwicky", a.k.a. SN 2022qmx. Magnified nearly twenty-five times, the system was found thanks to the ``standard candle" nature of SNe Ia. High-spatial resolution imaging with the Keck telescope resolved four images of the supernova with very small angular separation, corresponding to an Einstein radius of only $θ_E =0.167"$ and almost identical arrival times. The small $θ_E$ and faintness of the lensing galaxy is very unusual, highlighting the importance of supernovae to fully characterise the properties of galaxy-scale gravitational lenses, including the impact of galaxy substructures. △ Less

Submitted 14 June, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

Comments: Matches published version in Nature Astronomy

arXiv:2211.00093 [pdf, other]

doi 10.1016/j.physletb.2023.137950

Collins and Sivers transverse-spin asymmetries in inclusive muoproduction of $ρ^0$ mesons

Authors: G. D. Alexeev, M. G. Alexeev, C. Alice, A. Amoroso, V. Andrieux, V. Anosov, K. Augsten, W. Augustyniak, C. D. R. Azevedo, B. Badelek, J. Barth, R. Beck, Y. Bedfer, J. Bernhard, M. Bodlak, F. Bradamante, A. Bressan, V. E. Burtsev, W. -C. Chang, C. Chatterjee, M. Chiosso, A. G. Chumakov, S. -U. Chung, A. Cicuttin, P. M. M. Correia , et al. (167 additional authors not shown)

Abstract: The production of vector mesons in deep inelastic scattering is an interesting yet scarsely explored channel to study the transverse spin structure of the nucleon and the related phenomena. The COMPASS collaboration has performed the first measurement of the Collins and Sivers asymmetries for inclusively produced $ρ^0$ mesons. The analysis is based on the data set collected in deep inelastic scatt… ▽ More The production of vector mesons in deep inelastic scattering is an interesting yet scarsely explored channel to study the transverse spin structure of the nucleon and the related phenomena. The COMPASS collaboration has performed the first measurement of the Collins and Sivers asymmetries for inclusively produced $ρ^0$ mesons. The analysis is based on the data set collected in deep inelastic scattering in $2010$ using a $160\,\,\rm{GeV}/c$ $μ^+$ beam im**ing on a transversely polarized $\rm{NH}_3$ target. The $ρ^{0}$ mesons are selected from oppositely charged hadron pairs, and the asymmetries are extracted as a function of the Bjorken-$x$ variable, the transverse momentum of the pair and the fraction of the energy $z$ carried by the pair. Indications for positive Collins and Sivers asymmetries are observed. △ Less

Submitted 29 July, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

Report number: CERN-EP-2022--234

arXiv:2210.16932 [pdf, other]

Spin Density Matrix Elements in Exclusive $ρ^0$ Meson Muoproduction

Authors: G. D. Alexeev, M. G. Alexeev, C. Alice, A. Amoroso, V. Andrieux, V. Anosov, K. Augsten, W. Augustyniak, C. D. R. Azevedo, B. Badelek, J. Barth, R. Beck, Y. Bedfer, J. Bernhard, M. Bodlak, F. Bradamante, A. Bressan, V. E. Burtsev, W. -C. Chang, C. Chatterjee, M. Chiosso, A. G. Chumakov, S. -U. Chung, A. Cicuttin, P. M. M. Correia , et al. (165 additional authors not shown)

Abstract: We report on a measurement of Spin Density Matrix Elements (SDMEs) in hard exclusive $ρ^0$ meson muoproduction at COMPASS using 160~GeV/$c$ polarised $ μ^{+}$ and $ μ^{-}$ beams im**ing on a liquid hydrogen target. The measurement covers the kinematic range 5.0~GeV/$c^2$ $< W <$ 17.0~GeV/$c^2$, 1.0 (GeV/$c$)$^2$ $< Q^2 <$ 10.0 (GeV/$c$)$^2$ and 0.01 (GeV/$c$)$^2$ $< p_{\rm{T}}^2 <$ 0.5 (GeV/$c$)… ▽ More We report on a measurement of Spin Density Matrix Elements (SDMEs) in hard exclusive $ρ^0$ meson muoproduction at COMPASS using 160~GeV/$c$ polarised $ μ^{+}$ and $ μ^{-}$ beams im**ing on a liquid hydrogen target. The measurement covers the kinematic range 5.0~GeV/$c^2$ $< W <$ 17.0~GeV/$c^2$, 1.0 (GeV/$c$)$^2$ $< Q^2 <$ 10.0 (GeV/$c$)$^2$ and 0.01 (GeV/$c$)$^2$ $< p_{\rm{T}}^2 <$ 0.5 (GeV/$c$)$^2$. Here, $W$ denotes the mass of the final hadronic system, $Q^2$ the virtuality of the exchanged photon, and $p_{\rm{T}}$ the transverse momentum of the $ρ^0$ meson with respect to the virtual-photon direction. The measured non-zero SDMEs for the transitions of transversely polarised virtual photons to longitudinally polarised vector mesons ($γ^*_T \to V^{ }_L$) indicate a violation of $s$-channel helicity conservation. Additionally, we observe a dominant contribution of natural-parity-exchange transitions and a very small contribution of unnatural-parity-exchange transitions, which is compatible with zero within experimental uncertainties. The results provide important input for modelling Generalised Parton Distributions (GPDs). In particular, they may allow one to evaluate in a model-dependent way the role of parton helicity-flip GPDs in exclusive $ρ^0$ production. △ Less

Submitted 29 July, 2023; v1 submitted 30 October, 2022; originally announced October 2022.

Report number: CERN-EP-2022-231

arXiv:2210.12788 [pdf, other]

Expander graphs are globally synchronizing

Authors: Pedro Abdalla, Afonso S. Bandeira, Martin Kassabov, Victor Souza, Steven H. Strogatz, Alex Townsend

Abstract: The Kuramoto model is fundamental to the study of synchronization. It consists of a collection of oscillators with interactions given by a network, which we identify respectively with vertices and edges of a graph. In this paper, we show that a graph with sufficient expansion must be globally synchronizing, meaning that a homogeneous Kuramoto model of identical oscillators on such a graph will con… ▽ More The Kuramoto model is fundamental to the study of synchronization. It consists of a collection of oscillators with interactions given by a network, which we identify respectively with vertices and edges of a graph. In this paper, we show that a graph with sufficient expansion must be globally synchronizing, meaning that a homogeneous Kuramoto model of identical oscillators on such a graph will converge to the fully synchronized state with all the oscillators having the same phase, for every initial state up to a set of measure zero. In particular, we show that for any $\varepsilon > 0$ and $p \geq (1 + \varepsilon) (\log n) / n$, the homogeneous Kuramoto model on the Erdős-Rényi random graph $G(n, p)$ is globally synchronizing with probability tending to one as $n$ goes to infinity. This improves on a previous result of Kassabov, Strogatz, and Townsend and solves a conjecture of Ling, Xu, and Bandeira. We also show that the model is globally synchronizing on any $d$-regular Ramanujan graph, and on typical $d$-regular graphs, for large enough degree $d$. △ Less

Submitted 10 April, 2024; v1 submitted 23 October, 2022; originally announced October 2022.

Comments: 34 pages, 3 figures

arXiv:2210.06673 [pdf, other]

Probabilistic Missing Value Imputation for Mixed Categorical and Ordered Data

Authors: Yuxuan Zhao, Alex Townsend, Madeleine Udell

Abstract: Many real-world datasets contain missing entries and mixed data types including categorical and ordered (e.g. continuous and ordinal) variables. Imputing the missing entries is necessary, since many data analysis pipelines require complete data, but this is challenging especially for mixed data. This paper proposes a probabilistic imputation method using an extended Gaussian copula model that supp… ▽ More Many real-world datasets contain missing entries and mixed data types including categorical and ordered (e.g. continuous and ordinal) variables. Imputing the missing entries is necessary, since many data analysis pipelines require complete data, but this is challenging especially for mixed data. This paper proposes a probabilistic imputation method using an extended Gaussian copula model that supports both single and multiple imputation. The method models mixed categorical and ordered data using a latent Gaussian distribution. The unordered characteristics of categorical variables is explicitly modeled using the argmax operator. The method makes no assumptions on the data marginals nor does it require tuning any hyperparameters. Experimental results on synthetic and real datasets show that imputation with the extended Gaussian copula outperforms the current state-of-the-art for both categorical and ordered variables in mixed data. △ Less

Submitted 12 October, 2022; originally announced October 2022.

Comments: Accepted by NeurIPS 2022

arXiv:2206.08042 [pdf, other]

doi 10.1093/mnras/stac1702

Forecasting cosmic acceleration measurements using the Lyman-$α$ forest

Authors: Chenxing Dong, Anthony Gonzalez, Stephen Eikenberry, Sarik Jeram, Manunya Likamonsavad, Jochen Liske, Deno Stelter, Amanda Townsend

Abstract: We present results from end-to-end simulations of observations designed to constrain the rate of change in the expansion history of the Universe using the redshift drift of the Lyman-$α$ forest absorption lines along the lines-of-sight toward bright quasars. For our simulations we take Lyman-$α$ forest lines extracted from Keck/HIRES spectra of bright quasars at $z>3$, and compare the results from… ▽ More We present results from end-to-end simulations of observations designed to constrain the rate of change in the expansion history of the Universe using the redshift drift of the Lyman-$α$ forest absorption lines along the lines-of-sight toward bright quasars. For our simulations we take Lyman-$α$ forest lines extracted from Keck/HIRES spectra of bright quasars at $z>3$, and compare the results from these real quasar spectra with mock spectra generated via Monte Carlo realizations. We use the results of these simulations to assess the potential for a dedicated observatory to detect redshift drift, and quantify the telescope and spectrograph requirements for these observations. Relative to Liske et al. (2008), two main refinements in the current work are inclusion of quasars from more recent catalogs and consideration of a realistic observing strategy for a dedicated redshift drift experiment that maximizes $\dot{v}/σ_{\dot{v}}$. We find that using a dedicated facility and our designed observing plan, the redshift drift can be detected at $3σ$ significance in 15 years with a 25m telescope, given a spectrograph with long term stability with $R=50,000$ and 25% total system efficiency. To achieve this significance, the optimal number of targets is four quasars, with observing time weighted based upon $\dot{v}/σ_{\dot{v}}$ and object visibility. This optimized strategy leads to a 9% decrease in the telescope diameter or a 6% decrease in the required time to achieve the same S/N as for the idealized case of uniformly distributing time to the same quasars. △ Less

Submitted 16 July, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

Comments: 13 pages, 12 figures, accepted for publication in MNRAS

arXiv:2205.14300 [pdf, ps, other]

Tuning Frequency Bias in Neural Network Training with Nonuniform Data

Authors: Annan Yu, Yunan Yang, Alex Townsend

Abstract: Small generalization errors of over-parameterized neural networks (NNs) can be partially explained by the frequency biasing phenomenon, where gradient-based algorithms minimize the low-frequency misfit before reducing the high-frequency residuals. Using the Neural Tangent Kernel (NTK), one can provide a theoretically rigorous analysis for training where data are drawn from constant or piecewise-co… ▽ More Small generalization errors of over-parameterized neural networks (NNs) can be partially explained by the frequency biasing phenomenon, where gradient-based algorithms minimize the low-frequency misfit before reducing the high-frequency residuals. Using the Neural Tangent Kernel (NTK), one can provide a theoretically rigorous analysis for training where data are drawn from constant or piecewise-constant probability densities. Since most training data sets are not drawn from such distributions, we use the NTK model and a data-dependent quadrature rule to theoretically quantify the frequency biasing of NN training given fully nonuniform data. By replacing the loss function with a carefully selected Sobolev norm, we can further amplify, dampen, counterbalance, or reverse the intrinsic frequency biasing in NN training. △ Less

Submitted 25 September, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

MSC Class: 68T07; 68Q32

arXiv:2204.12789 [pdf, other]

Learning Green's functions associated with time-dependent partial differential equations

Authors: Nicolas Boullé, Seick Kim, Tianyi Shi, Alex Townsend

Abstract: Neural operators are a popular technique in scientific machine learning to learn a mathematical model of the behavior of unknown physical systems from data. Neural operators are especially useful to learn solution operators associated with partial differential equations (PDEs) from pairs of forcing functions and solutions when numerical solvers are not available or the underlying physics is poorly… ▽ More Neural operators are a popular technique in scientific machine learning to learn a mathematical model of the behavior of unknown physical systems from data. Neural operators are especially useful to learn solution operators associated with partial differential equations (PDEs) from pairs of forcing functions and solutions when numerical solvers are not available or the underlying physics is poorly understood. In this work, we attempt to provide theoretical foundations to understand the amount of training data needed to learn time-dependent PDEs. Given input-output pairs from a parabolic PDE in any spatial dimension $n\geq 1$, we derive the first theoretically rigorous scheme for learning the associated solution operator, which takes the form of a convolution with a Green's function $G$. Until now, rigorously learning Green's functions associated with time-dependent PDEs has been a major challenge in the field of scientific machine learning because $G$ may not be square-integrable when $n>1$, and time-dependent PDEs have transient dynamics. By combining the hierarchical low-rank structure of $G$ together with randomized numerical linear algebra, we construct an approximant to $G$ that achieves a relative error of $\smash{\mathcal{O}(Γ_ε^{-1/2}ε)}$ in the $L^1$-norm with high probability by using at most $\smash{\mathcal{O}(ε^{-\frac{n+2}{2}}\log(1/ε))}$ input-output training pairs, where $Γ_ε$ is a measure of the quality of the training dataset for learning $G$, and $ε>0$ is sufficiently small. △ Less

Submitted 4 August, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

Comments: 34 pages, 3 figures

Journal ref: Journal of Machine Learning Research 23 (2022) 1-34

arXiv:2204.10295 [pdf, other]

Exploring the electric field around a loop of static charge: Rectangles, stadiums, ellipses, and knots

Authors: Max Lipton, Steven H Strogatz, Alex Townsend

Abstract: We study the electric field around a continuous one-dimensional loop of static charge, under the assumption that the charge is distributed uniformly along the loop. For rectangular or stadium-shaped loops in the plane, we find that the electric field can undergo a symmetry-breaking pitchfork bifurcation as the loop is elongated; the field can have either one or three zeros, depending on the loop's… ▽ More We study the electric field around a continuous one-dimensional loop of static charge, under the assumption that the charge is distributed uniformly along the loop. For rectangular or stadium-shaped loops in the plane, we find that the electric field can undergo a symmetry-breaking pitchfork bifurcation as the loop is elongated; the field can have either one or three zeros, depending on the loop's aspect ratio. For knotted charge distributions in three-dimensional space, we compute the electric field numerically and compare our results to previously published theoretical bounds on the number of equilibrium points around charged knots. Our computations reveal that the previous bounds are far from sharp. The numerics also suggest conjectures for the actual minimum number of equilibrium points for all charged knots with five or fewer crossings. In addition, we provide the first images of the equipotential surfaces around charged knots, and visualize their topological transitions as the level of the potential is varied. △ Less

Submitted 17 September, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

arXiv:2204.01817 [pdf, other]

doi 10.1016/j.physletb.2023.137702

Double $J/ψ$ production in pion-nucleon scattering at COMPASS

Authors: G. D. Alexeev, M. G. Alexeev, A. Amoroso, V. Andrieux, V. Anosov, K. Augsten, W. Augustyniak, C. D. R. Azevedo, B. Badelek, M. Ball, J. Barth, R. Beck, Y. Bedfer, J. Bernhard, M. Bodlak, F. Bradamante, A. Bressan, V. E. Burtsev, W. -C. Chang, C. Chatterjee, M. Chiosso, A. G. Chumakov, S. -U. Chung, A. Cicuttin, P. M. M. Correia , et al. (170 additional authors not shown)

Abstract: We present the study of the production of double $J/ψ$ mesons using COMPASS data collected with a 190 GeV/$c$ $π^-$ beam scattering off NH$_{3}$, Al and W targets. Kinematic distributions of the collected double $J/ψ$ events are analysed, and the double $J/ψ$ production cross section is estimated for each of the COMPASS targets. The results are compared to predictions from single- and double-parto… ▽ More We present the study of the production of double $J/ψ$ mesons using COMPASS data collected with a 190 GeV/$c$ $π^-$ beam scattering off NH$_{3}$, Al and W targets. Kinematic distributions of the collected double $J/ψ$ events are analysed, and the double $J/ψ$ production cross section is estimated for each of the COMPASS targets. The results are compared to predictions from single- and double-parton scattering models as well as the pion intrinsic charm and the tetraquark exotic resonance hypotheses. It is demonstrated that the single parton scattering production mechanism gives the dominant contribution that is sufficient to describe the data. An upper limit on the double intrinsic charm content of pion is evaluated. No significant signatures that could be associated with exotic tetraquarks are found in the double $J/ψ$ mass spectrum. △ Less

Submitted 4 April, 2022; originally announced April 2022.

Comments: 12 pages, 4 figures

Report number: CERN-EP-2022--073

arXiv:2203.03152 [pdf, ps, other]

doi 10.1063/5.0090443

A global synchronization theorem for oscillators on a random graph

Authors: Martin Kassabov, Steven H. Strogatz, Alex Townsend

Abstract: Consider $n$ identical Kuramoto oscillators on a random graph. Specifically, consider \ER random graphs in which any two oscillators are bidirectionally coupled with unit strength, independently and at random, with probability $0\leq p\leq 1$. We say that a network is globally synchronizing if the oscillators converge to the all-in-phase synchronous state for almost all initial conditions. Is ther… ▽ More Consider $n$ identical Kuramoto oscillators on a random graph. Specifically, consider \ER random graphs in which any two oscillators are bidirectionally coupled with unit strength, independently and at random, with probability $0\leq p\leq 1$. We say that a network is globally synchronizing if the oscillators converge to the all-in-phase synchronous state for almost all initial conditions. Is there a critical threshold for $p$ above which global synchrony is extremely likely but below which it is extremely rare? It is suspected that a critical threshold exists and is close to the so-called connectivity threshold, namely, $p\sim \log(n)/n$ for $n \gg 1$. Ling, Xu, and Bandeira made the first progress toward proving a result in this direction: they showed that if $p\gg \log(n)/n^{1/3}$, then \ER networks of Kuramoto oscillators are globally synchronizing with high probability as $n\rightarrow\infty$. Here we improve that result by showing that $p\gg \log^2(n)/n$ suffices. Our estimates are explicit: for example, we can say that there is more than a $99.9996\%$ chance that a random network with $n = 10^6$ and $p>0.01117$ is globally synchronizing. △ Less

Submitted 7 March, 2022; originally announced March 2022.

Comments: 9 pages, 2 figures

arXiv:2202.04722 [pdf, other]

doi 10.1007/s10543-023-00965-z

On the stability of unevenly spaced samples for interpolation and quadrature

Authors: Annan Yu, Alex Townsend

Abstract: Unevenly spaced samples from a periodic function are common in signal processing and can often be viewed as a perturbed equally spaced grid. In this paper, we analyze how the uneven distribution of the samples impacts the quality of interpolation and quadrature. Starting with equally spaced nodes on $[-π,π)$ with grid spacing $h$, suppose the unevenly spaced nodes are obtained by perturbing each u… ▽ More Unevenly spaced samples from a periodic function are common in signal processing and can often be viewed as a perturbed equally spaced grid. In this paper, we analyze how the uneven distribution of the samples impacts the quality of interpolation and quadrature. Starting with equally spaced nodes on $[-π,π)$ with grid spacing $h$, suppose the unevenly spaced nodes are obtained by perturbing each uniform node by an arbitrary amount $\leq αh$, where $0 \leq α< 1/2$ is a fixed constant. We prove a discrete version of the Kadec-1/4 theorem, which states that the nonuniform discrete Fourier transform associated with perturbed nodes has a bounded condition number independent of $h$, for any $α< 1/4$. We go on to show that unevenly spaced quadrature rules converge for all continuous functions and interpolants converge uniformly for all differentiable functions whose derivative has bounded variation when $0 \leq α< 1/4$. Though, quadrature rules at perturbed nodes can have negative weights for any $α> 0$, we provide a bound on the absolute sum of the quadrature weights. Therefore, we show that perturbed equally spaced grids with small $α$ can be used without numerical woes. While our proof techniques work primarily when $0 \leq α< 1/4$, we show that a small amount of oversampling extends our results to the case when $1/4 \leq α< 1/2$. △ Less

Submitted 9 February, 2022; originally announced February 2022.

MSC Class: 42A15; 65D32; 94A20

Journal ref: Bit. Numer. Math. 63, 23 (2023)

arXiv:2111.14889 [pdf, other]

Rigorous data-driven computation of spectral properties of Koopman operators for dynamical systems

Authors: Matthew J. Colbrook, Alex Townsend

Abstract: Koopman operators are infinite-dimensional operators that globally linearize nonlinear dynamical systems, making their spectral information valuable for understanding dynamics. However, Koopman operators can have continuous spectra and infinite-dimensional invariant subspaces, making computing their spectral information a considerable challenge. This paper describes data-driven algorithms with rig… ▽ More Koopman operators are infinite-dimensional operators that globally linearize nonlinear dynamical systems, making their spectral information valuable for understanding dynamics. However, Koopman operators can have continuous spectra and infinite-dimensional invariant subspaces, making computing their spectral information a considerable challenge. This paper describes data-driven algorithms with rigorous convergence guarantees for computing spectral information of Koopman operators from trajectory data. We introduce residual dynamic mode decomposition (ResDMD), which provides the first scheme for computing the spectra and pseudospectra of general Koopman operators from snapshot data without spectral pollution. Using the resolvent operator and ResDMD, we compute smoothed approximations of spectral measures associated with general measure-preserving dynamical systems. We prove explicit convergence theorems for our algorithms, which can achieve high-order convergence even for chaotic systems when computing the density of the continuous spectrum and the discrete spectrum. Since our algorithms come with error control, ResDMD allows aposteri verification of spectral quantities, Koopman mode decompositions, and learned dictionaries. We demonstrate our algorithms on the tent map, circle rotations, Gauss iterated map, nonlinear pendulum, double pendulum, and Lorenz system. Finally, we provide kernelized variants of our algorithms for dynamical systems with a high-dimensional state space. This allows us to compute the spectral measure associated with the dynamics of a protein molecule with a 20,046-dimensional state space and compute nonlinear Koopman modes with error bounds for turbulent flow past aerofoils with Reynolds number $>10^5$ that has a 295,122-dimensional state space. △ Less

Submitted 11 May, 2023; v1 submitted 29 November, 2021; originally announced November 2021.

MSC Class: 37M10; 65P99; 65F99; 65T99; 37A30; 47A10; 47B33; 37N10; 37N25

arXiv:2111.10448 [pdf, other]

Parallel algorithms for computing the tensor-train decomposition

Authors: Tianyi Shi, Maximilian Ruth, Alex Townsend

Abstract: The tensor-train (TT) decomposition expresses a tensor in a data-sparse format used in molecular simulations, high-order correlation functions, and optimization. In this paper, we propose four parallelizable algorithms that compute the TT format from various tensor inputs: (1) Parallel-TTSVD for traditional format, (2) PSTT and its variants for streaming data, (3) Tucker2TT for Tucker format, and… ▽ More The tensor-train (TT) decomposition expresses a tensor in a data-sparse format used in molecular simulations, high-order correlation functions, and optimization. In this paper, we propose four parallelizable algorithms that compute the TT format from various tensor inputs: (1) Parallel-TTSVD for traditional format, (2) PSTT and its variants for streaming data, (3) Tucker2TT for Tucker format, and (4) TT-fADI for solutions of Sylvester tensor equations. We provide theoretical guarantees of accuracy, parallelization methods, scaling analysis, and numerical results. For example, for a $d$-dimension tensor in $\mathbb{R}^{n\times\dots\times n}$, a two-sided sketching algorithm PSTT2 is shown to have a memory complexity of $\mathcal{O}(n^{\lfloor d/2 \rfloor})$, improving upon $\mathcal{O}(n^{d-1})$ from previous algorithms. △ Less

Submitted 19 November, 2021; originally announced November 2021.

Comments: 23 pages, 8 figures

MSC Class: 15A69; 65Y05; 65F55

arXiv:2109.11354 [pdf, other]

Arbitrary-Depth Universal Approximation Theorems for Operator Neural Networks

Authors: Annan Yu, Chloé Becquey, Diana Halikias, Matthew Esmaili Mallory, Alex Townsend

Abstract: The standard Universal Approximation Theorem for operator neural networks (NNs) holds for arbitrary width and bounded depth. Here, we prove that operator NNs of bounded width and arbitrary depth are universal approximators for continuous nonlinear operators. In our main result, we prove that for non-polynomial activation functions that are continuously differentiable at a point with a nonzero deri… ▽ More The standard Universal Approximation Theorem for operator neural networks (NNs) holds for arbitrary width and bounded depth. Here, we prove that operator NNs of bounded width and arbitrary depth are universal approximators for continuous nonlinear operators. In our main result, we prove that for non-polynomial activation functions that are continuously differentiable at a point with a nonzero derivative, one can construct an operator NN of width five, whose inputs are real numbers with finite decimal representations, that is arbitrarily close to any given continuous nonlinear operator. We derive an analogous result for non-affine polynomial activation functions. We also show that depth has theoretical advantages by constructing operator ReLU NNs of depth $2k^3+8$ and constant width that cannot be well-approximated by any operator ReLU NN of depth $k$, unless its width is exponential in $k$. △ Less

Submitted 23 September, 2021; originally announced September 2021.

Comments: 12 pages

arXiv:2108.07293 [pdf, other]

doi 10.1051/0004-6361/202141227

High-resolution imaging with the International LOFAR Telescope: Observations of the gravitational lenses MG 0751+2716 and CLASS B1600+434

Authors: Shruti Badole, Deepika Venkattu, Neal Jackson, Sarah Wallace, Jiten Dhandha, Philippa Hartley, Christopher Riddell-Rovira, Alice Townsend, Leah K. Morabito, J. P. McKean

Abstract: We present Low-Frequency Array (LOFAR) telescope observations of the radio-loud gravitational lens systems MG 0751+2716 and CLASS B1600+434. These observations produce images at 300 milliarcseconds (mas) resolution at 150 MHz. In the case of MG 0751+2716, lens modelling is used to derive a size estimate of around 2 kpc for the low-frequency source, which is consistent with a previous 27.4 GHz stud… ▽ More We present Low-Frequency Array (LOFAR) telescope observations of the radio-loud gravitational lens systems MG 0751+2716 and CLASS B1600+434. These observations produce images at 300 milliarcseconds (mas) resolution at 150 MHz. In the case of MG 0751+2716, lens modelling is used to derive a size estimate of around 2 kpc for the low-frequency source, which is consistent with a previous 27.4 GHz study in the radio continuum with Karl G. Jansky Very Large Array (VLA). This consistency implies that the low-frequency radio source is cospatial with the core-jet structure that forms the radio structure at higher frequencies, and no significant lobe emission or further components associated with star formation are detected within the magnified region of the lens. CLASS B1600+434 is a two-image lens where one of the images passes through the edge-on spiral lensing galaxy, and the low radio frequency allows us to derive limits on propagation effects, namely scattering, in the lensing galaxy. The observed flux density ratio of the two lensed images is 1.19 +/- 0.04 at an observed frequency of 150 MHz. The widths of the two images give an upper limit of 0.035 kpc m^-20/3 on the integrated scattering column through the galaxy at a distance approximately 1 kpc above its plane, under the assumption that image A is not affected by scattering. This is relatively small compared to limits derived through very long baseline interferometry (VLBI) studies of differential scattering in lens systems. These observations demonstrate that LOFAR is an excellent instrument for studying gravitational lenses. We also report on the inability to calibrate three further lens observations: two from early observations that have less well determined station calibration, and a third observation impacted by phase transfer problems. △ Less

Submitted 16 August, 2021; originally announced August 2021.

Comments: Accepted to a special issue of A&A on sub-arcsecond imaging with LOFAR

arXiv:2108.01744 [pdf, other]

doi 10.1103/PhysRevD.105.012005

The exotic meson $π_1(1600)$ with $J^{PC} = 1^{-+}$ and its decay into $ρ(770)π$

Authors: M. G. Alexeev, G. D. Alexeev, A. Amoroso, V. Andrieux, V. Anosov, K. Augsten, W. Augustyniak, C. D. R. Azevedo, B. Badelek, F. Balestra, M. Ball, J. Barth, R. Beck, Y. Bedfer, J. Berenguer Antequera, J. Bernhard, M. Bodlak, F. Bradamante, A. Bressan, V. E. Burtsev, W. -C. Chang, C. Chatterjee, M. Chiosso, A. G. Chumakov, S. -U. Chung , et al. (171 additional authors not shown)

Abstract: We study the spin-exotic $J^{PC} = 1^{-+}$ amplitude in single-diffractive dissociation of 190 GeV$/c$ pions into $π^-π^-π^+$ using a hydrogen target and confirm the $π_1(1600) \to ρ(770) π$ amplitude, which interferes with a nonresonant $1^{-+}$ amplitude. We demonstrate that conflicting conclusions from previous studies on these amplitudes can be attributed to different analysis models and diffe… ▽ More We study the spin-exotic $J^{PC} = 1^{-+}$ amplitude in single-diffractive dissociation of 190 GeV$/c$ pions into $π^-π^-π^+$ using a hydrogen target and confirm the $π_1(1600) \to ρ(770) π$ amplitude, which interferes with a nonresonant $1^{-+}$ amplitude. We demonstrate that conflicting conclusions from previous studies on these amplitudes can be attributed to different analysis models and different treatment of the dependence of the amplitudes on the squared four-momentum transfer and we thus reconcile their experimental findings. We study the nonresonant contributions to the $π^-π^-π^+$ final state using pseudo-data generated on the basis of a Deck model. Subjecting pseudo-data and real data to the same partial-wave analysis, we find good agreement concerning the spectral shape and its dependence on the squared four-momentum transfer for the $J^{PC} = 1^{-+}$ amplitude and also for amplitudes with other $J^{PC}$ quantum numbers. We investigate for the first time the amplitude of the $π^-π^+$ subsystem with $J^{PC} = 1^{--}$ in the $3π$ amplitude with $J^{PC} = 1^{-+}$ employing the novel freed-isobar analysis scheme. We reveal this $π^-π^+$ amplitude to be dominated by the $ρ(770)$ for both the $π_1(1600)$ and the nonresonant contribution. We determine the $ρ(770)$ resonance parameters within the three-pion final state. These findings largely confirm the underlying assumptions for the isobar model used in all previous partial-wave analyses addressing the $J^{PC} = 1^{-+}$ amplitude. △ Less

Submitted 18 January, 2022; v1 submitted 3 August, 2021; originally announced August 2021.

Journal ref: Phys.Rev.D 105 (2022) 1, 012005

arXiv:2105.13052 [pdf, other]

A generalization of the randomized singular value decomposition

Authors: Nicolas Boullé, Alex Townsend

Abstract: The randomized singular value decomposition (SVD) is a popular and effective algorithm for computing a near-best rank $k$ approximation of a matrix $A$ using matrix-vector products with standard Gaussian vectors. Here, we generalize the randomized SVD to multivariate Gaussian vectors, allowing one to incorporate prior knowledge of $A$ into the algorithm. This enables us to explore the continuous a… ▽ More The randomized singular value decomposition (SVD) is a popular and effective algorithm for computing a near-best rank $k$ approximation of a matrix $A$ using matrix-vector products with standard Gaussian vectors. Here, we generalize the randomized SVD to multivariate Gaussian vectors, allowing one to incorporate prior knowledge of $A$ into the algorithm. This enables us to explore the continuous analogue of the randomized SVD for Hilbert--Schmidt (HS) operators using operator-function products with functions drawn from a Gaussian process (GP). We then construct a new covariance kernel for GPs, based on weighted Jacobi polynomials, which allows us to rapidly sample the GP and control the smoothness of the randomly generated functions. Numerical examples on matrices and HS operators demonstrate the applicability of the algorithm. △ Less

Submitted 21 January, 2022; v1 submitted 27 May, 2021; originally announced May 2021.

Comments: Accepted at ICLR 2022

arXiv:2105.11406 [pdf, ps, other]

doi 10.1063/5.0057659

Sufficiently dense Kuramoto networks are globally synchronizing

Authors: Martin Kassabov, Steven H. Strogatz, Alex Townsend

Abstract: Consider any network of $n$ identical Kuramoto oscillators in which each oscillator is coupled bidirectionally with unit strength to at least $μ(n-1)$ other oscillators. There is a critical value of the connectivity, $μ_c$, such that whenever $μ>μ_c$, the system is guaranteed to converge to the all-in-phase synchronous state for almost all initial conditions, but when $μ<μ_c$, there are networks w… ▽ More Consider any network of $n$ identical Kuramoto oscillators in which each oscillator is coupled bidirectionally with unit strength to at least $μ(n-1)$ other oscillators. There is a critical value of the connectivity, $μ_c$, such that whenever $μ>μ_c$, the system is guaranteed to converge to the all-in-phase synchronous state for almost all initial conditions, but when $μ<μ_c$, there are networks with other stable states. The precise value of the critical connectivity remains unknown, but it has been conjectured to be $μ_c=0.75$. In 2020, Lu and Steinerberger proved that $μ_c\leq 0.7889$, and Yoneda, Tatsukawa, and Teramae proved in 2021 that $μ_c > 0.6838$. In this paper, we prove that $μ_c\leq 0.75$ and explain why this is the best upper bound that one can obtain by a purely linear stability analysis. △ Less

Submitted 24 May, 2021; originally announced May 2021.

Comments: 6 pages, 1 figure

arXiv:2105.07324 [pdf, other]

Data-driven Algorithms for signal processing with trigonometric rational functions

Authors: Heather Wilber, Anil Damle, Alex Townsend

Abstract: Rational approximation schemes for reconstructing periodic signals from samples with poorly separated spectral content are described. These methods are automatic and adaptive, requiring no tuning or manual parameter selection. Collectively, they form a framework for fitting trigonometric rational models to data that is robust to various forms of corruption, including additive Gaussian noise, pertu… ▽ More Rational approximation schemes for reconstructing periodic signals from samples with poorly separated spectral content are described. These methods are automatic and adaptive, requiring no tuning or manual parameter selection. Collectively, they form a framework for fitting trigonometric rational models to data that is robust to various forms of corruption, including additive Gaussian noise, perturbed sampling grids, and missing data. Our approach combines a variant of Prony's method with a modified version of the AAA algorithm. Using representations in both frequency and time space, a collection of algorithms is described for adaptively computing with trigonometric rationals. This includes procedures for differentiation, filtering, convolution, and more. A new MATLAB software system based on these algorithms is introduced. Its effectiveness is illustrated with synthetic and practical examples drawn from applications including biomedical monitoring, acoustic denoising, and feature detection. △ Less

Submitted 9 December, 2021; v1 submitted 15 May, 2021; originally announced May 2021.

Comments: 25 pages, 7 figures

MSC Class: 41A20; 94A12 ACM Class: G.1.2

arXiv:2105.00266 [pdf, other]

doi 10.1038/s41598-022-08745-5

Data-driven discovery of Green's functions with human-understandable deep learning

Authors: Nicolas Boullé, Christopher J. Earls, Alex Townsend

Abstract: There is an opportunity for deep learning to revolutionize science and technology by revealing its findings in a human interpretable manner. To do this, we develop a novel data-driven approach for creating a human-machine partnership to accelerate scientific discovery. By collecting physical system responses under excitations drawn from a Gaussian process, we train rational neural networks to lear… ▽ More There is an opportunity for deep learning to revolutionize science and technology by revealing its findings in a human interpretable manner. To do this, we develop a novel data-driven approach for creating a human-machine partnership to accelerate scientific discovery. By collecting physical system responses under excitations drawn from a Gaussian process, we train rational neural networks to learn Green's functions of hidden linear partial differential equations. These functions reveal human-understandable properties and features, such as linear conservation laws and symmetries, along with shock and singularity locations, boundary effects, and dominant modes. We illustrate the technique on several examples and capture a range of physics, including advection-diffusion, viscous shocks, and Stokes flow in a lid-driven cavity. △ Less

Submitted 11 March, 2022; v1 submitted 1 May, 2021; originally announced May 2021.

Comments: 54 pages, 23 figures

arXiv:2104.13585 [pdf, other]

doi 10.1016/j.physletb.2021.136834

Probing transversity by measuring $Λ$ polarisation in SIDIS

Authors: M. G. Alexeev, G. D. Alexeev, A. Amoroso, V. Andrieux, V. Anosov, K. Augsten, W. Augustyniak, C. D. R. Azevedo, B. Badelek, F. Balestra, M. Ball, J. Barth, R. Beck, Y. Bedfer, J. Berenguer Antequera, J. Bernhard, M. Bodlak, F. Bradamante, A. Bressan, V. E. Burtsev, W. -C. Chang, C. Chatterjee, M. Chiosso, A. G. Chumakov, S. -U. Chung , et al. (175 additional authors not shown)

Abstract: Based on the observation of sizeable target-transverse-spin asymmetries in single-hadron and hadron-pair production in Semi-Inclusive measurements of Deep Inelastic Scattering (SIDIS), the chiral-odd transversity quark distribution functions $h_1^q$ are nowadays well established. Several possible channels to access these functions were originally proposed. One candidate is the measurement of the p… ▽ More Based on the observation of sizeable target-transverse-spin asymmetries in single-hadron and hadron-pair production in Semi-Inclusive measurements of Deep Inelastic Scattering (SIDIS), the chiral-odd transversity quark distribution functions $h_1^q$ are nowadays well established. Several possible channels to access these functions were originally proposed. One candidate is the measurement of the polarisation of $Λ$ hyperons produced in SIDIS off transversely polarised nucleons, where the transverse polarisation of the struck quark might be transferred to the final-state hyperon. In this article, we present the COMPASS results on the transversity-induced polarisation of $Λ$ and $\barΛ$ hyperons produced in SIDIS off transversely polarised protons. Within the experimental uncertainties, no significant deviation from zero was observed. The results are discussed in the context of different models taking into account previous experimental results on $h_1^u$ and $h_1^d$. △ Less

Submitted 29 April, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

Comments: 18 pages, 6 figures

Report number: CERN-EP-2021-072

arXiv:2103.16638 [pdf, other]

An optimal complexity spectral method for Navier--Stokes simulations in the ball

Authors: Nicolas Boullé, Jonasz Słomka, Alex Townsend

Abstract: We develop a spectral method for solving the incompressible generalized Navier--Stokes equations in the ball with no-flux and prescribed slip boundary conditions. The algorithm achieves an optimal complexity per time step of $\mathcal{O}(N\log^2(N))$, where $N$ is the number of spatial degrees of freedom. The method relies on the poloidal-toroidal decomposition of solenoidal vector fields, the dou… ▽ More We develop a spectral method for solving the incompressible generalized Navier--Stokes equations in the ball with no-flux and prescribed slip boundary conditions. The algorithm achieves an optimal complexity per time step of $\mathcal{O}(N\log^2(N))$, where $N$ is the number of spatial degrees of freedom. The method relies on the poloidal-toroidal decomposition of solenoidal vector fields, the double Fourier sphere method, the Fourier and ultraspherical spectral method, and the spherical harmonics transform to decouple the Navier--Stokes equations and achieve the desired complexity and spectral accuracy. △ Less

Submitted 20 April, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

Comments: 17 pages, 7 figures

arXiv:2102.00491 [pdf, other]

doi 10.1007/s10208-022-09556-w

Learning elliptic partial differential equations with randomized linear algebra

Authors: Nicolas Boullé, Alex Townsend

Abstract: Given input-output pairs of an elliptic partial differential equation (PDE) in three dimensions, we derive the first theoretically-rigorous scheme for learning the associated Green's function $G$. By exploiting the hierarchical low-rank structure of $G$, we show that one can construct an approximant to $G$ that converges almost surely and achieves a relative error of… ▽ More Given input-output pairs of an elliptic partial differential equation (PDE) in three dimensions, we derive the first theoretically-rigorous scheme for learning the associated Green's function $G$. By exploiting the hierarchical low-rank structure of $G$, we show that one can construct an approximant to $G$ that converges almost surely and achieves a relative error of $\mathcal{O}(Γ_ε^{-1/2}\log^3(1/ε)ε)$ using at most $\mathcal{O}(ε^{-6}\log^4(1/ε))$ input-output training pairs with high probability, for any $0<ε<1$. The quantity $0<Γ_ε\leq 1$ characterizes the quality of the training dataset. Along the way, we extend the randomized singular value decomposition algorithm for learning matrices to Hilbert--Schmidt operators and characterize the quality of covariance kernels for PDE learning. △ Less

Submitted 21 January, 2022; v1 submitted 31 January, 2021; originally announced February 2021.

Comments: 25 pages, 4 figures

MSC Class: 65N80; 35J08; 35R30; 60G15; 65F55 65N80; 35J08; 35R30; 60G15; 65F55

arXiv:2010.15959 [pdf, other]

Over-parametrized neural networks as under-determined linear systems

Authors: Austin R. Benson, Anil Damle, Alex Townsend

Abstract: We draw connections between simple neural networks and under-determined linear systems to comprehensively explore several interesting theoretical questions in the study of neural networks. First, we emphatically show that it is unsurprising such networks can achieve zero training loss. More specifically, we provide lower bounds on the width of a single hidden layer neural network such that only tr… ▽ More We draw connections between simple neural networks and under-determined linear systems to comprehensively explore several interesting theoretical questions in the study of neural networks. First, we emphatically show that it is unsurprising such networks can achieve zero training loss. More specifically, we provide lower bounds on the width of a single hidden layer neural network such that only training the last linear layer suffices to reach zero training loss. Our lower bounds grow more slowly with data set size than existing work that trains the hidden layer weights. Second, we show that kernels typically associated with the ReLU activation function have fundamental flaws -- there are simple data sets where it is impossible for widely studied bias-free models to achieve zero training loss irrespective of how the parameters are chosen or trained. Lastly, our analysis of gradient descent clearly illustrates how spectral properties of certain matrices impact both the early iteration and long-term training behavior. We propose new activation functions that avoid the pitfalls of ReLU in that they admit zero training loss solutions for any set of distinct data points and experimentally exhibit favorable spectral properties. △ Less

Submitted 29 October, 2020; originally announced October 2020.

Comments: 25 pages, 4 figures

MSC Class: 68T05; 68Q32; 46E22

arXiv:2009.03271 [pdf, other]

doi 10.1140/epjc/s10052-020-08740-y

Spin Density Matrix Elements in Exclusive $ω$ Meson Muoproduction $^*$

Authors: M. G. Alexeev, G. D. Alexeev, A. Amoroso, V. Andrieux, V. Anosov, A. Antoshkin, K. Augsten, W. Augustyniak, C. D. R. Azevedo, B. Badelek, F. Balestra, M. Ball, J. Barth, R. Beck, Y. Bedfer, J. Berenguer Antequera, J. Bernhard, M. Bodlak, F. Bradamante, A. Bressan, V. E. Burtsev, W. -C. Chang, C. Chatterjee, M. Chiosso, A. G. Chumakov , et al. (176 additional authors not shown)

Abstract: We report on a measurement of Spin Density Matrix Elements (SDMEs) in hard exclusive $ω$ meson muoproduction on the proton at COMPASS using 160 GeV/$c$ polarised $ μ^{+}$ and $ μ^{-}$ beams im**ing on a liquid hydrogen target. The measurement covers the range 5.0 GeV/$c^2$ $< W <$ 17.0 GeV/$c^2$, with the average kinematics $\langle Q^{2} \rangle=$ 2.1 (GeV/$c$)$^2$, $\langle W \rangle= 7.6$ GeV… ▽ More We report on a measurement of Spin Density Matrix Elements (SDMEs) in hard exclusive $ω$ meson muoproduction on the proton at COMPASS using 160 GeV/$c$ polarised $ μ^{+}$ and $ μ^{-}$ beams im**ing on a liquid hydrogen target. The measurement covers the range 5.0 GeV/$c^2$ $< W <$ 17.0 GeV/$c^2$, with the average kinematics $\langle Q^{2} \rangle=$ 2.1 (GeV/$c$)$^2$, $\langle W \rangle= 7.6$ GeV/$c^2$, and $\langle p^{2}_{\rm T} \rangle = 0.16$ (GeV/$c$)$^2$. Here, $Q^2$ denotes the virtuality of the exchanged photon, $W$ the mass of the final hadronic system and $p_T$ the transverse momentum of the $ω$ meson with respect to the virtual-photon direction. The measured non-zero SDMEs for the transitions of transversely polarised virtual photons to longitudinally polarised vector mesons ($γ^*_T \to V_L$) indicate a violation of $s$-channel helicity conservation. Additionally, we observe a sizeable contribution of unnatural-parity-exchange (UPE) transitions that decreases with increasing $W$. The results provide important input for modelling Generalised Parton Distributions (GPDs). In particular, they may allow to evaluate in a model-dependent way the contribution of UPE transitions and assess the role of parton helicity-flip GPDs in exclusive $ω$ production. △ Less

Submitted 7 December, 2021; v1 submitted 7 September, 2020; originally announced September 2020.

Comments: 31 pages, 12 figures, 1 appendix

Report number: CERN-EP-2020-169

Showing 1–50 of 99 results for author: Townsend, A