Search | arXiv e-print repository

arXiv:2203.11992 [pdf, other]

Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum

Authors: Kirby Banman, Liam Peet-Pare, Nidhi Hegde, Alona Fyshe, Martha White

Abstract: Most convergence guarantees for stochastic gradient descent with momentum (SGDm) rely on iid sampling. Yet, SGDm is often used outside this regime, in settings with temporally correlated input samples such as continual learning and reinforcement learning. Existing work has shown that SGDm with a decaying step-size can converge under Markovian temporal correlation. In this work, we show that SGDm u… ▽ More Most convergence guarantees for stochastic gradient descent with momentum (SGDm) rely on iid sampling. Yet, SGDm is often used outside this regime, in settings with temporally correlated input samples such as continual learning and reinforcement learning. Existing work has shown that SGDm with a decaying step-size can converge under Markovian temporal correlation. In this work, we show that SGDm under covariate shift with a fixed step-size can be unstable and diverge. In particular, we show SGDm under covariate shift is a parametric oscillator, and so can suffer from a phenomenon known as resonance. We approximate the learning system as a time varying system of ordinary differential equations, and leverage existing theory to characterize the system's divergence/convergence as resonant/nonresonant modes. The theoretical result is limited to the linear setting with periodic covariate shift, so we empirically supplement this result to show that resonance phenomena persist even under non-periodic covariate shift, nonlinear dynamics with neural networks, and optimizers other than SGDm. △ Less

Submitted 22 March, 2022; originally announced March 2022.

Comments: In International Conference on Learning Representations. 2021

arXiv:2203.08024 [pdf, other]

Snowmass 2021 CMB-S4 White Paper

Authors: Kevork Abazajian, Arwa Abdulghafour, Graeme E. Addison, Peter Adshead, Zeeshan Ahmed, Marco Ajello, Daniel Akerib, Steven W. Allen, David Alonso, Marcelo Alvarez, Mustafa A. Amin, Mandana Amiri, Adam Anderson, Behzad Ansarinejad, Melanie Archipley, Kam S. Arnold, Matt Ashby, Han Aung, Carlo Baccigalupi, Carina Baker, Abhishek Bakshi, Debbie Bard, Denis Barkats, Darcy Barron, Peter S. Barry , et al. (331 additional authors not shown)

Abstract: This Snowmass 2021 White Paper describes the Cosmic Microwave Background Stage 4 project CMB-S4, which is designed to cross critical thresholds in our understanding of the origin and evolution of the Universe, from the highest energies at the dawn of time through the growth of structure to the present day. We provide an overview of the science case, the technical design, and project plan. This Snowmass 2021 White Paper describes the Cosmic Microwave Background Stage 4 project CMB-S4, which is designed to cross critical thresholds in our understanding of the origin and evolution of the Universe, from the highest energies at the dawn of time through the growth of structure to the present day. We provide an overview of the science case, the technical design, and project plan. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: Contribution to Snowmass 2021. arXiv admin note: substantial text overlap with arXiv:1908.01062, arXiv:1907.04473

arXiv:2203.07883 [pdf, other]

Global fit of 2HDM with future collider results

Authors: Ankit Beniwal, Filip Rajec, Markus Tobias Prim, Pat Scott, Wei Su, Martin White, Anthony G. Williams, Alex Woodcock

Abstract: In this work, we summarize a global fit study of Type-II two Higgs doublet models (2HDM), and explore the impact of future SM-like Higgs and Z-pole precision measurements on the allowed parameter space. The work is based on the study results of a global fit of 2HDMs with the tool GAMBIT, utilising various current constraints including theoretical constraints (unitarity, perturbativity and vacuum s… ▽ More In this work, we summarize a global fit study of Type-II two Higgs doublet models (2HDM), and explore the impact of future SM-like Higgs and Z-pole precision measurements on the allowed parameter space. The work is based on the study results of a global fit of 2HDMs with the tool GAMBIT, utilising various current constraints including theoretical constraints (unitarity, perturbativity and vacuum stability), Higgs searches at colliders, electroweak physics and flavour constraints. We further investigate the ability of future facilities, such as the HL-LHC, CEPC, ILC and FCC-ee to explore the 2HDM parameter space. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: 9 pages, 3 plots, contribution to Snowmass 2021

Report number: KIAS--P22014, gambit-proceedings-2022

arXiv:2203.07734 [pdf, other]

doi 10.1093/mnras/stac1413

Disc instability and bar formation: view from the IllustrisTNG simulations

Authors: David Izquierdo-Villalba, Silvia Bonoli, Yetli Rosas-Guevara, Volker Springel, Simon D. M. White, Tommaso Zana, Massimo Dotti, Daniele Spinoso, Matteo Bonetti, Alessandro Lupi

Abstract: We make use of z = 0 samples of strongly barred and unbarred disc galaxies from the TNG100 and TNG50 cosmological hydrodynamical simulations to assess the performance of the simple disc instability criterion proposed by Efstathiou, Lake & Negroponte (1982) (ELN-criterion). We find that strongly barred galaxies generally assemble earlier, are more star-dominated in their central regions, and have m… ▽ More We make use of z = 0 samples of strongly barred and unbarred disc galaxies from the TNG100 and TNG50 cosmological hydrodynamical simulations to assess the performance of the simple disc instability criterion proposed by Efstathiou, Lake & Negroponte (1982) (ELN-criterion). We find that strongly barred galaxies generally assemble earlier, are more star-dominated in their central regions, and have more massive and more compact discs than unbarred galaxies. The ELN-criterion successfully identifies ~75% and ~80% of the strongly barred and the unbarred galaxies, respectively. Strongly barred galaxies that the criterion fails to identify tend to have more extended discs, higher spin values and bars that assembled later than is typical for the bulk of the barred population. The bars in many of these cases appear to be produced by an interaction with a close neighbour (i.e. to be externally triggered) rather than to result from secular growth in the disc. On the other hand, we find that unbarred galaxies misclassified as barred by the ELN-criterion typically have stellar discs similar to those of barred galaxies, although more extended in the vertical direction and less star-dominated in their central regions, possibly reflecting later formation times. In addition, the bulge component of these galaxies is significantly more prominent at early times than in the strongly barred sample. Thus, the ELN-criterion robustly identifies secular bar instabilities in most simulated disc galaxies, but additional environmental criteria are needed to account for interaction-induced bar formation. △ Less

Submitted 19 May, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

Comments: 15 pages, 12 Figures, Accepted by MNRAS

arXiv:2203.07506 [pdf, other]

Snowmass2021 Cosmic Frontier White Paper: Cosmology and Fundamental Physics from the three-dimensional Large Scale Structure

Authors: Simone Ferraro, Noah Sailer, Anze Slosar, Martin White

Abstract: Advances in experimental techniques make it possible to map the high redshift Universe in three dimensions at high fidelity in the near future. This will increase the observed volume by many-fold, while providing unprecedented access to very large scales, which hold key information about primordial physics. Recently developed theoretical techniques, together with the smaller size of non-linearitie… ▽ More Advances in experimental techniques make it possible to map the high redshift Universe in three dimensions at high fidelity in the near future. This will increase the observed volume by many-fold, while providing unprecedented access to very large scales, which hold key information about primordial physics. Recently developed theoretical techniques, together with the smaller size of non-linearities at high redshift, allow the reconstruction of an order of magnitude more "primordial modes", and should improve our understanding of the early Universe through measurements of primordial non-Gaussianity and features in the primordial power spectrum. In addition to probing the first epoch of accelerated expansion, such measurements can probe the Dark Energy density in the dark matter domination era, tightly constraining broad classes of dynamical Dark Energy models. The shape of the matter power spectrum itself has the potential to detect sub-percent fractional amounts of Early Dark Energy to $z \sim 10^5$, probing Dark Energy all the way to when the Universe was only a few years old. The precision of these measurements, combined with CMB observations, also has the promise of greatly improving our constraints on the effective number of relativistic species, the masses of neutrinos, the amount of spatial curvature and the gravitational slip. Studies of linear or quasi-linear large-scale structure with redshift surveys and the CMB currently provide our tightest constraints on cosmology and fundamental physics. Pushing the redshift and volume frontier will provide guaranteed, significant improvements in the state-of-the-art in a manner that is easy to forecast and optimize. △ Less

Submitted 9 September, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

Comments: 26 pages, 8 figures; Snowmass2021 Cosmic Frontier White Paper

arXiv:2203.07291 [pdf, other]

Snowmass2021 Cosmic Frontier White Paper: High Density Galaxy Clustering in the Regime of Cosmic Acceleration

Authors: Kyle Dawson, Andrew Hearin, Katrin Heitmann, Mustapha Ishak, Johannes Ulf Lange, Martin White, Rongpu Zhou

Abstract: Joint studies of imaging and spectroscopic samples, informed by theory and simulations, offer the potential for comprehensive tests of the cosmological model over redshifts z<1.5. Spectroscopic galaxy samples at these redshifts can be increased beyond the planned Dark Energy Spectroscopic Instrument (DESI) program by at least an order of magnitude, thus offering significantly more constraining pow… ▽ More Joint studies of imaging and spectroscopic samples, informed by theory and simulations, offer the potential for comprehensive tests of the cosmological model over redshifts z<1.5. Spectroscopic galaxy samples at these redshifts can be increased beyond the planned Dark Energy Spectroscopic Instrument (DESI) program by at least an order of magnitude, thus offering significantly more constraining power for these joint studies. Spectroscopic observations of these galaxies in the latter half of the 2020's and beyond would leverage the theory and simulation effort in this regime. In turn, these high density observations will allow enhanced tests of dark energy, physics beyond the standard model, and neutrino masses that will greatly exceed what is currently possible. Here, we present a coordinated program of simulations, theoretical modeling, and future spectroscopy that would enable precise cosmological studies in the accelerating epoch where the effects of dark energy are most apparent. △ Less

Submitted 14 March, 2022; originally announced March 2022.

Comments: Contribution to Snowmass 2022, CF04: Dark energy and cosmic acceleration: the modern universe

arXiv:2203.06795 [pdf, other]

Snowmass2021: Opportunities from Cross-survey Analyses of Static Probes

Authors: Eric J. Baxter, Chihway Chang, Andrew Hearin, Jonathan Blazek, Lindsey E. Bleem, Simone Ferraro, Mustapha Ishak, Kirit S. Karkare, Alexie Leauthaud, Jia Liu, Rachel Mandelbaum, Joel Meyers, Azadeh Moradinezhad Dizgah, Daisuke Nagai, Jeffrey A. Newman, Yuuki Omori, Neelima Sehgal, Martin White, Joe Zuntz, Marcelo A. Alvarez, Camille Avestruz, Federico Bianchini, Sebastian Bocquet, Boris Bolliet, John E. Carlstrom , et al. (15 additional authors not shown)

Abstract: Cosmological data in the next decade will be characterized by high-precision, multi-wavelength measurements of thousands of square degrees of the same patches of sky. By performing multi-survey analyses that harness the correlated nature of these datasets, we will gain access to new science, and increase the precision and robustness of science being pursued by each individual survey. However, effe… ▽ More Cosmological data in the next decade will be characterized by high-precision, multi-wavelength measurements of thousands of square degrees of the same patches of sky. By performing multi-survey analyses that harness the correlated nature of these datasets, we will gain access to new science, and increase the precision and robustness of science being pursued by each individual survey. However, effective application of such analyses requires a qualitatively new level of investment in cross-survey infrastructure, including simulations, associated modeling, coordination of data sharing, and survey strategy. The scientific gains from this new level of investment are multiplicative, as the benefits can be reaped by even present-day instruments, and can be applied to new instruments as they come online. △ Less

Submitted 16 May, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

Comments: Contribution to Snowmass 2021

arXiv:2203.02840 [pdf, other]

doi 10.1051/0004-6361/202143015

Dielectric properties and stratigraphy of regolith in the lunar South Pole-Aitken basin: Observations from the Lunar Penetrating Radar

Authors: Jianqing Feng, Matthew. A. Siegler, Mackenzie N. White

Abstract: We examine data obtained by the Lunar Penetrating Radar (LPR) onboard the Chang'E-4 (CE-4) mission to study the dielectric properties and stratigraphy of lunar regolith on the far side of the Moon. The data collected from January 2019 to September 2020 were processed to generate a 540 m radargram. The travel velocity of the radar signal and the permittivity of the regolith were deduced from hyperb… ▽ More We examine data obtained by the Lunar Penetrating Radar (LPR) onboard the Chang'E-4 (CE-4) mission to study the dielectric properties and stratigraphy of lunar regolith on the far side of the Moon. The data collected from January 2019 to September 2020 were processed to generate a 540 m radargram. The travel velocity of the radar signal and the permittivity of the regolith were deduced from hyperbolas in the radargram. As CE-4 LPR detected distinct planar reflectors, we evaluated the dielectric loss from the maximum penetration depth based on the radar equation. The derived dielectric properties are compared with the measurements of Apollo samples and Chang'E-2 microwave radiometer observations. The results suggest that regolith at the landing site has a permittivity of 2.64-3.85 and a loss tangent of 0.0032-0.0044, indicating that the local regolith is composed of a fine-grained, low-loss material that is much more homogeneous than that found at the Chang'E-3 landing site. The total thickness of weathered material is 40 m, with several regolith layers and a buried craternidentified in the reconstructed subsurface structure. These layers clearly record a series of impact events from the adjacent regions. We suggest that the top layer is primarily made up of the ejecta from a large crater 140 km away. In contrast, the material source of other thinner layers comes from nearby smaller craters. △ Less

Submitted 5 March, 2022; originally announced March 2022.

Comments: 9 pages, 9 figures

Journal ref: A&A 661, A47 (2022)

arXiv:2203.02419 [pdf, other]

doi 10.1103/PhysRevLett.130.046202

Momentum-Resolved Exciton Coupling and Valley Polarization Dynamics in Monolayer WS$_2$

Authors: Alice Kunin, Sergey Chernov, ** Bakalis, Ziling Li, Shuyu Cheng, Zachary H. Withers, Michael G. White, Gerd Schönhense, Xu Du, Roland K. Kawakami, Thomas K. Allison

Abstract: Coupling between exciton states across the Brillouin zone in monolayer transition metal dichalcogenides can lead to ultrafast valley depolarization. Using time- and angle-resolved photoemission, we present momentum- and energy-resolved measurements of exciton coupling in monolayer WS$_2$. By comparing full 4D ($k_x, k_y, E, t$) data sets after both linearly and circularly polarized excitation, we… ▽ More Coupling between exciton states across the Brillouin zone in monolayer transition metal dichalcogenides can lead to ultrafast valley depolarization. Using time- and angle-resolved photoemission, we present momentum- and energy-resolved measurements of exciton coupling in monolayer WS$_2$. By comparing full 4D ($k_x, k_y, E, t$) data sets after both linearly and circularly polarized excitation, we are able to disentangle intervalley and intravalley exciton coupling dynamics. Recording in the exciton binding energy basis instead of excitation energy, we observe strong mixing between the B$_{1s}$ exciton and A$_{n>1}$ states. The photoelectron energy and momentum distributions observed from excitons populated via intervalley coupling (e.g. K$^-$ $\rightarrow$ K$^+$) indicate that the dominant valley depolarization mechanism conserves the exciton binding energy and center-of-mass momentum, consistent with intervalley Coulomb exchange. On longer timescales, exciton relaxation is accompanied by contraction of the momentum space distribution. △ Less

Submitted 4 March, 2022; originally announced March 2022.

Comments: 16 pages: 8 pages main text with 5 figures, 8 pages SI with 6 figures

Journal ref: Phys. Rev. Lett. 130, 046202 (2023)

arXiv:2203.00814 [pdf, other]

doi 10.1021/jacs.1c11335

Stacking Faults Assist Lithium-Ion Conduction in a Halide-Based Superionic Conductor

Authors: Elias Sebti, Hayden A. Evans, Hengning Chen, Peter M. Richardson, Kelly M. White, Raynald Giovine, Krishna Prasad Koirala, Yaobin Xu, Eliovardo Gonzalez-Correa, Chongmin Wang, Craig M. Brown, Anthony K. Cheetham, Pieremanuele Canepa, Raphaële J. Clément

Abstract: In the pursuit of urgently-needed, energy dense solid-state batteries for electric vehicle and portable electronics applications, halide solid electrolytes offer a promising path forward with exceptional compatibility against high-voltage oxide electrodes, tunable ionic conductivities, and facile processing. For this family of compounds, synthesis protocols strongly affect cation site disorder and… ▽ More In the pursuit of urgently-needed, energy dense solid-state batteries for electric vehicle and portable electronics applications, halide solid electrolytes offer a promising path forward with exceptional compatibility against high-voltage oxide electrodes, tunable ionic conductivities, and facile processing. For this family of compounds, synthesis protocols strongly affect cation site disorder and modulate Li+ mobility. In this work, we reveal the presence of a high concentration of stacking faults in the superionic conductor Li3YCl6 and demonstrate a method of controlling its Li+ conductivity by tuning the defect concentration with synthesis and heat treatments at select temperatures. Leveraging complementary insights from variable temperature synchrotron X-ray diffraction, neutron diffraction, cryogenic transmission electron microscopy, solid-state nuclear magnetic resonance, density functional theory, and electrochemical impedance spectroscopy, we identify the nature of planar defects and the role of nonstoichiometry in lowering Li+ migration barriers and increasing Li site connectivity in mechanochemically-synthesized Li3YCl6. We harness paramagnetic relaxation enhancement to enable 89Y solid-state NMR, and directly contrast the Y cation site disorder resulting from different preparation methods, demonstrating a potent tool for other researchers studying Y-containing compositions. With heat treatments at temperatures as low as 333 K (60°C), we decrease the concentration of planar defects, demonstrating a simple method for tuning the Li+ conductivity. Findings from this work are expected to be generalizable to other halide solid electrolyte candidates and provide an improved understanding of defect-enabled Li+ conduction in this class of Li-ion conductors. △ Less

Submitted 1 March, 2022; originally announced March 2022.

arXiv:2202.11133 [pdf, other]

Continual Auxiliary Task Learning

Authors: Matthew McLeod, Chunlok Lo, Matthew Schlegel, Andrew Jacobsen, Raksha Kumaraswamy, Martha White, Adam White

Abstract: Learning auxiliary tasks, such as multiple predictions about the world, can provide many benefits to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there is little work on how to adapt the behavior to gather useful data for those off-policy predictions. In this work, we investigate a reinforcement learning syste… ▽ More Learning auxiliary tasks, such as multiple predictions about the world, can provide many benefits to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there is little work on how to adapt the behavior to gather useful data for those off-policy predictions. In this work, we investigate a reinforcement learning system designed to learn a collection of auxiliary tasks, with a behavior policy learning to take actions to improve those auxiliary predictions. We highlight the inherent non-stationarity in this continual auxiliary task learning problem, for both prediction learners and the behavior learner. We develop an algorithm based on successor features that facilitates tracking under non-stationary rewards, and prove the separation into learning successor features and rewards provides convergence rate improvements. We conduct an in-depth study into the resulting multi-prediction learning system. △ Less

Submitted 22 February, 2022; originally announced February 2022.

Comments: Neural Information Processing Systems 2021

arXiv:2202.07440 [pdf, other]

doi 10.1093/mnras/stac2938

Consistent lensing and clustering in a low-$S_8$ Universe with BOSS, DES Year 3, HSC Year 1 and KiDS-1000

Authors: A. Amon, N. C. Robertson, H. Miyatake, C. Heymans, M. White, J. DeRose, S. Yuan, R. H. Wechsler, T. N. Varga, S. Bocquet, A. Dvornik, S. More, A. J. Ross, H. Hoekstra, A. Alarcon, M. Asgari, J. Blazek, A. Campos, R. Chen, A. Choi, M. Crocce, H. T. Diehl, C. Doux, K. Eckert, J. Elvin-Poole , et al. (83 additional authors not shown)

Abstract: We evaluate the consistency between lensing and clustering probes of large-scale structure based on measurements of projected galaxy clustering from BOSS combined with overlap** galaxy-galaxy lensing from three surveys: DES Y3, HSC Y1, and KiDS-1000. An intra-lensing-survey study finds good agreement between these lensing data. We model the observations using the Dark Emulator and fit the data a… ▽ More We evaluate the consistency between lensing and clustering probes of large-scale structure based on measurements of projected galaxy clustering from BOSS combined with overlap** galaxy-galaxy lensing from three surveys: DES Y3, HSC Y1, and KiDS-1000. An intra-lensing-survey study finds good agreement between these lensing data. We model the observations using the Dark Emulator and fit the data at two fixed cosmologies: Planck, with $S_8=0.83$, and a Lensing cosmology with $S_8=0.76$. For a joint analysis limited to scales with $R>5.25h^{-1}$Mpc, we find that both cosmologies provide an acceptable fit to the data. Full utilisation of the small-scale clustering and lensing measurements is hindered by uncertainty in the impact of baryon feedback and assembly bias, which we account for with a reasoned theoretical error budget. We incorporate a systematic scaling parameter for each redshift bin, $A$, that decouples the lensing and clustering to capture any inconsistency. When a wide range of scales ($0.15<R<60h^{-1}$Mpc) are incorporated, we find different results for the consistency of clustering and lensing between the two cosmologies. Limiting the analysis to the bins for which the impact of the selection of the lens sample is expected to be minimal, for the low-$S_8$ Lensing cosmology, the measurements are consistent with $A$=1; $A=0.91\pm0.04$ using DES+KiDS and $A=0.97\pm0.06$ using HSC. For the Planck cosmology case, we find a discrepancy: $A=0.79\pm0.03$ using DES+KiDS and $A=0.84\pm0.05$ using HSC. We demonstrate that a kSZ-based estimate for baryonic effects alleviates some of the discrepancy in the Planck cosmology. This analysis demonstrates the statistical power of these small-scale measurements, but also indicates that caution is still warranted given current uncertainties in modelling baryonic effects, assembly bias, and selection effects in the foreground sample. △ Less

Submitted 13 October, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

Comments: 28 pages, 11 figures

arXiv:2202.06074 [pdf, other]

doi 10.1093/mnras/stac1501

The DESI $N$-body Simulation Project -- II. Suppressing sample variance with fast simulations

Authors: Zhejie Ding, Chia-Hsun Chuang, Yu Yu, Lehman H. Garrison, Adrian E. Bayer, Yu Feng, Chirag Modi, Daniel J. Eisenstein, Martin White, Andrei Variu, Cheng Zhao, Hanyu Zhang, Jennifer Meneses Rizo, David Brooks, Kyle Dawson, Peter Doel, Enrique Gaztanaga, Robert Kehoe, Alex Krolewski, Martin Landriau, Nathalie Palanque-Delabrouille, Claire Poppett

Abstract: Dark Energy Spectroscopic Instrument (DESI) will construct a large and precise three-dimensional map of our Universe. The survey effective volume reaches $\sim20\Gpchcube$. It is a great challenge to prepare high-resolution simulations with a much larger volume for validating the DESI analysis pipelines. \textsc{AbacusSummit} is a suite of high-resolution dark-matter-only simulations designed for… ▽ More Dark Energy Spectroscopic Instrument (DESI) will construct a large and precise three-dimensional map of our Universe. The survey effective volume reaches $\sim20\Gpchcube$. It is a great challenge to prepare high-resolution simulations with a much larger volume for validating the DESI analysis pipelines. \textsc{AbacusSummit} is a suite of high-resolution dark-matter-only simulations designed for this purpose, with $200\Gpchcube$ (10 times DESI volume) for the base cosmology. However, further efforts need to be done to provide a more precise analysis of the data and to cover also other cosmologies. Recently, the CARPool method was proposed to use paired accurate and approximate simulations to achieve high statistical precision with a limited number of high-resolution simulations. Relying on this technique, we propose to use fast quasi-$N$-body solvers combined with accurate simulations to produce accurate summary statistics. This enables us to obtain 100 times smaller variance than the expected DESI statistical variance at the scales we are interested in, e.g. $k < 0.3\hMpc$ for the halo power spectrum. In addition, it can significantly suppress the sample variance of the halo bispectrum. We further generalize the method for other cosmologies with only one realization in \textsc{AbacusSummit} suite to extend the effective volume $\sim 20$ times. In summary, our proposed strategy of combining high-fidelity simulations with fast approximate gravity solvers and a series of variance suppression techniques sets the path for a robust cosmological analysis of galaxy survey data. △ Less

Submitted 18 June, 2022; v1 submitted 12 February, 2022; originally announced February 2022.

Comments: Matched version accepted by MNRAS, should be clearer

arXiv:2202.03955 [pdf, other]

doi 10.1051/0004-6361/202243191

Heating of the solar chromosphere through current dissipation

Authors: J. M. da Silva Santos, S. Danilovic, J. Leenaarts, J. de la Cruz Rodríguez, X. Zhu, S. M. White, G. J. M. Vissers, M. Rempel

Abstract: The solar chromosphere is heated to temperatures higher than predicted by radiative equilibrium. This excess heating is greater in active regions where the magnetic field is stronger. We aim to investigate the magnetic topology associated with an area of enhanced millimeter (mm) brightness temperatures in a solar active region mapped by the Atacama Large Millimeter/submillimeter Array (ALMA) using… ▽ More The solar chromosphere is heated to temperatures higher than predicted by radiative equilibrium. This excess heating is greater in active regions where the magnetic field is stronger. We aim to investigate the magnetic topology associated with an area of enhanced millimeter (mm) brightness temperatures in a solar active region mapped by the Atacama Large Millimeter/submillimeter Array (ALMA) using spectropolarimetric co-observations with the 1-m Swedish Solar Telescope (SST). We used Milne-Eddington inversions, nonlocal thermodynamic equilibrium (non-LTE) inversions, and a magnetohydrostatic extrapolation to obtain constraints on the three-dimensional stratification of temperature, magnetic field, and radiative energy losses. We compared the observations to a snapshot of a magnetohydrodynamics simulation and investigate the formation of the thermal continuum at 3 mm using contribution functions. We find enhanced heating rates in the upper chromosphere of up to $\sim 5\rm\,kW\,m^{-2}$, where small-scale emerging loops interact with the overlying magnetic canopy leading to current sheets as shown by the magnetic field extrapolation. Our estimates are about a factor of two higher than canonical values, but they are limited by the ALMA spatial resolution ($\sim 1.2^{\prime\prime}$). Band 3 brightness temperatures reach about $\sim10^{4}\,$K in the region, and the transverse magnetic field strength inferred from the non-LTE inversions is on the order of $\sim 500\,$G in the chromosphere. We are able to quantitatively reproduce many of the observed features, including the integrated radiative losses in our numerical simulation. We conclude that the heating is caused by dissipation in current sheets. However, the simulation shows a complex stratification in the flux emergence region where distinct layers may contribute significantly to the emission in the mm continuum. △ Less

Submitted 19 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

Comments: 18 pages, 13 figures. Accepted for publication in A&A. Typos and bibtex corrected

Journal ref: A&A 661, A59 (2022)

arXiv:2202.02396 [pdf, other]

A Temporal-Difference Approach to Policy Gradient Estimation

Authors: Samuele Tosatto, Andrew Patterson, Martha White, A. Rupam Mahmood

Abstract: The policy gradient theorem (Sutton et al., 2000) prescribes the usage of a cumulative discounted state distribution under the target policy to approximate the gradient. Most algorithms based on this theorem, in practice, break this assumption, introducing a distribution shift that can cause the convergence to poor solutions. In this paper, we propose a new approach of reconstructing the policy gr… ▽ More The policy gradient theorem (Sutton et al., 2000) prescribes the usage of a cumulative discounted state distribution under the target policy to approximate the gradient. Most algorithms based on this theorem, in practice, break this assumption, introducing a distribution shift that can cause the convergence to poor solutions. In this paper, we propose a new approach of reconstructing the policy gradient from the start state without requiring a particular sampling strategy. The policy gradient calculation in this form can be simplified in terms of a gradient critic, which can be recursively estimated due to a new Bellman equation of gradients. By using temporal-difference updates of the gradient critic from an off-policy data stream, we develop the first estimator that sidesteps the distribution shift issue in a model-free way. We prove that, under certain realizability conditions, our estimator is unbiased regardless of the sampling strategy. We empirically show that our technique achieves a superior bias-variance trade-off and performance in presence of off-policy samples. △ Less

Submitted 7 July, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

arXiv:2202.01251 [pdf, other]

doi 10.1093/mnras/stac316

Pulsar Observations at Low Frequencies: Applications to Pulsar Timing and Solar Wind Models

Authors: P. Kumar, S. M. White, K. Stovall, J. Dowell, G. B. Taylor

Abstract: Efforts are underway to use high-precision timing of pulsars in order to detect low-frequency gravitational waves. A limit to this technique is the timing noise generated by dispersion in the plasma along the line of sight to the pulsar, including the solar wind. The effects due to the solar wind vary with time, influenced by the change in solar activity on different time scales, ranging up to… ▽ More Efforts are underway to use high-precision timing of pulsars in order to detect low-frequency gravitational waves. A limit to this technique is the timing noise generated by dispersion in the plasma along the line of sight to the pulsar, including the solar wind. The effects due to the solar wind vary with time, influenced by the change in solar activity on different time scales, ranging up to $\sim 11$ years for a solar cycle. The solar wind contribution depends strongly on the angle between the pulsar line of sight and the solar disk, and is a dominant effect at small separations. Although solar wind models to mitigate these effects do exist, they do not account for all the effects of the solar wind and its temporal changes. Since low-frequency pulsar observations are most sensitive to these dispersive delays, they are most suited to test the efficacy of these models and identify alternative approaches. Here, we investigate the efficacy of some solar wind models commonly used in pulsar timing using long-term, high-cadence data on 6 pulsars taken with the Long Wavelength Array, and compare them with an operational solar wind model. Our results show that stationary models of the solar wind correction are insufficient to achieve the timing noise desired by pulsar timing experiments, and we need to use non-stationary models, which are informed by other solar wind observations, to obtain accurate timing residuals. △ Less

Submitted 2 February, 2022; originally announced February 2022.

Comments: Accepted for publication in MNRAS. 15 pages, 7 figures

arXiv:2201.12644 [pdf]

doi 10.2196/40667

Associations between depression symptom severity and daily-life gait characteristics derived from long-term acceleration signals in real-world settings

Authors: Yuezhou Zhang, Amos A Folarin, Shaoxiong Sun, Nicholas Cummins, Srinivasan Vairavan, Linglong Qian, Yatharth Ranjan, Zulqarnain Rashid, Pauline Conde, Callum Stewart, Petroula Laiou, Heet Sankesara, Faith Matcham, Katie M White, Carolin Oetzmann, Alina Ivan, Femke Lamers, Sara Siddi, Sara Simblett, Aki Rintala, David C Mohr, Inez Myin-Germeys, Til Wykes, Josep Maria Haro, Brenda WJH Penninx , et al. (5 additional authors not shown)

Abstract: Gait is an essential manifestation of depression. Laboratory gait characteristics have been found to be closely associated with depression. However, the gait characteristics of daily walking in real-world scenarios and their relationships with depression are yet to be fully explored. This study aimed to explore associations between depression symptom severity and daily-life gait characteristics de… ▽ More Gait is an essential manifestation of depression. Laboratory gait characteristics have been found to be closely associated with depression. However, the gait characteristics of daily walking in real-world scenarios and their relationships with depression are yet to be fully explored. This study aimed to explore associations between depression symptom severity and daily-life gait characteristics derived from acceleration signals in real-world settings. In this study, we used two ambulatory datasets: a public dataset with 71 elder adults' 3-day acceleration signals collected by a wearable device, and a subset of an EU longitudinal depression study with 215 participants and their phone-collected acceleration signals (average 463 hours per participant). We detected participants' gait cycles and force from acceleration signals and extracted 20 statistics-based daily-life gait features to describe the distribution and variance of gait cadence and force over a long-term period corresponding to the self-reported depression score. The gait cadence of faster steps (75th percentile) over a long-term period has a significant negative association with the depression symptom severity of this period in both datasets. Daily-life gait features could significantly improve the goodness of fit of evaluating depression severity relative to laboratory gait patterns and demographics, which was assessed by likelihood-ratio tests in both datasets. This study indicated that the significant links between daily-life walking characteristics and depression symptom severity could be captured by both wearable devices and mobile phones. The gait cadence of faster steps in daily-life walking has the potential to be a biomarker for evaluating depression severity, which may contribute to clinical tools to remotely monitor mental health in real-world settings. △ Less

Submitted 29 January, 2022; originally announced January 2022.

arXiv:2201.10373 [pdf]

doi 10.1088/1757-899X/1240/1/012087

Fabrication and installation of the Mu2e cryogenic distribution system

Authors: M. White, M. Lamm, A. Hocker, D. Arnold, G. Tatkowski, J. Kilmer, V. Poloubotko, T. Tope, Y. Huang, L. Elementi, K. Badgley, E. Voirin, I. Young, J. Brandt, S. Feher, C. Hess, D. Markley

Abstract: The muon-to-electron conversion (Mu2e) experiment at Fermilab will be used to search for the charged lepton flavor-violating conversion of muons to electrons in the field of an atomic nucleus. The Mu2e experiment is currently in the construction stage. The scope of this paper is the cryogenic distribution system and superconducting power leads for four superconducting solenoid magnets: Production… ▽ More The muon-to-electron conversion (Mu2e) experiment at Fermilab will be used to search for the charged lepton flavor-violating conversion of muons to electrons in the field of an atomic nucleus. The Mu2e experiment is currently in the construction stage. The scope of this paper is the cryogenic distribution system and superconducting power leads for four superconducting solenoid magnets: Production Solenoid (PS), an Upstream and Downstream Transport Solenoids (TSu and TSd) and Detector Solenoid (DS). The design of the cryogenic distribution system and the fabrication of several sub-systems was reported previously. This paper reports on additional fabrication and installation progress that has been performed over the past two years. Lessons learned during fabrication and testing of the cryogenic distribution system components are described. In particular, the challenges and solutions implemented for aluminum welding are reported. A description of the process used to qualify the welding procedure and welders for welding the aluminium stabilized NbTi superconducting power leads is provided. Additionally, the progress made with regards to installing the power leads into the cryogenic Feedboxes is covered. △ Less

Submitted 25 January, 2022; originally announced January 2022.

Report number: FERMILAB-CONF-21-733-TD

arXiv:2201.03567 [pdf, other]

doi 10.1093/mnrasl/slac011

Dark matter annihilation and the Galactic Centre Excess

Authors: Robert J. J. Grand, Simon D. M. White

Abstract: We compare the surface brightness profile and morphology of the Galactic Centre Excess (GCE) identified in wide-angle $γ$-ray maps from the Fermi-Large Area Telescope to dark matter annihilation predictions derived from high-resolution $Λ$CDM magnetohydrodynamic simulations of galaxy formation. These simulations produce isolated, disc-dominated galaxies with structure, stellar populations, gas con… ▽ More We compare the surface brightness profile and morphology of the Galactic Centre Excess (GCE) identified in wide-angle $γ$-ray maps from the Fermi-Large Area Telescope to dark matter annihilation predictions derived from high-resolution $Λ$CDM magnetohydrodynamic simulations of galaxy formation. These simulations produce isolated, disc-dominated galaxies with structure, stellar populations, gas content, and stellar and halo masses comparable to those of the Milky Way. For a specific choice of annihilation cross-section, they agree well with the Fermi-LAT data over the full observed angular range, $1^{\circ}$ to $15^{\circ}$, whereas their dark-matter only counterparts, lacking any compression of the inner halo by the gravitational effects of the baryons, fail to predict emission as centrally concentrated as observed. These results provide additional support to the hypothesis that the GCE is produced by annihilating dark matter. If, however, it is produced by a different mechanism, they imply a strong upper limit on annihilation rates which can be translated into upper limits on the expected $γ$-ray flux not only from the inner Galaxy but also from any substructure, with or without stars, in the Galactic halo. △ Less

Submitted 28 January, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

Comments: Accepted to MNRAS letters following minor revision

arXiv:2112.11622 [pdf, other]

An Alternate Policy Gradient Estimator for Softmax Policies

Authors: Shivam Garg, Samuele Tosatto, Yangchen Pan, Martha White, A. Rupam Mahmood

Abstract: Policy gradient (PG) estimators are ineffective in dealing with softmax policies that are sub-optimally saturated, which refers to the situation when the policy concentrates its probability mass on sub-optimal actions. Sub-optimal policy saturation may arise from bad policy initialization or sudden changes in the environment that occur after the policy has already converged. Current softmax PG est… ▽ More Policy gradient (PG) estimators are ineffective in dealing with softmax policies that are sub-optimally saturated, which refers to the situation when the policy concentrates its probability mass on sub-optimal actions. Sub-optimal policy saturation may arise from bad policy initialization or sudden changes in the environment that occur after the policy has already converged. Current softmax PG estimators require a large number of updates to overcome policy saturation, which causes low sample efficiency and poor adaptability to new situations. To mitigate this problem, we propose a novel PG estimator for softmax policies that utilizes the bias in the critic estimate and the noise present in the reward signal to escape the saturated regions of the policy parameter space. Our theoretical analysis and experiments, conducted on bandits and various reinforcement learning environments, show that this new estimator is significantly more robust to policy saturation. △ Less

Submitted 24 February, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

Comments: Accepted to AISTATS 2022. 60 pages, 50 figures. This updated version has an additional experiment and minor corrections

arXiv:2112.07806 [pdf, other]

Representation Alignment in Neural Networks

Authors: Ehsan Imani, Wei Hu, Martha White

Abstract: It is now a standard for neural network representations to be trained on large, publicly available datasets, and used for new problems. The reasons for why neural network representations have been so successful for transfer, however, are still not fully understood. In this paper we show that, after training, neural network representations align their top singular vectors to the targets. We investi… ▽ More It is now a standard for neural network representations to be trained on large, publicly available datasets, and used for new problems. The reasons for why neural network representations have been so successful for transfer, however, are still not fully understood. In this paper we show that, after training, neural network representations align their top singular vectors to the targets. We investigate this representation alignment phenomenon in a variety of neural network architectures and find that (a) alignment emerges across a variety of different architectures and optimizers, with more alignment arising from depth (b) alignment increases for layers closer to the output and (c) existing high-performance deep CNNs exhibit high levels of alignment. We then highlight why alignment between the top singular vectors and the targets can speed up learning and show in a classic synthetic transfer problem that representation alignment correlates with positive and negative transfer to similar and dissimilar tasks. △ Less

Submitted 17 September, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

Comments: 26 pages, 21 figures

arXiv:2112.05889 [pdf, other]

doi 10.1088/1475-7516/2022/04/056

Neural Network Acceleration of Large-scale Structure Theory Calculations

Authors: Joseph DeRose, Shi-Fan Chen, Martin White, Nickolas Kokron

Abstract: We make use of neural networks to accelerate the calculation of power spectra required for the analysis of galaxy clustering and weak gravitational lensing data. For modern perturbation theory codes, evaluation time for a single cosmology and redshift can take on the order of two seconds. In combination with the comparable time required to compute linear predictions using a Boltzmann solver, these… ▽ More We make use of neural networks to accelerate the calculation of power spectra required for the analysis of galaxy clustering and weak gravitational lensing data. For modern perturbation theory codes, evaluation time for a single cosmology and redshift can take on the order of two seconds. In combination with the comparable time required to compute linear predictions using a Boltzmann solver, these calculations are the bottleneck for many contemporary large-scale structure analyses. In this work, we construct neural network-based surrogate models for Lagrangian perturbation theory (LPT) predictions of matter power spectra, real and redshift space galaxy power spectra, and galaxy--matter cross power spectra that attain $\sim 0.1\%$ (at one sigma) accuracy over a broad range of scales in a $w$CDM parameter space. The neural network surrogates can be evaluated in approximately one millisecond, a factor of 1000 times faster than the full Boltzmann code and LPT computations. In a simulated full-shape redshift space galaxy power spectrum analysis, we demonstrate that the posteriors obtained using our surrogates are accurate compared to those obtained using the full LPT model. We make our surrogate models public at https://github.com/sfschen/EmulateLSS, so that others may take advantage of the speed gains they provide to enable rapid iteration on analysis settings, something that is essential in complex contemporary large-scale structure analyses. △ Less

Submitted 14 December, 2021; v1 submitted 10 December, 2021; originally announced December 2021.

Comments: 12 pages, 4 figures, models available at https://github.com/sfschen/EmulateLSS

arXiv:2112.04946 [pdf, other]

Limits on atomic qubit control from laser noise

Authors: Matthew L Day, Pei Jiang Low, Brendan M White, Rajibul Islam, Crystal Senko

Abstract: Technical noise present in laser systems can limit their ability to perform high fidelity quantum control of atomic qubits. The ultimate fidelity floor for atomic qubits driven with laser radiation is due to spontaneous emission from excited energy levels. The goal is to suppress the technical noise from the laser source to below the spontaneous emission floor such that it is no longer a limiting… ▽ More Technical noise present in laser systems can limit their ability to perform high fidelity quantum control of atomic qubits. The ultimate fidelity floor for atomic qubits driven with laser radiation is due to spontaneous emission from excited energy levels. The goal is to suppress the technical noise from the laser source to below the spontaneous emission floor such that it is no longer a limiting factor. It has been shown that the spectral structure of control noise can have a large influence on achievable control fidelities, while prior studies of laser noise contributions have been restricted to noise magnitudes. Here, we study the unique spectral structure of laser noise and introduce a new metric that determines when a stabilised laser source has been optimised for quantum control of atomic qubits. We find requirements on stabilisation bandwidths that can be orders of magnitude higher than those required to simply narrow the linewidth of a laser. We introduce a new metric, the $χ$-separation line, that provides a tool for the study and engineering of laser sources for quantum control of atomic qubits below the spontaneous emission floor. △ Less

Submitted 9 December, 2021; originally announced December 2021.

arXiv:2112.00012 [pdf, other]

doi 10.1093/mnras/stac1420

Priors on red galaxy stochasticity from hybrid effective field theory

Authors: Nickolas Kokron, Joseph DeRose, Shi-Fan Chen, Martin White, Risa H. Wechsler

Abstract: We investigate the stochastic properties of typical red galaxy samples in a controlled numerical environment. We use Halo Occupation Distribution (HOD) modelling to create mock realizations of three separate bright red galaxy samples consistent with datasets used for clustering and lensing analyses in modern galaxy surveys. Second-order Hybrid Effective Field Theory (HEFT) is used as a field-level… ▽ More We investigate the stochastic properties of typical red galaxy samples in a controlled numerical environment. We use Halo Occupation Distribution (HOD) modelling to create mock realizations of three separate bright red galaxy samples consistent with datasets used for clustering and lensing analyses in modern galaxy surveys. Second-order Hybrid Effective Field Theory (HEFT) is used as a field-level forward model to describe the full statistical distribution of these tracer samples, and their stochastic power spectra are directly measured and compared to the Poisson shot-noise prediction. While all of the galaxy samples we consider are hosted within haloes with sub-Poisson stochasticity, we observe that the galaxy samples themselves possess stochasticities that range from sub-Poisson to super-Poisson, in agreement with predictions from the halo model. As an application of our methodology, we place priors on the expected degree of non-Poisson stochasticity in cosmological analyses using such samples. We expect these priors will be useful in reducing the complexity of the full parameter space for future analyses using second-order Lagrangian bias models. More generally, the techniques outlined here present the first application of hybrid EFT methods to characterize models of the galaxy--halo connection at the field level, revealing new connections between once-disparate modelling frameworks. △ Less

Submitted 18 May, 2022; v1 submitted 30 November, 2021; originally announced December 2021.

Comments: 16 pages, 10 figures. Revised version accepted to MNRAS. Revisions include a new appendix on the covariance of the field-level bias estimator

arXiv:2111.15051 [pdf, other]

Lower pressure phases and metastable states of superconducting photo-induced carbonaceous sulfur hydride

Authors: G. Alexander Smith, Ines E. Collings, Elliot Snider, Dean Smith, Sylvain Petitgirard, Jesse Smith, Melanie White, Elyse Jones, Paul Ellison, Keith V. Lawler, Ranga P. Dias, Ashkan Salamat

Abstract: Room-temperature superconductivity was recently discovered in carbonaceous sulfur hydride (C-S-H) close to 3\,Mbar. We report significant differences in the superconducting response of C-S-H, with a maximum $T_{C}$ of 191(1)\,K, below a 1\,Mbar. Variations in intensity of the C-H Raman modes reveal carbon content can vary between crystals synthesized with the same photo-induced method. Synchrotron… ▽ More Room-temperature superconductivity was recently discovered in carbonaceous sulfur hydride (C-S-H) close to 3\,Mbar. We report significant differences in the superconducting response of C-S-H, with a maximum $T_{C}$ of 191(1)\,K, below a 1\,Mbar. Variations in intensity of the C-H Raman modes reveal carbon content can vary between crystals synthesized with the same photo-induced method. Synchrotron single crystal x-ray diffraction identifies polymorphism with increasing degrees of covalency. These unique metastable states are highly sensitive to thermodynamic pathways. △ Less

Submitted 29 November, 2021; originally announced November 2021.

arXiv:2111.09898 [pdf, other]

doi 10.1088/1475-7516/2022/02/007

Cosmological constraints from the tomographic cross-correlation of DESI Luminous Red Galaxies and Planck CMB lensing

Authors: Martin White, Rongpu Zhou, Joseph DeRose, Simone Ferraro, Shi-Fan Chen, Nickolas Kokron, Stephen Bailey, David Brooks, Juan Garcia-Bellido, Julien Guy, Klaus Honscheid, Robert Kehoe, Anthony Kremin, Michael Levi, Nathalie Palanque-Delabrouille, Claire Poppett, David Schlegel, Gregory Tarle

Abstract: We use luminous red galaxies selected from the imaging surveys that are being used for targeting by the Dark Energy Spectroscopic Instrument (DESI) in combination with CMB lensing maps from the Planck collaboration to probe the amplitude of large-scale structure over $0.4\le z\le 1$. Our galaxy sample, with an angular number density of approximately $500\,\mathrm{deg}^{-2}$ over 18,000 sq.deg., is… ▽ More We use luminous red galaxies selected from the imaging surveys that are being used for targeting by the Dark Energy Spectroscopic Instrument (DESI) in combination with CMB lensing maps from the Planck collaboration to probe the amplitude of large-scale structure over $0.4\le z\le 1$. Our galaxy sample, with an angular number density of approximately $500\,\mathrm{deg}^{-2}$ over 18,000 sq.deg., is divided into 4 tomographic bins by photometric redshift and the redshift distributions are calibrated using spectroscopy from DESI. We fit the galaxy autospectra and galaxy-convergence cross-spectra using models based on cosmological perturbation theory, restricting to large scales that are expected to be well described by such models. Within the context of $Λ$CDM, combining all 4 samples and using priors on the background cosmology from supernova and baryon acoustic oscillation measurements, we find $S_8=σ_8(Ω_m/0.3)^{0.5}=0.73\pm 0.03$. This result is lower than the prediction of the $Λ$CDM model conditioned on the Planck data. Our data prefer a slower growth of structure at low redshift than the model predictions, though at only modest significance. △ Less

Submitted 18 January, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

Comments: 44 pages, 16 figures. Matches version accepted by journal: more details on analysis, updated references, link to data added

arXiv:2111.08718 [pdf, other]

doi 10.1093/mnras/stab3537

Detecting low-mass haloes with strong gravitational lensing I: the effect of data quality and lensing configuration

Authors: Giulia Despali, Simona Vegetti, Simon D. M. White, Devon M. Powell, Hannah R. Stacey, Christopher D. Fassnacht, Francesca Rizzo, Wolfgang Enzi

Abstract: This paper aims to quantify how the lowest halo mass that can be detected with galaxy-galaxy strong gravitational lensing depends on the quality of the observations and the characteristics of the observed lens systems. Using simulated data, we measure the lowest detectable NFW mass at each location of the lens plane, in the form of detailed \emph{sensitivity maps}. In summary, we find that: (i) th… ▽ More This paper aims to quantify how the lowest halo mass that can be detected with galaxy-galaxy strong gravitational lensing depends on the quality of the observations and the characteristics of the observed lens systems. Using simulated data, we measure the lowest detectable NFW mass at each location of the lens plane, in the form of detailed \emph{sensitivity maps}. In summary, we find that: (i) the lowest detectable mass $M_{\rm low}$ decreases linearly as the signal-to-noise ratio (SNR) increases and the sensitive area is larger when we decrease the noise; (ii) a moderate increase in angular resolution (0.07" vs 0.09") and pixel scale (0.01" vs 0.04") improves the sensitivity by on average 0.25 dex in halo mass, with more significant improvement around the most sensitive regions; (iii) the sensitivity to low-mass objects is largest for bright and complex lensed galaxies located inside the caustic curves and lensed into larger Einstein rings (i.e $r_{E}\geq1.0"$). We find that for the sensitive mock images considered in this work, the minimum mass that we can detect at the redshift of the lens lies between $1.5\times10^{8}$ and $3\times10^{9}M_{\odot}$. We derive analytic relations between $M_{\rm low}$, the SNR and resolution and discuss the impact of the lensing configuration and source structure. Our results start to fill the gap between approximate predictions and real data and demonstrate the challenging nature of calculating precise forecasts for gravitational imaging. In light of our findings, we discuss possible strategies for designing strong lensing surveys and the prospects for HST, Keck, ALMA, Euclid and other future observations. △ Less

Submitted 4 December, 2021; v1 submitted 16 November, 2021; originally announced November 2021.

Comments: 17 pages, 11 figures, accepted for publication in MNRAS. Comments welcome

arXiv:2111.08172 [pdf, other]

Off-Policy Actor-Critic with Emphatic Weightings

Authors: Eric Graves, Ehsan Imani, Raksha Kumaraswamy, Martha White

Abstract: A variety of theoretically-sound policy gradient algorithms exist for the on-policy setting due to the policy gradient theorem, which provides a simplified form for the gradient. The off-policy setting, however, has been less clear due to the existence of multiple objectives and the lack of an explicit off-policy policy gradient theorem. In this work, we unify these objectives into one off-policy… ▽ More A variety of theoretically-sound policy gradient algorithms exist for the on-policy setting due to the policy gradient theorem, which provides a simplified form for the gradient. The off-policy setting, however, has been less clear due to the existence of multiple objectives and the lack of an explicit off-policy policy gradient theorem. In this work, we unify these objectives into one off-policy objective, and provide a policy gradient theorem for this unified objective. The derivation involves emphatic weightings and interest functions. We show multiple strategies to approximate the gradients, in an algorithm called Actor Critic with Emphatic weightings (ACE). We prove in a counterexample that previous (semi-gradient) off-policy actor-critic methods--particularly Off-Policy Actor-Critic (OffPAC) and Deterministic Policy Gradient (DPG)--converge to the wrong solution whereas ACE finds the optimal solution. We also highlight why these semi-gradient approaches can still perform well in practice, suggesting strategies for variance reduction in ACE. We empirically study several variants of ACE on two classic control environments and an image-based environment designed to illustrate the tradeoffs made by each gradient approximation. We find that by approximating the emphatic weightings directly, ACE performs as well as or better than OffPAC in all settings tested. △ Less

Submitted 13 April, 2023; v1 submitted 15 November, 2021; originally announced November 2021.

Comments: 63 pages

Journal ref: Journal of Machine Learning Research 24 (2023) 1-63

arXiv:2111.08066 [pdf, other]

Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning

Authors: Vincent Liu, James R. Wright, Martha White

Abstract: Offline reinforcement learning -- learning a policy from a batch of data -- is known to be hard for general MDPs. These results motivate the need to look at specific classes of MDPs where offline reinforcement learning might be feasible. In this work, we explore a restricted class of MDPs to obtain guarantees for offline reinforcement learning. The key property, which we call Action Impact Regular… ▽ More Offline reinforcement learning -- learning a policy from a batch of data -- is known to be hard for general MDPs. These results motivate the need to look at specific classes of MDPs where offline reinforcement learning might be feasible. In this work, we explore a restricted class of MDPs to obtain guarantees for offline reinforcement learning. The key property, which we call Action Impact Regularity (AIR), is that actions primarily impact a part of the state (an endogenous component) and have limited impact on the remaining part of the state (an exogenous component). AIR is a strong assumption, but it nonetheless holds in a number of real-world domains including financial markets. We discuss algorithms that exploit the AIR property, and provide a theoretical analysis for an algorithm based on Fitted-Q Iteration. Finally, we demonstrate that the algorithm outperforms existing offline reinforcement learning algorithms across different data collection policies in simulated and real world environments where the regularity holds. △ Less

Submitted 3 May, 2023; v1 submitted 15 November, 2021; originally announced November 2021.

Journal ref: Journal of Artificial Intelligence Research, 77 (2023) 71-101

arXiv:2110.14580 [pdf]

LCLS-II-HE verification cryomodule high gradient performance and quench behavior

Authors: S. Posen, A. Cravatta, M. Checchin, S. Aderhold, C. Adolphsen, T. Arkan, D. Bafia, A. Benwell, D. Bice, B. Chase, C. Contreras-Martinez, L. Dootlittle, J. Fuerst, D. Gonnella, A. Grassellino, C. Grimm, B. Hansen, E. Harms, B. Hartsell, G. Hays, J. Holzbauer, S. Hoobler, J. Kaluzny, T. Khabiboulline, M. Kucera , et al. (21 additional authors not shown)

Abstract: An 8-cavity, 1.3 GHz, LCLS-II-HE cryomodule was assembled and tested at Fermilab to verify performance before the start of production. Its cavities were processed with a novel nitrogen do** treatment to improve gradient performance. The cryomodule was tested with a modified protocol to process sporadic quenches, which were observed in LCLS-II production cryomodules and are attributed to multipac… ▽ More An 8-cavity, 1.3 GHz, LCLS-II-HE cryomodule was assembled and tested at Fermilab to verify performance before the start of production. Its cavities were processed with a novel nitrogen do** treatment to improve gradient performance. The cryomodule was tested with a modified protocol to process sporadic quenches, which were observed in LCLS-II production cryomodules and are attributed to multipacting. Dedicated vertical test experiments support the attribution to multipacting. The verification cryomodule achieved an acceleration voltage of 200 MV in continuous wave mode, corresponding to an average accelerating gradient of 24.1 MV/m, significantly exceeding the specification of 173 MV. The average Q0 (3.0x10^10) also exceeded its specification (2.7x10^10). After processing, no field emission was observed up to the maximum gradient of each cavity. This paper reviews the cryomodule performance and discusses operational issues and mitigations implemented during the several month program. △ Less

Submitted 27 October, 2021; originally announced October 2021.

Comments: 15 pages, 24 figures

arXiv:2110.08456 [pdf, other]

doi 10.1103/PhysRevA.105.033102

Isotope-Selective Laser Ablation Ion-Trap Loading of $\mathbf{^{137}\mathrm{Ba}^+}$ using a $\mathbf{\mathrm{BaCl}_2}$ Target

Authors: Brendan M. White, Pei Jiang Low, Yvette de Sereville, Matthew L. Day, Noah Greenberg, Richard Rademacher, Crystal Senko

Abstract: The $^{133}\mathrm{Ba}^+$ ion is a promising candidate as a high-fidelity qubit, and the $^{137}\mathrm{Ba}^+$ isotope is promising as a high-fidelity qudit ($d>2$). Barium metal is very reactive, and $^{133}\mathrm{Ba}^+$ is radioactive and can only be sourced in small quantities, so the most commonly used loading method, oven heating, is less suited for barium, and is currently not possible for… ▽ More The $^{133}\mathrm{Ba}^+$ ion is a promising candidate as a high-fidelity qubit, and the $^{137}\mathrm{Ba}^+$ isotope is promising as a high-fidelity qudit ($d>2$). Barium metal is very reactive, and $^{133}\mathrm{Ba}^+$ is radioactive and can only be sourced in small quantities, so the most commonly used loading method, oven heating, is less suited for barium, and is currently not possible for $^{133}\mathrm{Ba}^+$.Pulsed laser ablation solves both of these problems by utilizing compound barium sources, while also giving some distinct advantages, such as fast loading, less displaced material, and lower heat load near the ion trap. Because of the relatively low abundances of the isotopes of interest, a two-step photoionization technique is used, which gives us the ability to selectively load isotopes. Characterization of the ablation process for our $\mathrm{BaCl}_2$ targets are presented, including observation of neutral and ion ablation-fluence regimes, preparation/conditioning and lifetimes of ablation spots, and plume velocity distributions.We show that using laser ablation on $\mathrm{BaCl}_2$ salt targets with a two-step photoionization method, we can produce and trap barium ions reliably. Further, we demonstrate that with our photoionization method, we can trap $^{137}\mathrm{Ba}^+$ with an enhanced selectivity compared to its natural abundance. △ Less

Submitted 15 October, 2021; originally announced October 2021.

Comments: 24 pages, 21 figures

arXiv:2110.08345 [pdf, other]

Towards Transparent Interactive Semantic Parsing via Step-by-Step Correction

Authors: Lingbo Mo, Ashley Lewis, Huan Sun, Michael White

Abstract: Existing studies on semantic parsing focus primarily on map** a natural-language utterance to a corresponding logical form in one turn. However, because natural language can contain a great deal of ambiguity and variability, this is a difficult challenge. In this work, we investigate an interactive semantic parsing framework that explains the predicted logical form step by step in natural langua… ▽ More Existing studies on semantic parsing focus primarily on map** a natural-language utterance to a corresponding logical form in one turn. However, because natural language can contain a great deal of ambiguity and variability, this is a difficult challenge. In this work, we investigate an interactive semantic parsing framework that explains the predicted logical form step by step in natural language and enables the user to make corrections through natural-language feedback for individual steps. We focus on question answering over knowledge bases (KBQA) as an instantiation of our framework, aiming to increase the transparency of the parsing process and help the user appropriately trust the final answer. To do so, we construct INSPIRED, a crowdsourced dialogue dataset derived from the ComplexWebQuestions dataset. Our experiments show that the interactive framework with human feedback has the potential to greatly improve overall parse accuracy. Furthermore, we develop a pipeline for dialogue simulation to evaluate our framework w.r.t. a variety of state-of-the-art KBQA models without involving further crowdsourcing effort. The results demonstrate that our interactive semantic parsing framework promises to be effective across such models. △ Less

Submitted 27 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

Comments: Accepted by Findings of ACL 2022

arXiv:2110.05530 [pdf, other]

doi 10.1088/1475-7516/2022/02/008

A new analysis of galaxy 2-point functions in the BOSS survey, including full-shape information and post-reconstruction BAO

Authors: Shi-Fan Chen, Zvonimir Vlah, Martin White

Abstract: We present a new method for consistent, joint analysis of the pre- and post-reconstruction two-point functions of the BOSS survey. The post-reconstruction correlation function is used to accurately measure the distance-redshift relation and expansion history, while the pre-reconstruction power spectrum multipoles constrain the broad-band shape and the rate-of-growth of large-scale structure. Our t… ▽ More We present a new method for consistent, joint analysis of the pre- and post-reconstruction two-point functions of the BOSS survey. The post-reconstruction correlation function is used to accurately measure the distance-redshift relation and expansion history, while the pre-reconstruction power spectrum multipoles constrain the broad-band shape and the rate-of-growth of large-scale structure. Our technique uses Lagrangian perturbation theory to self-consistently work at the level of two-point functions, i.e.\ directly with the measured data, without approximating the constraints with summary statistics normalized by the drag scale. Combining galaxies across the full redshift range and both hemispheres we constrain $Ω_m=0.303 \pm 0.0082$, $H_0=69.23 \pm 0.77$ and $σ_8=0.733 \pm 0.047$ within the context of $Λ$CDM. These constraints are in good agreement both with the Planck primary CMB anisotropy data and recent cosmic shear surveys. △ Less

Submitted 18 January, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

Comments: 39 pages, 10 figures, updated to correct minor typos and match version in JCAP. Note updated results from v1 fixing bug in code

arXiv:2109.09760 [pdf, other]

doi 10.1093/mnras/stab3078

Simulating the complexity of the dark matter sheet II: halo and subhalo mass functions for non-cold dark matter models

Authors: Jens Stücker, Raul E. Angulo, Oliver Hahn, Simon D. M. White

Abstract: We present "sheet+release" simulations that reliably follow the evolution of dark matter structure at and below the dark matter free-streaming scale, where instabilities in traditional N-body simulations create a large population of spurious artificial haloes. Our simulations sample a large range of power-spectrum cutoff functions, parameterized through the half-mode scale $k_{\rm{hm}}$ and a slop… ▽ More We present "sheet+release" simulations that reliably follow the evolution of dark matter structure at and below the dark matter free-streaming scale, where instabilities in traditional N-body simulations create a large population of spurious artificial haloes. Our simulations sample a large range of power-spectrum cutoff functions, parameterized through the half-mode scale $k_{\rm{hm}}$ and a slope parameter $β$. This parameter space can represent many non-cold dark matter models, including thermal relic warm dark matter, sterile-neutrinos, fuzzy dark matter, and a significant fraction of ETHOS models. Combining these simulations with additional N-body simulations, we find the following results. (1) Even after eliminating spurious haloes, the halo mass function in the strongly suppressed regime ($n_{\rm{X}}/n_{\rm{CDM}} < 5\%$) remains uncertain because it depends strongly on the definition of a halo. At these mass scales traditional halo finders primarily identify overdensities that are unbound, highly elongated, dominated by tidal fields, or far from virialized. (2) The regime where the suppression is smaller than a factor of 20 is quite robust to these uncertainties, however, and can be inferred reliably from suitable N-body simulations. (3) Parameterizing the suppression in the halo- and subhalo mass functions through the scales where the suppression reaches $20\%$, $50\%$ and $80\%$, we provide simple formulae which enable predictions for many non-cold dark matter models. (4) The halo mass-concentration relations in our sheet+release simulations agree well with previous results based on N-body simulations. (5) In general, we confirm the validity of previous N-body studies of warm dark matter models, largely eliminating concerns about the effects of artificial haloes. △ Less

Submitted 19 October, 2021; v1 submitted 20 September, 2021; originally announced September 2021.

Comments: 17 pages, 15 figures, submitted to MNRAS, for extra material see https://bacco.dipc.org/ncdm.html

arXiv:2109.09660 [pdf, other]

doi 10.3847/1538-4365/ac982d

Second Data Release of the COSMOS Lyman-alpha Map** and Tomographic Observation: The First 3D Maps of the Detailed Cosmic Web at 2.05<z<2.55

Authors: Benjamin Horowitz, Khee-Gan Lee, Metin Ata, Thomas Müller, Alex Krolewski, J. Xavier Prochaska, Joseph F. Hennawi, Martin White, David Schlegel, R. Michael Rich, Peter E. Nugent, Nao Suzuki, Daichi Kashino, Anton M. Koekemoer, Brian C. Lemaux

Abstract: We present the second data release of the COSMOS Lyman-Alpha Map** And Tomography Observations (CLAMATO) Survey conducted with the LRIS spectrograph on the Keck-I telescope. This project used Lyman-alpha forest absorption in the spectra of faint star forming galaxies and quasars at z ~ 2-3 to trace neutral hydrogen in the intergalactic medium. In particular, we use 320 objects over a footprint o… ▽ More We present the second data release of the COSMOS Lyman-Alpha Map** And Tomography Observations (CLAMATO) Survey conducted with the LRIS spectrograph on the Keck-I telescope. This project used Lyman-alpha forest absorption in the spectra of faint star forming galaxies and quasars at z ~ 2-3 to trace neutral hydrogen in the intergalactic medium. In particular, we use 320 objects over a footprint of ~0.2 deg^2 to reconstruct the absorption field at 2.05 < z < 2.55 at ~2 h^{-1}Mpc resolution. We apply a Wiener filtering technique to the observed data to reconstruct three dimensional maps of the field over a volume of 4.1 x 10^5 comoving cubic Mpc. In addition to the filtered flux maps, for the first time we infer the underlying dark matter field through a forward modeling framework from a joint likelihood of galaxy and Lyman-alpha forest data, finding clear examples of the detailed cosmic web consisting of cosmic voids, sheets, filaments, and nodes. In addition to traditional figures, we present a number of interactive three dimensional models to allow exploration of the data and qualitative comparisons to known galaxy surveys. We find that our inferred over-densities are consistent with those found from galaxy fields. Our reduced spectra, extracted Lyman-alpha forest pixel data, and reconstructed tomographic maps are available publicly at https://doi.org/10.5281/zenodo.7524313 △ Less

Submitted 12 January, 2023; v1 submitted 20 September, 2021; originally announced September 2021.

Comments: 20 pages, 15 figures. Data is available at https://doi.org/10.5281/zenodo.7524313 arXiv admin note: text overlap with arXiv:1710.02894

arXiv:2109.00263 [pdf, other]

doi 10.1093/mnras/stab2283

NuSTAR observations of a repeatedly microflaring active region

Authors: Kristopher Cooper, Iain G. Hannah, Brian W. Grefenstette, Lindsay Glesener, Säm Krucker, Hugh S. Hudson, Stephen M. White, David M. Smith, Jessie Duncan

Abstract: We investigate the spatial, temporal, and spectral properties of 10 microflares from AR12721 on 2018 September 9 and 10 observed in X-rays using the Nuclear Spectroscopic Telescope ARray (NuSTAR) and the Solar Dynamic Observatory's Atmospheric Imaging Assembly and Helioseismic and Magnetic Imager (SDO/AIA and HMI). We find GOES sub-A class equivalent microflare energies of 10$^{26}$-10$^{28}$ erg… ▽ More We investigate the spatial, temporal, and spectral properties of 10 microflares from AR12721 on 2018 September 9 and 10 observed in X-rays using the Nuclear Spectroscopic Telescope ARray (NuSTAR) and the Solar Dynamic Observatory's Atmospheric Imaging Assembly and Helioseismic and Magnetic Imager (SDO/AIA and HMI). We find GOES sub-A class equivalent microflare energies of 10$^{26}$-10$^{28}$ erg reaching temperatures up to 10 MK with consistent quiescent or hot active region core plasma temperatures of 3-4 MK. One microflare (SOL2018-09-09T10:33), with an equivalent GOES class of A0.1, has non-thermal HXR emission during its impulsive phase (of non-thermal power $\sim$7$\times$10$^{24}$ erg s$^{-1}$) making it one of the faintest X-ray microflares to have direct evidence for accelerated electrons. In 4 of the 10 microflares, we find that the X-ray time profile matches fainter and more transient sources in the EUV, highlighting the need for observations sensitive to only the hottest material that reaches temperatures higher than those of the active region core ($>$5 MK). Evidence for corresponding photospheric magnetic flux cancellation/emergence present at the footpoints of 8 microflares is also observed. △ Less

Submitted 1 September, 2021; originally announced September 2021.

Comments: Accepted for published in MNRAS

arXiv:2108.13637 [pdf, other]

When are Deep Networks really better than Decision Forests at small sample sizes, and how?

Authors: Haoyin Xu, Kaleab A. Kinfu, Will LeVine, Sambit Panda, Jayanta Dey, Michael Ainsworth, Yu-Chung Peng, Madi Kusmanov, Florian Engert, Christopher M. White, Joshua T. Vogelstein, Carey E. Priebe

Abstract: Deep networks and decision forests (such as random forests and gradient boosted trees) are the leading machine learning methods for structured and tabular data, respectively. Many papers have empirically compared large numbers of classifiers on one or two different domains (e.g., on 100 different tabular data settings). However, a careful conceptual and empirical comparison of these two strategies… ▽ More Deep networks and decision forests (such as random forests and gradient boosted trees) are the leading machine learning methods for structured and tabular data, respectively. Many papers have empirically compared large numbers of classifiers on one or two different domains (e.g., on 100 different tabular data settings). However, a careful conceptual and empirical comparison of these two strategies using the most contemporary best practices has yet to be performed. Conceptually, we illustrate that both can be profitably viewed as "partition and vote" schemes. Specifically, the representation space that they both learn is a partitioning of feature space into a union of convex polytopes. For inference, each decides on the basis of votes from the activated nodes. This formulation allows for a unified basic understanding of the relationship between these methods. Empirically, we compare these two strategies on hundreds of tabular data settings, as well as several vision and auditory settings. Our focus is on datasets with at most 10,000 samples, which represent a large fraction of scientific and biomedical datasets. In general, we found forests to excel at tabular and structured data (vision and audition) with small sample sizes, whereas deep nets performed better on structured data with larger sample sizes. This suggests that further gains in both scenarios may be realized via further combining aspects of forests and networks. We will continue revising this technical report in the coming months with updated results. △ Less

Submitted 2 November, 2021; v1 submitted 31 August, 2021; originally announced August 2021.

arXiv:2108.00639 [pdf, other]

doi 10.1109/LSP.2021.3116510

Bespoke Fractal Sampling Patterns for Discrete Fourier Space via the Kaleidoscope Transform

Authors: Jacob M. White, Stuart Crozier, Shekhar S. Chandra

Abstract: Sampling strategies are important for sparse imaging methodologies, especially those employing the discrete Fourier transform (DFT). Chaotic sensing is one such methodology that employs deterministic, fractal sampling in conjunction with finite, iterative reconstruction schemes to form an image from limited samples. Using a sampling pattern constructed entirely from periodic lines in DFT space, ch… ▽ More Sampling strategies are important for sparse imaging methodologies, especially those employing the discrete Fourier transform (DFT). Chaotic sensing is one such methodology that employs deterministic, fractal sampling in conjunction with finite, iterative reconstruction schemes to form an image from limited samples. Using a sampling pattern constructed entirely from periodic lines in DFT space, chaotic sensing was found to outperform traditional compressed sensing for magnetic resonance imaging; however, only one such sampling pattern was presented and the reason for its fractal nature was not proven. Through the introduction of a novel image transform known as the kaleidoscope transform, which formalises and extends upon the concept of downsampling and concatenating an image with itself, this paper: (1) demonstrates a fundamental relationship between multiplication in modular arithmetic and downsampling; (2) provides a rigorous mathematical explanation for the fractal nature of the sampling pattern in the DFT; and (3) leverages this understanding to develop a collection of novel fractal sampling patterns for the 2D DFT with customisable properties. The ability to design tailor-made fractal sampling patterns expands the utility of the DFT in chaotic imaging and may form the basis for a bespoke chaotic sensing methodology, in which the fractal sampling matches the imaging task for improved reconstruction. △ Less

Submitted 2 August, 2021; originally announced August 2021.

Comments: 6 pages, 7 figures

arXiv:2107.10364 [pdf, other]

doi 10.3847/1538-4365/ac9838

CCAT-prime Collaboration: Science Goals and Forecasts with Prime-Cam on the Fred Young Submillimeter Telescope

Authors: CCAT-Prime collaboration, M. Aravena, J. E. Austermann, K. Basu, N. Battaglia, B. Beringue, F. Bertoldi, F. Bigiel, J. R. Bond, P. C. Breysse, C. Broughton, R. Bustos, S. C. Chapman, M. Charmetant, S. K. Choi, D. T. Chung, S. E. Clark, N. F. Cothard, A. T. Crites, A. Dev, K. Douglas, C. J. Duell, R. Dunner, H. Ebina, J. Erler , et al. (62 additional authors not shown)

Abstract: We present a detailed overview of the science goals and predictions for the Prime-Cam direct detection camera/spectrometer being constructed by the CCAT-prime collaboration for dedicated use on the Fred Young Submillimeter Telescope (FYST). The FYST is a wide-field, 6-m aperture submillimeter telescope being built (first light in mid-2024) by an international consortium of institutions led by Corn… ▽ More We present a detailed overview of the science goals and predictions for the Prime-Cam direct detection camera/spectrometer being constructed by the CCAT-prime collaboration for dedicated use on the Fred Young Submillimeter Telescope (FYST). The FYST is a wide-field, 6-m aperture submillimeter telescope being built (first light in mid-2024) by an international consortium of institutions led by Cornell University and sited at more than 5600 meters on Cerro Chajnantor in northern Chile. Prime-Cam is one of two instruments planned for FYST and will provide unprecedented spectroscopic and broadband measurement capabilities to address important astrophysical questions ranging from Big Bang cosmology through reionization and the formation of the first galaxies to star formation within our own Milky Way galaxy. Prime-Cam on the FYST will have a map** speed that is over ten times greater than existing and near-term facilities for high-redshift science and broadband polarimetric imaging at frequencies above 300 GHz. We describe details of the science program enabled by this system and our preliminary survey strategies. △ Less

Submitted 8 August, 2022; v1 submitted 21 July, 2021; originally announced July 2021.

Comments: 61 pages, 16 figures. Resubmitted to ApJSS July 11, 2022

arXiv:2107.08285 [pdf, other]

Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences

Authors: Alan Chan, Hugo Silva, Sungsu Lim, Tadashi Kozuno, A. Rupam Mahmood, Martha White

Abstract: Approximate Policy Iteration (API) algorithms alternate between (approximate) policy evaluation and (approximate) greedification. Many different approaches have been explored for approximate policy evaluation, but less is understood about approximate greedification and what choices guarantee policy improvement. In this work, we investigate approximate greedification when reducing the KL divergence… ▽ More Approximate Policy Iteration (API) algorithms alternate between (approximate) policy evaluation and (approximate) greedification. Many different approaches have been explored for approximate policy evaluation, but less is understood about approximate greedification and what choices guarantee policy improvement. In this work, we investigate approximate greedification when reducing the KL divergence between the parameterized policy and the Boltzmann distribution over action values. In particular, we investigate the difference between the forward and reverse KL divergences, with varying degrees of entropy regularization. We show that the reverse KL has stronger policy improvement guarantees, but that reducing the forward KL can result in a worse policy. We also demonstrate, however, that a large enough reduction of the forward KL can induce improvement under additional assumptions. Empirically, we show on simple continuous-action environments that the forward KL can induce more exploration, but at the cost of a more suboptimal policy. No significant differences were observed in the discrete-action setting or on a suite of benchmark problems. Throughout, we highlight that many policy gradient methods can be seen as an instance of API, with either the forward or reverse KL for the policy update, and discuss next steps for understanding and improving our policy optimization algorithms. △ Less

Submitted 18 April, 2022; v1 submitted 17 July, 2021; originally announced July 2021.

Comments: Updated the paper with more theory in Section 5 and moved some experiments to the Appendix

arXiv:2106.12621 [pdf, other]

Leveraging semantically similar queries for ranking via combining representations

Authors: Hayden S. Helm, Marah Abdin, Benjamin D. Pedigo, Shweti Mahajan, Vince Lyzinski, Youngser Park, Amitabh Basu, Piali~Choudhury, Christopher M. White, Weiwei Yang, Carey E. Priebe

Abstract: In modern ranking problems, different and disparate representations of the items to be ranked are often available. It is sensible, then, to try to combine these representations to improve ranking. Indeed, learning to rank via combining representations is both principled and practical for learning a ranking function for a particular query. In extremely data-scarce settings, however, the amount of l… ▽ More In modern ranking problems, different and disparate representations of the items to be ranked are often available. It is sensible, then, to try to combine these representations to improve ranking. Indeed, learning to rank via combining representations is both principled and practical for learning a ranking function for a particular query. In extremely data-scarce settings, however, the amount of labeled data available for a particular query can lead to a highly variable and ineffective ranking function. One way to mitigate the effect of the small amount of data is to leverage information from semantically similar queries. Indeed, as we demonstrate in simulation settings and real data examples, when semantically similar queries are available it is possible to gainfully use them when ranking with respect to a particular query. We describe and explore this phenomenon in the context of the bias-variance trade off and apply it to the data-scarce settings of a Bing navigational graph and the Drosophila larva connectome. △ Less

Submitted 23 June, 2021; originally announced June 2021.

arXiv:2106.09713 [pdf, other]

doi 10.1088/1475-7516/2021/12/049

Cosmology at high redshift -- a probe of fundamental physics

Authors: Noah Sailer, Emanuele Castorina, Simone Ferraro, Martin White

Abstract: An observational program focused on the high redshift ($2<z<6$) Universe has the opportunity to dramatically improve over upcoming LSS and CMB surveys on measurements of both the standard cosmological model and its extensions. Using a Fisher matrix formalism that builds upon recent advances in Lagrangian perturbation theory, we forecast constraints for future spectroscopic and 21-cm surveys on the… ▽ More An observational program focused on the high redshift ($2<z<6$) Universe has the opportunity to dramatically improve over upcoming LSS and CMB surveys on measurements of both the standard cosmological model and its extensions. Using a Fisher matrix formalism that builds upon recent advances in Lagrangian perturbation theory, we forecast constraints for future spectroscopic and 21-cm surveys on the standard cosmological model, curvature, neutrino mass, relativistic species, primordial features, primordial non-Gaussianity, dynamical dark energy, and gravitational slip. We compare these constraints with those achievable by current or near-future surveys such as DESI and Euclid, all under the same forecasting formalism, and compare our formalism with traditional linear methods. Our Python code FishLSS $-$ used to calculate the Fisher information of the full shape power spectrum, CMB lensing, the cross-correlation of CMB lensing with galaxies, and combinations thereof $-$ is publicly available. △ Less

Submitted 2 February, 2022; v1 submitted 17 June, 2021; originally announced June 2021.

Comments: published in JCAP

arXiv:2106.02056 [pdf, other]

doi 10.1140/epjc/s10052-021-09712-6

Thermal WIMPs and the Scale of New Physics: Global Fits of Dirac Dark Matter Effective Field Theories

Authors: The GAMBIT Collaboration, Peter Athron, Neal Avis Kozar, Csaba Balázs, Ankit Beniwal, Sanjay Bloor, Torsten Bringmann, Joachim Brod, Christopher Chang, Jonathan M. Cornell, Ben Farmer, Andrew Fowlie, Tomás E. Gonzalo, Will Handley, Felix Kahlhoefer, Anders Kvellestad, Farvah Mahmoudi, Markus T. Prim, Are Raklev, Janina J. Renk, Andre Scaffidi, Pat Scott, Patrick Stöcker, Aaron C. Vincent, Martin White , et al. (2 additional authors not shown)

Abstract: We assess the status of a wide class of WIMP dark matter (DM) models in light of the latest experimental results using the global fitting framework $\textsf{GAMBIT}$. We perform a global analysis of effective field theory (EFT) operators describing the interactions between a gauge-singlet Dirac fermion and the Standard Model quarks, the gluons and the photon. In this bottom-up approach, we simulta… ▽ More We assess the status of a wide class of WIMP dark matter (DM) models in light of the latest experimental results using the global fitting framework $\textsf{GAMBIT}$. We perform a global analysis of effective field theory (EFT) operators describing the interactions between a gauge-singlet Dirac fermion and the Standard Model quarks, the gluons and the photon. In this bottom-up approach, we simultaneously vary the coefficients of 14 such operators up to dimension 7, along with the DM mass, the scale of new physics and several nuisance parameters. Our likelihood functions include the latest data from $\mathit{Planck}$, direct and indirect detection experiments, and the LHC. For DM masses below 100 GeV, we find that it is impossible to satisfy all constraints simultaneously while maintaining EFT validity at LHC energies. For new physics scales around 1 TeV, our results are influenced by several small excesses in the LHC data and depend on the prescription that we adopt to ensure EFT validity. Furthermore, we find large regions of viable parameter space where the EFT is valid and the relic density can be reproduced, implying that WIMPs can still account for the DM of the universe while being consistent with the latest data. △ Less

Submitted 13 November, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

Comments: 37 pages, 11 figures, 5 tables; v2: matches EPJC version

Report number: ADP-21-9/T1156, CERN-TH-2021-084, CP3-21-15, P3H-21-038, TTK-21-19, gambit-physics-2021

Journal ref: Eur. Phys. J. C 81, 992 (2021)

arXiv:2105.14214 [pdf, other]

Predictive Representation Learning for Language Modeling

Authors: Qingfeng Lan, Luke Kumar, Martha White, Alona Fyshe

Abstract: To effectively perform the task of next-word prediction, long short-term memory networks (LSTMs) must keep track of many types of information. Some information is directly related to the next word's identity, but some is more secondary (e.g. discourse-level features or features of downstream words). Correlates of secondary information appear in LSTM representations even though they are not part of… ▽ More To effectively perform the task of next-word prediction, long short-term memory networks (LSTMs) must keep track of many types of information. Some information is directly related to the next word's identity, but some is more secondary (e.g. discourse-level features or features of downstream words). Correlates of secondary information appear in LSTM representations even though they are not part of an \emph{explicitly} supervised prediction task. In contrast, in reinforcement learning (RL), techniques that explicitly supervise representations to predict secondary information have been shown to be beneficial. Inspired by that success, we propose Predictive Representation Learning (PRL), which explicitly constrains LSTMs to encode specific predictions, like those that might need to be learned implicitly. We show that PRL 1) significantly improves two strong language modeling methods, 2) converges more quickly, and 3) performs better when data is limited. Our work shows that explicitly encoding a simple predictive task facilitates the search for a more effective language model. △ Less

Submitted 29 May, 2021; originally announced May 2021.

arXiv:2105.14027 [pdf, other]

doi 10.21468/SciPostPhys.12.1.043

The Dark Machines Anomaly Score Challenge: Benchmark Data and Model Independent Event Classification for the Large Hadron Collider

Authors: T. Aarrestad, M. van Beekveld, M. Bona, A. Boveia, S. Caron, J. Davies, A. De Simone, C. Doglioni, J. M. Duarte, A. Farbin, H. Gupta, L. Hendriks, L. Heinrich, J. Howarth, P. Jawahar, A. Jueid, J. Lastow, A. Leinweber, J. Mamuzic, E. Merényi, A. Morandini, P. Moskvitina, C. Nellist, J. Ngadiuba, B. Ostdiek , et al. (14 additional authors not shown)

Abstract: We describe the outcome of a data challenge conducted as part of the Dark Machines Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims at detecting signals of new physics at the LHC using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We defin… ▽ More We describe the outcome of a data challenge conducted as part of the Dark Machines Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims at detecting signals of new physics at the LHC using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We define and describe a large benchmark dataset, consisting of >1 Billion simulated LHC events corresponding to $10~\rm{fb}^{-1}$ of proton-proton collisions at a center-of-mass energy of 13 TeV. We then review a wide range of anomaly detection and density estimation algorithms, developed in the context of the data challenge, and we measure their performance in a set of realistic analysis environments. We draw a number of useful conclusions that will aid the development of unsupervised new physics searches during the third run of the LHC, and provide our benchmark dataset for future studies at https://www.phenoMLdata.org. Code to reproduce the analysis is provided at https://github.com/bostdiek/DarkMachines-UnsupervisedChallenge. △ Less

Submitted 9 December, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

Comments: v1: 54 pages, 24 figures. v2: 56 pages, citations added, extend discussion of look-elsewhere-effect, results unchanged; v3. minor typos and updated references

Journal ref: SciPost Phys. 12, 043 (2022)

arXiv:2105.04560 [pdf, other]

doi 10.1093/mnras/stab2492

Determining the full satellite population of a Milky Way-mass halo in a highly resolved cosmological hydrodynamic simulation

Authors: Robert J. J. Grand, Federico Marinacci, Rüdiger Pakmor, Christine M. Simpson, Ashley J. Kelly, Facundo A. Gómez, Adrian Jenkins, Volker Springel, Carlos S. Frenk, Simon D. M. White

Abstract: We investigate the formation of the satellite galaxy population of a Milky Way-mass halo in a very highly resolved magneto-hydrodynamic cosmological zoom-in simulation (baryonic mass resolution $m_b =$ 800 $\rm M_{\odot}$). We show that the properties of the central star-forming galaxy, such as the radial stellar surface density profile and star formation history, are: i) robust to stochastic vari… ▽ More We investigate the formation of the satellite galaxy population of a Milky Way-mass halo in a very highly resolved magneto-hydrodynamic cosmological zoom-in simulation (baryonic mass resolution $m_b =$ 800 $\rm M_{\odot}$). We show that the properties of the central star-forming galaxy, such as the radial stellar surface density profile and star formation history, are: i) robust to stochastic variations associated with the so-called ``Butterfly Effect''; and ii) well converged over 3.5 orders of magnitude in mass resolution. We find that there are approximately five times as many satellite galaxies at this high resolution compared to a standard ($m_b\sim 10^{4-5}\, \rm M_{\odot}$) resolution simulation of the same system. This is primarily because 2/3rds of the high resolution satellites do not form at standard resolution. A smaller fraction (1/6th) of the satellites present at high resolution form and disrupt at standard resolution; these objects are preferentially low-mass satellites on intermediate- to low-eccentricity orbits with impact parameters $\lesssim 30$ kpc. As a result, the radial distribution of satellites becomes substantially more centrally concentrated at higher resolution, in better agreement with recent observations of satellites around Milky Way-mass haloes. Finally, we show that our galaxy formation model successfully forms ultra-faint galaxies and reproduces the stellar velocity dispersion, half-light radii, and $V$-band luminosities of observed Milky Way and Local Group dwarf galaxies across 6 orders of magnitude in luminosity ($10^3$-$10^{9}$ $\rm L_{\odot}$). △ Less

Submitted 3 September, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

Comments: accepted for publication in MNRAS. Main changes include new figure 4 and figure 11

arXiv:2105.03421 [pdf, other]

doi 10.1088/1475-7516/2021/12/028

Cosmological constraints from unWISE and Planck CMB lensing tomography

Authors: Alex Krolewski, Simone Ferraro, Martin White

Abstract: A number of recent, low-redshift, lensing measurements hint at a universe in which the amplitude of lensing is lower than that predicted from the $Λ$CDM model fit to the data of the Planck CMB mission. Here we use the auto- and cross-correlation signal of unWISE galaxies and Planck CMB lensing maps to infer cosmological parameters at low redshift. In particular, we consider three unWISE samples (d… ▽ More A number of recent, low-redshift, lensing measurements hint at a universe in which the amplitude of lensing is lower than that predicted from the $Λ$CDM model fit to the data of the Planck CMB mission. Here we use the auto- and cross-correlation signal of unWISE galaxies and Planck CMB lensing maps to infer cosmological parameters at low redshift. In particular, we consider three unWISE samples (denoted as "blue", "green" and "red") at median redshifts $z \sim 0.6$, $1.1$ and 1.5, which fully cover the Dark Energy dominated era. Our cross-correlation measurements, with combined significance $S/N \sim 80$, are used to infer the amplitude of low-redshift fluctuations, $σ_8$; the fraction of matter in the Universe, $Ω_m$; and the combination $S_8 \equiv σ_8 (Ω_m / 0.3)^{0.5}$ to which these low-redshift lensing measurements are most sensitive. The combination of blue, green and red samples gives a value $S_8=0.784\pm 0.015$, that is fully consistent with other low-redshift lensing measurements and in 2.4$σ$ tension with the CMB predictions from Planck. This is noteworthy, because CMB lensing probes the same physics as previous galaxy lensing measurements, but with very different systematics, thus providing an excellent complement to previous measurements. △ Less

Submitted 13 December, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

Comments: 40 pages, 17 figures. Small changes to cosmological parameters from v1

arXiv:2104.13844 [pdf, other]

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning

Authors: Andrew Patterson, Adam White, Martha White

Abstract: Many reinforcement learning algorithms rely on value estimation, however, the most widely used algorithms -- namely temporal difference algorithms -- can diverge under both off-policy sampling and nonlinear function approximation. Many algorithms have been developed for off-policy value estimation based on the linear mean squared projected Bellman error (MSPBE) and are sound under linear function… ▽ More Many reinforcement learning algorithms rely on value estimation, however, the most widely used algorithms -- namely temporal difference algorithms -- can diverge under both off-policy sampling and nonlinear function approximation. Many algorithms have been developed for off-policy value estimation based on the linear mean squared projected Bellman error (MSPBE) and are sound under linear function approximation. Extending these methods to the nonlinear case has been largely unsuccessful. Recently, several methods have been introduced that approximate a different objective -- the mean-squared Bellman error (MSBE) -- which naturally facilitate nonlinear approximation. In this work, we build on these insights and introduce a new generalized MSPBE that extends the linear MSPBE to the nonlinear setting. We show how this generalized objective unifies previous work and obtain new bounds for the value error of the solutions of the generalized objective. We derive an easy-to-use, but sound, algorithm to minimize the generalized objective, and show that it is more stable across runs, is less sensitive to hyperparameters, and performs favorably across four control domains with neural network function approximation. △ Less

Submitted 28 March, 2022; v1 submitted 28 April, 2021; originally announced April 2021.

Comments: Accepted for publication in JMLR 2022

arXiv:2104.08600 [pdf]

doi 10.21437/Interspeech.2021-1240

Remote smartphone-based speech collection: acceptance and barriers in individuals with major depressive disorder

Authors: Judith Dineley, Grace Lavelle, Daniel Leightley, Faith Matcham, Sara Siddi, Maria Teresa Peñarrubia-María, Katie M. White, Alina Ivan, Carolin Oetzmann, Sara Simblett, Erin Dawe-Lane, Stuart Bruce, Daniel Stahl, Yatharth Ranjan, Zulqarnain Rashid, Pauline Conde, Amos A. Folarin, Josep Maria Haro, Til Wykes, Richard J. B. Dobson, Vaibhav A. Narayan, Matthew Hotopf, Björn W. Schuller, Nicholas Cummins, The RADAR-CNS Consortium

Abstract: The ease of in-the-wild speech recording using smartphones has sparked considerable interest in the combined application of speech, remote measurement technology (RMT) and advanced analytics as a research and healthcare tool. For this to be realised, the acceptability of remote speech collection to the user must be established, in addition to feasibility from an analytical perspective. To understa… ▽ More The ease of in-the-wild speech recording using smartphones has sparked considerable interest in the combined application of speech, remote measurement technology (RMT) and advanced analytics as a research and healthcare tool. For this to be realised, the acceptability of remote speech collection to the user must be established, in addition to feasibility from an analytical perspective. To understand the acceptance, facilitators, and barriers of smartphone-based speech recording, we invited 384 individuals with major depressive disorder (MDD) from the Remote Assessment of Disease and Relapse - Central Nervous System (RADAR-CNS) research programme in Spain and the UK to complete a survey on their experiences recording their speech. In this analysis, we demonstrate that study participants were more comfortable completing a scripted speech task than a free speech task. For both speech tasks, we found depression severity and country to be significant predictors of comfort. Not seeing smartphone notifications of the scheduled speech tasks, low mood and forgetfulness were the most commonly reported obstacles to providing speech recordings. △ Less

Submitted 30 August, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

Comments: Accepted to Interspeech 2021. Formatting changes + minor language edits

ACM Class: H.1.2

Journal ref: Proc. Interspeech 2021, pp. 631-635

arXiv:2103.13498 [pdf, other]

doi 10.1088/1475-7516/2021/05/053

The Ly$α$ forest flux correlation function: a perturbation theory perspective

Authors: Shi-Fan Chen, Zvonimir Vlah, Martin White

Abstract: The Ly$α$ forest provides one of the best means of map** large-scale structure at high redshift, including our tightest constraint on the distance-redshift relation before cosmic noon. We describe how the large-scale correlations in the Ly$α$ forest can be understood as an expansion in cumulants of the optical depth field, which itself can be related to the density field by a bias expansion. Thi… ▽ More The Ly$α$ forest provides one of the best means of map** large-scale structure at high redshift, including our tightest constraint on the distance-redshift relation before cosmic noon. We describe how the large-scale correlations in the Ly$α$ forest can be understood as an expansion in cumulants of the optical depth field, which itself can be related to the density field by a bias expansion. This provides a direct connection between the observable and the statistics of the matter fluctuations which can be computed in a systematic manner. We discuss the way in which complex, small-scale physics enters the predictions, the origin of the much-discussed velocity bias and the `renormalization' of the large-scale bias coefficients. Our calculations are within the context of perturbation theory, but we also make contact with earlier work using the peak-background split. Using the structure of the equations of motion we demonstrate, to all orders in perturbation theory, that the large-scale flux power spectrum becomes the linear spectrum times the square of a quadratic in the cosine of the angle to the line of sight. Unlike the case of galaxies, both the isotropic and anisotropic pieces receive contributions from small-scale physics. △ Less

Submitted 10 May, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

Comments: 28 pages, 4 figures, updated to match version accepted by JCAP

Showing 151–200 of 1,204 results for author: White, M