Search | arXiv e-print repository

Parnassus: An Automated Approach to Accurate, Precise, and Fast Detector Simulation and Reconstruction

Authors: Etienne Dreyer, Eilam Gross, Dmitrii Kobylianskii, Vinicius Mikuni, Benjamin Nachman, Nathalie Soybelman

Abstract: Detector simulation and reconstruction are a significant computational bottleneck in particle physics. We develop Particle-flow Neural Assisted Simulations (Parnassus) to address this challenge. Our deep learning model takes as input a point cloud (particles im**ing on a detector) and produces a point cloud (reconstructed particles). By combining detector simulations and reconstruction into one… ▽ More Detector simulation and reconstruction are a significant computational bottleneck in particle physics. We develop Particle-flow Neural Assisted Simulations (Parnassus) to address this challenge. Our deep learning model takes as input a point cloud (particles im**ing on a detector) and produces a point cloud (reconstructed particles). By combining detector simulations and reconstruction into one step, we aim to minimize resource utilization and enable fast surrogate models suitable for application both inside and outside large collaborations. We demonstrate this approach using a publicly available dataset of jets passed through the full simulation and reconstruction pipeline of the CMS experiment. We show that Parnassus accurately mimics the CMS particle flow algorithm on the (statistically) same events it was trained on and can generalize to jet momentum and type outside of the training distribution. △ Less

Submitted 31 May, 2024; originally announced June 2024.

Comments: 9 pages, 3 figures, 2 tables

arXiv:2405.15847 [pdf, other]

Constraining the Higgs Potential with Neural Simulation-based Inference for Di-Higgs Production

Authors: Radha Mastandrea, Benjamin Nachman, Tilman Plehn

Abstract: Determining the form of the Higgs potential is one of the most exciting challenges of modern particle physics. Higgs pair production directly probes the Higgs self-coupling and should be observed in the near future at the High-Luminosity LHC. We explore how to improve the sensitivity to physics beyond the Standard Model through per-event kinematics for di-Higgs events. In particular, we employ mac… ▽ More Determining the form of the Higgs potential is one of the most exciting challenges of modern particle physics. Higgs pair production directly probes the Higgs self-coupling and should be observed in the near future at the High-Luminosity LHC. We explore how to improve the sensitivity to physics beyond the Standard Model through per-event kinematics for di-Higgs events. In particular, we employ machine learning through simulation-based inference to estimate per-event likelihood ratios and gauge potential sensitivity gains from including this kinematic information. In terms of the Standard Model Effective Field Theory, we find that adding a limited number of observables can help to remove degeneracies in Wilson coefficient likelihoods and significantly improve the experimental sensitivity. △ Less

Submitted 24 May, 2024; originally announced May 2024.

Comments: 19 pages, 14 figures

arXiv:2405.10106 [pdf, other]

Advancing Set-Conditional Set Generation: Diffusion Models for Fast Simulation of Reconstructed Particles

Authors: Dmitrii Kobylianskii, Nathalie Soybelman, Nilotpal Kakati, Etienne Dreyer, Benjamin Nachman, Eilam Gross

Abstract: The computational intensity of detector simulation and event reconstruction poses a significant difficulty for data analysis in collider experiments. This challenge inspires the continued development of machine learning techniques to serve as efficient surrogate models. We propose a fast emulation approach that combines simulation and reconstruction. In other words, a neural network generates a se… ▽ More The computational intensity of detector simulation and event reconstruction poses a significant difficulty for data analysis in collider experiments. This challenge inspires the continued development of machine learning techniques to serve as efficient surrogate models. We propose a fast emulation approach that combines simulation and reconstruction. In other words, a neural network generates a set of reconstructed objects conditioned on input particle sets. To make this possible, we advance set-conditional set generation with diffusion models. Using a realistic, generic, and public detector simulation and reconstruction package (COCOA), we show how diffusion models can accurately model the complex spectrum of reconstructed particles inside jets. △ Less

Submitted 31 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

Comments: 15 pages, 10 figures, 2 tables

arXiv:2405.08889 [pdf, other]

Incorporating Physical Priors into Weakly-Supervised Anomaly Detection

Authors: Chi Lung Cheng, Gurpreet Singh, Benjamin Nachman

Abstract: We propose a new machine-learning-based anomaly detection strategy for comparing data with a background-only reference (a form of weak supervision). The sensitivity of previous strategies degrades significantly when the signal is too rare or there are many unhelpful features. Our Prior-Assisted Weak Supervision (PAWS) method incorporates information from a class of signal models in order to signif… ▽ More We propose a new machine-learning-based anomaly detection strategy for comparing data with a background-only reference (a form of weak supervision). The sensitivity of previous strategies degrades significantly when the signal is too rare or there are many unhelpful features. Our Prior-Assisted Weak Supervision (PAWS) method incorporates information from a class of signal models in order to significantly enhance the search sensitivity of weakly supervised approaches. As long as the true signal is in the pre-specified class, PAWS matches the sensitivity of a dedicated, fully supervised method without specifying the exact parameters ahead of time. On the benchmark LHC Olympics anomaly detection dataset, our mix of semi-supervised and weakly supervised learning is able to extend the sensitivity over previous methods by a factor of 10 in cross section. Furthermore, if we add irrelevant (noise) dimensions to the inputs, classical methods degrade by another factor of 10 in cross section while PAWS remains insensitive to noise. This new approach could be applied in a number of scenarios and pushes the frontier of sensitivity between completely model-agnostic approaches and fully model-specific searches. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 7 pages, 2 figures

arXiv:2404.18992 [pdf, other]

Unifying Simulation and Inference with Normalizing Flows

Authors: Haoxing Du, Claudius Krause, Vinicius Mikuni, Benjamin Nachman, Ian Pang, David Shih

Abstract: There have been many applications of deep neural networks to detector calibrations and a growing number of studies that propose deep generative models as automated fast detector simulators. We show that these two tasks can be unified by using maximum likelihood estimation (MLE) from conditional generative models for energy regression. Unlike direct regression techniques, the MLE approach is prior-… ▽ More There have been many applications of deep neural networks to detector calibrations and a growing number of studies that propose deep generative models as automated fast detector simulators. We show that these two tasks can be unified by using maximum likelihood estimation (MLE) from conditional generative models for energy regression. Unlike direct regression techniques, the MLE approach is prior-independent and non-Gaussian resolutions can be determined from the shape of the likelihood near the maximum. Using an ATLAS-like calorimeter simulation, we demonstrate this concept in the context of calorimeter energy calibration. △ Less

Submitted 9 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

Comments: 12 pages, 7 figures

Report number: HEPHY-ML-24-01

arXiv:2404.18807 [pdf, other]

The Landscape of Unfolding with Machine Learning

Authors: Nathan Huetsch, Javier Mariño Villadamigo, Alexander Shmakov, Sascha Diefenbacher, Vinicius Mikuni, Theo Heimel, Michael Fenton, Kevin Greif, Benjamin Nachman, Daniel Whiteson, Anja Butter, Tilman Plehn

Abstract: Recent innovations from machine learning allow for data unfolding, without binning and including correlations across many dimensions. We describe a set of known, upgraded, and new methods for ML-based unfolding. The performance of these approaches are evaluated on the same two datasets. We find that all techniques are capable of accurately reproducing the particle-level spectra across complex obse… ▽ More Recent innovations from machine learning allow for data unfolding, without binning and including correlations across many dimensions. We describe a set of known, upgraded, and new methods for ML-based unfolding. The performance of these approaches are evaluated on the same two datasets. We find that all techniques are capable of accurately reproducing the particle-level spectra across complex observables. Given that these approaches are conceptually diverse, they offer an exciting toolkit for a new class of measurements that can probe the Standard Model with an unprecedented level of detail and may enable sensitivity to new phenomena. △ Less

Submitted 17 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

arXiv:2404.16091 [pdf, other]

OmniLearn: A Method to Simultaneously Facilitate All Jet Physics Tasks

Authors: Vinicius Mikuni, Benjamin Nachman

Abstract: Machine learning has become an essential tool in jet physics. Due to their complex, high-dimensional nature, jets can be explored holistically by neural networks in ways that are not possible manually. However, innovations in all areas of jet physics are proceeding in parallel. We show that specially constructed machine learning models trained for a specific jet classification task can improve the… ▽ More Machine learning has become an essential tool in jet physics. Due to their complex, high-dimensional nature, jets can be explored holistically by neural networks in ways that are not possible manually. However, innovations in all areas of jet physics are proceeding in parallel. We show that specially constructed machine learning models trained for a specific jet classification task can improve the accuracy, precision, or speed of all other jet physics tasks. This is demonstrated by training on a particular multiclass classification task and then using the learned representation for different classification tasks, for datasets with a different (full) detector simulation, for jets from a different collision system ($pp$ versus $ep$), for generative models, for likelihood ratio estimation, and for anomaly detection. Our OmniLearn approach is thus a foundation model and is made publicly available for use in any area where state-of-the-art precision is required for analyses involving jets and their substructure. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: 19 pages, 12 figures

arXiv:2403.10134 [pdf, other]

Measurement of groomed event shape observables in deep-inelastic electron-proton scattering at HERA

Authors: The H1 collaboration, V. Andreev, M. Arratia, A. Baghdasaryan, A. Baty, K. Begzsuren, A. Bolz, V. Boudry, G. Brandt, D. Britzger, A. Buniatyan, L. Bystritskaya, A. J. Campbell, K. B. Cantun Avila, K. Cerny, V. Chekelian, Z. Chen, J. G. Contreras, J. Cvach, J. B. Dainton, K. Daum, A. Deshpande, C. Diaconu, A. Drees, G. Eckerlin , et al. (123 additional authors not shown)

Abstract: The H1 Collaboration at HERA reports the first measurement of groomed event shape observables in deep inelastic electron-proton scattering (DIS) at $\sqrt{s}=319$ GeV, using data recorded between the years 2003 and 2007 with an integrated luminosity of $351$ pb$^{-1}$. Event shapes provide incisive probes of perturbative and non-perturbative QCD. Grooming techniques have been used for jet measurem… ▽ More The H1 Collaboration at HERA reports the first measurement of groomed event shape observables in deep inelastic electron-proton scattering (DIS) at $\sqrt{s}=319$ GeV, using data recorded between the years 2003 and 2007 with an integrated luminosity of $351$ pb$^{-1}$. Event shapes provide incisive probes of perturbative and non-perturbative QCD. Grooming techniques have been used for jet measurements in hadronic collisions; this paper presents the first application of grooming to DIS data. The analysis is carried out in the Breit frame, utilizing the novel Centauro jet clustering algorithm that is designed for DIS event topologies. Events are required to have squared momentum-transfer $Q^2 > 150$ GeV$^2$ and inelasticity $ 0.2 < y < 0.7$. We report measurements of the production cross section of groomed event 1-jettiness and groomed invariant mass for several choices of grooming parameter. Monte Carlo model calculations and analytic calculations based on Soft Collinear Effective Theory are compared to the measurements. △ Less

Submitted 15 March, 2024; originally announced March 2024.

Comments: 32 pages, 17 tables, 7 figures, submitted to EPJ C

Report number: DESY-24-036

arXiv:2403.10109 [pdf, other]

Measurement of the 1-jettiness event shape observable in deep-inelastic electron-proton scattering at HERA

Authors: The H1 collaboration, V. Andreev, M. Arratia, A. Baghdasaryan, A. Baty, K. Begzsuren, A. Bolz, V. Boudry, G. Brandt, D. Britzger, A. Buniatyan, L. Bystritskaya, A. J. Campbell, K. B. Cantun Avila, K. Cerny, V. Chekelian, Z. Chen, J. G. Contreras, J. Cvach, J. B. Dainton, K. Daum, A. Deshpande, C. Diaconu, A. Drees, G. Eckerlin , et al. (124 additional authors not shown)

Abstract: The H1 Collaboration reports the first measurement of the 1-jettiness event shape observable $τ_1^b$ in neutral-current deep-inelastic electron-proton scattering (DIS). The observable $τ_1^b$ is equivalent to a thrust observable defined in the Breit frame. The data sample was collected at the HERA $ep$ collider in the years 2003-2007 with center-of-mass energy of $\sqrt{s}=319\,\text{GeV}$, corres… ▽ More The H1 Collaboration reports the first measurement of the 1-jettiness event shape observable $τ_1^b$ in neutral-current deep-inelastic electron-proton scattering (DIS). The observable $τ_1^b$ is equivalent to a thrust observable defined in the Breit frame. The data sample was collected at the HERA $ep$ collider in the years 2003-2007 with center-of-mass energy of $\sqrt{s}=319\,\text{GeV}$, corresponding to an integrated luminosity of $351.1\,\text{pb}^{-1}$. Triple differential cross sections are provided as a function of $τ_1^b$, event virtuality $Q^2$, and inelasticity $y$, in the kinematic region $Q^2>150\,\text{GeV}^{2}$. Single differential cross section are provided as a function of $τ_1^b$ in a limited kinematic range. Double differential cross sections are measured, in contrast, integrated over $τ_1^b$ and represent the inclusive neutral-current DIS cross section measured as a function of $Q^2$ and $y$. The data are compared to a variety of predictions and include classical and modern Monte Carlo event generators, predictions in fixed-order perturbative QCD where calculations up to $\mathcal{O}(α_s^3)$ are available for $τ_1^b$ or inclusive DIS, and resummed predictions at next-to-leading logarithmic accuracy matched to fixed order predictions at $\mathcal{O}(α_s^2)$. These comparisons reveal sensitivity of the 1-jettiness observable to QCD parton shower and resummation effects, as well as the modeling of hadronization and fragmentation. Within their range of validity, the fixed-order predictions provide a good description of the data. Monte Carlo event generators are predictive over the full measured range and hence their underlying models and parameters can be constrained by comparing to the presented data. △ Less

Submitted 15 March, 2024; originally announced March 2024.

Comments: 45 pages, 38 tables, 13 figures

Report number: DESY-24-035

arXiv:2403.08982 [pdf, other]

Observation and differential cross section measurement of neutral current DIS events with an empty hemisphere in the Breit frame

Authors: The H1 collaboration, V. Andreev, M. Arratia, A. Baghdasaryan, A. Baty, K. Begzsuren, A. Bolz, V. Boudry, G. Brandt, D. Britzger, A. Buniatyan, L. Bystritskaya, A. J. Campbell, K. B. Cantun Avila, K. Cerny, V. Chekelian, Z. Chen, J. G. Contreras, J. Cvach, J. B. Dainton, K. Daum, A. Deshpande, C. Diaconu, A. Drees, G. Eckerlin , et al. (124 additional authors not shown)

Abstract: The Breit frame provides a natural frame to analyze lepton-proton scattering events. In this reference frame, the parton model hard interactions between a quark and an exchanged boson defines the coordinate system such that the struck quark is back-scattered along the virtual photon momentum direction. In Quantum Chromodynamics (QCD), higher order perturbative or non-perturbative effects can chang… ▽ More The Breit frame provides a natural frame to analyze lepton-proton scattering events. In this reference frame, the parton model hard interactions between a quark and an exchanged boson defines the coordinate system such that the struck quark is back-scattered along the virtual photon momentum direction. In Quantum Chromodynamics (QCD), higher order perturbative or non-perturbative effects can change this picture drastically. As Bjorken-$x$ decreases below one half, a rather peculiar event signature is predicted with increasing probability, where no radiation is present in one of the two Breit-frame hemispheres and all emissions are to be found in the other hemisphere. At higher orders in $α_s$ or in the presence of soft QCD effects, predictions of the rate of these events are far from trivial, and that motivates measurements with real data. We report on the first observation of the empty current hemisphere events in electron-proton collisions at the HERA collider using data recorded with the H1 detector at a center-of-mass energy of 319 GeV. The fraction of inclusive neutral-current DIS events with an empty hemisphere is found to be $0.0112 \pm 3.9\,\%_\text{stat} \pm 4.5\,\%_\text{syst} \pm 1.6\,\%_\text{mod}$ in the selected kinematic region of $150< Q^2<1500$ GeV$^2$ and inelasticity $0.14< y<0.7$. The data sample corresponds to an integrated luminosity of 351.1 pb$^{-1}$, sufficient to enable differential cross section measurements of these events. The results show an enhanced discriminating power at lower Bjorken-$x$ among different Monte Carlo event generator predictions. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: 13 pages, 5 figures, 2 Tables

Report number: DESY-24-034

arXiv:2402.14067 [pdf, other]

Seeing Double: Calibrating Two Jets at Once

Authors: Rikab Gambhir, Benjamin Nachman

Abstract: Jet energy calibration is an important aspect of many measurements and searches at the LHC. Currently, these calibrations are performed on a per-jet basis, i.e. agnostic to the properties of other jets in the same event. In this work, we propose taking advantage of the correlations induced by momentum conservation between jets in order to improve their jet energy calibration. By fitting the $p_T$… ▽ More Jet energy calibration is an important aspect of many measurements and searches at the LHC. Currently, these calibrations are performed on a per-jet basis, i.e. agnostic to the properties of other jets in the same event. In this work, we propose taking advantage of the correlations induced by momentum conservation between jets in order to improve their jet energy calibration. By fitting the $p_T$ asymmetry of dijet events in simulation, while remaining agnostic to the $p_T$ spectra themselves, we are able to obtain correlation-improved maximum likelihood estimates. This approach is demonstrated with simulated jets from the CMS Detector, yielding a $3$-$5\%$ relative improvement in the jet energy resolution, corresponding to a quadrature improvement of approximately 35\%. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 14 pages, 10 figures, 1 table. Code available at https://github.com/rikab/SeeingDouble

Report number: MIT-CTP 5680

arXiv:2312.11618 [pdf, other]

Anomaly detection with flow-based fast calorimeter simulators

Authors: Claudius Krause, Benjamin Nachman, Ian Pang, David Shih, Yunhao Zhu

Abstract: Recently, several normalizing flow-based deep generative models have been proposed to accelerate the simulation of calorimeter showers. Using CaloFlow as an example, we show that these models can simultaneously perform unsupervised anomaly detection with no additional training cost. As a demonstration, we consider electromagnetic showers initiated by one (background) or multiple (signal) photons.… ▽ More Recently, several normalizing flow-based deep generative models have been proposed to accelerate the simulation of calorimeter showers. Using CaloFlow as an example, we show that these models can simultaneously perform unsupervised anomaly detection with no additional training cost. As a demonstration, we consider electromagnetic showers initiated by one (background) or multiple (signal) photons. The CaloFlow model is designed to generate single photon showers, but it also provides access to the shower likelihood. We use this likelihood as an anomaly score and study the showers tagged as being unlikely. As expected, the tagger struggles when the signal photons are nearly collinear, but is otherwise effective. This approach is complementary to a supervised classifier trained on only specific signal models using the same low-level calorimeter inputs. While the supervised classifier is also highly effective at unseen signal models, the unsupervised method is more sensitive in certain regions and thus we expect that the ultimate performance will require a combination of approaches. △ Less

Submitted 18 December, 2023; originally announced December 2023.

Comments: 12 pages, 6 figures

arXiv:2312.08453 [pdf, other]

Integrating Particle Flavor into Deep Learning Models for Hadronization

Authors: Jay Chan, Xiangyang Ju, Adam Kania, Benjamin Nachman, Vishnu Sangli, Andrzej Siodmok

Abstract: Hadronization models used in event generators are physics-inspired functions with many tunable parameters. Since we do not understand hadronization from first principles, there have been multiple proposals to improve the accuracy of hadronization models by utilizing more flexible parameterizations based on neural networks. These recent proposals have focused on the kinematic properties of hadrons,… ▽ More Hadronization models used in event generators are physics-inspired functions with many tunable parameters. Since we do not understand hadronization from first principles, there have been multiple proposals to improve the accuracy of hadronization models by utilizing more flexible parameterizations based on neural networks. These recent proposals have focused on the kinematic properties of hadrons, but a full model must also include particle flavor. In this paper, we show how to build a deep learning-based hadronization model that includes both kinematic (continuous) and flavor (discrete) degrees of freedom. Our approach is based on Generative Adversarial Networks and we show the performance within the context of the cluster hadronization model within the Herwig event generator. △ Less

Submitted 13 December, 2023; originally announced December 2023.

Comments: 9 pages, 4 figures

arXiv:2311.12924 [pdf, other]

doi 10.1007/JHEP04(2024)059

Non-resonant Anomaly Detection with Background Extrapolation

Authors: Kehang Bai, Radha Mastandrea, Benjamin Nachman

Abstract: Complete anomaly detection strategies that are both signal sensitive and compatible with background estimation have largely focused on resonant signals. Non-resonant new physics scenarios are relatively under-explored and may arise from off-shell effects or final states with significant missing energy. In this paper, we extend a class of weakly supervised anomaly detection strategies developed for… ▽ More Complete anomaly detection strategies that are both signal sensitive and compatible with background estimation have largely focused on resonant signals. Non-resonant new physics scenarios are relatively under-explored and may arise from off-shell effects or final states with significant missing energy. In this paper, we extend a class of weakly supervised anomaly detection strategies developed for resonant physics to the non-resonant case. Machine learning models are trained to reweight, generate, or morph the background, extrapolated from a control region. A classifier is then trained in a signal region to distinguish the estimated background from the data. The new methods are demonstrated using a semi-visible jet signature as a benchmark signal model, and are shown to automatically identify the anomalous events without specifying the signal ahead of time. △ Less

Submitted 7 May, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

Comments: 25 pages, 11 figures; v2: added two appendices; v3: additional discussion to match JHEP version

Journal ref: JHEP 04 (2024) 059

arXiv:2311.07652 [pdf, other]

Safe but Incalculable: Energy-weighting is not all you need

Authors: Samuel Bright-Thonney, Benjamin Nachman, Jesse Thaler

Abstract: Infrared and collinear (IRC) safety has long been used a proxy for robustness when develo** new jet substructure observables. This guiding philosophy has been carried into the deep learning era, where IRC-safe neural networks have been used for many jet studies. For graph-based neural networks, the most straightforward way to achieve IRC safety is to weight particle inputs by their energies. How… ▽ More Infrared and collinear (IRC) safety has long been used a proxy for robustness when develo** new jet substructure observables. This guiding philosophy has been carried into the deep learning era, where IRC-safe neural networks have been used for many jet studies. For graph-based neural networks, the most straightforward way to achieve IRC safety is to weight particle inputs by their energies. However, energy-weighting by itself does not guarantee that perturbative calculations of machine-learned observables will enjoy small non-perturbative corrections. In this paper, we demonstrate the sensitivity of IRC-safe networks to non-perturbative effects, by training an energy flow network (EFN) to maximize its sensitivity to hadronization. We then show how to construct Lipschitz Energy Flow Networks (L-EFNs), which are both IRC safe and relatively insensitive to non-perturbative corrections. We demonstrate the performance of L-EFNs on generated samples of quark and gluon jets, and showcase fascinating differences between the learned latent representations of EFNs and L-EFNs. △ Less

Submitted 13 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

Comments: 11 pages, 7 figures. v2: Added short appendix on quark/gluon discrimination with L-EFNs

Report number: MIT-CTP 5641

arXiv:2310.06897 [pdf, other]

Full Phase Space Resonant Anomaly Detection

Authors: Erik Buhmann, Cedric Ewen, Gregor Kasieczka, Vinicius Mikuni, Benjamin Nachman, David Shih

Abstract: Physics beyond the Standard Model that is resonant in one or more dimensions has been a longstanding focus of countless searches at colliders and beyond. Recently, many new strategies for resonant anomaly detection have been developed, where sideband information can be used in conjunction with modern machine learning, in order to generate synthetic datasets representing the Standard Model backgrou… ▽ More Physics beyond the Standard Model that is resonant in one or more dimensions has been a longstanding focus of countless searches at colliders and beyond. Recently, many new strategies for resonant anomaly detection have been developed, where sideband information can be used in conjunction with modern machine learning, in order to generate synthetic datasets representing the Standard Model background. Until now, this approach was only able to accommodate a relatively small number of dimensions, limiting the breadth of the search sensitivity. Using recent innovations in point cloud generative models, we show that this strategy can also be applied to the full phase space, using all relevant particles for the anomaly detection. As a proof of principle, we show that the signal from the R\&D dataset from the LHC Olympics is findable with this method, opening up the door to future studies that explore the interplay between depth and breadth in the representation of the data for anomaly detection. △ Less

Submitted 9 February, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

Comments: 10 pages, 7 figures

Journal ref: Phys. Rev. D 109, 055015 (2024)

arXiv:2310.04442 [pdf, other]

The Optimal use of Segmentation for Sampling Calorimeters

Authors: Fernando Torales Acosta, Bishnu Karki, Piyush Karande, Aaron Angerami, Miguel Arratia, Kenneth Barish, Ryan Milton, Sebastián Morán, Benjamin Nachman, Anshuman Sinha

Abstract: One of the key design choices of any sampling calorimeter is how fine to make the longitudinal and transverse segmentation. To inform this choice, we study the impact of calorimeter segmentation on energy reconstruction. To ensure that the trends are due entirely to hardware and not to a sub-optimal use of segmentation, we deploy deep neural networks to perform the reconstruction. These networks m… ▽ More One of the key design choices of any sampling calorimeter is how fine to make the longitudinal and transverse segmentation. To inform this choice, we study the impact of calorimeter segmentation on energy reconstruction. To ensure that the trends are due entirely to hardware and not to a sub-optimal use of segmentation, we deploy deep neural networks to perform the reconstruction. These networks make use of all available information by representing the calorimeter as a point cloud. To demonstrate our approach, we simulate a detector similar to the forward calorimeter system intended for use in the ePIC detector, which will operate at the upcoming Electron Ion Collider. We find that for the energy estimation of isolated charged pion showers, relatively fine longitudinal segmentation is key to achieving an energy resolution that is better than 10% across the full phase space. These results provide a valuable benchmark for ongoing EIC detector optimizations and may also inform future studies involving high-granularity calorimeters in other experiments at various facilities. △ Less

Submitted 2 October, 2023; originally announced October 2023.

arXiv:2309.06472 [pdf, other]

doi 10.1103/PhysRevD.108.096018

Flows for Flows: Morphing one Dataset into another with Maximum Likelihood Estimation

Authors: Tobias Golling, Samuel Klein, Radha Mastandrea, Benjamin Nachman, John Andrew Raine

Abstract: Many components of data analysis in high energy physics and beyond require morphing one dataset into another. This is commonly solved via reweighting, but there are many advantages of preserving weights and shifting the data points instead. Normalizing flows are machine learning models with impressive precision on a variety of particle physics tasks. Naively, normalizing flows cannot be used for m… ▽ More Many components of data analysis in high energy physics and beyond require morphing one dataset into another. This is commonly solved via reweighting, but there are many advantages of preserving weights and shifting the data points instead. Normalizing flows are machine learning models with impressive precision on a variety of particle physics tasks. Naively, normalizing flows cannot be used for morphing because they require knowledge of the probability density of the starting dataset. In most cases in particle physics, we can generate more examples, but we do not know densities explicitly. We propose a protocol called flows for flows for training normalizing flows to morph one dataset into another even if the underlying probability density of neither dataset is known explicitly. This enables a morphing strategy trained with maximum likelihood estimation, a setup that has been shown to be highly effective in related tasks. We study variations on this protocol to explore how far the data points are moved to statistically match the two datasets. Furthermore, we show how to condition the learned flows on particular features in order to create a morphing function for every value of the conditioning feature. For illustration, we demonstrate flows for flows for toy examples as well as a collider physics example involving dijet events △ Less

Submitted 12 September, 2023; originally announced September 2023.

Comments: 15 pages, 17 figures. This work is a merger of arXiv:2211.02487 and arXiv:2212.06155

arXiv:2308.12351 [pdf, other]

Improving Generative Model-based Unfolding with Schrödinger Bridges

Authors: Sascha Diefenbacher, Guan-Horng Liu, Vinicius Mikuni, Benjamin Nachman, Weili Nie

Abstract: Machine learning-based unfolding has enabled unbinned and high-dimensional differential cross section measurements. Two main approaches have emerged in this research area: one based on discriminative models and one based on generative models. The main advantage of discriminative models is that they learn a small correction to a starting simulation while generative models scale better to regions of… ▽ More Machine learning-based unfolding has enabled unbinned and high-dimensional differential cross section measurements. Two main approaches have emerged in this research area: one based on discriminative models and one based on generative models. The main advantage of discriminative models is that they learn a small correction to a starting simulation while generative models scale better to regions of phase space with little data. We propose to use Schroedinger Bridges and diffusion models to create SBUnfold, an unfolding approach that combines the strengths of both discriminative and generative models. The key feature of SBUnfold is that its generative model maps one set of events into another without having to go through a known probability density as is the case for normalizing flows and standard diffusion models. We show that SBUnfold achieves excellent performance compared to state of the art methods on a synthetic Z+jets dataset. △ Less

Submitted 22 September, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

Comments: 9 pages, 5 figures

arXiv:2308.12339 [pdf, other]

Refining Fast Calorimeter Simulations with a Schrödinger Bridge

Authors: Sascha Diefenbacher, Vinicius Mikuni, Benjamin Nachman

Abstract: Machine learning-based simulations, especially calorimeter simulations, are promising tools for approximating the precision of classical high energy physics simulations with a fraction of the generation time. Nearly all methods proposed so far learn neural networks that map a random variable with a known probability density, like a Gaussian, to realistic-looking events. In many cases, physics even… ▽ More Machine learning-based simulations, especially calorimeter simulations, are promising tools for approximating the precision of classical high energy physics simulations with a fraction of the generation time. Nearly all methods proposed so far learn neural networks that map a random variable with a known probability density, like a Gaussian, to realistic-looking events. In many cases, physics events are not close to Gaussian and so these neural networks have to learn a highly complex function. We study an alternative approach: Schrödinger bridge Quality Improvement via Refinement of Existing Lightweight Simulations (SQuIRELS). SQuIRELS leverages the power of diffusion-based neural networks and Schrödinger bridges to map between samples where the probability density is not known explicitly. We apply SQuIRELS to the task of refining a classical fast simulation to approximate a full classical simulation. On simulated calorimeter events, we find that SQuIRELS is able to reproduce highly non-trivial features of the full simulation with a fraction of the generation time. △ Less

Submitted 23 August, 2023; originally announced August 2023.

Comments: 10 pages, 5 figures

arXiv:2308.03847 [pdf, other]

doi 10.1088/1748-0221/19/02/P02001

CaloScore v2: Single-shot Calorimeter Shower Simulation with Diffusion Models

Authors: Vinicius Mikuni, Benjamin Nachman

Abstract: Diffusion generative models are promising alternatives for fast surrogate models, producing high-fidelity physics simulations. However, the generation time often requires an expensive denoising process with hundreds of function evaluations, restricting the current applicability of these models in a realistic setting. In this work, we report updates on the CaloScore architecture, detailing the chan… ▽ More Diffusion generative models are promising alternatives for fast surrogate models, producing high-fidelity physics simulations. However, the generation time often requires an expensive denoising process with hundreds of function evaluations, restricting the current applicability of these models in a realistic setting. In this work, we report updates on the CaloScore architecture, detailing the changes in the diffusion process, which produces higher quality samples, and the use of progressive distillation, resulting in a diffusion model capable of generating new samples with a single function evaluation. We demonstrate these improvements using the Calorimeter Simulation Challenge 2022 dataset. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: 10 pages, 5 figures

arXiv:2307.11157 [pdf, other]

doi 10.1140/epjc/s10052-024-12607-x

The Interplay of Machine Learning--based Resonant Anomaly Detection Methods

Authors: Tobias Golling, Gregor Kasieczka, Claudius Krause, Radha Mastandrea, Benjamin Nachman, John Andrew Raine, Debajyoti Sengupta, David Shih, Manuel Sommerhalder

Abstract: Machine learning--based anomaly detection (AD) methods are promising tools for extending the coverage of searches for physics beyond the Standard Model (BSM). One class of AD methods that has received significant attention is resonant anomaly detection, where the BSM is assumed to be localized in at least one known variable. While there have been many methods proposed to identify such a BSM signal… ▽ More Machine learning--based anomaly detection (AD) methods are promising tools for extending the coverage of searches for physics beyond the Standard Model (BSM). One class of AD methods that has received significant attention is resonant anomaly detection, where the BSM is assumed to be localized in at least one known variable. While there have been many methods proposed to identify such a BSM signal that make use of simulated or detected data in different ways, there has not yet been a study of the methods' complementarity. To this end, we address two questions. First, in the absence of any signal, do different methods pick the same events as signal-like? If not, then we can significantly reduce the false-positive rate by comparing different methods on the same dataset. Second, if there is a signal, are different methods fully correlated? Even if their maximum performance is the same, since we do not know how much signal is present, it may be beneficial to combine approaches. Using the Large Hadron Collider (LHC) Olympics dataset, we provide quantitative answers to these questions. We find that there are significant gains possible by combining multiple methods, which will strengthen the search program at the LHC and beyond. △ Less

Submitted 14 March, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

Comments: 27 pages, 21 figures. Updated with revisions for journal acceptance

arXiv:2307.04780 [pdf, other]

Comparison of Point Cloud and Image-based Models for Calorimeter Fast Simulation

Authors: Fernando Torales Acosta, Vinicius Mikuni, Benjamin Nachman, Miguel Arratia, Bishnu Karki, Ryan Milton, Piyush Karande, Aaron Angerami

Abstract: Score based generative models are a new class of generative models that have been shown to accurately generate high dimensional calorimeter datasets. Recent advances in generative models have used images with 3D voxels to represent and model complex calorimeter showers. Point clouds, however, are likely a more natural representation of calorimeter showers, particularly in calorimeters with high gr… ▽ More Score based generative models are a new class of generative models that have been shown to accurately generate high dimensional calorimeter datasets. Recent advances in generative models have used images with 3D voxels to represent and model complex calorimeter showers. Point clouds, however, are likely a more natural representation of calorimeter showers, particularly in calorimeters with high granularity. Point clouds preserve all of the information of the original simulation, more naturally deal with sparse datasets, and can be implemented with more compact models and data files. In this work, two state-of-the-art score based models are trained on the same set of calorimeter simulation and directly compared. △ Less

Submitted 31 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

Comments: 11 pages, 6 figures, 1 table

arXiv:2306.03933 [pdf, other]

doi 10.21468/SciPostPhys.16.3.062

High-dimensional and Permutation Invariant Anomaly Detection

Authors: Vinicius Mikuni, Benjamin Nachman

Abstract: Methods for anomaly detection of new physics processes are often limited to low-dimensional spaces due to the difficulty of learning high-dimensional probability densities. Particularly at the constituent level, incorporating desirable properties such as permutation invariance and variable-length inputs becomes difficult within popular density estimation methods. In this work, we introduce a permu… ▽ More Methods for anomaly detection of new physics processes are often limited to low-dimensional spaces due to the difficulty of learning high-dimensional probability densities. Particularly at the constituent level, incorporating desirable properties such as permutation invariance and variable-length inputs becomes difficult within popular density estimation methods. In this work, we introduce a permutation-invariant density estimator for particle physics data based on diffusion models, specifically designed to handle variable-length inputs. We demonstrate the efficacy of our methodology by utilizing the learned density as a permutation-invariant anomaly detection score, effectively identifying jets with low likelihood under the background-only hypothesis. To validate our density estimation method, we investigate the ratio of learned densities and compare to those obtained by a supervised classification algorithm. △ Less

Submitted 7 February, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

Comments: 7 pages, 5 figures

Journal ref: SciPost Phys. 16, 062 (2024)

arXiv:2305.17169 [pdf, other]

Fitting a Deep Generative Hadronization Model

Authors: Jay Chan, Xiangyang Ju, Adam Kania, Benjamin Nachman, Vishnu Sangli, Andrzej Siodmok

Abstract: Hadronization is a critical step in the simulation of high-energy particle and nuclear physics experiments. As there is no first principles understanding of this process, physically-inspired hadronization models have a large number of parameters that are fit to data. Deep generative models are a natural replacement for classical techniques, since they are more flexible and may be able to improve t… ▽ More Hadronization is a critical step in the simulation of high-energy particle and nuclear physics experiments. As there is no first principles understanding of this process, physically-inspired hadronization models have a large number of parameters that are fit to data. Deep generative models are a natural replacement for classical techniques, since they are more flexible and may be able to improve the overall precision. Proof of principle studies have shown how to use neural networks to emulate specific hadronization when trained using the inputs and outputs of classical methods. However, these approaches will not work with data, where we do not have a matching between observed hadrons and partons. In this paper, we develop a protocol for fitting a deep generative hadronization model in a realistic setting, where we only have access to a set of hadrons in data. Our approach uses a variation of a Generative Adversarial Network with a permutation invariant discriminator. We find that this setup is able to match the hadronization model in Herwig with multiple sets of parameters. This work represents a significant step forward in a longer term program to develop, train, and integrate machine learning-based hadronization models into parton shower Monte Carlo programs. △ Less

Submitted 24 July, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

Comments: 14 pages, 4 figures

arXiv:2305.10500 [pdf, other]

Learning Likelihood Ratios with Neural Network Classifiers

Authors: Shahzar Rizvi, Mariel Pettee, Benjamin Nachman

Abstract: The likelihood ratio is a crucial quantity for statistical inference in science that enables hypothesis testing, construction of confidence intervals, reweighting of distributions, and more. Many modern scientific applications, however, make use of data- or simulation-driven models for which computing the likelihood ratio can be very difficult or even impossible. By applying the so-called ``likeli… ▽ More The likelihood ratio is a crucial quantity for statistical inference in science that enables hypothesis testing, construction of confidence intervals, reweighting of distributions, and more. Many modern scientific applications, however, make use of data- or simulation-driven models for which computing the likelihood ratio can be very difficult or even impossible. By applying the so-called ``likelihood ratio trick,'' approximations of the likelihood ratio may be computed using clever parametrizations of neural network-based classifiers. A number of different neural network setups can be defined to satisfy this procedure, each with varying performance in approximating the likelihood ratio when using finite training data. We present a series of empirical studies detailing the performance of several common loss functionals and parametrizations of the classifier output in approximating the likelihood ratio of two univariate and multivariate Gaussian distributions as well as simulated high-energy particle physics datasets. △ Less

Submitted 8 January, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

arXiv:2305.07696 [pdf, other]

doi 10.1140/epjc/s10052-023-11989-8

ELSA -- Enhanced latent spaces for improved collider simulations

Authors: Benjamin Nachman, Ramon Winterhalder

Abstract: Simulations play a key role for inference in collider physics. We explore various approaches for enhancing the precision of simulations using machine learning, including interventions at the end of the simulation chain (reweighting), at the beginning of the simulation chain (pre-processing), and connections between the end and beginning (latent space refinement). To clearly illustrate our approach… ▽ More Simulations play a key role for inference in collider physics. We explore various approaches for enhancing the precision of simulations using machine learning, including interventions at the end of the simulation chain (reweighting), at the beginning of the simulation chain (pre-processing), and connections between the end and beginning (latent space refinement). To clearly illustrate our approaches, we use W+jets matrix element surrogate simulations based on normalizing flows as a prototypical example. First, weights in the data space are derived using machine learning classifiers. Then, we pull back the data-space weights to the latent space to produce unweighted examples and employ the Latent Space Refinement (LASER) protocol using Hamiltonian Monte Carlo. An alternative approach is an augmented normalizing flow, which allows for different dimensions in the latent and target spaces. These methods are studied for various pre-processing strategies, including a new and general method for massive particles at hadron colliders that is a tweak on the widely-used RAMBO-on-diet map**. We find that modified simulations can achieve sub-percent precision across a wide range of phase space. △ Less

Submitted 21 October, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

Comments: 17 pages, 9 figures, 2 tables, code and data at https://github.com/ramonpeter/elsa, v2: journal version

Report number: IRMP-CP3-23-20

Journal ref: Eur. Phys. J. C 83, 843 (2023)

arXiv:2305.03761 [pdf, other]

Weakly-Supervised Anomaly Detection in the Milky Way

Authors: Mariel Pettee, Sowmya Thanvantri, Benjamin Nachman, David Shih, Matthew R. Buckley, Jack H. Collins

Abstract: Large-scale astrophysics datasets present an opportunity for new machine learning techniques to identify regions of interest that might otherwise be overlooked by traditional searches. To this end, we use Classification Without Labels (CWoLa), a weakly-supervised anomaly detection method, to identify cold stellar streams within the more than one billion Milky Way stars observed by the Gaia satelli… ▽ More Large-scale astrophysics datasets present an opportunity for new machine learning techniques to identify regions of interest that might otherwise be overlooked by traditional searches. To this end, we use Classification Without Labels (CWoLa), a weakly-supervised anomaly detection method, to identify cold stellar streams within the more than one billion Milky Way stars observed by the Gaia satellite. CWoLa operates without the use of labeled streams or knowledge of astrophysical principles. Instead, we train a classifier to distinguish between mixed samples for which the proportions of signal and background samples are unknown. This computationally lightweight strategy is able to detect both simulated streams and the known stream GD-1 in data. Originally designed for high-energy collider physics, this technique may have broad applicability within astrophysics as well as other domains interested in identifying localized anomalies. △ Less

Submitted 5 May, 2023; originally announced May 2023.

arXiv:2304.09208 [pdf, other]

doi 10.1140/epjc/s10052-023-11809-z

Parton Labeling without Matching: Unveiling Emergent Labelling Capabilities in Regression Models

Authors: Shikai Qiu, Shuo Han, Xiangyang Ju, Benjamin Nachman, Haichen Wang

Abstract: Parton labeling methods are widely used when reconstructing collider events with top quarks or other massive particles. State-of-the-art techniques are based on machine learning and require training data with events that have been matched using simulations with truth information. In nature, there is no unique matching between partons and final state objects due to the properties of the strong forc… ▽ More Parton labeling methods are widely used when reconstructing collider events with top quarks or other massive particles. State-of-the-art techniques are based on machine learning and require training data with events that have been matched using simulations with truth information. In nature, there is no unique matching between partons and final state objects due to the properties of the strong force and due to acceptance effects. We propose a new approach to parton labeling that circumvents these challenges by recycling regression models. The final state objects that are most relevant for a regression model to predict the properties of a particular top quark are assigned to said parent particle without having any parton-matched training data. This approach is demonstrated using simulated events with top quarks and outperforms the widely-used $χ^2$ method. △ Less

Submitted 7 July, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

Comments: 6 pages, 4 figures; v2: matches minor changes from journal version

Journal ref: Eur. Phys. J. C. 83 (2023) 622

arXiv:2304.01266 [pdf, other]

doi 10.1103/PhysRevD.108.036025

Fast Point Cloud Generation with Diffusion Models in High Energy Physics

Authors: Vinicius Mikuni, Benjamin Nachman, Mariel Pettee

Abstract: Many particle physics datasets like those generated at colliders are described by continuous coordinates (in contrast to grid points like in an image), respect a number of symmetries (like permutation invariance), and have a stochastic dimensionality. For this reason, standard deep generative models that produce images or at least a fixed set of features are limiting. We introduce a new neural net… ▽ More Many particle physics datasets like those generated at colliders are described by continuous coordinates (in contrast to grid points like in an image), respect a number of symmetries (like permutation invariance), and have a stochastic dimensionality. For this reason, standard deep generative models that produce images or at least a fixed set of features are limiting. We introduce a new neural network simulation based on a diffusion model that addresses these limitations named Fast Point Cloud Diffusion (FPCD). We show that our approach can reproduce the complex properties of hadronic jets from proton-proton collisions with competitive precision to other recently proposed models. Additionally, we use a procedure called progressive distillation to accelerate the generation time of our method, which is typically a significant challenge for diffusion models despite their state-of-the-art precision. △ Less

Submitted 17 July, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

Comments: 11 pages, 8 figures

arXiv:2302.05390 [pdf, other]

doi 10.1103/PhysRevD.108.016002

Unbinned Profiled Unfolding

Authors: Jay Chan, Benjamin Nachman

Abstract: Unfolding is an important procedure in particle physics experiments which corrects for detector effects and provides differential cross section measurements that can be used for a number of downstream tasks, such as extracting fundamental physics parameters. Traditionally, unfolding is done by discretizing the target phase space into a finite number of bins and is limited in the number of unfolded… ▽ More Unfolding is an important procedure in particle physics experiments which corrects for detector effects and provides differential cross section measurements that can be used for a number of downstream tasks, such as extracting fundamental physics parameters. Traditionally, unfolding is done by discretizing the target phase space into a finite number of bins and is limited in the number of unfolded variables. Recently, there have been a number of proposals to perform unbinned unfolding with machine learning. However, none of these methods (like most unfolding methods) allow for simultaneously constraining (profiling) nuisance parameters. We propose a new machine learning-based unfolding method that results in an unbinned differential cross section and can profile nuisance parameters. The machine learning loss function is the full likelihood function, based on binned inputs at detector-level. We first demonstrate the method with simple Gaussian examples and then show the impact on a simulated Higgs boson cross section measurement. △ Less

Submitted 7 July, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

Comments: Fixed a reference

arXiv:2301.06581 [pdf, other]

Report of the 2021 U.S. Community Study on the Future of Particle Physics (Snowmass 2021) Summary Chapter

Authors: Joel N. Butler, R. Sekhar Chivukula, André de Gouvêa, Tao Han, Young-Kee Kim, Priscilla Cushman, Glennys R. Farrar, Yury G. Kolomensky, Sergei Nagaitsev, Nicolás Yunes, Stephen Gourlay, Tor Raubenheimer, Vladimir Shiltsev, Kétévi A. Assamagan, Breese Quinn, V. Daniel Elvira, Steven Gottlieb, Benjamin Nachman, Aaron S. Chou, Marcelle Soares-Santos, Tim M. P. Tait, Meenakshi Narain, Laura Reina, Alessandro Tricoli, Phillip S. Barbeau , et al. (18 additional authors not shown)

Abstract: The 2021-22 High-Energy Physics Community Planning Exercise (a.k.a. ``Snowmass 2021'') was organized by the Division of Particles and Fields of the American Physical Society. Snowmass 2021 was a scientific study that provided an opportunity for the entire U.S. particle physics community, along with its international partners, to identify the most important scientific questions in High Energy Physi… ▽ More The 2021-22 High-Energy Physics Community Planning Exercise (a.k.a. ``Snowmass 2021'') was organized by the Division of Particles and Fields of the American Physical Society. Snowmass 2021 was a scientific study that provided an opportunity for the entire U.S. particle physics community, along with its international partners, to identify the most important scientific questions in High Energy Physics for the following decade, with an eye to the decade after that, and the experiments, facilities, infrastructure, and R&D needed to pursue them. This Snowmass summary report synthesizes the lessons learned and the main conclusions of the Community Planning Exercise as a whole and presents a community-informed synopsis of U.S. particle physics at the beginning of 2023. This document, along with the Snowmass reports from the various subfields, will provide input to the 2023 Particle Physics Project Prioritization Panel (P5) subpanel of the U.S. High-Energy Physics Advisory Panel (HEPAP), and will help to guide and inform the activity of the U.S. particle physics community during the next decade and beyond. △ Less

Submitted 3 December, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

Comments: 75 pages, 3 figures, 2 tables. This is the first chapter and summary of the full report of the Snowmass 2021 Workshop. This version fixes an important omission from Table 2, adds two references that were not available at the time of the original version, fixes a minor few typos, and adds a small amount of material to section 1.1.3

Report number: FERMILAB-CONF-23-008

arXiv:2212.11285 [pdf, other]

doi 10.1103/PhysRevD.107.096025

FETA: Flow-Enhanced Transportation for Anomaly Detection

Authors: Tobias Golling, Samuel Klein, Radha Mastandrea, Benjamin Nachman

Abstract: Resonant anomaly detection is a promising framework for model-independent searches for new particles. Weakly supervised resonant anomaly detection methods compare data with a potential signal against a template of the Standard Model (SM) background inferred from sideband regions. We propose a means to generate this background template that uses a flow-based model to create a map** between high-f… ▽ More Resonant anomaly detection is a promising framework for model-independent searches for new particles. Weakly supervised resonant anomaly detection methods compare data with a potential signal against a template of the Standard Model (SM) background inferred from sideband regions. We propose a means to generate this background template that uses a flow-based model to create a map** between high-fidelity SM simulations and the data. The flow is trained in sideband regions with the signal region blinded, and the flow is conditioned on the resonant feature (mass) such that it can be interpolated into the signal region. To illustrate this approach, we use simulated collisions from the Large Hadron Collider (LHC) Olympics Dataset. We find that our flow-constructed background method has competitive sensitivity with other recent proposals and can therefore provide complementary information to improve future searches. △ Less

Submitted 14 June, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

Comments: 13 pages, 11 figures. minor updates, v2 (published version)

arXiv:2212.10579 [pdf, other]

doi 10.1007/JHEP07(2023)188

Resonant Anomaly Detection with Multiple Reference Datasets

Authors: Mayee F. Chen, Benjamin Nachman, Frederic Sala

Abstract: An important class of techniques for resonant anomaly detection in high energy physics builds models that can distinguish between reference and target datasets, where only the latter has appreciable signal. Such techniques, including Classification Without Labels (CWoLa) and Simulation Assisted Likelihood-free Anomaly Detection (SALAD) rely on a single reference dataset. They cannot take advantage… ▽ More An important class of techniques for resonant anomaly detection in high energy physics builds models that can distinguish between reference and target datasets, where only the latter has appreciable signal. Such techniques, including Classification Without Labels (CWoLa) and Simulation Assisted Likelihood-free Anomaly Detection (SALAD) rely on a single reference dataset. They cannot take advantage of commonly-available multiple datasets and thus cannot fully exploit available information. In this work, we propose generalizations of CWoLa and SALAD for settings where multiple reference datasets are available, building on weak supervision techniques. We demonstrate improved performance in a number of settings with realistic and synthetic data. As an added benefit, our generalizations enable us to provide finite-sample guarantees, improving on existing asymptotic analyses. △ Less

Submitted 20 December, 2022; originally announced December 2022.

arXiv:2212.06155 [pdf, other]

Efficiently Moving Instead of Reweighting Collider Events with Machine Learning

Authors: Radha Mastandrea, Benjamin Nachman

Abstract: There are many cases in collider physics and elsewhere where a calibration dataset is used to predict the known physics and / or noise of a target region of phase space. This calibration dataset usually cannot be used out-of-the-box but must be tweaked, often with conditional importance weights, to be maximally realistic. Using resonant anomaly detection as an example, we compare a number of alter… ▽ More There are many cases in collider physics and elsewhere where a calibration dataset is used to predict the known physics and / or noise of a target region of phase space. This calibration dataset usually cannot be used out-of-the-box but must be tweaked, often with conditional importance weights, to be maximally realistic. Using resonant anomaly detection as an example, we compare a number of alternative approaches based on transporting events with normalizing flows instead of reweighting them. We find that the accuracy of the morphed calibration dataset depends on the degree to which the transport task is set up to carry out optimal transport, which motivates future research into this area. △ Less

Submitted 12 December, 2022; originally announced December 2022.

Comments: 7 pages, 3 figures. Presented at the Machine Learning and the Physical Sciences Workshop at the 36th conference on Neural Information Processing Systems (NeurIPS)

arXiv:2211.10497 [pdf, other]

Efficient quantum implementation of 2+1 U(1) lattice gauge theories with Gauss law constraints

Authors: Christopher Kane, Dorota M. Grabowska, Benjamin Nachman, Christian W. Bauer

Abstract: The study of real-time evolution of lattice quantum field theories using classical computers is known to scale exponentially with the number of lattice sites. Due to a fundamentally different computational strategy, quantum computers hold the promise of allowing for detailed studies of these dynamics from first principles. However, much like with classical computations, it is important that quantu… ▽ More The study of real-time evolution of lattice quantum field theories using classical computers is known to scale exponentially with the number of lattice sites. Due to a fundamentally different computational strategy, quantum computers hold the promise of allowing for detailed studies of these dynamics from first principles. However, much like with classical computations, it is important that quantum algorithms do not have a cost that scales exponentially with the volume. Recently, it was shown how to break the exponential scaling of a naive implementation of a U(1) gauge theory in two spatial dimensions through an operator redefinition. In this work, we describe modifications to how operators must be sampled in the new operator basis to keep digitization errors small. We compare the precision of the energies and plaquette expectation value between the two operator bases and find they are comparable. Additionally, we provide an explicit circuit construction for the Suzuki-Trotter implementation of the theory using the Walsh function formalism. The gate count scaling is studied as a function of the lattice volume, for both exact circuits and approximate circuits where rotation gates with small arguments have been dropped. We study the errors from finite Suzuki-Trotter time-step, circuit approximation, and quantum noise in a calculation of an explicit observable using IBMQ superconducting qubit hardware. We find the gate count scaling for the approximate circuits can be further reduced by up to a power of the volume without introducing larger errors. △ Less

Submitted 18 November, 2022; originally announced November 2022.

Comments: 19 pages, 8 appendices, 15 figures

Report number: CERN-TH-2022-195

arXiv:2211.08450 [pdf, other]

Geometry Optimization for Long-lived Particle Detectors

Authors: Thomas Gorordo, Simon Knapen, Benjamin Nachman, Dean J. Robinson, Adi Suresh

Abstract: The proposed designs of many auxiliary long-lived particle (LLP) detectors at the LHC call for the instrumentation of a large surface area inside the detector volume, in order to reliably reconstruct tracks and LLP decay vertices. Taking the CODEX-b detector as an example, we provide a proof-of-concept optimization analysis that demonstrates the required instrumented surface area can be substantia… ▽ More The proposed designs of many auxiliary long-lived particle (LLP) detectors at the LHC call for the instrumentation of a large surface area inside the detector volume, in order to reliably reconstruct tracks and LLP decay vertices. Taking the CODEX-b detector as an example, we provide a proof-of-concept optimization analysis that demonstrates the required instrumented surface area can be substantially reduced for many LLP models, while only marginally affecting the LLP signal efficiency. This optimization permits a significant reduction in cost and installation time, and may also inform the installation order for modular detector elements. We derive a branch-and-bound based optimization algorithm that permits highly computationally efficient determination of optimal detector configurations, subject to any specified LLP vertex and track reconstruction requirements. We outline the features of a newly-developed generalized simulation framework, for the computation of LLP signal efficiencies across a range of LLP models and detector geometries. △ Less

Submitted 15 November, 2022; originally announced November 2022.

Comments: 46 pages, 11 figures, 3 tables

arXiv:2210.15167 [pdf, other]

Statistical Patterns of Theory Uncertainties

Authors: Aishik Ghosh, Benjamin Nachman, Tilman Plehn, Lily Shire, Tim M. P. Tait, Daniel Whiteson

Abstract: A comprehensive uncertainty estimation is vital for the precision program of the LHC. While experimental uncertainties are often described by stochastic processes and well-defined nuisance parameters, theoretical uncertainties lack such a description. We study uncertainty estimates for cross-section predictions based on scale variations across a large set of processes. We find patterns similar to… ▽ More A comprehensive uncertainty estimation is vital for the precision program of the LHC. While experimental uncertainties are often described by stochastic processes and well-defined nuisance parameters, theoretical uncertainties lack such a description. We study uncertainty estimates for cross-section predictions based on scale variations across a large set of processes. We find patterns similar to a stochastic origin, with accurate uncertainties for processes mediated by the strong force, but a systematic underestimate for electroweak processes. We propose an improved scheme, based on the scale variation of reference processes, which reduces outliers in the map** from leading order to next-to-leading-order in perturbation theory. △ Less

Submitted 4 May, 2023; v1 submitted 27 October, 2022; originally announced October 2022.

Comments: UCI-HEP-TH-2022-21

arXiv:2210.11489 [pdf, other]

Machine-Learning Compression for Particle Physics Discoveries

Authors: Jack H. Collins, Yifeng Huang, Simon Knapen, Benjamin Nachman, Daniel Whiteson

Abstract: In collider-based particle and nuclear physics experiments, data are produced at such extreme rates that only a subset can be recorded for later analysis. Typically, algorithms select individual collision events for preservation and store the complete experimental response. A relatively new alternative strategy is to additionally save a partial record for a larger subset of events, allowing for la… ▽ More In collider-based particle and nuclear physics experiments, data are produced at such extreme rates that only a subset can be recorded for later analysis. Typically, algorithms select individual collision events for preservation and store the complete experimental response. A relatively new alternative strategy is to additionally save a partial record for a larger subset of events, allowing for later specific analysis of a larger fraction of events. We propose a strategy that bridges these paradigms by compressing entire events for generic offline analysis but at a lower fidelity. An optimal-transport-based $β$ Variational Autoencoder (VAE) is used to automate the compression and the hyperparameter $β$ controls the compression fidelity. We introduce a new approach for multi-objective learning functions by simultaneously learning a VAE appropriate for all values of $β$ through parameterization. We present an example use case, a di-muon resonance search at the Large Hadron Collider (LHC), where we show that simulated data compressed by our $β$-VAE has enough fidelity to distinguish distinct signal morphologies. △ Less

Submitted 18 December, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

Comments: 9 pages, 3 figures

Report number: SLAC-PUB-17704

arXiv:2210.05822 [pdf, other]

The Future of High Energy Physics Software and Computing

Authors: V. Daniel Elvira, Steven Gottlieb, Oliver Gutsche, Benjamin Nachman, S. Bailey, W. Bhimji, P. Boyle, G. Cerati, M. Carrasco Kind, K. Cranmer, G. Davies, V. D. Elvira, R. Gardner, K. Heitmann, M. Hildreth, W. Hopkins, T. Humble, M. Lin, P. Onyisi, J. Qiang, K. Pedro, G. Perdue, A. Roberts, M. Savage, P. Shanahan , et al. (3 additional authors not shown)

Abstract: Software and Computing (S&C) are essential to all High Energy Physics (HEP) experiments and many theoretical studies. The size and complexity of S&C are now commensurate with that of experimental instruments, playing a critical role in experimental design, data acquisition/instrumental control, reconstruction, and analysis. Furthermore, S&C often plays a leading role in driving the precision of th… ▽ More Software and Computing (S&C) are essential to all High Energy Physics (HEP) experiments and many theoretical studies. The size and complexity of S&C are now commensurate with that of experimental instruments, playing a critical role in experimental design, data acquisition/instrumental control, reconstruction, and analysis. Furthermore, S&C often plays a leading role in driving the precision of theoretical calculations and simulations. Within this central role in HEP, S&C has been immensely successful over the last decade. This report looks forward to the next decade and beyond, in the context of the 2021 Particle Physics Community Planning Exercise ("Snowmass") organized by the Division of Particles and Fields (DPF) of the American Physical Society. △ Less

Submitted 8 November, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

Comments: Computational Frontier Report Contribution to Snowmass 2021; 41 pages, 1 figure. v2: missing ref and added missing topical group conveners. v3: fixed typos

arXiv:2209.14872 [pdf, other]

Precision QCD, Hadronic Structure & Forward QCD, Heavy Ions: Report of Energy Frontier Topical Groups 5, 6, 7 submitted to Snowmass 2021

Authors: M. Begel, S. Hoeche, M. Schmitt, H. -W. Lin, P. M. Nadolsky, C. Royon, Y-J. Lee, S. Mukherjee, C. Baldenegro, J. Campbell, G. Chachamis, F. G. Celiberto, A. M. Cooper-Sarkar, D. d'Enterria, M. Diefenthaler, M. Fucilla, M. V. Garzelli, M. Guzzi, M. Hentschinski, T. J. Hobbs, J. Huston, J. Isaacson, S. R. Klein, F. Kling, P. Kotko , et al. (25 additional authors not shown)

Abstract: This report was prepared on behalf of three Energy Frontier Topical Groups of the Snowmass 2021 Community Planning Exercise. It summarizes the status and implications of studies of strong interactions in high-energy experiments and QCD theory. We emphasize the rich landscape and broad impact of these studies in the decade ahead. Hadronic interactions play a central role in the high-luminosity Larg… ▽ More This report was prepared on behalf of three Energy Frontier Topical Groups of the Snowmass 2021 Community Planning Exercise. It summarizes the status and implications of studies of strong interactions in high-energy experiments and QCD theory. We emphasize the rich landscape and broad impact of these studies in the decade ahead. Hadronic interactions play a central role in the high-luminosity Large Hadron Collider (LHC) physics program, and strong synergies exist between the (HL-)LHC and planned or proposed experiments at the U.S. Electron-Ion Collider, CERN forward physics experiments, high-intensity facilities, and future TeV-range lepton and hadron colliders. Prospects for precision determinations of the strong coupling and a variety of nonperturbative distribution and fragmentation functions are examined. We also review the potential of envisioned tests of new dynamical regimes of QCD in high-energy and high-density scattering processes with nucleon, ion, and photon initial states. The important role of the high-energy heavy-ion program in studies of nuclear structure and the nuclear medium, and its connections with QCD involving nucleons are summarized. We address ongoing and future theoretical advancements in multi-loop QCD computations, lattice QCD, jet substructure, and event generators. Cross-cutting connections between experimental measurements, theoretical predictions, large-scale data analysis, and high-performance computing are emphasized. △ Less

Submitted 19 November, 2022; v1 submitted 29 September, 2022; originally announced September 2022.

Comments: 95 pages (bibliography 30 pages), 28 figures; v.2: minor changes, authors and references added

Report number: FERMILAB-CONF-22-733-SCD-T, SMU-HEP-22-06

arXiv:2209.06225 [pdf, other]

doi 10.1103/PhysRevD.107.015009

Anomaly Detection under Coordinate Transformations

Authors: Gregor Kasieczka, Radha Mastandrea, Vinicius Mikuni, Benjamin Nachman, Mariel Pettee, David Shih

Abstract: There is a growing need for machine learning-based anomaly detection strategies to broaden the search for Beyond-the-Standard-Model (BSM) physics at the Large Hadron Collider (LHC) and elsewhere. The first step of any anomaly detection approach is to specify observables and then use them to decide on a set of anomalous events. One common choice is to select events that have low probability density… ▽ More There is a growing need for machine learning-based anomaly detection strategies to broaden the search for Beyond-the-Standard-Model (BSM) physics at the Large Hadron Collider (LHC) and elsewhere. The first step of any anomaly detection approach is to specify observables and then use them to decide on a set of anomalous events. One common choice is to select events that have low probability density. It is a well-known fact that probability densities are not invariant under coordinate transformations, so the sensitivity can depend on the initial choice of coordinates. The broader machine learning community has recently connected coordinate sensitivity with anomaly detection and our goal is to bring awareness of this issue to the growing high energy physics literature on anomaly detection. In addition to analytical explanations, we provide numerical examples from simple random variables and from the LHC Olympics Dataset that show how using probability density as an anomaly score can lead to events being classified as anomalous or not depending on the coordinate frame. △ Less

Submitted 13 September, 2022; originally announced September 2022.

Comments: 10 pages, 6 figures

arXiv:2208.07910 [pdf, other]

When, Where, and How to Open Data: A Personal Perspective

Authors: Benjamin Nachman

Abstract: This is a personal perspective on data sharing in the context of public data releases suitable for generic analysis. These open data can be a powerful tool for expanding the science of high energy physics, but care must be taken in when, where, and how they are utilized. I argue that data preservation even within collaborations needs additional support in order to maximize our science potential. A… ▽ More This is a personal perspective on data sharing in the context of public data releases suitable for generic analysis. These open data can be a powerful tool for expanding the science of high energy physics, but care must be taken in when, where, and how they are utilized. I argue that data preservation even within collaborations needs additional support in order to maximize our science potential. Additionally, it should also be easier for non-collaboration members to engage with collaborations. Finally, I advocate that we recognize a new type of high energy physicist: the 'data physicist', who would be optimally suited to analyze open data as well as develop and deploy new advanced data science tools so that we can use our precious data to their fullest potential. This document has been coordinated with a white paper on open data commissioned by the American Physical Society's (APS) Division of Particles and Field (DPS) Community Planning Exercise ('Snowmass') Theory Frontier [1] and relevant also for the Computational Frontier. △ Less

Submitted 16 August, 2022; originally announced August 2022.

Comments: 11 pages, 2 figures, contribution to Snowmass 2021

arXiv:2208.03333 [pdf, other]

Overcoming exponential scaling with system size in Trotter-Suzuki implementations of constrained Hamiltonians: 2+1 U(1) lattice gauge theories

Authors: Dorota M. Grabowska, Christopher Kane, Benjamin Nachman, Christian W. Bauer

Abstract: For many quantum systems of interest, the classical computational cost of simulating their time evolution scales exponentially in the system size. At the same time, quantum computers have been shown to allow for simulations of some of these systems using resources that scale polynomially with the system size. Given the potential for using quantum computers for simulations that are not feasible usi… ▽ More For many quantum systems of interest, the classical computational cost of simulating their time evolution scales exponentially in the system size. At the same time, quantum computers have been shown to allow for simulations of some of these systems using resources that scale polynomially with the system size. Given the potential for using quantum computers for simulations that are not feasible using classical devices, it is paramount that one studies the scaling of quantum algorithms carefully. This work identifies a term in the Hamiltonian of a class of constrained systems that naively requires quantum resources that scale exponentially in the system size. An important example is a compact U(1) gauge theory on lattices with periodic boundary conditions. Imposing the magnetic Gauss' law a priori introduces a constraint into that Hamiltonian that naively results in an exponentially deep circuit. A method is then developed that reduces this scaling to polynomial in the system size, using a redefinition of the operator basis. An explicit construction of the matrices defining the change of operator basis, as well as the scaling of the associated computational cost, is given. △ Less

Submitted 24 January, 2023; v1 submitted 5 August, 2022; originally announced August 2022.

Comments: 9 pages, 1 Figure. V2 clarifies how to calculate the Degree of Coupling and how weaved matrices are constructed to reduce the Degree of Coupling

Report number: CERN-TH-2022-133

arXiv:2208.02274 [pdf, other]

Morphing parton showers with event derivatives

Authors: Benjamin Nachman, Stefan Prestel

Abstract: We develop EventMover, a differentiable parton shower event generator. This tool generates high- and variable-length scattering events that can be moved with simulation derivatives to change the value of the scale $Λ_\mathrm{QCD}$ defining the strong coupling constant, without introducing statistical variations between samples. To demonstrate the potential for EventMover, we compare the output of… ▽ More We develop EventMover, a differentiable parton shower event generator. This tool generates high- and variable-length scattering events that can be moved with simulation derivatives to change the value of the scale $Λ_\mathrm{QCD}$ defining the strong coupling constant, without introducing statistical variations between samples. To demonstrate the potential for EventMover, we compare the output of the simulation with $e^+e^-$ data to show how one could fit $Λ_\mathrm{QCD}$ with only a single event sample. This is a critical step towards a fully differentiable event generator for particle and nuclear physics. △ Less

Submitted 3 August, 2022; originally announced August 2022.

Comments: Implementation available at https://gitlab.com/discreteqcd/eventmover

arXiv:2207.12411 [pdf, other]

doi 10.1007/JHEP12(2022)021

Systematic Quark/Gluon Identification with Ratios of Likelihoods

Authors: Samuel Bright-Thonney, Ian Moult, Benjamin Nachman, Stefan Prestel

Abstract: Discriminating between quark- and gluon-initiated jets has long been a central focus of jet substructure, leading to the introduction of numerous observables and calculations to high perturbative accuracy. At the same time, there have been many attempts to fully exploit the jet radiation pattern using tools from statistics and machine learning. We propose a new approach that combines a deep analyt… ▽ More Discriminating between quark- and gluon-initiated jets has long been a central focus of jet substructure, leading to the introduction of numerous observables and calculations to high perturbative accuracy. At the same time, there have been many attempts to fully exploit the jet radiation pattern using tools from statistics and machine learning. We propose a new approach that combines a deep analytic understanding of jet substructure with the optimality promised by machine learning and statistics. After specifying an approximation to the full emission phase space, we show how to construct the optimal observable for a given classification task. This procedure is demonstrated for the case of quark and gluons jets, where we show how to systematically capture sub-eikonal corrections in the splitting functions, and prove that linear combinations of weighted multiplicity is the optimal observable. In addition to providing a new and powerful framework for systematically improving jet substructure observables, we demonstrate the performance of several quark versus gluon jet tagging observables in parton-level Monte Carlo simulations, and find that they perform at or near the level of a deep neural network classifier. Combined with the rapid recent progress in the development of higher order parton showers, we believe that our approach provides a basis for systematically exploiting subleading effects in jet substructure analyses at the Large Hadron Collider (LHC) and beyond. △ Less

Submitted 4 November, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

Comments: 25 pages, 6 figures

arXiv:2206.11898 [pdf, other]

doi 10.1103/PhysRevD.106.092009

Score-based Generative Models for Calorimeter Shower Simulation

Authors: Vinicius Mikuni, Benjamin Nachman

Abstract: Score-based generative models are a new class of generative algorithms that have been shown to produce realistic images even in high dimensional spaces, currently surpassing other state-of-the-art models for different benchmark categories and applications. In this work we introduce CaloScore, a score-based generative model for collider physics applied to calorimeter shower generation. Three differ… ▽ More Score-based generative models are a new class of generative algorithms that have been shown to produce realistic images even in high dimensional spaces, currently surpassing other state-of-the-art models for different benchmark categories and applications. In this work we introduce CaloScore, a score-based generative model for collider physics applied to calorimeter shower generation. Three different diffusion models are investigated using the Fast Calorimeter Simulation Challenge 2022 dataset. CaloScore is the first application of a score-based generative model in collider physics and is able to produce high-fidelity calorimeter images for all datasets, providing an alternative paradigm for calorimeter shower simulation. △ Less

Submitted 19 October, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

arXiv:2206.10642 [pdf, other]

doi 10.1007/JHEP02(2023)150

Going off topics to demix quark and gluon jets in $α_S$ extractions

Authors: Matt LeBlanc, Benjamin Nachman, Christof Sauer

Abstract: Quantum chromodynamics is the theory of the strong interaction between quarks and gluons; the coupling strength of the interaction, $α_S$, is the least precisely-known of all interactions in nature. An extraction of the strong coupling from the radiation pattern within jets would provide a complementary approach to conventional extractions from jet production rates and hadronic event shapes, and w… ▽ More Quantum chromodynamics is the theory of the strong interaction between quarks and gluons; the coupling strength of the interaction, $α_S$, is the least precisely-known of all interactions in nature. An extraction of the strong coupling from the radiation pattern within jets would provide a complementary approach to conventional extractions from jet production rates and hadronic event shapes, and would be a key achievement of jet substructure at the Large Hadron Collider (LHC). Presently, the relative fraction of quark and gluon jets in a sample is the limiting factor in such extractions, as this fraction is degenerate with the value of $α_S$ for the most well-understood observables. To overcome this limitation, we apply recently proposed techniques to statistically demix multiple mixtures of jets and obtain purified quark and gluon distributions based on an operational definition. We illustrate that studying quark and gluon jet substructure separately can significantly improve the sensitivity of such extractions of the strong coupling. We also discuss how using machine learning techniques or infrared- and collinear-unsafe information can improve the demixing performance without the loss of theoretical control. While theoretical research is required to connect the extract topics with the quark and gluon objects in cross section calculations, our study illustrates the potential of demixing to reduce the dominant uncertainty for the $α_S$ extraction from jet substructure at the LHC. △ Less

Submitted 7 March, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

Comments: 20 pages, 7 figures

Journal ref: J. High Energ. Phys. 2023, 150 (2023)

arXiv:2206.08391 [pdf, other]

doi 10.1007/JHEP02(2023)220

Quantum Anomaly Detection for Collider Physics

Authors: Sulaiman Alvi, Christian Bauer, Benjamin Nachman

Abstract: Quantum Machine Learning (QML) is an exciting tool that has received significant recent attention due in part to advances in quantum computing hardware. While there is currently no formal guarantee that QML is superior to classical ML for relevant problems, there have been many claims of an empirical advantage with high energy physics datasets. These studies typically do not claim an exponential s… ▽ More Quantum Machine Learning (QML) is an exciting tool that has received significant recent attention due in part to advances in quantum computing hardware. While there is currently no formal guarantee that QML is superior to classical ML for relevant problems, there have been many claims of an empirical advantage with high energy physics datasets. These studies typically do not claim an exponential speedup in training, but instead usually focus on an improved performance with limited training data. We explore an analysis that is characterized by a low statistics dataset. In particular, we study an anomaly detection task in the four-lepton final state at the Large Hadron Collider that is limited by a small dataset. We explore the application of QML in a semi-supervised mode to look for new physics without specifying a particular signal model hypothesis. We find no evidence that QML provides any advantage over classical ML. It could be that a case where QML is superior to classical ML for collider physics will be established in the future, but for now, classical ML is a powerful tool that will continue to expand the science of the LHC and beyond. △ Less

Submitted 7 November, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

Comments: 18 pages, 6 figures v2: updated acknowledgment, fixed typos related to the output of VQC and QCL

arXiv:2205.10380 [pdf, other]

doi 10.1103/PhysRevD.106.056005

Self-supervised Anomaly Detection for New Physics

Authors: Barry M. Dillon, Radha Mastandrea, Benjamin Nachman

Abstract: We investigate a method of model-agnostic anomaly detection through studying jets, collimated sprays of particles produced in high-energy collisions. We train a transformer neural network to encode simulated QCD "event space" dijets into a low-dimensional "latent space" representation. We optimize the network using the self-supervised contrastive loss, which encourages the preservation of known ph… ▽ More We investigate a method of model-agnostic anomaly detection through studying jets, collimated sprays of particles produced in high-energy collisions. We train a transformer neural network to encode simulated QCD "event space" dijets into a low-dimensional "latent space" representation. We optimize the network using the self-supervised contrastive loss, which encourages the preservation of known physical symmetries of the dijets. We then train a binary classifier to discriminate a BSM resonant dijet signal from a QCD dijet background both in the event space and the latent space representations. We find the classifier performances on the event and latent spaces to be comparable. We finally perform an anomaly detection search using a weakly supervised bump hunt on the latent space dijets, finding again a comparable performance to a search run on the physical space dijets. This opens the door to using low-dimensional latent representations as a computationally efficient space for resonant anomaly detection in generic particle collision events. △ Less

Submitted 15 May, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

Comments: 13 pages, 12 figures. minor updates, v2 (published version)

Journal ref: Phys. Rev. D 106, 056005 (2022)

Showing 1–50 of 153 results for author: Nachman, B