-
Moments of Clarity: Streamlining Latent Spaces in Machine Learning using Moment Pooling
Authors:
Rikab Gambhir,
Athis Osathapan,
Jesse Thaler
Abstract:
Many machine learning applications involve learning a latent representation of data, which is often high-dimensional and difficult to directly interpret. In this work, we propose "Moment Pooling", a natural extension of Deep Sets networks which drastically decrease latent space dimensionality of these networks while maintaining or even improving performance. Moment Pooling generalizes the summatio…
▽ More
Many machine learning applications involve learning a latent representation of data, which is often high-dimensional and difficult to directly interpret. In this work, we propose "Moment Pooling", a natural extension of Deep Sets networks which drastically decrease latent space dimensionality of these networks while maintaining or even improving performance. Moment Pooling generalizes the summation in Deep Sets to arbitrary multivariate moments, which enables the model to achieve a much higher effective latent dimensionality for a fixed latent dimension. We demonstrate Moment Pooling on the collider physics task of quark/gluon jet classification by extending Energy Flow Networks (EFNs) to Moment EFNs. We find that Moment EFNs with latent dimensions as small as 1 perform similarly to ordinary EFNs with higher latent dimension. This small latent dimension allows for the internal representation to be directly visualized and interpreted, which in turn enables the learned internal jet representation to be extracted in closed form.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Seeing Double: Calibrating Two Jets at Once
Authors:
Rikab Gambhir,
Benjamin Nachman
Abstract:
Jet energy calibration is an important aspect of many measurements and searches at the LHC. Currently, these calibrations are performed on a per-jet basis, i.e. agnostic to the properties of other jets in the same event. In this work, we propose taking advantage of the correlations induced by momentum conservation between jets in order to improve their jet energy calibration. By fitting the $p_T$…
▽ More
Jet energy calibration is an important aspect of many measurements and searches at the LHC. Currently, these calibrations are performed on a per-jet basis, i.e. agnostic to the properties of other jets in the same event. In this work, we propose taking advantage of the correlations induced by momentum conservation between jets in order to improve their jet energy calibration. By fitting the $p_T$ asymmetry of dijet events in simulation, while remaining agnostic to the $p_T$ spectra themselves, we are able to obtain correlation-improved maximum likelihood estimates. This approach is demonstrated with simulated jets from the CMS Detector, yielding a $3$-$5\%$ relative improvement in the jet energy resolution, corresponding to a quadrature improvement of approximately 35\%.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
The New Physics Case for Beam-Dump Experiments with Accelerated Muon Beams
Authors:
Cari Cesarotti,
Rikab Gambhir
Abstract:
As the field examines a future muon collider as a possible successor to the LHC, we must consider how to fully utilize not only the high-energy particle collisions, but also any lower-energy staging facilities necessary in the R&D process. An economical and efficient possibility is to use the accelerated muon beam from either the full experiment or from cooling and acceleration tests in beam-dump…
▽ More
As the field examines a future muon collider as a possible successor to the LHC, we must consider how to fully utilize not only the high-energy particle collisions, but also any lower-energy staging facilities necessary in the R&D process. An economical and efficient possibility is to use the accelerated muon beam from either the full experiment or from cooling and acceleration tests in beam-dump experiments.Beam-dump experiments are complementary to the main collider as they achieve sensitivity to very small couplings with minimal instrumentation. We demonstrate the utility of muon beam-dump experiments for new physics searches at energies from 10 GeV to 5 TeV. We find that, even at low energies like those accessible at staging or demonstrator facilities, it is possible to probe new regions of parameter space for a variety of generic BSM models, including muonphilic, leptophilic, $L_μ- L_τ$, and dark photon scenarios. Such experiments could therefore provide opportunities for discovery of new physics well before the completion of the full multi-TeV collider.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
SHAPER: Can You Hear the Shape of a Jet?
Authors:
Demba Ba,
Akshunna S. Dogra,
Rikab Gambhir,
Abiy Tasissa,
Jesse Thaler
Abstract:
The identification of interesting substructures within jets is an important tool for searching for new physics and probing the Standard Model at colliders. Many of these substructure tools have previously been shown to take the form of optimal transport problems, in particular the Energy Mover's Distance (EMD). In this work, we show that the EMD is in fact the natural structure for comparing colli…
▽ More
The identification of interesting substructures within jets is an important tool for searching for new physics and probing the Standard Model at colliders. Many of these substructure tools have previously been shown to take the form of optimal transport problems, in particular the Energy Mover's Distance (EMD). In this work, we show that the EMD is in fact the natural structure for comparing collider events, which accounts for its recent success in understanding event and jet substructure. We then present a Shape Hunting Algorithm using Parameterized Energy Reconstruction (SHAPER), which is a general framework for defining and computing shape-based observables. SHAPER generalizes N-jettiness from point clusters to any extended, parametrizable shape. This is accomplished by efficiently minimizing the EMD between events and parameterized manifolds of energy flows representing idealized shapes, implemented using the dual-potential Sinkhorn approximation of the Wasserstein metric. We show how the geometric language of observables as manifolds can be used to define novel observables with built-in infrared-and-collinear safety. We demonstrate the efficacy of the SHAPER framework by performing empirical jet substructure studies using several examples of new shape-based observables.
△ Less
Submitted 20 July, 2023; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Bias and Priors in Machine Learning Calibrations for High Energy Physics
Authors:
Rikab Gambhir,
Benjamin Nachman,
Jesse Thaler
Abstract:
Machine learning offers an exciting opportunity to improve the calibration of nearly all reconstructed objects in high-energy physics detectors. However, machine learning approaches often depend on the spectra of examples used during training, an issue known as prior dependence. This is an undesirable property of a calibration, which needs to be applicable in a variety of environments. The purpose…
▽ More
Machine learning offers an exciting opportunity to improve the calibration of nearly all reconstructed objects in high-energy physics detectors. However, machine learning approaches often depend on the spectra of examples used during training, an issue known as prior dependence. This is an undesirable property of a calibration, which needs to be applicable in a variety of environments. The purpose of this paper is to explicitly highlight the prior dependence of some machine learning-based calibration strategies. We demonstrate how some recent proposals for both simulation-based and data-based calibrations inherit properties of the sample used for training, which can result in biases for downstream analyses. In the case of simulation-based calibration, we argue that our recently proposed Gaussian Ansatz approach can avoid some of the pitfalls of prior dependence, whereas prior-independent data-based calibration remains an open problem.
△ Less
Submitted 31 August, 2022; v1 submitted 10 May, 2022;
originally announced May 2022.
-
Learning Uncertainties the Frequentist Way: Calibration and Correlation in High Energy Physics
Authors:
Rikab Gambhir,
Benjamin Nachman,
Jesse Thaler
Abstract:
Calibration is a common experimental physics problem, whose goal is to infer the value and uncertainty of an unobservable quantity Z given a measured quantity X. Additionally, one would like to quantify the extent to which X and Z are correlated. In this paper, we present a machine learning framework for performing frequentist maximum likelihood inference with Gaussian uncertainty estimation, whic…
▽ More
Calibration is a common experimental physics problem, whose goal is to infer the value and uncertainty of an unobservable quantity Z given a measured quantity X. Additionally, one would like to quantify the extent to which X and Z are correlated. In this paper, we present a machine learning framework for performing frequentist maximum likelihood inference with Gaussian uncertainty estimation, which also quantifies the mutual information between the unobservable and measured quantities. This framework uses the Donsker-Varadhan representation of the Kullback-Leibler divergence -- parametrized with a novel Gaussian Ansatz -- to enable a simultaneous extraction of the maximum likelihood values, uncertainties, and mutual information in a single training. We demonstrate our framework by extracting jet energy corrections and resolution factors from a simulation of the CMS detector at the Large Hadron Collider. By leveraging the high-dimensional feature space inside jets, we improve upon the nominal CMS jet resolution by upwards of 15%.
△ Less
Submitted 24 September, 2023; v1 submitted 6 May, 2022;
originally announced May 2022.