-
Shower Separation in Five Dimensions for Highly Granular Calorimeters using Machine Learning
Authors:
S. Lai,
J. Utehs,
A. Wilhahn,
M. C. Fouz,
O. Bach,
E. Brianne,
A. Ebrahimi,
K. Gadow,
P. Göttlicher,
O. Hartbrich,
D. Heuchel,
A. Irles,
K. Krüger,
J. Kvasnicka,
S. Lu,
C. Neubüser,
A. Provenza,
M. Reinecke,
F. Sefkow,
S. Schuwalow,
M. De Silva,
Y. Sudo,
H. L. Tran,
L. Liu,
R. Masuda
, et al. (26 additional authors not shown)
Abstract:
To achieve state-of-the-art jet energy resolution for Particle Flow, sophisticated energy clustering algorithms must be developed that can fully exploit available information to separate energy deposits from charged and neutral particles. Three published neural network-based shower separation models were applied to simulation and experimental data to measure the performance of the highly granular…
▽ More
To achieve state-of-the-art jet energy resolution for Particle Flow, sophisticated energy clustering algorithms must be developed that can fully exploit available information to separate energy deposits from charged and neutral particles. Three published neural network-based shower separation models were applied to simulation and experimental data to measure the performance of the highly granular CALICE Analogue Hadronic Calorimeter (AHCAL) technological prototype in distinguishing the energy deposited by a single charged and single neutral hadron for Particle Flow. The performance of models trained using only standard spatial and energy and charged track position information from an event was compared to models trained using timing information available from AHCAL, which is expected to improve sensitivity to shower development and, therefore, aid in clustering. Both simulation and experimental data were used to train and test the models and their performances were compared. The best-performing neural network achieved significantly superior event reconstruction when timing information was utilised in training for the case where the charged hadron had more energy than the neutral one, motivating temporally sensitive calorimeters. All models under test were observed to tend to allocate energy deposited by the more energetic of the two showers to the less energetic one. Similar shower reconstruction performance was observed for a model trained on simulation and applied to data and a model trained and applied to data.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Software Compensation for Highly Granular Calorimeters using Machine Learning
Authors:
S. Lai,
J. Utehs,
A. Wilhahn,
O. Bach,
E. Brianne,
A. Ebrahimi,
K. Gadow,
P. Göttlicher,
O. Hartbrich,
D. Heuchel,
A. Irles,
K. Krüger,
J. Kvasnicka,
S. Lu,
C. Neubüser,
A. Provenza,
M. Reinecke,
F. Sefkow,
S. Schuwalow,
M. De Silva,
Y. Sudo,
H. L. Tran,
E. Buhmann,
E. Garutti,
S. Huck
, et al. (39 additional authors not shown)
Abstract:
A neural network for software compensation was developed for the highly granular CALICE Analogue Hadronic Calorimeter (AHCAL). The neural network uses spatial and temporal event information from the AHCAL and energy information, which is expected to improve sensitivity to shower development and the neutron fraction of the hadron shower. The neural network method produced a depth-dependent energy w…
▽ More
A neural network for software compensation was developed for the highly granular CALICE Analogue Hadronic Calorimeter (AHCAL). The neural network uses spatial and temporal event information from the AHCAL and energy information, which is expected to improve sensitivity to shower development and the neutron fraction of the hadron shower. The neural network method produced a depth-dependent energy weighting and a time-dependent threshold for enhancing energy deposits consistent with the timescale of evaporation neutrons. Additionally, it was observed to learn an energy-weighting indicative of longitudinal leakage correction. In addition, the method produced a linear detector response and outperformed a published control method regarding resolution for every particle energy studied.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Flow Matching Beyond Kinematics: Generating Jets with Particle-ID and Trajectory Displacement Information
Authors:
Joschka Birk,
Erik Buhmann,
Cedric Ewen,
Gregor Kasieczka,
David Shih
Abstract:
We introduce the first generative model trained on the JetClass dataset. Our model generates jets at the constituent level, and it is a permutation-equivariant continuous normalizing flow (CNF) trained with the flow matching technique. It is conditioned on the jet type, so that a single model can be used to generate the ten different jet types of JetClass. For the first time, we also introduce a g…
▽ More
We introduce the first generative model trained on the JetClass dataset. Our model generates jets at the constituent level, and it is a permutation-equivariant continuous normalizing flow (CNF) trained with the flow matching technique. It is conditioned on the jet type, so that a single model can be used to generate the ten different jet types of JetClass. For the first time, we also introduce a generative model that goes beyond the kinematic features of jet constituents. The JetClass dataset includes more features, such as particle-ID and track impact parameter, and we demonstrate that our CNF can accurately model all of these additional features as well. Our generative model for JetClass expands on the versatility of existing jet generation techniques, enhancing their potential utility in high-energy physics research, and offering a more comprehensive understanding of the generated jets.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
Full Phase Space Resonant Anomaly Detection
Authors:
Erik Buhmann,
Cedric Ewen,
Gregor Kasieczka,
Vinicius Mikuni,
Benjamin Nachman,
David Shih
Abstract:
Physics beyond the Standard Model that is resonant in one or more dimensions has been a longstanding focus of countless searches at colliders and beyond. Recently, many new strategies for resonant anomaly detection have been developed, where sideband information can be used in conjunction with modern machine learning, in order to generate synthetic datasets representing the Standard Model backgrou…
▽ More
Physics beyond the Standard Model that is resonant in one or more dimensions has been a longstanding focus of countless searches at colliders and beyond. Recently, many new strategies for resonant anomaly detection have been developed, where sideband information can be used in conjunction with modern machine learning, in order to generate synthetic datasets representing the Standard Model background. Until now, this approach was only able to accommodate a relatively small number of dimensions, limiting the breadth of the search sensitivity. Using recent innovations in point cloud generative models, we show that this strategy can also be applied to the full phase space, using all relevant particles for the anomaly detection. As a proof of principle, we show that the signal from the R\&D dataset from the LHC Olympics is findable with this method, opening up the door to future studies that explore the interplay between depth and breadth in the representation of the data for anomaly detection.
△ Less
Submitted 9 February, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
CaloClouds II: Ultra-Fast Geometry-Independent Highly-Granular Calorimeter Simulation
Authors:
Erik Buhmann,
Frank Gaede,
Gregor Kasieczka,
Anatolii Korol,
William Korcari,
Katja Krüger,
Peter McKeown
Abstract:
Fast simulation of the energy depositions in high-granular detectors is needed for future collider experiments with ever-increasing luminosities. Generative machine learning (ML) models have been shown to speed up and augment the traditional simulation chain in physics analysis. However, the majority of previous efforts were limited to models relying on fixed, regular detector readout geometries.…
▽ More
Fast simulation of the energy depositions in high-granular detectors is needed for future collider experiments with ever-increasing luminosities. Generative machine learning (ML) models have been shown to speed up and augment the traditional simulation chain in physics analysis. However, the majority of previous efforts were limited to models relying on fixed, regular detector readout geometries. A major advancement is the recently introduced CaloClouds model, a geometry-independent diffusion model, which generates calorimeter showers as point clouds for the electromagnetic calorimeter of the envisioned International Large Detector (ILD).
In this work, we introduce CaloClouds II which features a number of key improvements. This includes continuous time score-based modelling, which allows for a 25-step sampling with comparable fidelity to CaloClouds while yielding a $6\times$ speed-up over Geant4 on a single CPU ($5\times$ over CaloClouds). We further distill the diffusion model into a consistency model allowing for accurate sampling in a single step and resulting in a $46\times$ ($37\times$ over CaloClouds) speed-up. This constitutes the first application of consistency distillation for the generation of calorimeter showers.
△ Less
Submitted 26 February, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
CaloClouds: Fast Geometry-Independent Highly-Granular Calorimeter Simulation
Authors:
Erik Buhmann,
Sascha Diefenbacher,
Engin Eren,
Frank Gaede,
Gregor Kasieczka,
Anatolii Korol,
William Korcari,
Katja Krüger,
Peter McKeown
Abstract:
Simulating showers of particles in highly-granular detectors is a key frontier in the application of machine learning to particle physics. Achieving high accuracy and speed with generative machine learning models would enable them to augment traditional simulations and alleviate a major computing constraint. This work achieves a major breakthrough in this task by, for the first time, directly gene…
▽ More
Simulating showers of particles in highly-granular detectors is a key frontier in the application of machine learning to particle physics. Achieving high accuracy and speed with generative machine learning models would enable them to augment traditional simulations and alleviate a major computing constraint. This work achieves a major breakthrough in this task by, for the first time, directly generating a point cloud of a few thousand space points with energy depositions in the detector in 3D space without relying on a fixed-grid structure. This is made possible by two key innovations: i) Using recent improvements in generative modeling we apply a diffusion model to generate photon showers as high-cardinality point clouds. ii) These point clouds of up to $6,000$ space points are largely geometry-independent as they are down-sampled from initial even higher-resolution point clouds of up to $40,000$ so-called Geant4 steps. We showcase the performance of this approach using the specific example of simulating photon showers in the planned electromagnetic calorimeter of the International Large Detector (ILD) and achieve overall good modeling of physically relevant distributions.
△ Less
Submitted 26 February, 2024; v1 submitted 8 May, 2023;
originally announced May 2023.
-
EPiC-GAN: Equivariant Point Cloud Generation for Particle Jets
Authors:
Erik Buhmann,
Gregor Kasieczka,
Jesse Thaler
Abstract:
With the vast data-collecting capabilities of current and future high-energy collider experiments, there is an increasing demand for computationally efficient simulations. Generative machine learning models enable fast event generation, yet so far these approaches are largely constrained to fixed data structures and rigid detector geometries. In this paper, we introduce EPiC-GAN - equivariant poin…
▽ More
With the vast data-collecting capabilities of current and future high-energy collider experiments, there is an increasing demand for computationally efficient simulations. Generative machine learning models enable fast event generation, yet so far these approaches are largely constrained to fixed data structures and rigid detector geometries. In this paper, we introduce EPiC-GAN - equivariant point cloud generative adversarial network - which can produce point clouds of variable multiplicity. This flexible framework is based on deep sets and is well suited for simulating sprays of particles called jets. The generator and discriminator utilize multiple EPiC layers with an interpretable global latent vector. Crucially, the EPiC layers do not rely on pairwise information sharing between particles, which leads to a significant speed-up over graph- and transformer-based approaches with more complex relation diagrams. We demonstrate that EPiC-GAN scales well to large particle multiplicities and achieves high generation fidelity on benchmark jet generation tasks.
△ Less
Submitted 12 July, 2023; v1 submitted 17 January, 2023;
originally announced January 2023.
-
Performance of the CMS High Granularity Calorimeter prototype to charged pion beams of 20$-$300 GeV/c
Authors:
B. Acar,
G. Adamov,
C. Adloff,
S. Afanasiev,
N. Akchurin,
B. Akgün,
M. Alhusseini,
J. Alison,
J. P. Figueiredo de sa Sousa de Almeida,
P. G. Dias de Almeida,
A. Alpana,
M. Alyari,
I. Andreev,
U. Aras,
P. Aspell,
I. O. Atakisi,
O. Bach,
A. Baden,
G. Bakas,
A. Bakshi,
S. Banerjee,
P. DeBarbaro,
P. Bargassa,
D. Barney,
F. Beaudette
, et al. (435 additional authors not shown)
Abstract:
The upgrade of the CMS experiment for the high luminosity operation of the LHC comprises the replacement of the current endcap calorimeter by a high granularity sampling calorimeter (HGCAL). The electromagnetic section of the HGCAL is based on silicon sensors interspersed between lead and copper (or copper tungsten) absorbers. The hadronic section uses layers of stainless steel as an absorbing med…
▽ More
The upgrade of the CMS experiment for the high luminosity operation of the LHC comprises the replacement of the current endcap calorimeter by a high granularity sampling calorimeter (HGCAL). The electromagnetic section of the HGCAL is based on silicon sensors interspersed between lead and copper (or copper tungsten) absorbers. The hadronic section uses layers of stainless steel as an absorbing medium and silicon sensors as an active medium in the regions of high radiation exposure, and scintillator tiles directly readout by silicon photomultipliers in the remaining regions. As part of the development of the detector and its readout electronic components, a section of a silicon-based HGCAL prototype detector along with a section of the CALICE AHCAL prototype was exposed to muons, electrons and charged pions in beam test experiments at the H2 beamline at the CERN SPS in October 2018. The AHCAL uses the same technology as foreseen for the HGCAL but with much finer longitudinal segmentation. The performance of the calorimeters in terms of energy response and resolution, longitudinal and transverse shower profiles is studied using negatively charged pions, and is compared to GEANT4 predictions. This is the first report summarizing results of hadronic showers measured by the HGCAL prototype using beam test data.
△ Less
Submitted 27 May, 2023; v1 submitted 9 November, 2022;
originally announced November 2022.
-
Hadrons, Better, Faster, Stronger
Authors:
Erik Buhmann,
Sascha Diefenbacher,
Engin Eren,
Frank Gaede,
Daniel Hundhausen,
Gregor Kasieczka,
William Korcari,
Katja Krüger,
Peter McKeown,
Lennart Rustige
Abstract:
Motivated by the computational limitations of simulating interactions of particles in highly-granular detectors, there exists a concerted effort to build fast and exact machine-learning-based shower simulators. This work reports progress on two important fronts. First, the previously investigated WGAN and BIB-AE generative models are improved and successful learning of hadronic showers initiated b…
▽ More
Motivated by the computational limitations of simulating interactions of particles in highly-granular detectors, there exists a concerted effort to build fast and exact machine-learning-based shower simulators. This work reports progress on two important fronts. First, the previously investigated WGAN and BIB-AE generative models are improved and successful learning of hadronic showers initiated by charged pions in a segment of the hadronic calorimeter of the International Large Detector (ILD) is demonstrated for the first time. Second, we consider how state-of-the-art reconstruction software applied to generated shower energies affects the obtainable energy response and resolution. While many challenges remain, these results constitute an important milestone in using generative models in a realistic setting.
△ Less
Submitted 17 December, 2021;
originally announced December 2021.
-
Shared Data and Algorithms for Deep Learning in Fundamental Physics
Authors:
Lisa Benato,
Erik Buhmann,
Martin Erdmann,
Peter Fackeldey,
Jonas Glombitza,
Nikolai Hartmann,
Gregor Kasieczka,
William Korcari,
Thomas Kuhr,
Jan Steinheimer,
Horst Stöcker,
Tilman Plehn,
Kai Zhou
Abstract:
We introduce a Python package that provides simply and unified access to a collection of datasets from fundamental physics research - including particle physics, astroparticle physics, and hadron- and nuclear physics - for supervised machine learning studies. The datasets contain hadronic top quarks, cosmic-ray induced air showers, phase transitions in hadronic matter, and generator-level historie…
▽ More
We introduce a Python package that provides simply and unified access to a collection of datasets from fundamental physics research - including particle physics, astroparticle physics, and hadron- and nuclear physics - for supervised machine learning studies. The datasets contain hadronic top quarks, cosmic-ray induced air showers, phase transitions in hadronic matter, and generator-level histories. While public datasets from multiple fundamental physics disciplines already exist, the common interface and provided reference models simplify future work on cross-disciplinary machine learning and transfer learning in fundamental physics. We discuss the design and structure and line out how additional datasets can be submitted for inclusion.
As showcase application, we present a simple yet flexible graph-based neural network architecture that can easily be applied to a wide range of supervised learning tasks. We show that our approach reaches performance close to dedicated methods on all datasets. To simplify adaptation for various problems, we provide easy-to-follow instructions on how graph-based representations of data structures, relevant for fundamental physics, can be constructed and provide code implementations for several of them. Implementations are also provided for our proposed method and all reference algorithms.
△ Less
Submitted 24 March, 2022; v1 submitted 1 July, 2021;
originally announced July 2021.
-
Decoding Photons: Physics in the Latent Space of a BIB-AE Generative Network
Authors:
Erik Buhmann,
Sascha Diefenbacher,
Engin Eren,
Frank Gaede,
Gregor Kasieczka,
Anatolii Korol,
Katja Krüger
Abstract:
Given the increasing data collection capabilities and limited computing resources of future collider experiments, interest in using generative neural networks for the fast simulation of collider events is growing. In our previous study, the Bounded Information Bottleneck Autoencoder (BIB-AE) architecture for generating photon showers in a high-granularity calorimeter showed a high accuracy modelin…
▽ More
Given the increasing data collection capabilities and limited computing resources of future collider experiments, interest in using generative neural networks for the fast simulation of collider events is growing. In our previous study, the Bounded Information Bottleneck Autoencoder (BIB-AE) architecture for generating photon showers in a high-granularity calorimeter showed a high accuracy modeling of various global differential shower distributions. In this work, we investigate how the BIB-AE encodes this physics information in its latent space. Our understanding of this encoding allows us to propose methods to optimize the generation performance further, for example, by altering latent space sampling or by suggesting specific changes to hyperparameters. In particular, we improve the modeling of the shower shape along the particle incident axis.
△ Less
Submitted 29 June, 2021; v1 submitted 24 February, 2021;
originally announced February 2021.
-
Getting High: High Fidelity Simulation of High Granularity Calorimeters with High Speed
Authors:
Erik Buhmann,
Sascha Diefenbacher,
Engin Eren,
Frank Gaede,
Gregor Kasieczka,
Anatolii Korol,
Katja Krüger
Abstract:
Accurate simulation of physical processes is crucial for the success of modern particle physics. However, simulating the development and interaction of particle showers with calorimeter detectors is a time consuming process and drives the computing needs of large experiments at the LHC and future colliders. Recently, generative machine learning models based on deep neural networks have shown promi…
▽ More
Accurate simulation of physical processes is crucial for the success of modern particle physics. However, simulating the development and interaction of particle showers with calorimeter detectors is a time consuming process and drives the computing needs of large experiments at the LHC and future colliders. Recently, generative machine learning models based on deep neural networks have shown promise in speeding up this task by several orders of magnitude. We investigate the use of a new architecture -- the Bounded Information Bottleneck Autoencoder -- for modelling electromagnetic showers in the central region of the Silicon-Tungsten calorimeter of the proposed International Large Detector. Combined with a novel second post-processing network, this approach achieves an accurate simulation of differential distributions including for the first time the shape of the minimum-ionizing-particle peak compared to a full GEANT4 simulation for a high-granularity calorimeter with 27k simulated channels. The results are validated by comparing to established architectures. Our results further strengthen the case of using generative networks for fast simulation and demonstrate that physically relevant differential distributions can be described with high accuracy.
△ Less
Submitted 3 February, 2021; v1 submitted 11 May, 2020;
originally announced May 2020.