Skip to main content

Showing 1–50 of 57 results for author: Kasieczka, G

Searching in archive hep-ph. Search in all archives.
.
  1. arXiv:2405.20407  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    Convolutional L2LFlows: Generating Accurate Showers in Highly Granular Calorimeters Using Convolutional Normalizing Flows

    Authors: Thorsten Buss, Frank Gaede, Gregor Kasieczka, Claudius Krause, David Shih

    Abstract: In the quest to build generative surrogate models as computationally efficient alternatives to rule-based simulations, the quality of the generated samples remains a crucial frontier. So far, normalizing flows have been among the models with the best fidelity. However, as the latent space in such models is required to have the same dimensionality as the data space, scaling up normalizing flows to… ▽ More

    Submitted 3 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Report number: HEPHY-ML-24-02

  2. arXiv:2405.12972  [pdf, other

    hep-ph hep-ex physics.data-an

    Accelerating Resonance Searches via Signature-Oriented Pre-training

    Authors: Congqiao Li, Antonios Agapitos, Jovin Drews, Javier Duarte, Dawei Fu, Leyun Gao, Raghav Kansal, Gregor Kasieczka, Louis Moureaux, Huilin Qu, Cristina Mantilla Suarez, Qiang Li

    Abstract: The search for heavy resonances beyond the Standard Model (BSM) is a key objective at the LHC. While the recent use of advanced deep neural networks for boosted-jet tagging significantly enhances the sensitivity of dedicated searches, it is limited to specific final states, leaving vast potential BSM phase space underexplored. We introduce a novel experimental method, Signature-Oriented Pre-traini… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 14 pages, 5 figures

  3. arXiv:2404.07258  [pdf, other

    hep-ph hep-ex physics.data-an

    Complete Optimal Non-Resonant Anomaly Detection

    Authors: Gregor Kasieczka, John Andrew Raine, David Shih, Aman Upadhyay

    Abstract: We propose the first-ever complete, model-agnostic search strategy based on the optimal anomaly score, for new physics on the tails of distributions. Signal sensitivity is achieved via a classifier trained on auxiliary features in a weakly-supervised fashion, and backgrounds are predicted using the ABCD method in the classifier output and the primary tail feature. The independence between the clas… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 9 pages, 9 figures

  4. arXiv:2403.05618  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    OmniJet-$α$: The first cross-task foundation model for particle physics

    Authors: Joschka Birk, Anna Hallin, Gregor Kasieczka

    Abstract: Foundation models are multi-dataset and multi-task machine learning methods that once pre-trained can be fine-tuned for a large variety of downstream applications. The successful development of such general-purpose models for physics data would be a major breakthrough as they could improve the achievable physics performance while at the same time drastically reduce the required amount of training… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  5. arXiv:2402.15558  [pdf, other

    hep-ph hep-ex physics.data-an

    Classifier Surrogates: Sharing AI-based Searches with the World

    Authors: Sebastian Bieringer, Gregor Kasieczka, Jan Kieseler, Mathias Trabs

    Abstract: In recent years, neural network-based classification has been used to improve data analysis at collider experiments. While this strategy proves to be hugely successful, the underlying models are not commonly shared with the public and rely on experiment-internal data as well as full detector simulations. We show a concrete implementation of a newly proposed strategy, so-called Classifier Surrogate… ▽ More

    Submitted 2 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 10 pages, 6 Figures, 1 Table

  6. arXiv:2312.14575  [pdf, ps, other

    hep-ph hep-ex

    Les Houches guide to reusable ML models in LHC analyses

    Authors: Jack Y. Araz, Andy Buckley, Gregor Kasieczka, Jan Kieseler, Sabine Kraml, Anders Kvellestad, Andre Lessa, Tomasz Procter, Are Raklev, Humberto Reyes-Gonzalez, Krzysztof Rolbiecki, Sezen Sekmen, Gokhan Unel

    Abstract: With the increasing usage of machine-learning in high-energy physics analyses, the publication of the trained models in a reusable form has become a crucial question for analysis preservation and reuse. The complexity of these models creates practical issues for both reporting them accurately and for ensuring the stability of their behaviours in different environments and over extended timescales.… ▽ More

    Submitted 10 January, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 12 pages; v2: added funding acknowledgement

  7. arXiv:2312.11629  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Residual ANODE

    Authors: Ranit Das, Gregor Kasieczka, David Shih

    Abstract: We present R-ANODE, a new method for data-driven, model-agnostic resonant anomaly detection that raises the bar for both performance and interpretability. The key to R-ANODE is to enhance the inductive bias of the anomaly detection task by fitting a normalizing flow directly to the small and unknown signal component, while holding fixed a background model (also a normalizing flow) learned from sid… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 9 pages, 6 figures

  8. arXiv:2312.00123  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Flow Matching Beyond Kinematics: Generating Jets with Particle-ID and Trajectory Displacement Information

    Authors: Joschka Birk, Erik Buhmann, Cedric Ewen, Gregor Kasieczka, David Shih

    Abstract: We introduce the first generative model trained on the JetClass dataset. Our model generates jets at the constituent level, and it is a permutation-equivariant continuous normalizing flow (CNF) trained with the flow matching technique. It is conditioned on the jet type, so that a single model can be used to generate the ten different jet types of JetClass. For the first time, we also introduce a g… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  9. arXiv:2310.06897  [pdf, other

    hep-ph hep-ex physics.data-an

    Full Phase Space Resonant Anomaly Detection

    Authors: Erik Buhmann, Cedric Ewen, Gregor Kasieczka, Vinicius Mikuni, Benjamin Nachman, David Shih

    Abstract: Physics beyond the Standard Model that is resonant in one or more dimensions has been a longstanding focus of countless searches at colliders and beyond. Recently, many new strategies for resonant anomaly detection have been developed, where sideband information can be used in conjunction with modern machine learning, in order to generate synthetic datasets representing the Standard Model backgrou… ▽ More

    Submitted 9 February, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 10 pages, 7 figures

    Journal ref: Phys. Rev. D 109, 055015 (2024)

  10. arXiv:2310.00049  [pdf, other

    hep-ph cs.LG

    EPiC-ly Fast Particle Cloud Generation with Flow-Matching and Diffusion

    Authors: Erik Buhmann, Cedric Ewen, Darius A. Faroughy, Tobias Golling, Gregor Kasieczka, Matthew Leigh, Guillaume Quétant, John Andrew Raine, Debajyoti Sengupta, David Shih

    Abstract: Jets at the LHC, typically consisting of a large number of highly correlated particles, are a fascinating laboratory for deep generative modeling. In this paper, we present two novel methods that generate LHC jets as point clouds efficiently and accurately. We introduce \epcjedi, which combines score-matching diffusion models with the Equivariant Point Cloud (EPiC) architecture based on the deep s… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: 21 pages, 8 figures

  11. arXiv:2309.13111  [pdf, other

    hep-ph hep-ex physics.data-an

    Back To The Roots: Tree-Based Algorithms for Weakly Supervised Anomaly Detection

    Authors: Thorben Finke, Marie Hein, Gregor Kasieczka, Michael Krämer, Alexander Mück, Parada Prangchaikul, Tobias Quadfasel, David Shih, Manuel Sommerhalder

    Abstract: Weakly supervised methods have emerged as a powerful tool for model-agnostic anomaly detection at the Large Hadron Collider (LHC). While these methods have shown remarkable performance on specific signatures such as di-jet resonances, their application in a more model-agnostic manner requires dealing with a larger number of potentially noisy input features. In this paper, we show that using booste… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: 11 pages, 9 figures

    Report number: TTK-23-26

  12. Combining Resonant and Tail-based Anomaly Detection

    Authors: Gerrit Bickendorf, Manuel Drees, Gregor Kasieczka, Claudius Krause, David Shih

    Abstract: In many well-motivated models of the electroweak scale, cascade decays of new particles can result in highly boosted hadronic resonances (e.g. $Z/W/h$). This can make these models rich and promising targets for recently developed resonant anomaly detection methods powered by modern machine learning. We demonstrate this using the state-of-the-art CATHODE method applied to supersymmetry scenarios wi… ▽ More

    Submitted 28 May, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 13 pages, 15 figures

  13. arXiv:2309.05704  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    CaloClouds II: Ultra-Fast Geometry-Independent Highly-Granular Calorimeter Simulation

    Authors: Erik Buhmann, Frank Gaede, Gregor Kasieczka, Anatolii Korol, William Korcari, Katja Krüger, Peter McKeown

    Abstract: Fast simulation of the energy depositions in high-granular detectors is needed for future collider experiments with ever-increasing luminosities. Generative machine learning (ML) models have been shown to speed up and augment the traditional simulation chain in physics analysis. However, the majority of previous efforts were limited to models relying on fixed, regular detector readout geometries.… ▽ More

    Submitted 26 February, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: 30 pages, 7 figures, 3 tables, code available at https://github.com/FLC-QU-hep/CaloClouds-2

    Report number: DESY-23-130

  14. arXiv:2307.11157  [pdf, other

    hep-ph hep-ex physics.data-an

    The Interplay of Machine Learning--based Resonant Anomaly Detection Methods

    Authors: Tobias Golling, Gregor Kasieczka, Claudius Krause, Radha Mastandrea, Benjamin Nachman, John Andrew Raine, Debajyoti Sengupta, David Shih, Manuel Sommerhalder

    Abstract: Machine learning--based anomaly detection (AD) methods are promising tools for extending the coverage of searches for physics beyond the Standard Model (BSM). One class of AD methods that has received significant attention is resonant anomaly detection, where the BSM is assumed to be localized in at least one known variable. While there have been many methods proposed to identify such a BSM signal… ▽ More

    Submitted 14 March, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 27 pages, 21 figures. Updated with revisions for journal acceptance

  15. arXiv:2305.04847  [pdf, other

    physics.ins-det cs.LG hep-ex hep-ph physics.data-an

    CaloClouds: Fast Geometry-Independent Highly-Granular Calorimeter Simulation

    Authors: Erik Buhmann, Sascha Diefenbacher, Engin Eren, Frank Gaede, Gregor Kasieczka, Anatolii Korol, William Korcari, Katja Krüger, Peter McKeown

    Abstract: Simulating showers of particles in highly-granular detectors is a key frontier in the application of machine learning to particle physics. Achieving high accuracy and speed with generative machine learning models would enable them to augment traditional simulations and alleviate a major computing constraint. This work achieves a major breakthrough in this task by, for the first time, directly gene… ▽ More

    Submitted 26 February, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: 25 pages, 11 figures

    Report number: DESY-23-061

    Journal ref: JINST 18 (2023) 11, P11025

  16. arXiv:2303.18150  [pdf, other

    physics.ins-det hep-ex hep-ph physics.data-an

    New Angles on Fast Calorimeter Shower Simulation

    Authors: Sascha Diefenbacher, Engin Eren, Frank Gaede, Gregor Kasieczka, Anatolii Korol, Katja Krüger, Peter McKeown, Lennart Rustige

    Abstract: The demands placed on computational resources by the simulation requirements of high energy physics experiments motivate the development of novel simulation tools. Machine learning based generative models offer a solution that is both fast and accurate. In this work we extend the Bounded Information Bottleneck Autoencoder (BIB-AE) architecture, designed for the simulation of particle showers in hi… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Comments: 26 pages, 19 figures

    Report number: DESY-23-039

  17. arXiv:2302.11594  [pdf, other

    physics.ins-det hep-ex hep-ph physics.data-an

    L2LFlows: Generating High-Fidelity 3D Calorimeter Images

    Authors: Sascha Diefenbacher, Engin Eren, Frank Gaede, Gregor Kasieczka, Claudius Krause, Imahn Shekhzadeh, David Shih

    Abstract: We explore the use of normalizing flows to emulate Monte Carlo detector simulations of photon showers in a high-granularity electromagnetic calorimeter prototype for the International Large Detector (ILD). Our proposed method -- which we refer to as "Layer-to-Layer-Flows" (L$2$LFlows) -- is an evolution of the CaloFlow architecture adapted to a higher-dimensional setting (30 layers of… ▽ More

    Submitted 20 October, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: v2: 28 pages, 13 figures; matches version accepted for publication in JINST. Neither SISSA Medialab Srl nor IOP Publishing Ltd is responsible for any errors or omissions in this version of the manuscript or any version derived from it. Published version available via DOI

    Journal ref: 2023 JINST 18 P10017

  18. arXiv:2301.08128  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    EPiC-GAN: Equivariant Point Cloud Generation for Particle Jets

    Authors: Erik Buhmann, Gregor Kasieczka, Jesse Thaler

    Abstract: With the vast data-collecting capabilities of current and future high-energy collider experiments, there is an increasing demand for computationally efficient simulations. Generative machine learning models enable fast event generation, yet so far these approaches are largely constrained to fixed data structures and rigid detector geometries. In this paper, we introduce EPiC-GAN - equivariant poin… ▽ More

    Submitted 12 July, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: 18 pages, 8 figures, 3 tables, code available at: https://github.com/uhh-pd-ml/EPiC-GAN

    Report number: MIT-CTP 5519

    Journal ref: SciPost Phys. 15, 130 (2023)

  19. arXiv:2212.00046  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an

    Feature Selection with Distance Correlation

    Authors: Ranit Das, Gregor Kasieczka, David Shih

    Abstract: Choosing which properties of the data to use as input to multivariate decision algorithms -- a.k.a. feature selection -- is an important step in solving any problem with machine learning. While there is a clear trend towards training sophisticated deep networks on large numbers of relatively unprocessed inputs (so-called automated feature engineering), for many tasks in physics, sets of theoretica… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

    Comments: 14 pages, 8 figures, 3 tables

  20. arXiv:2210.14924  [pdf, other

    hep-ph hep-ex physics.data-an

    Resonant anomaly detection without background sculpting

    Authors: Anna Hallin, Gregor Kasieczka, Tobias Quadfasel, David Shih, Manuel Sommerhalder

    Abstract: We introduce a new technique named Latent CATHODE (LaCATHODE) for performing "enhanced bump hunts", a type of resonant anomaly search that combines conventional one-dimensional bump hunts with a model-agnostic anomaly score in an auxiliary feature space where potential signals could also be localized. The main advantage of LaCATHODE over existing methods is that it provides an anomaly score that i… ▽ More

    Submitted 10 July, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: 11 pages, 8 figures; v2 (published version): referencing code and minor style updates

    Journal ref: Phys. Rev. D 107, 114012 (2023)

  21. arXiv:2209.06225  [pdf, other

    hep-ph hep-ex physics.data-an

    Anomaly Detection under Coordinate Transformations

    Authors: Gregor Kasieczka, Radha Mastandrea, Vinicius Mikuni, Benjamin Nachman, Mariel Pettee, David Shih

    Abstract: There is a growing need for machine learning-based anomaly detection strategies to broaden the search for Beyond-the-Standard-Model (BSM) physics at the Large Hadron Collider (LHC) and elsewhere. The first step of any anomaly detection approach is to specify observables and then use them to decide on a set of anomalous events. One common choice is to select events that have low probability density… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 10 pages, 6 figures

  22. arXiv:2203.08806  [pdf, other

    hep-ph cs.LG hep-ex physics.comp-ph physics.ins-det

    New directions for surrogate models and differentiable programming for High Energy Physics detector simulation

    Authors: Andreas Adelmann, Walter Hopkins, Evangelos Kourlitis, Michael Kagan, Gregor Kasieczka, Claudius Krause, David Shih, Vinicius Mikuni, Benjamin Nachman, Kevin Pedro, Daniel Winklehner

    Abstract: The computational cost for high energy physics detector simulation in future experimental facilities is going to exceed the current available resources. To overcome this challenge, new ideas on surrogate models using machine learning methods are being explored to replace computationally expensive components. Additionally, differentiable programming has been proposed as a complementary approach, pr… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Comments: contribution to Snowmass 2021

    Report number: FERMILAB-CONF-22-199-SCD

  23. Machine Learning and LHC Event Generation

    Authors: Anja Butter, Tilman Plehn, Steffen Schumann, Simon Badger, Sascha Caron, Kyle Cranmer, Francesco Armando Di Bello, Etienne Dreyer, Stefano Forte, Sanmay Ganguly, Dorival Gonçalves, Eilam Gross, Theo Heimel, Gudrun Heinrich, Lukas Heinrich, Alexander Held, Stefan Höche, Jessica N. Howard, Philip Ilten, Joshua Isaacson, Timo Janßen, Stephen Jones, Marumi Kado, Michael Kagan, Gregor Kasieczka , et al. (26 additional authors not shown)

    Abstract: First-principle simulations are at the heart of the high-energy physics research program. They link the vast data output of multi-purpose detectors with fundamental theory predictions and interpretation. This review illustrates a wide range of applications of modern machine learning to event generation and simulation-based inference, including conceptional developments driven by the specific requi… ▽ More

    Submitted 28 December, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Review article based on a Snowmass 2021 contribution

    Journal ref: SciPost Phys. 14, 079 (2023)

  24. arXiv:2202.09375  [pdf, other

    hep-ph hep-ex physics.data-an

    Ephemeral Learning -- Augmenting Triggers with Online-Trained Normalizing Flows

    Authors: Anja Butter, Sascha Diefenbacher, Gregor Kasieczka, Benjamin Nachman, Tilman Plehn, David Shih, Ramon Winterhalder

    Abstract: The large data rates at the LHC require an online trigger system to select relevant collisions. Rather than compressing individual events, we propose to compress an entire data set at once. We use a normalizing flow as a deep generative model to learn the probability density of the data online. The events are then represented by the generative neural network and can be inspected offline for anomal… ▽ More

    Submitted 28 June, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: 17 pages, 9 figures, minor changes to text, addressed referee comments

    Report number: CP3-22-10

    Journal ref: SciPost Phys. 13, 087 (2022)

  25. Calomplification -- The Power of Generative Calorimeter Models

    Authors: Sebastian Bieringer, Anja Butter, Sascha Diefenbacher, Engin Eren, Frank Gaede, Daniel Hundhausen, Gregor Kasieczka, Benjamin Nachman, Tilman Plehn, Mathias Trabs

    Abstract: Motivated by the high computational costs of classical simulations, machine-learned generative models can be extremely useful in particle physics and elsewhere. They become especially attractive when surrogate models can efficiently learn the underlying distribution, such that a generated sample outperforms a training sample of limited size. This kind of GANplification has been observed for simple… ▽ More

    Submitted 25 January, 2023; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: 17 pages, 10 figures

    Report number: DESY-22-031

    Journal ref: JINST 17 P09028 (2022)

  26. arXiv:2112.09709  [pdf, other

    physics.ins-det hep-ex hep-ph physics.data-an

    Hadrons, Better, Faster, Stronger

    Authors: Erik Buhmann, Sascha Diefenbacher, Engin Eren, Frank Gaede, Daniel Hundhausen, Gregor Kasieczka, William Korcari, Katja Krüger, Peter McKeown, Lennart Rustige

    Abstract: Motivated by the computational limitations of simulating interactions of particles in highly-granular detectors, there exists a concerted effort to build fast and exact machine-learning-based shower simulators. This work reports progress on two important fronts. First, the previously investigated WGAN and BIB-AE generative models are improved and successful learning of hadronic showers initiated b… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: 20 pages, 8 figures

  27. arXiv:2112.03769  [pdf, other

    hep-ph hep-ex physics.data-an stat.ML

    Machine Learning in the Search for New Fundamental Physics

    Authors: Georgia Karagiorgi, Gregor Kasieczka, Scott Kravitz, Benjamin Nachman, David Shih

    Abstract: Machine learning plays a crucial role in enhancing and accelerating the search for new fundamental physics. We review the state of machine learning methods and applications for new physics searches in the context of terrestrial high energy physics experiments, including the Large Hadron Collider, rare event searches, and neutrino experiments. While machine learning has a long history in these fiel… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Preprint of article submitted to Nature Reviews Physics, 19 pages, 1 figure

  28. arXiv:2109.00546  [pdf, other

    hep-ph hep-ex physics.data-an

    Classifying Anomalies THrough Outer Density Estimation (CATHODE)

    Authors: Anna Hallin, Joshua Isaacson, Gregor Kasieczka, Claudius Krause, Benjamin Nachman, Tobias Quadfasel, Matthias Schlaffer, David Shih, Manuel Sommerhalder

    Abstract: We propose a new model-agnostic search strategy for physics beyond the standard model (BSM) at the LHC, based on a novel application of neural density estimation to anomaly detection. Our approach, which we call Classifying Anomalies THrough Outer Density Estimation (CATHODE), assumes the BSM signal is localized in a signal region (defined e.g. using invariant mass). By training a conditional dens… ▽ More

    Submitted 11 September, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: 17 pages, 12 figures; v2: minor updates; v3 (published version): added study of background sculpting and minor fixes

    Report number: EFI-20-5, FERMILAB-PUB-21-389-T

    Journal ref: Phys. Rev. D 106, 055006 (2022)

  29. Symmetries, Safety, and Self-Supervision

    Authors: Barry M. Dillon, Gregor Kasieczka, Hans Olischlager, Tilman Plehn, Peter Sorrenson, Lorenz Vogel

    Abstract: Collider searches face the challenge of defining a representation of high-dimensional data such that physical symmetries are manifest, the discriminating features are retained, and the choice of representation is new-physics agnostic. We introduce JetCLR to solve the map** from low-level data to optimized observables though self-supervised contrastive learning. As an example, we construct a data… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Journal ref: SciPost Phys. 12, 188 (2022)

  30. Unsupervised Hadronic SUEP at the LHC

    Authors: Jared Barron, David Curtin, Gregor Kasieczka, Tilman Plehn, Aris Spourdalakis

    Abstract: Confining dark sectors with pseudo-conformal dynamics produce SUEP, or Soft Unclustered Energy Patterns, at colliders: isotropic dark hadrons with soft and democratic energies. We target the experimental nightmare scenario, SUEPs in exotic Higgs decays, where all dark hadrons decay promptly to SM hadrons. First, we identify three promising observables, the charged particle multiplicity, the event… ▽ More

    Submitted 4 November, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: 10 pages, 7 figures + references and appendix v2: Added graph to appendix and fixed typos

  31. arXiv:2107.02821  [pdf, other

    stat.ML cs.LG hep-ex hep-ph

    New Methods and Datasets for Group Anomaly Detection From Fundamental Physics

    Authors: Gregor Kasieczka, Benjamin Nachman, David Shih

    Abstract: The identification of anomalous overdensities in data - group or collective anomaly detection - is a rich problem with a large number of real world applications. However, it has received relatively little attention in the broader ML community, as compared to point anomalies or other types of single instance outliers. One reason for this is the lack of powerful benchmark datasets. In this paper, we… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: Accepted for ANDEA (Anomaly and Novelty Detection, Explanation and Accommodation) Workshop at KDD 2021

  32. arXiv:2107.00656  [pdf, other

    cs.LG astro-ph.IM hep-ph nucl-th physics.data-an stat.ML

    Shared Data and Algorithms for Deep Learning in Fundamental Physics

    Authors: Lisa Benato, Erik Buhmann, Martin Erdmann, Peter Fackeldey, Jonas Glombitza, Nikolai Hartmann, Gregor Kasieczka, William Korcari, Thomas Kuhr, Jan Steinheimer, Horst Stöcker, Tilman Plehn, Kai Zhou

    Abstract: We introduce a Python package that provides simply and unified access to a collection of datasets from fundamental physics research - including particle physics, astroparticle physics, and hadron- and nuclear physics - for supervised machine learning studies. The datasets contain hadronic top quarks, cosmic-ray induced air showers, phase transitions in hadronic matter, and generator-level historie… ▽ More

    Submitted 24 March, 2022; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: 14 pages, 3 figures, 5 tables - Version accepted by Computing and Software for Big Science

    Journal ref: Comput Softw Big Sci 6, 9 (2022)

  33. arXiv:2102.12491  [pdf, other

    physics.ins-det hep-ex hep-ph physics.data-an

    Decoding Photons: Physics in the Latent Space of a BIB-AE Generative Network

    Authors: Erik Buhmann, Sascha Diefenbacher, Engin Eren, Frank Gaede, Gregor Kasieczka, Anatolii Korol, Katja Krüger

    Abstract: Given the increasing data collection capabilities and limited computing resources of future collider experiments, interest in using generative neural networks for the fast simulation of collider events is growing. In our previous study, the Bounded Information Bottleneck Autoencoder (BIB-AE) architecture for generating photon showers in a high-granularity calorimeter showed a high accuracy modelin… ▽ More

    Submitted 29 June, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: 13 pages, 9 figures, 2 tables, accepted by vCHEP 2021

    Report number: DESY 21-029

    Journal ref: EPJ Web of Conferences 251, 03003 (2021)

  34. arXiv:2101.08320  [pdf, other

    hep-ph hep-ex physics.data-an

    The LHC Olympics 2020: A Community Challenge for Anomaly Detection in High Energy Physics

    Authors: Gregor Kasieczka, Benjamin Nachman, David Shih, Oz Amram, Anders Andreassen, Kees Benkendorfer, Blaz Bortolato, Gustaaf Brooijmans, Florencia Canelli, Jack H. Collins, Biwei Dai, Felipe F. De Freitas, Barry M. Dillon, Ioan-Mihail Dinu, Zhongtian Dong, Julien Donini, Javier Duarte, D. A. Faroughy, Julia Gonski, Philip Harris, Alan Kahn, Jernej F. Kamenik, Charanjit K. Khosa, Patrick Komiske, Luc Le Pottier , et al. (22 additional authors not shown)

    Abstract: A new paradigm for data-driven, model-agnostic new physics searches at colliders is emerging, and aims to leverage recent breakthroughs in anomaly detection and machine learning. In order to develop and benchmark new anomaly detection methods within this framework, it is essential to have standard datasets. To this end, we have created the LHC Olympics 2020, a community challenge accompanied by a… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

    Comments: 108 pages, 53 figures, 3 tables

  35. arXiv:2012.11944  [pdf, other

    hep-ph

    How to GAN Higher Jet Resolution

    Authors: Pierre Baldi, Lukas Blecher, Anja Butter, Julian Collado, Jessica N. Howard, Fabian Keilbach, Tilman Plehn, Gregor Kasieczka, Daniel Whiteson

    Abstract: QCD-jets at the LHC are described by simple physics principles. We show how super-resolution generative networks can learn the underlying structures and use them to improve the resolution of jet images. We test this approach on massless QCD-jets and on fat top-jets and find that the network reproduces their main features even without training on pure samples. In addition, we show how a slim networ… ▽ More

    Submitted 2 December, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: 25 pages, 11 figures; implemented SciPost reviewer comments, clarified definitions and expanded discussion in high-level observable benchmarking subsection (section 3.3 and Fig. 7)

  36. arXiv:2009.03796  [pdf, other

    hep-ph hep-ex physics.data-an physics.ins-det stat.ML

    DCTRGAN: Improving the Precision of Generative Models with Reweighting

    Authors: Sascha Diefenbacher, Engin Eren, Gregor Kasieczka, Anatolii Korol, Benjamin Nachman, David Shih

    Abstract: Significant advances in deep learning have led to more widely used and precise neural network-based generative models such as Generative Adversarial Networks (GANs). We introduce a post-hoc correction to deep generative models to further improve their fidelity, based on the Deep neural networks using the Classification for Tuning and Reweighting (DCTR) protocol. The correction takes the form of a… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: 14 pages, 8 figures

  37. arXiv:2008.06545  [pdf, other

    hep-ph hep-ex physics.data-an stat.ML

    GANplifying Event Samples

    Authors: Anja Butter, Sascha Diefenbacher, Gregor Kasieczka, Benjamin Nachman, Tilman Plehn

    Abstract: A critical question concerning generative networks applied to event generation in particle physics is if the generated events add statistical precision beyond the training sample. We show for a simple example with increasing dimensionality how generative networks indeed amplify the training statistics. We quantify their impact through an amplification factor or equivalent numbers of sampled events… ▽ More

    Submitted 25 March, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

    Comments: 15 pages, 7 figures, fixed two equations, extended acknowledgments, addressed referee comments, improved figure readability

    Journal ref: SciPost Phys. 10, 139 (2021)

  38. arXiv:2007.14400  [pdf, other

    hep-ph hep-ex physics.data-an

    ABCDisCo: Automating the ABCD Method with Machine Learning

    Authors: Gregor Kasieczka, Benjamin Nachman, Matthew D. Schwartz, David Shih

    Abstract: The ABCD method is one of the most widely used data-driven background estimation techniques in high energy physics. Cuts on two statistically-independent classifiers separate signal and background into four regions, so that background in the signal region can be estimated simply using the other three control regions. Typically, the independent classifiers are chosen "by hand" to be intuitive and p… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 37 pages, 12 figures

    Journal ref: Phys. Rev. D 103, 035021 (2021)

  39. Towards Machine Learning Analytics for Jet Substructure

    Authors: Gregor Kasieczka, Simone Marzani, Gregory Soyez, Giovanni Stagnitto

    Abstract: The past few years have seen a rapid development of machine-learning algorithms. While surely augmenting performance, these complex tools are often treated as black-boxes and may impair our understanding of the physical processes under study. The aim of this paper is to move a first step into the direction of applying expert-knowledge in particle physics to calculate the optimal decision function… ▽ More

    Submitted 22 September, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: 32 pages, 9 figures; v2 brings extra clarifications, as accepted by JHEP

    Report number: TIF-UNIMI-2020-12

    Journal ref: JHEP 09 (2020) 195

  40. Invertible Networks or Partons to Detector and Back Again

    Authors: Marco Bellagente, Anja Butter, Gregor Kasieczka, Tilman Plehn, Armand Rousselot, Ramon Winterhalder, Lynton Ardizzone, Ullrich Köthe

    Abstract: For simulations where the forward and the inverse directions have a physics meaning, invertible neural networks are especially useful. A conditional INN can invert a detector simulation in terms of high-level observables, specifically for ZW production at the LHC. It allows for a per-event statistical interpretation. Next, we allow for a variable number of QCD jets. We unfold detector effects and… ▽ More

    Submitted 1 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 25 pages, 10 figures

    Journal ref: SciPost Phys. 9, 074 (2020)

  41. arXiv:2005.05334  [pdf, other

    physics.ins-det hep-ex hep-ph physics.data-an

    Getting High: High Fidelity Simulation of High Granularity Calorimeters with High Speed

    Authors: Erik Buhmann, Sascha Diefenbacher, Engin Eren, Frank Gaede, Gregor Kasieczka, Anatolii Korol, Katja Krüger

    Abstract: Accurate simulation of physical processes is crucial for the success of modern particle physics. However, simulating the development and interaction of particle showers with calorimeter detectors is a time consuming process and drives the computing needs of large experiments at the LHC and future colliders. Recently, generative machine learning models based on deep neural networks have shown promi… ▽ More

    Submitted 3 February, 2021; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: 17 pages, 12 figures

    Report number: DESY 20-075

    Journal ref: Computing and Software for Big Science 5, 13 (2021)

  42. Per-Object Systematics using Deep-Learned Calibration

    Authors: Gregor Kasieczka, Michel Luchmann, Florian Otterpohl, Tilman Plehn

    Abstract: We show how to treat systematic uncertainties using Bayesian deep networks for regression. First, we analyze how these networks separately trace statistical and systematic uncertainties on the momenta of boosted top quarks forming fat jets. Next, we propose a novel calibration procedure by training on labels and their error bars. Again, the network cleanly separates the different uncertainties. As… ▽ More

    Submitted 2 November, 2020; v1 submitted 24 March, 2020; originally announced March 2020.

    Journal ref: SciPost Phys. 9, 089 (2020)

  43. arXiv:2001.05310  [pdf, other

    hep-ph hep-ex physics.data-an

    DisCo Fever: Robust Networks Through Distance Correlation

    Authors: Gregor Kasieczka, David Shih

    Abstract: While deep learning has proven to be extremely successful at supervised classification tasks at the LHC and beyond, for practical applications, raw classification accuracy is often not the only consideration. One crucial issue is the stability of network predictions, either versus changes of individual features of the input data, or against systematic perturbations. We present a new method based o… ▽ More

    Submitted 30 September, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: 9 pages, v2: essentially the journal version (refs added, typos fixed, minor improvements)

    Journal ref: Phys. Rev. Lett. 125, 122001 (2020)

  44. How to GAN away Detector Effects

    Authors: Marco Bellagente, Anja Butter, Gregor Kasieczka, Tilman Plehn, Ramon Winterhalder

    Abstract: LHC analyses directly comparing data and simulated events bear the danger of using first-principle predictions only as a black-box part of event simulation. We show how simulations, for instance, of detector effects can instead be inverted using generative networks. This allows us to reconstruct parton level information from measured events. Our results illustrate how, in general, fully conditiona… ▽ More

    Submitted 17 March, 2020; v1 submitted 1 December, 2019; originally announced December 2019.

    Comments: 16 pages, 13 figures

    Journal ref: SciPost Phys. 8, 070 (2020)

  45. CapsNets Continuing the Convolutional Quest

    Authors: Sascha Diefenbacher, Hermann Frost, Gregor Kasieczka, Tilman Plehn, Jennifer M. Thompson

    Abstract: Capsule networks are ideal tools to combine event-level and subjet information at the LHC. After benchmarking our capsule network against standard convolutional networks, we show how multi-class capsules extract a resonance decaying to top quarks from both, QCD di-jet and the top continuum backgrounds. We then show how its results can be easily interpreted. Finally, we use associated top-Higgs pro… ▽ More

    Submitted 31 October, 2019; v1 submitted 26 June, 2019; originally announced June 2019.

    Comments: Minor changes to text

    Journal ref: SciPost Phys. 8, 023 (2020)

  46. Deep-Learning Jets with Uncertainties and More

    Authors: Sven Bollweg, Manuel Haussmann, Gregor Kasieczka, Michel Luchmann, Tilman Plehn, Jennifer Thompson

    Abstract: Bayesian neural networks allow us to keep track of uncertainties, for example in top tagging, by learning a tagger output together with an error band. We illustrate the main features of Bayesian versions of established deep-learning taggers. We show how they capture statistical uncertainties from finite training samples, systematics related to the jet energy scale, and stability issues through pil… ▽ More

    Submitted 15 August, 2019; v1 submitted 22 April, 2019; originally announced April 2019.

    Comments: 15 figures

    Journal ref: SciPost Phys. 8, 006 (2020)

  47. Searching for long-lived particles beyond the Standard Model at the Large Hadron Collider

    Authors: Juliette Alimena, James Beacham, Martino Borsato, Yangyang Cheng, Xabier Cid Vidal, Giovanna Cottin, Albert De Roeck, Nishita Desai, David Curtin, Jared A. Evans, Simon Knapen, Sabine Kraml, Andre Lessa, Zhen Liu, Sascha Mehlhase, Michael J. Ramsey-Musolf, Heather Russell, Jessie Shelton, Brian Shuve, Monica Verducci, Jose Zurita, Todd Adams, Michael Adersberger, Cristiano Alpigiani, Artur Apresyan , et al. (176 additional authors not shown)

    Abstract: Particles beyond the Standard Model (SM) can generically have lifetimes that are long compared to SM particles at the weak scale. When produced at experiments such as the Large Hadron Collider (LHC) at CERN, these long-lived particles (LLPs) can decay far from the interaction vertex of the primary proton-proton collision. Such LLP signatures are distinct from those of promptly decaying particles t… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

    Journal ref: J. Phys. G: Nucl. Part. Phys. 47 090501 (2020)

  48. The Machine Learning Landscape of Top Taggers

    Authors: G. Kasieczka, T. Plehn, A. Butter, K. Cranmer, D. Debnath, B. M. Dillon, M. Fairbairn, D. A. Faroughy, W. Fedorko, C. Gay, L. Gouskos, J. F. Kamenik, P. T. Komiske, S. Leiss, A. Lister, S. Macaluso, E. M. Metodiev, L. Moore, B. Nachman, K. Nordstrom, J. Pearkes, H. Qu, Y. Rath, M. Rieger, D. Shih , et al. (2 additional authors not shown)

    Abstract: Based on the established task of identifying boosted, hadronically decaying top quarks, we compare a wide range of modern machine learning approaches. Unlike most established methods they rely on low-level input, for instance calorimeter output. While their network architectures are vastly different, their performance is comparatively similar. In general, we find that these new approaches are extr… ▽ More

    Submitted 23 July, 2019; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: Yet another tagger included!

    Journal ref: SciPost Phys. 7, 014 (2019)

  49. Quark-Gluon Tagging: Machine Learning vs Detector

    Authors: Gregor Kasieczka, Nicholas Kiefer, Tilman Plehn, Jennifer M. Thompson

    Abstract: Distinguishing quarks from gluons based on low-level detector output is one of the most challenging applications of multi-variate and machine learning techniques at the LHC. We first show the performance of our 4-vector-based LoLa tagger without and after considering detector effects. We then discuss two benchmark applications, mono-jet searches with a gluon-rich signal and di-jet resonances with… ▽ More

    Submitted 20 May, 2019; v1 submitted 21 December, 2018; originally announced December 2018.

    Journal ref: SciPost Phys. 6, 069 (2019)

  50. QCD or What?

    Authors: Theo Heimel, Gregor Kasieczka, Tilman Plehn, Jennifer M Thompson

    Abstract: Autoencoder networks, trained only on QCD jets, can be used to search for anomalies in jet-substructure. We show how, based either on images or on 4-vectors, they identify jets from decays of arbitrary heavy resonances. To control the backgrounds and the underlying systematics we can de-correlate the jet mass using an adversarial network. Such an adversarial autoencoder allows for a general and at… ▽ More

    Submitted 14 January, 2019; v1 submitted 27 August, 2018; originally announced August 2018.

    Comments: 11 figures, added references

    Journal ref: SciPost Phys. 6, 030 (2019)