-
Robust Anomaly Detection for Particle Physics Using Multi-Background Representation Learning
Authors:
Abhijith Gandrakota,
Lily Zhang,
Aahlad Puli,
Kyle Cranmer,
Jennifer Ngadiuba,
Rajesh Ranganath,
Nhan Tran
Abstract:
Anomaly, or out-of-distribution, detection is a promising tool for aiding discoveries of new particles or processes in particle physics. In this work, we identify and address two overlooked opportunities to improve anomaly detection for high-energy physics. First, rather than train a generative model on the single most dominant background process, we build detection algorithms using representation…
▽ More
Anomaly, or out-of-distribution, detection is a promising tool for aiding discoveries of new particles or processes in particle physics. In this work, we identify and address two overlooked opportunities to improve anomaly detection for high-energy physics. First, rather than train a generative model on the single most dominant background process, we build detection algorithms using representation learning from multiple background types, thus taking advantage of more information to improve estimation of what is relevant for detection. Second, we generalize decorrelation to the multi-background setting, thus directly enforcing a more complete definition of robustness for anomaly detection. We demonstrate the benefit of the proposed robust multi-background anomaly detection algorithms on a high-dimensional dataset of particle decays at the Large Hadron Collider.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Physics Community Needs, Tools, and Resources for Machine Learning
Authors:
Philip Harris,
Erik Katsavounidis,
William Patrick McCormack,
Dylan Rankin,
Yongbin Feng,
Abhijith Gandrakota,
Christian Herwig,
Burt Holzman,
Kevin Pedro,
Nhan Tran,
Tingjun Yang,
Jennifer Ngadiuba,
Michael Coughlin,
Scott Hauck,
Shih-Chieh Hsu,
Elham E Khoda,
Deming Chen,
Mark Neubauer,
Javier Duarte,
Georgia Karagiorgi,
Mia Liu
Abstract:
Machine learning (ML) is becoming an increasingly important component of cutting-edge physics research, but its computational requirements present significant challenges. In this white paper, we discuss the needs of the physics community regarding ML across latency and throughput regimes, the tools and resources that offer the possibility of addressing these needs, and how these can be best utiliz…
▽ More
Machine learning (ML) is becoming an increasingly important component of cutting-edge physics research, but its computational requirements present significant challenges. In this white paper, we discuss the needs of the physics community regarding ML across latency and throughput regimes, the tools and resources that offer the possibility of addressing these needs, and how these can be best utilized and accessed in the coming years.
△ Less
Submitted 30 March, 2022;
originally announced March 2022.
-
Model selection and signal extraction using Gaussian Process regression
Authors:
Abhijith Gandrakota,
Amitabh Lath,
Alexandre V. Morozov,
Sindhu Murthy
Abstract:
We present a novel computational approach for extracting weak signals, whose exact location and width may be unknown, from complex background distributions with an arbitrary functional form. We focus on datasets that can be naturally presented as binned integer counts, demonstrating our approach on the CERN open dataset from the ATLAS collaboration at the Large Hadron Collider, which contains the…
▽ More
We present a novel computational approach for extracting weak signals, whose exact location and width may be unknown, from complex background distributions with an arbitrary functional form. We focus on datasets that can be naturally presented as binned integer counts, demonstrating our approach on the CERN open dataset from the ATLAS collaboration at the Large Hadron Collider, which contains the Higgs boson signature. Our approach is based on Gaussian Process (GP) regression - a powerful and flexible machine learning technique that allowed us to model the background without specifying its functional form explicitly, and to separate the background and signal contributions in a robust and reproducible manner. Unlike functional fits, our GP-regression-based approach does not need to be constantly updated as more data becomes available. We discuss how to select the GP kernel type, considering trade-offs between kernel complexity and its ability to capture the features of the background distribution. We show that our GP framework can be used to detect the Higgs boson resonance in the data with more statistical significance than a polynomial fit specifically tailored to the dataset. Finally, we use Markov Chain Monte Carlo (MCMC) sampling to confirm the statistical significance of the extracted Higgs signature.
△ Less
Submitted 11 February, 2022;
originally announced February 2022.