-
Insights into Dark Matter Direct Detection Experiments: Decision Trees versus Deep Learning
Authors:
Daniel E. Lopez-Fogliani,
Andres D. Perez,
Roberto Ruiz de Austri
Abstract:
The detection of Dark Matter (DM) remains a significant challenge in particle physics. This study exploits advanced machine learning models to improve detection capabilities of liquid xenon time projection chamber experiments, utilizing state-of-the-art transformers alongside traditional methods like Multilayer Perceptrons and Convolutional Neural Networks. We evaluate various data representations…
▽ More
The detection of Dark Matter (DM) remains a significant challenge in particle physics. This study exploits advanced machine learning models to improve detection capabilities of liquid xenon time projection chamber experiments, utilizing state-of-the-art transformers alongside traditional methods like Multilayer Perceptrons and Convolutional Neural Networks. We evaluate various data representations and find that simplified feature representations, particularly corrected S1 and S2 signals, retain critical information for classification. Our results show that while transformers offer promising performance, simpler models like XGBoost can achieve comparable results with optimal data representations. We also derive exclusion limits in the cross-section versus DM mass parameter space, showing minimal differences between XGBoost and the best performing deep learning models. The comparative analysis of different machine learning approaches provides a valuable reference for future experiments by guiding the choice of models and data representations to maximize detection capabilities.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Progress in End-to-End Optimization of Detectors for Fundamental Physics with Differentiable Programming
Authors:
Max Aehle,
Lorenzo Arsini,
R. Belén Barreiro,
Anastasios Belias,
Florian Bury,
Susana Cebrian,
Alexander Demin,
Jennet Dickinson,
Julien Donini,
Tommaso Dorigo,
Michele Doro,
Nicolas R. Gauger,
Andrea Giammanco,
Lindsey Gray,
Borja S. González,
Verena Kain,
Jan Kieseler,
Lisa Kusch,
Marcus Liwicki,
Gernot Maier,
Federico Nardi,
Fedor Ratnikov,
Ryan Roussel,
Roberto Ruiz de Austri,
Fredrik Sandin
, et al. (5 additional authors not shown)
Abstract:
In this article we examine recent developments in the research area concerning the creation of end-to-end models for the complete optimization of measuring instruments. The models we consider rely on differentiable programming methods and on the specification of a software pipeline including all factors impacting performance -- from the data-generating processes to their reconstruction and the ext…
▽ More
In this article we examine recent developments in the research area concerning the creation of end-to-end models for the complete optimization of measuring instruments. The models we consider rely on differentiable programming methods and on the specification of a software pipeline including all factors impacting performance -- from the data-generating processes to their reconstruction and the extraction of inference on the parameters of interest of a measuring instrument -- along with the careful specification of a utility function well aligned with the end goals of the experiment.
Building on previous studies originated within the MODE Collaboration, we focus specifically on applications involving instruments for particle physics experimentation, as well as industrial and medical applications that share the detection of radiation as their data-generating mechanism.
△ Less
Submitted 30 September, 2023;
originally announced October 2023.
-
Gradient-Annihilated PINNs for Solving Riemann Problems: Application to Relativistic Hydrodynamics
Authors:
Antonio Ferrer-Sánchez,
José D. Martín-Guerrero,
Roberto Ruiz de Austri,
Alejandro Torres-Forné,
José A. Font
Abstract:
We present a novel methodology based on Physics-Informed Neural Networks (PINNs) for solving systems of partial differential equations admitting discontinuous solutions. Our method, called Gradient-Annihilated PINNs (GA-PINNs), introduces a modified loss function that requires the model to partially ignore high-gradients in the physical variables, achieved by introducing a suitable weighting funct…
▽ More
We present a novel methodology based on Physics-Informed Neural Networks (PINNs) for solving systems of partial differential equations admitting discontinuous solutions. Our method, called Gradient-Annihilated PINNs (GA-PINNs), introduces a modified loss function that requires the model to partially ignore high-gradients in the physical variables, achieved by introducing a suitable weighting function. The method relies on a set of hyperparameters that control how gradients are treated in the physical loss and how the activation functions of the neural model are dynamically accounted for. The performance of our GA-PINN model is demonstrated by solving Riemann problems in special relativistic hydrodynamics, extending earlier studies with PINNs in the context of the classical Euler equations. The solutions obtained with our GA-PINN model correctly describe the propagation speeds of discontinuities and sharply capture the associated jumps. We use the relative $l^{2}$ error to compare our results with the exact solution of special relativistic Riemann problems, used as the reference ``ground truth'', and with the error obtained with a second-order, central, shock-capturing scheme. In all problems investigated, the accuracy reached by our GA-PINN model is comparable to that obtained with a shock-capturing scheme and significantly higher than that achieved by a baseline PINN algorithm. An additional benefit worth stressing is that our PINN-based approach sidesteps the costly recovery of the primitive variables from the state vector of conserved ones, a well-known drawback of grid-based solutions of the relativistic hydrodynamics equations. Due to its inherent generality and its ability to handle steep gradients, the GA-PINN method discussed could be a valuable tool to model relativistic flows in astrophysics and particle physics, characterized by the prevalence of discontinuous solutions.
△ Less
Submitted 19 May, 2023; v1 submitted 15 May, 2023;
originally announced May 2023.
-
Toward the End-to-End Optimization of Particle Physics Instruments with Differentiable Programming: a White Paper
Authors:
Tommaso Dorigo,
Andrea Giammanco,
Pietro Vischia,
Max Aehle,
Mateusz Bawaj,
Alexey Boldyrev,
Pablo de Castro Manzano,
Denis Derkach,
Julien Donini,
Auralee Edelen,
Federica Fanzago,
Nicolas R. Gauger,
Christian Glaser,
Atılım G. Baydin,
Lukas Heinrich,
Ralf Keidel,
Jan Kieseler,
Claudius Krause,
Maxime Lagrange,
Max Lamparth,
Lukas Layer,
Gernot Maier,
Federico Nardi,
Helge E. S. Pettersen,
Alberto Ramos
, et al. (11 additional authors not shown)
Abstract:
The full optimization of the design and operation of instruments whose functioning relies on the interaction of radiation with matter is a super-human task, given the large dimensionality of the space of possible choices for geometry, detection technology, materials, data-acquisition, and information-extraction techniques, and the interdependence of the related parameters. On the other hand, massi…
▽ More
The full optimization of the design and operation of instruments whose functioning relies on the interaction of radiation with matter is a super-human task, given the large dimensionality of the space of possible choices for geometry, detection technology, materials, data-acquisition, and information-extraction techniques, and the interdependence of the related parameters. On the other hand, massive potential gains in performance over standard, "experience-driven" layouts are in principle within our reach if an objective function fully aligned with the final goals of the instrument is maximized by means of a systematic search of the configuration space. The stochastic nature of the involved quantum processes make the modeling of these systems an intractable problem from a classical statistics point of view, yet the construction of a fully differentiable pipeline and the use of deep learning techniques may allow the simultaneous optimization of all design parameters.
In this document we lay down our plans for the design of a modular and versatile modeling tool for the end-to-end optimization of complex instruments for particle physics experiments as well as industrial and medical applications that share the detection of radiation as their basic ingredient. We consider a selected set of use cases to highlight the specific needs of different applications.
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
The Dark Machines Anomaly Score Challenge: Benchmark Data and Model Independent Event Classification for the Large Hadron Collider
Authors:
T. Aarrestad,
M. van Beekveld,
M. Bona,
A. Boveia,
S. Caron,
J. Davies,
A. De Simone,
C. Doglioni,
J. M. Duarte,
A. Farbin,
H. Gupta,
L. Hendriks,
L. Heinrich,
J. Howarth,
P. Jawahar,
A. Jueid,
J. Lastow,
A. Leinweber,
J. Mamuzic,
E. Merényi,
A. Morandini,
P. Moskvitina,
C. Nellist,
J. Ngadiuba,
B. Ostdiek
, et al. (14 additional authors not shown)
Abstract:
We describe the outcome of a data challenge conducted as part of the Dark Machines Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims at detecting signals of new physics at the LHC using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We defin…
▽ More
We describe the outcome of a data challenge conducted as part of the Dark Machines Initiative and the Les Houches 2019 workshop on Physics at TeV colliders. The challenged aims at detecting signals of new physics at the LHC using unsupervised machine learning algorithms. First, we propose how an anomaly score could be implemented to define model-independent signal regions in LHC searches. We define and describe a large benchmark dataset, consisting of >1 Billion simulated LHC events corresponding to $10~\rm{fb}^{-1}$ of proton-proton collisions at a center-of-mass energy of 13 TeV. We then review a wide range of anomaly detection and density estimation algorithms, developed in the context of the data challenge, and we measure their performance in a set of realistic analysis environments. We draw a number of useful conclusions that will aid the development of unsupervised new physics searches during the third run of the LHC, and provide our benchmark dataset for future studies at https://www.phenoMLdata.org. Code to reproduce the analysis is provided at https://github.com/bostdiek/DarkMachines-UnsupervisedChallenge.
△ Less
Submitted 9 December, 2021; v1 submitted 28 May, 2021;
originally announced May 2021.
-
A comparison of optimisation algorithms for high-dimensional particle and astrophysics applications
Authors:
The DarkMachines High Dimensional Sampling Group,
Csaba Balázs,
Melissa van Beekveld,
Sascha Caron,
Barry M. Dillon,
Ben Farmer,
Andrew Fowlie,
Eduardo C. Garrido-Merchán,
Will Handley,
Luc Hendriks,
Guðlaugur Jóhannesson,
Adam Leinweber,
Judita Mamužić,
Gregory D. Martinez,
Sydney Otten,
Pat Scott,
Roberto Ruiz de Austri,
Zachary Searle,
Bob Stienen,
Joaquin Vanschoren,
Martin White
Abstract:
Optimisation problems are ubiquitous in particle and astrophysics, and involve locating the optimum of a complicated function of many parameters that may be computationally expensive to evaluate. We describe a number of global optimisation algorithms that are not yet widely used in particle astrophysics, benchmark them against random sampling and existing techniques, and perform a detailed compari…
▽ More
Optimisation problems are ubiquitous in particle and astrophysics, and involve locating the optimum of a complicated function of many parameters that may be computationally expensive to evaluate. We describe a number of global optimisation algorithms that are not yet widely used in particle astrophysics, benchmark them against random sampling and existing techniques, and perform a detailed comparison of their performance on a range of test functions. These include four analytic test functions of varying dimensionality, and a realistic example derived from a recent global fit of weak-scale supersymmetry. Although the best algorithm to use depends on the function being investigated, we are able to present general conclusions about the relative merits of random sampling, Differential Evolution, Particle Swarm Optimisation, the Covariance Matrix Adaptation Evolution Strategy, Bayesian Optimisation, Grey Wolf Optimisation, and the PyGMO Artificial Bee Colony, Gaussian Particle Filter and Adaptive Memory Programming for Global Optimisation algorithms.
△ Less
Submitted 1 April, 2021; v1 submitted 12 January, 2021;
originally announced January 2021.
-
Simple and statistically sound recommendations for analysing physical theories
Authors:
Shehu S. AbdusSalam,
Fruzsina J. Agocs,
Benjamin C. Allanach,
Peter Athron,
Csaba Balázs,
Emanuele Bagnaschi,
Philip Bechtle,
Oliver Buchmueller,
Ankit Beniwal,
Jihyun Bhom,
Sanjay Bloor,
Torsten Bringmann,
Andy Buckley,
Anja Butter,
José Eliel Camargo-Molina,
Marcin Chrzaszcz,
Jan Conrad,
Jonathan M. Cornell,
Matthias Danninger,
Jorge de Blas,
Albert De Roeck,
Klaus Desch,
Matthew Dolan,
Herbert Dreiner,
Otto Eberhardt
, et al. (50 additional authors not shown)
Abstract:
Physical theories that depend on many parameters or are tested against data from many different experiments pose unique challenges to statistical inference. Many models in particle physics, astrophysics and cosmology fall into one or both of these categories. These issues are often sidestepped with statistically unsound ad hoc methods, involving intersection of parameter intervals estimated by mul…
▽ More
Physical theories that depend on many parameters or are tested against data from many different experiments pose unique challenges to statistical inference. Many models in particle physics, astrophysics and cosmology fall into one or both of these categories. These issues are often sidestepped with statistically unsound ad hoc methods, involving intersection of parameter intervals estimated by multiple experiments, and random or grid sampling of model parameters. Whilst these methods are easy to apply, they exhibit pathologies even in low-dimensional parameter spaces, and quickly become problematic to use and interpret in higher dimensions. In this article we give clear guidance for going beyond these procedures, suggesting where possible simple methods for performing statistically sound inference, and recommendations of readily-available software tools and standards that can assist in doing so. Our aim is to provide any physicists lacking comprehensive statistical training with recommendations for reaching correct scientific conclusions, with only a modest increase in analysis burden. Our examples can be reproduced with the code publicly available at https://doi.org/10.5281/zenodo.4322283.
△ Less
Submitted 11 April, 2022; v1 submitted 17 December, 2020;
originally announced December 2020.
-
Event Generation and Statistical Sampling for Physics with Deep Generative Models and a Density Information Buffer
Authors:
Sydney Otten,
Sascha Caron,
Wieske de Swart,
Melissa van Beekveld,
Luc Hendriks,
Caspar van Leeuwen,
Damian Podareanu,
Roberto Ruiz de Austri,
Rob Verheyen
Abstract:
We present a study for the generation of events from a physical process with deep generative models. The simulation of physical processes requires not only the production of physical events, but also to ensure these events occur with the correct frequencies. We investigate the feasibility of learning the event generation and the frequency of occurrence with Generative Adversarial Networks (GANs) a…
▽ More
We present a study for the generation of events from a physical process with deep generative models. The simulation of physical processes requires not only the production of physical events, but also to ensure these events occur with the correct frequencies. We investigate the feasibility of learning the event generation and the frequency of occurrence with Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs) to produce events like Monte Carlo generators. We study three processes: a simple two-body decay, the processes $e^+e^-\to Z \to l^+l^-$ and $p p \to t\bar{t} $ including the decay of the top quarks and a simulation of the detector response. We find that the tested GAN architectures and the standard VAE are not able to learn the distributions precisely. By buffering density information of encoded Monte Carlo events given the encoder of a VAE we are able to construct a prior for the sampling of new events from the decoder that yields distributions that are in very good agreement with real Monte Carlo events and are generated several orders of magnitude faster. Applications of this work include generic density estimation and sampling, targeted event generation via a principal component analysis of encoded ground truth data, anomaly detection and more efficient importance sampling, e.g. for the phase space integration of matrix elements in quantum field theories.
△ Less
Submitted 25 February, 2021; v1 submitted 3 January, 2019;
originally announced January 2019.
-
Challenges of Profile Likelihood Evaluation in Multi-Dimensional SUSY Scans
Authors:
F. Feroz,
K. Cranmer,
M. Hobson,
R. Ruiz de Austri,
R. Trotta
Abstract:
Statistical inference of the fundamental parameters of supersymmetric theories is a challenging and active endeavor. Several sophisticated algorithms have been employed to this end. While Markov-Chain Monte Carlo (MCMC) and nested sampling techniques are geared towards Bayesian inference, they have also been used to estimate frequentist confidence intervals based on the profile likelihood ratio. W…
▽ More
Statistical inference of the fundamental parameters of supersymmetric theories is a challenging and active endeavor. Several sophisticated algorithms have been employed to this end. While Markov-Chain Monte Carlo (MCMC) and nested sampling techniques are geared towards Bayesian inference, they have also been used to estimate frequentist confidence intervals based on the profile likelihood ratio. We investigate the performance and appropriate configuration of MultiNest, a nested sampling based algorithm, when used for profile likelihood-based analyses both on toy models and on the parameter space of the Constrained MSSM. We find that while the standard configuration is appropriate for an accurate reconstruction of the Bayesian posterior, the profile likelihood is poorly approximated. We identify a more appropriate MultiNest configuration for profile likelihood analyses, which gives an excellent exploration of the profile likelihood (albeit at a larger computational cost), including the identification of the global maximum likelihood value. We conclude that with the appropriate configuration MultiNest is a suitable tool for profile likelihood studies, indicating previous claims to the contrary are not well founded.
△ Less
Submitted 25 May, 2011; v1 submitted 17 January, 2011;
originally announced January 2011.
-
A Coverage Study of the CMSSM Based on ATLAS Sensitivity Using Fast Neural Networks Techniques
Authors:
M. Bridges,
K. Cranmer,
F. Feroz,
M. Hobson,
R. Ruiz de Austri,
R. Trotta
Abstract:
We assess the coverage properties of confidence and credible intervals on the CMSSM parameter space inferred from a Bayesian posterior and the profile likelihood based on an ATLAS sensitivity study. In order to make those calculations feasible, we introduce a new method based on neural networks to approximate the map** between CMSSM parameters and weak-scale particle masses. Our method reduces t…
▽ More
We assess the coverage properties of confidence and credible intervals on the CMSSM parameter space inferred from a Bayesian posterior and the profile likelihood based on an ATLAS sensitivity study. In order to make those calculations feasible, we introduce a new method based on neural networks to approximate the map** between CMSSM parameters and weak-scale particle masses. Our method reduces the computational effort needed to sample the CMSSM parameter space by a factor of ~ 10^4 with respect to conventional techniques. We find that both the Bayesian posterior and the profile likelihood intervals can significantly over-cover and identify the origin of this effect to physical boundaries in the parameter space. Finally, we point out that the effects intrinsic to the statistical procedure are conflated with simplifications to the likelihood functions from the experiments themselves.
△ Less
Submitted 28 February, 2011; v1 submitted 18 November, 2010;
originally announced November 2010.