-
Quantum Vision Transformers for Quark-Gluon Classification
Authors:
Marçal Comajoan Cara,
Gopal Ramesh Dahale,
Zhongtian Dong,
Roy T. Forestano,
Sergei Gleyzer,
Daniel Justice,
Kyoungchul Kong,
Tom Magorsch,
Konstantin T. Matchev,
Katia Matcheva,
Eyup B. Unlu
Abstract:
We introduce a hybrid quantum-classical vision transformer architecture, notable for its integration of variational quantum circuits within both the attention mechanism and the multi-layer perceptrons. The research addresses the critical challenge of computational efficiency and resource constraints in analyzing data from the upcoming High Luminosity Large Hadron Collider, presenting the architect…
▽ More
We introduce a hybrid quantum-classical vision transformer architecture, notable for its integration of variational quantum circuits within both the attention mechanism and the multi-layer perceptrons. The research addresses the critical challenge of computational efficiency and resource constraints in analyzing data from the upcoming High Luminosity Large Hadron Collider, presenting the architecture as a potential solution. In particular, we evaluate our method by applying the model to multi-detector jet images from CMS Open Data. The goal is to distinguish quark-initiated from gluon-initiated jets. We successfully train the quantum model and evaluate it via numerical simulations. Using this approach, we achieve classification performance almost on par with the one obtained with the completely classical architecture, considering a similar number of parameters.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
$M_{TN}$ is all you need: production of multiple semi-invisible resonances at hadron colliders
Authors:
Zhongtian Dong,
Kyoungchul Kong,
Konstantin T. Matchev,
Katia Matcheva
Abstract:
The stransverse mass variable $M_{T2}$ was originally proposed for the study of hadron collider events in which $N=2$ parent particles are produced and then decay semi-invisibly. Here we consider the generalization to the case of $N\ge 3$ semi-invisibly decaying parent particles. We introduce the corresponding class of kinematic variables $M_{TN}$ and illustrate their mathematical properties. Many…
▽ More
The stransverse mass variable $M_{T2}$ was originally proposed for the study of hadron collider events in which $N=2$ parent particles are produced and then decay semi-invisibly. Here we consider the generalization to the case of $N\ge 3$ semi-invisibly decaying parent particles. We introduce the corresponding class of kinematic variables $M_{TN}$ and illustrate their mathematical properties. Many of the celebrated features of the $M_{T2}$ kinematic endpoint are retained in this more general case, including the ability to measure the mass of the invisible daughter particle from the stransverse mass kink. We describe and validate a numerical procedure for computing $M_{TN}$ in practice. We also identify the configurations of visible momenta which result in nontrivial ($M_{TN}\ne 0$) values, and derive a pure phase-space estimate for the fraction of such events for any $N$.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Hybrid Quantum Vision Transformers for Event Classification in High Energy Physics
Authors:
Eyup B. Unlu,
Marçal Comajoan Cara,
Gopal Ramesh Dahale,
Zhongtian Dong,
Roy T. Forestano,
Sergei Gleyzer,
Daniel Justice,
Kyoungchul Kong,
Tom Magorsch,
Konstantin T. Matchev,
Katia Matcheva
Abstract:
Models based on vision transformer architectures are considered state-of-the-art when it comes to image classification tasks. However, they require extensive computational resources both for training and deployment. The problem is exacerbated as the amount and complexity of the data increases. Quantum-based vision transformer models could potentially alleviate this issue by reducing the training a…
▽ More
Models based on vision transformer architectures are considered state-of-the-art when it comes to image classification tasks. However, they require extensive computational resources both for training and deployment. The problem is exacerbated as the amount and complexity of the data increases. Quantum-based vision transformer models could potentially alleviate this issue by reducing the training and operating time while maintaining the same predictive power. Although current quantum computers are not yet able to perform high-dimensional tasks yet, they do offer one of the most efficient solutions for the future. In this work, we construct several variations of a quantum hybrid vision transformer for a classification problem in high energy physics (distinguishing photons and electrons in the electromagnetic calorimeter). We test them against classical vision transformer architectures. Our findings indicate that the hybrid models can achieve comparable performance to their classical analogues with a similar number of parameters.
△ Less
Submitted 19 March, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Exploring the Truth and Beauty of Theory Landscapes with Machine Learning
Authors:
Konstantin T. Matchev,
Katia Matcheva,
Pierre Ramond,
Sarunas Verner
Abstract:
Theoretical physicists describe nature by i) building a theory model and ii) determining the model parameters. The latter step involves the dual aspect of both fitting to the existing experimental data and satisfying abstract criteria like beauty, naturalness, etc. We use the Yukawa quark sector as a toy example to demonstrate how both of those tasks can be accomplished with machine learning techn…
▽ More
Theoretical physicists describe nature by i) building a theory model and ii) determining the model parameters. The latter step involves the dual aspect of both fitting to the existing experimental data and satisfying abstract criteria like beauty, naturalness, etc. We use the Yukawa quark sector as a toy example to demonstrate how both of those tasks can be accomplished with machine learning techniques. We propose loss functions whose minimization results in true models that are also beautiful as measured by three different criteria - uniformity, sparsity, or symmetry.
△ Less
Submitted 21 January, 2024;
originally announced January 2024.
-
$\mathbb{Z}_2\times \mathbb{Z}_2$ Equivariant Quantum Neural Networks: Benchmarking against Classical Neural Networks
Authors:
Zhongtian Dong,
Marçal Comajoan Cara,
Gopal Ramesh Dahale,
Roy T. Forestano,
Sergei Gleyzer,
Daniel Justice,
Kyoungchul Kong,
Tom Magorsch,
Konstantin T. Matchev,
Katia Matcheva,
Eyup B. Unlu
Abstract:
This paper presents a comprehensive comparative analysis of the performance of Equivariant Quantum Neural Networks (EQNN) and Quantum Neural Networks (QNN), juxtaposed against their classical counterparts: Equivariant Neural Networks (ENN) and Deep Neural Networks (DNN). We evaluate the performance of each network with two toy examples for a binary classification task, focusing on model complexity…
▽ More
This paper presents a comprehensive comparative analysis of the performance of Equivariant Quantum Neural Networks (EQNN) and Quantum Neural Networks (QNN), juxtaposed against their classical counterparts: Equivariant Neural Networks (ENN) and Deep Neural Networks (DNN). We evaluate the performance of each network with two toy examples for a binary classification task, focusing on model complexity (measured by the number of parameters) and the size of the training data set. Our results show that the $\mathbb{Z}_2\times \mathbb{Z}_2$ EQNN and the QNN provide superior performance for smaller parameter sets and modest training data samples.
△ Less
Submitted 20 March, 2024; v1 submitted 30 November, 2023;
originally announced November 2023.
-
A Comparison Between Invariant and Equivariant Classical and Quantum Graph Neural Networks
Authors:
Roy T. Forestano,
Marçal Comajoan Cara,
Gopal Ramesh Dahale,
Zhongtian Dong,
Sergei Gleyzer,
Daniel Justice,
Kyoungchul Kong,
Tom Magorsch,
Konstantin T. Matchev,
Katia Matcheva,
Eyup B. Unlu
Abstract:
Machine learning algorithms are heavily relied on to understand the vast amounts of data from high-energy particle collisions at the CERN Large Hadron Collider (LHC). The data from such collision events can naturally be represented with graph structures. Therefore, deep geometric methods, such as graph neural networks (GNNs), have been leveraged for various data analysis tasks in high-energy physi…
▽ More
Machine learning algorithms are heavily relied on to understand the vast amounts of data from high-energy particle collisions at the CERN Large Hadron Collider (LHC). The data from such collision events can naturally be represented with graph structures. Therefore, deep geometric methods, such as graph neural networks (GNNs), have been leveraged for various data analysis tasks in high-energy physics. One typical task is jet tagging, where jets are viewed as point clouds with distinct features and edge connections between their constituent particles. The increasing size and complexity of the LHC particle datasets, as well as the computational models used for their analysis, greatly motivate the development of alternative fast and efficient computational paradigms such as quantum computation. In addition, to enhance the validity and robustness of deep networks, one can leverage the fundamental symmetries present in the data through the use of invariant inputs and equivariant layers. In this paper, we perform a fair and comprehensive comparison between classical graph neural networks (GNNs) and equivariant graph neural networks (EGNNs) and their quantum counterparts: quantum graph neural networks (QGNNs) and equivariant quantum graph neural networks (EQGNN). The four architectures were benchmarked on a binary classification task to classify the parton-level particle initiating the jet. Based on their AUC scores, the quantum networks were shown to outperform the classical networks. However, seeing the computational advantage of the quantum networks in practice may have to wait for the further development of quantum technology and its associated APIs.
△ Less
Submitted 21 May, 2024; v1 submitted 30 November, 2023;
originally announced November 2023.
-
Seeking Truth and Beauty in Flavor Physics with Machine Learning
Authors:
Konstantin T. Matchev,
Katia Matcheva,
Pierre Ramond,
Sarunas Verner
Abstract:
The discovery process of building new theoretical physics models involves the dual aspect of both fitting to the existing experimental data and satisfying abstract theorists' criteria like beauty, naturalness, etc. We design loss functions for performing both of those tasks with machine learning techniques. We use the Yukawa quark sector as a toy example to demonstrate that the optimization of the…
▽ More
The discovery process of building new theoretical physics models involves the dual aspect of both fitting to the existing experimental data and satisfying abstract theorists' criteria like beauty, naturalness, etc. We design loss functions for performing both of those tasks with machine learning techniques. We use the Yukawa quark sector as a toy example to demonstrate that the optimization of these loss functions results in true and beautiful models.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
Reproducing Bayesian Posterior Distributions for Exoplanet Atmospheric Parameter Retrievals with a Machine Learning Surrogate Model
Authors:
Eyup B. Unlu,
Roy T. Forestano,
Konstantin T. Matchev,
Katia Matcheva
Abstract:
We describe a machine-learning-based surrogate model for reproducing the Bayesian posterior distributions for exoplanet atmospheric parameters derived from transmission spectra of transiting planets with typical retrieval software such as TauRex. The model is trained on ground truth distributions for seven parameters: the planet radius, the atmospheric temperature, and the mixing ratios for five c…
▽ More
We describe a machine-learning-based surrogate model for reproducing the Bayesian posterior distributions for exoplanet atmospheric parameters derived from transmission spectra of transiting planets with typical retrieval software such as TauRex. The model is trained on ground truth distributions for seven parameters: the planet radius, the atmospheric temperature, and the mixing ratios for five common absorbers: $H_2O$, $CH_4$, $NH_3$, $CO$ and $CO_2$. The model performance is enhanced by domain-inspired preprocessing of the features and the use of semi-supervised learning in order to leverage the large amount of unlabelled training data available. The model was among the winning solutions in the 2023 Ariel Machine Learning Data Challenge.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Variance Reduction via Simultaneous Importance Sampling and Control Variates Techniques Using Vegas
Authors:
Prasanth Shyamsundar,
Jacob L. Scott,
Stephen Mrenna,
Konstantin T. Matchev,
Kyoungchul Kong
Abstract:
Monte Carlo (MC) integration is an important calculational technique in the physical sciences. Practical considerations require that the calculations are performed as accurately as possible for a given set of computational resources. To improve the accuracy of MC integration, a number of useful variance reduction algorithms have been developed, including importance sampling and control variates. I…
▽ More
Monte Carlo (MC) integration is an important calculational technique in the physical sciences. Practical considerations require that the calculations are performed as accurately as possible for a given set of computational resources. To improve the accuracy of MC integration, a number of useful variance reduction algorithms have been developed, including importance sampling and control variates. In this work, we demonstrate how these two methods can be applied simultaneously, thus combining their benefits. We provide a python wrapper, named CoVVVR, which implements our approach in the Vegas program. The improvements are quantified with several benchmark examples from the literature.
△ Less
Submitted 24 January, 2024; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Identifying the Group-Theoretic Structure of Machine-Learned Symmetries
Authors:
Roy T. Forestano,
Konstantin T. Matchev,
Katia Matcheva,
Alexander Roman,
Eyup B. Unlu,
Sarunas Verner
Abstract:
Deep learning was recently successfully used in deriving symmetry transformations that preserve important physics quantities. Being completely agnostic, these techniques postpone the identification of the discovered symmetries to a later stage. In this letter we propose methods for examining and identifying the group-theoretic structure of such machine-learned symmetries. We design loss functions…
▽ More
Deep learning was recently successfully used in deriving symmetry transformations that preserve important physics quantities. Being completely agnostic, these techniques postpone the identification of the discovered symmetries to a later stage. In this letter we propose methods for examining and identifying the group-theoretic structure of such machine-learned symmetries. We design loss functions which probe the subalgebra structure either during the deep learning stage of symmetry discovery or in a subsequent post-processing stage. We illustrate the new methods with examples from the U(n) Lie group family, obtaining the respective subalgebra decompositions. As an application to particle physics, we demonstrate the identification of the residual symmetries after the spontaneous breaking of non-Abelian gauge symmetries like SU(3) and SU(5) which are commonly used in model building.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Searching for Novel Chemistry in Exoplanetary Atmospheres using Machine Learning for Anomaly Detection
Authors:
Roy T. Forestano,
Konstantin T. Matchev,
Katia Matcheva,
Eyup B. Unlu
Abstract:
The next generation of telescopes will yield a substantial increase in the availability of high-resolution spectroscopic data for thousands of exoplanets. The sheer volume of data and number of planets to be analyzed greatly motivate the development of new, fast and efficient methods for flagging interesting planets for reobservation and detailed analysis. We advocate the application of machine le…
▽ More
The next generation of telescopes will yield a substantial increase in the availability of high-resolution spectroscopic data for thousands of exoplanets. The sheer volume of data and number of planets to be analyzed greatly motivate the development of new, fast and efficient methods for flagging interesting planets for reobservation and detailed analysis. We advocate the application of machine learning (ML) techniques for anomaly (novelty) detection to exoplanet transit spectra, with the goal of identifying planets with unusual chemical composition and even searching for unknown biosignatures. We successfully demonstrate the feasibility of two popular anomaly detection methods (Local Outlier Factor and One Class Support Vector Machine) on a large public database of synthetic spectra. We consider several test cases, each with different levels of instrumental noise. In each case, we use ROC curves to quantify and compare the performance of the two ML techniques.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
Accelerated Discovery of Machine-Learned Symmetries: Deriving the Exceptional Lie Groups G2, F4 and E6
Authors:
Roy T. Forestano,
Konstantin T. Matchev,
Katia Matcheva,
Alexander Roman,
Eyup B. Unlu,
Sarunas Verner
Abstract:
Recent work has applied supervised deep learning to derive continuous symmetry transformations that preserve the data labels and to obtain the corresponding algebras of symmetry generators. This letter introduces two improved algorithms that significantly speed up the discovery of these symmetry transformations. The new methods are demonstrated by deriving the complete set of generators for the un…
▽ More
Recent work has applied supervised deep learning to derive continuous symmetry transformations that preserve the data labels and to obtain the corresponding algebras of symmetry generators. This letter introduces two improved algorithms that significantly speed up the discovery of these symmetry transformations. The new methods are demonstrated by deriving the complete set of generators for the unitary groups U(n) and the exceptional Lie groups $G_2$, $F_4$, and $E_6$. A third post-processing algorithm renders the found generators in sparse form. We benchmark the performance improvement of the new algorithms relative to the standard approach. Given the significant complexity of the exceptional Lie groups, our results demonstrate that this machine-learning method for discovering symmetries is completely general and can be applied to a wide variety of labeled datasets.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
Discovering Sparse Representations of Lie Groups with Machine Learning
Authors:
Roy T. Forestano,
Konstantin T. Matchev,
Katia Matcheva,
Alexander Roman,
Eyup B. Unlu,
Sarunas Verner
Abstract:
Recent work has used deep learning to derive symmetry transformations, which preserve conserved quantities, and to obtain the corresponding algebras of generators. In this letter, we extend this technique to derive sparse representations of arbitrary Lie algebras. We show that our method reproduces the canonical (sparse) representations of the generators of the Lorentz group, as well as the…
▽ More
Recent work has used deep learning to derive symmetry transformations, which preserve conserved quantities, and to obtain the corresponding algebras of generators. In this letter, we extend this technique to derive sparse representations of arbitrary Lie algebras. We show that our method reproduces the canonical (sparse) representations of the generators of the Lorentz group, as well as the $U(n)$ and $SU(n)$ families of Lie groups. This approach is completely general and can be used to find the infinitesimal generators for any Lie group.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
Oracle-Preserving Latent Flows
Authors:
Alexander Roman,
Roy T. Forestano,
Konstantin T. Matchev,
Katia Matcheva,
Eyup B. Unlu
Abstract:
We develop a deep learning methodology for the simultaneous discovery of multiple nontrivial continuous symmetries across an entire labelled dataset. The symmetry transformations and the corresponding generators are modeled with fully connected neural networks trained with a specially constructed loss function ensuring the desired symmetry properties. The two new elements in this work are the use…
▽ More
We develop a deep learning methodology for the simultaneous discovery of multiple nontrivial continuous symmetries across an entire labelled dataset. The symmetry transformations and the corresponding generators are modeled with fully connected neural networks trained with a specially constructed loss function ensuring the desired symmetry properties. The two new elements in this work are the use of a reduced-dimensionality latent space and the generalization to transformations invariant with respect to high-dimensional oracles. The method is demonstrated with several examples on the MNIST digit dataset.
△ Less
Submitted 1 February, 2023;
originally announced February 2023.
-
Deep Learning Symmetries and Their Lie Groups, Algebras, and Subalgebras from First Principles
Authors:
Roy T. Forestano,
Konstantin T. Matchev,
Katia Matcheva,
Alexander Roman,
Eyup Unlu,
Sarunas Verner
Abstract:
We design a deep-learning algorithm for the discovery and identification of the continuous group of symmetries present in a labeled dataset. We use fully connected neural networks to model the symmetry transformations and the corresponding generators. We construct loss functions that ensure that the applied transformations are symmetries and that the corresponding set of generators forms a closed…
▽ More
We design a deep-learning algorithm for the discovery and identification of the continuous group of symmetries present in a labeled dataset. We use fully connected neural networks to model the symmetry transformations and the corresponding generators. We construct loss functions that ensure that the applied transformations are symmetries and that the corresponding set of generators forms a closed (sub)algebra. Our procedure is validated with several examples illustrating different types of conserved quantities preserved by symmetry. In the process of deriving the full set of symmetries, we analyze the complete subgroup structure of the rotation groups $SO(2)$, $SO(3)$, and $SO(4)$, and of the Lorentz group $SO(1,3)$. Other examples include squeeze map**, piecewise discontinuous labels, and $SO(10)$, demonstrating that our method is completely general, with many possible applications in physics and data science. Our study also opens the door for using a machine learning approach in the mathematical study of Lie groups and their properties.
△ Less
Submitted 13 January, 2023;
originally announced January 2023.
-
Is the Machine Smarter than the Theorist: Deriving Formulas for Particle Kinematics with Symbolic Regression
Authors:
Zhongtian Dong,
Kyoungchul Kong,
Konstantin T. Matchev,
Katia Matcheva
Abstract:
We demonstrate the use of symbolic regression in deriving analytical formulas, which are needed at various stages of a typical experimental analysis in collider phenomenology. As a first application, we consider kinematic variables like the stransverse mass, $M_{T2}$, which are defined algorithmically through an optimization procedure and not in terms of an analytical formula. We then train a symb…
▽ More
We demonstrate the use of symbolic regression in deriving analytical formulas, which are needed at various stages of a typical experimental analysis in collider phenomenology. As a first application, we consider kinematic variables like the stransverse mass, $M_{T2}$, which are defined algorithmically through an optimization procedure and not in terms of an analytical formula. We then train a symbolic regression and obtain the correct analytical expressions for all known special cases of $M_{T2}$ in the literature. As a second application, we reproduce the correct analytical expression for a next-to-leading order (NLO) kinematic distribution from data, which is simulated with a NLO event generator. Finally, we derive analytical approximations for the NLO kinematic distributions after detector simulation, for which no known analytical formulas currently exist.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
New Machine Learning Techniques for Simulation-Based Inference: InferoStatic Nets, Kernel Score Estimation, and Kernel Likelihood Ratio Estimation
Authors:
Kyoungchul Kong,
Konstantin T. Matchev,
Stephen Mrenna,
Prasanth Shyamsundar
Abstract:
We propose an intuitive, machine-learning approach to multiparameter inference, dubbed the InferoStatic Networks (ISN) method, to model the score and likelihood ratio estimators in cases when the probability density can be sampled but not computed directly. The ISN uses a backend neural network that models a scalar function called the inferostatic potential $\varphi$. In addition, we introduce new…
▽ More
We propose an intuitive, machine-learning approach to multiparameter inference, dubbed the InferoStatic Networks (ISN) method, to model the score and likelihood ratio estimators in cases when the probability density can be sampled but not computed directly. The ISN uses a backend neural network that models a scalar function called the inferostatic potential $\varphi$. In addition, we introduce new strategies, respectively called Kernel Score Estimation (KSE) and Kernel Likelihood Ratio Estimation (KLRE), to learn the score and the likelihood ratio functions from simulated data. We illustrate the new techniques with some toy examples and compare to existing approaches in the literature. We mention en passant some new loss functions that optimally incorporate latent information from simulations into the training procedure.
△ Less
Submitted 3 February, 2023; v1 submitted 4 October, 2022;
originally announced October 2022.
-
Signatures and Detection Prospects for sub-GeV Dark Matter with Superfluid Helium
Authors:
Yining You,
Jordan Smolinsky,
Wei Xue,
Konstantin T. Matchev,
Keegan Gunther,
Yoonseok Lee,
Tarek Saab
Abstract:
We explore the possibility of using superfluid helium for direct detection of sub-GeV dark matter (DM). We discuss the relevant phenomenology resulting from the scattering of an incident dark matter particle on a Helium nucleus. Rather than directly exciting quasi-particles, DM in this mass range will interact with a single He atom, triggering an atomic cascade which eventually also includes emiss…
▽ More
We explore the possibility of using superfluid helium for direct detection of sub-GeV dark matter (DM). We discuss the relevant phenomenology resulting from the scattering of an incident dark matter particle on a Helium nucleus. Rather than directly exciting quasi-particles, DM in this mass range will interact with a single He atom, triggering an atomic cascade which eventually also includes emission and thermalization of quasi-particles. We present in detail the analytical framework needed for modeling these processes and determining the resulting flux of quasi-particles. We propose a novel method for detecting this flux with modern force-sensitive devices, such as nanoelectro-mechanical system (NEMS) oscillators, and derive the sensitivity projections for a generic sub-GeV DM detection experiment using such sensors.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Kinematic Variables and Feature Engineering for Particle Phenomenology
Authors:
Roberto Franceschini,
Doo** Kim,
Kyoungchul Kong,
Konstantin T. Matchev,
Myeonghun Park,
Prasanth Shyamsundar
Abstract:
Kinematic variables have been playing an important role in collider phenomenology, as they expedite discoveries of new particles by separating signal events from unwanted background events and allow for measurements of particle properties such as masses, couplings, spins, etc. For the past 10 years, an enormous number of kinematic variables have been designed and proposed, primarily for the experi…
▽ More
Kinematic variables have been playing an important role in collider phenomenology, as they expedite discoveries of new particles by separating signal events from unwanted background events and allow for measurements of particle properties such as masses, couplings, spins, etc. For the past 10 years, an enormous number of kinematic variables have been designed and proposed, primarily for the experiments at the Large Hadron Collider, allowing for a drastic reduction of high-dimensional experimental data to lower-dimensional observables, from which one can readily extract underlying features of phase space and develop better-optimized data-analysis strategies. We review these recent developments in the area of phase space kinematics, summarizing the new kinematic variables with important phenomenological implications and physics applications. We also review recently proposed analysis methods and techniques specifically designed to leverage the new kinematic variables. As machine learning is nowadays percolating through many fields of particle physics including collider phenomenology, we discuss the interconnection and mutual complementarity of kinematic variables and machine learning techniques. We finally discuss how the utilization of kinematic variables originally developed for colliders can be extended to other high-energy physics experiments including neutrino experiments.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Transverse Vector Decomposition Method for Analytical Inversion of Exoplanet Transit Spectra
Authors:
Konstantin T. Matchev,
Katia Matcheva,
Alexander Roman
Abstract:
We develop a new method for analytical inversion of binned exoplanet transit spectra and for retrieval of planet parameters. The method has a geometrical interpretation and treats each observed spectrum as a single vector $\vec r$ in the multidimensional spectral space of observed bin values. We decompose the observed $\vec{r}$ into a wavelength-independent component $\vec{r}_\parallel$ correspond…
▽ More
We develop a new method for analytical inversion of binned exoplanet transit spectra and for retrieval of planet parameters. The method has a geometrical interpretation and treats each observed spectrum as a single vector $\vec r$ in the multidimensional spectral space of observed bin values. We decompose the observed $\vec{r}$ into a wavelength-independent component $\vec{r}_\parallel$ corresponding to the spectral mean across all observed bins, and a transverse component $\vec{r}_\perp$ which is wavelength-dependent and contains the relevant information about the atmospheric chemistry. The method allows us to extract, without any prior assumptions or additional information, the relative mass (or volume) mixing ratios of the absorbers in the atmosphere, the scale height to stellar radius ratio, $H/R_S$, and the atmospheric temperature. The method is illustrated and validated with several examples of increasing complexity.
△ Less
Submitted 11 March, 2022;
originally announced March 2022.
-
Unsupervised Machine Learning for Exploratory Data Analysis of Exoplanet Transmission Spectra
Authors:
Konstantin T. Matchev,
Katia Matcheva,
Alexander Roman
Abstract:
Transit spectroscopy is a powerful tool to decode the chemical composition of the atmospheres of extrasolar planets. In this paper we focus on unsupervised techniques for analyzing spectral data from transiting exoplanets. We demonstrate methods for i) cleaning and validating the data, ii) initial exploratory data analysis based on summary statistics (estimates of location and variability), iii) e…
▽ More
Transit spectroscopy is a powerful tool to decode the chemical composition of the atmospheres of extrasolar planets. In this paper we focus on unsupervised techniques for analyzing spectral data from transiting exoplanets. We demonstrate methods for i) cleaning and validating the data, ii) initial exploratory data analysis based on summary statistics (estimates of location and variability), iii) exploring and quantifying the existing correlations in the data, iv) pre-processing and linearly transforming the data to its principal components, v) dimensionality reduction and manifold learning, vi) clustering and anomaly detection, vii) visualization and interpretation of the data. To illustrate the proposed unsupervised methodology, we use a well-known public benchmark data set of synthetic transit spectra. We show that there is a high degree of correlation in the spectral data, which calls for appropriate low-dimensional representations. We explore a number of different techniques for such dimensionality reduction and identify several suitable options in terms of summary statistics, principal components, etc. We uncover interesting structures in the principal component basis, namely, well-defined branches corresponding to different chemical regimes of the underlying atmospheres. We demonstrate that those branches can be successfully recovered with a K-means clustering algorithm in fully unsupervised fashion. We advocate for a three-dimensional representation of the spectroscopic data in terms of the first three principal components, in order to reveal the existing structure in the data and quickly characterize the chemical class of a planet.
△ Less
Submitted 7 January, 2022;
originally announced January 2022.
-
Analytical Modelling of Exoplanet Transit Specroscopy with Dimensional Analysis and Symbolic Regression
Authors:
Konstantin T. Matchev,
Katia Matcheva,
Alexander Roman
Abstract:
The physical characteristics and atmospheric chemical composition of newly discovered exoplanets are often inferred from their transit spectra which are obtained from complex numerical models of radiative transfer. Alternatively, simple analytical expressions provide insightful physical intuition into the relevant atmospheric processes. The deep learning revolution has opened the door for deriving…
▽ More
The physical characteristics and atmospheric chemical composition of newly discovered exoplanets are often inferred from their transit spectra which are obtained from complex numerical models of radiative transfer. Alternatively, simple analytical expressions provide insightful physical intuition into the relevant atmospheric processes. The deep learning revolution has opened the door for deriving such analytical results directly with a computer algorithm fitting to the data. As a proof of concept, we successfully demonstrate the use of symbolic regression on synthetic data for the transit radii of generic hot Jupiter exoplanets to derive a corresponding analytical formula. As a preprocessing step, we use dimensional analysis to identify the relevant dimensionless combinations of variables and reduce the number of independent inputs, which improves the performance of the symbolic regression. The dimensional analysis also allowed us to mathematically derive and properly parametrize the most general family of degeneracies among the input atmospheric parameters which affect the characterization of an exoplanet atmosphere through transit spectroscopy.
△ Less
Submitted 21 December, 2021;
originally announced December 2021.
-
Superfluid Effective Field Theory for Dark Matter Direct Detection
Authors:
Konstantin T. Matchev,
Jordan Smolinsky,
Wei Xue,
Yining You
Abstract:
We develop an effective field theory (EFT) framework for superfluid ${}^4$He to model the interactions among quasiparticles, helium atoms and probe particles. Our effective field theory approach brings together symmetry arguments and power-counting and matches to classical fluid dynamics. We then present the decay and scattering rates for the relevant processes involving quasiparticles and helium…
▽ More
We develop an effective field theory (EFT) framework for superfluid ${}^4$He to model the interactions among quasiparticles, helium atoms and probe particles. Our effective field theory approach brings together symmetry arguments and power-counting and matches to classical fluid dynamics. We then present the decay and scattering rates for the relevant processes involving quasiparticles and helium atoms. The presented EFT framework and results can be used to understand the dynamics of thermalization in the superfluid, and can be further applied to sub-GeV dark matter direct detection with superfluid ${}^4$He.
△ Less
Submitted 10 September, 2021; v1 submitted 16 August, 2021;
originally announced August 2021.
-
Deep-Learned Event Variables for Collider Phenomenology
Authors:
Doo** Kim,
Kyoungchul Kong,
Konstantin T. Matchev,
Myeonghun Park,
Prasanth Shyamsundar
Abstract:
The choice of optimal event variables is crucial for achieving the maximal sensitivity of experimental analyses. Over time, physicists have derived suitable kinematic variables for many typical event topologies in collider physics. Here we introduce a deep learning technique to design good event variables, which are sensitive over a wide range of values for the unknown model parameters. We demonst…
▽ More
The choice of optimal event variables is crucial for achieving the maximal sensitivity of experimental analyses. Over time, physicists have derived suitable kinematic variables for many typical event topologies in collider physics. Here we introduce a deep learning technique to design good event variables, which are sensitive over a wide range of values for the unknown model parameters. We demonstrate that the neural networks trained with our technique on some simple event topologies are able to reproduce standard event variables like invariant mass, transverse mass, and stransverse mass. The method is automatable, completely general, and can be used to derive sensitive, previously unknown, event variables for other, more complex event topologies.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
InClass Nets: Independent Classifier Networks for Nonparametric Estimation of Conditional Independence Mixture Models and Unsupervised Classification
Authors:
Konstantin T. Matchev,
Prasanth Shyamsundar
Abstract:
We introduce a new machine-learning-based approach, which we call the Independent Classifier networks (InClass nets) technique, for the nonparameteric estimation of conditional independence mixture models (CIMMs). We approach the estimation of a CIMM as a multi-class classification problem, since dividing the dataset into different categories naturally leads to the estimation of the mixture model.…
▽ More
We introduce a new machine-learning-based approach, which we call the Independent Classifier networks (InClass nets) technique, for the nonparameteric estimation of conditional independence mixture models (CIMMs). We approach the estimation of a CIMM as a multi-class classification problem, since dividing the dataset into different categories naturally leads to the estimation of the mixture model. InClass nets consist of multiple independent classifier neural networks (NNs), each of which handles one of the variates of the CIMM. Fitting the CIMM to the data is performed by simultaneously training the individual NNs using suitable cost functions. The ability of NNs to approximate arbitrary functions makes our technique nonparametric. Further leveraging the power of NNs, we allow the conditionally independent variates of the model to be individually high-dimensional, which is the main advantage of our technique over existing non-machine-learning-based approaches. We derive some new results on the nonparametric identifiability of bivariate CIMMs, in the form of a necessary and a (different) sufficient condition for a bivariate CIMM to be identifiable. We provide a public implementation of InClass nets as a Python package called RainDancesVI and validate our InClass nets technique with several worked out examples. Our method also has applications in unsupervised and semi-supervised classification problems.
△ Less
Submitted 31 August, 2020;
originally announced September 2020.
-
OASIS: Optimal Analysis-Specific Importance Sampling for event generation
Authors:
Konstantin T. Matchev,
Prasanth Shyamsundar
Abstract:
We propose a technique called Optimal Analysis-Specific Importance Sampling (OASIS) to reduce the number of simulated events required for a high-energy experimental analysis to reach a target sensitivity. We provide recipes to obtain the optimal sampling distributions which preferentially focus the event generation on the regions of phase space with high utility to the experimental analyses. OASIS…
▽ More
We propose a technique called Optimal Analysis-Specific Importance Sampling (OASIS) to reduce the number of simulated events required for a high-energy experimental analysis to reach a target sensitivity. We provide recipes to obtain the optimal sampling distributions which preferentially focus the event generation on the regions of phase space with high utility to the experimental analyses. OASIS leads to a conservation of resources at all stages of the Monte Carlo pipeline, including full-detector simulation, and is complementary to approaches which seek to speed-up the simulation pipeline.
△ Less
Submitted 23 December, 2020; v1 submitted 30 June, 2020;
originally announced June 2020.
-
Finding Wombling Boundaries in LHC Data with Voronoi and Delaunay Tessellations
Authors:
Konstantin T. Matchev,
Alexander Roman,
Prasanth Shyamsundar
Abstract:
We address the problem of finding a wombling boundary in point data generated by a general Poisson point process, a specific example of which is an LHC event sample distributed in the phase space of a final state signature, with the wombling boundary created by some new physics. We discuss the use of Voronoi and Delaunay tessellations of the point data for estimating the local gradients and invest…
▽ More
We address the problem of finding a wombling boundary in point data generated by a general Poisson point process, a specific example of which is an LHC event sample distributed in the phase space of a final state signature, with the wombling boundary created by some new physics. We discuss the use of Voronoi and Delaunay tessellations of the point data for estimating the local gradients and investigate methods for sharpening the boundaries by reducing the statistical noise. The outcome from traditional wombling algorithms is a set of boundary cell candidates with relatively large gradients, whose spatial properties must then be scrutinized in order to construct the boundary and evaluate its significance. Here we propose an alternative approach where we simultaneously form and evaluate the significance of all possible boundaries in terms of the total gradient flux. We illustrate our method with several toy examples of both straight and curved boundaries with varying amounts of signal present in the data.
△ Less
Submitted 10 January, 2021; v1 submitted 8 June, 2020;
originally announced June 2020.
-
A quantum algorithm for model independent searches for new physics
Authors:
Konstantin T. Matchev,
Prasanth Shyamsundar,
Jordan Smolinsky
Abstract:
We propose a novel quantum technique to search for unmodelled anomalies in multi-dimensional binned collider data. We propose to associate an Ising lattice spin site with each bin, with the Ising Hamiltonian suitably constructed from the observed data and a corresponding theoretical expectation. In order to capture spatially correlated anomalies in the data, we introduce spin-spin interactions bet…
▽ More
We propose a novel quantum technique to search for unmodelled anomalies in multi-dimensional binned collider data. We propose to associate an Ising lattice spin site with each bin, with the Ising Hamiltonian suitably constructed from the observed data and a corresponding theoretical expectation. In order to capture spatially correlated anomalies in the data, we introduce spin-spin interactions between neighboring sites, as well as self-interactions. The ground state energy of the resulting Ising Hamiltonian can be used as a new test statistic, which can be computed either classically or via adiabatic quantum optimization. We demonstrate that our test statistic outperforms some of the most commonly used goodness-of-fit tests. The new approach greatly reduces the look-elsewhere effect by exploiting the typical differences between statistical noise and genuine new physics signals.
△ Less
Submitted 2 August, 2020; v1 submitted 4 March, 2020;
originally announced March 2020.
-
Uncertainties associated with GAN-generated datasets in high energy physics
Authors:
Konstantin T. Matchev,
Alexander Roman,
Prasanth Shyamsundar
Abstract:
Recently, Generative Adversarial Networks (GANs) trained on samples of traditionally simulated collider events have been proposed as a way of generating larger simulated datasets at a reduced computational cost. In this paper we point out that data generated by a GAN cannot statistically be better than the data it was trained on, and critically examine the applicability of GANs in various situatio…
▽ More
Recently, Generative Adversarial Networks (GANs) trained on samples of traditionally simulated collider events have been proposed as a way of generating larger simulated datasets at a reduced computational cost. In this paper we point out that data generated by a GAN cannot statistically be better than the data it was trained on, and critically examine the applicability of GANs in various situations, including a) for replacing the entire Monte Carlo pipeline or parts of it, and b) to produce datasets for usage in highly sensitive analyses or sub-optimal ones. We present our arguments using information theoretic demonstrations, a toy example, as well as in the form of a formal statement, and identify some potential valid uses of GANs in collider simulations.
△ Less
Submitted 8 February, 2022; v1 submitted 14 February, 2020;
originally announced February 2020.
-
Optimal event selection and categorization in high energy physics, Part 1: Signal discovery
Authors:
Konstantin K. Matchev,
Prasanth Shyamsundar
Abstract:
We provide a prescription to train optimal machine-learning-based event selectors and categorizers that maximize the statistical significance of a potential signal excess in high energy physics (HEP) experiments, as quantified by any of six different performance measures. For analyses where the signal search is performed in the distribution of some event variables, our prescription ensures that on…
▽ More
We provide a prescription to train optimal machine-learning-based event selectors and categorizers that maximize the statistical significance of a potential signal excess in high energy physics (HEP) experiments, as quantified by any of six different performance measures. For analyses where the signal search is performed in the distribution of some event variables, our prescription ensures that only the information complementary to those event variables is used in event selection and categorization. This eliminates a major misalignment with the physics goals of the analysis (maximizing the significance of an excess) that exists in the training of typical ML-based event selectors and categorizers. In addition, this decorrelation of event selectors from the relevant event variables prevents the background distribution from becoming peaked in the signal region as a result of event selection, thereby ameliorating the challenges imposed on signal searches by systematic uncertainties. Our event selectors (categorizers) use the output of machine-learning-based classifiers as input and apply optimal selection cutoffs (categorization thresholds) that depend on the event variables being analyzed, as opposed to flat cutoffs (thresholds). These optimal cutoffs and thresholds are learned iteratively, using a novel approach with connections to Lloyd's k-means clustering algorithm. We provide a public, Python 3 implementation of our prescription called ThickBrick, along with usage examples.
△ Less
Submitted 26 November, 2019;
originally announced November 2019.
-
Singularity Variables for Missing Energy Event Kinematics
Authors:
Konstantin T. Matchev,
Prasanth Shyamsundar
Abstract:
We discuss singularity variables which are properly suited for analyzing the kinematics of events with missing transverse energy at the LHC. We consider six of the simplest event topologies encountered in studies of leptonic W-bosons and top quarks, as well as in SUSY-like searches for new physics with dark matter particles. In each case, we illustrate the general prescription for finding the rele…
▽ More
We discuss singularity variables which are properly suited for analyzing the kinematics of events with missing transverse energy at the LHC. We consider six of the simplest event topologies encountered in studies of leptonic W-bosons and top quarks, as well as in SUSY-like searches for new physics with dark matter particles. In each case, we illustrate the general prescription for finding the relevant singularity variable, which in turn helps delineate the visible parameter subspace on which the singularities are located. Our results can be used in two different ways - first, as a guide for targeting the signal-rich regions of parameter space during the stage of discovery, and second, as a sensitive focus point method for measuring the particle mass spectrum after the initial discovery.
△ Less
Submitted 8 April, 2020; v1 submitted 5 November, 2019;
originally announced November 2019.
-
Higgs boson potential at colliders: status and perspectives
Authors:
B. Di Micco,
M. Gouzevitch,
J. Mazzitelli,
C. Vernieri,
J. Alison,
K. Androsov,
J. Baglio,
E. Bagnaschi,
S. Banerjee,
P. Basler,
A. Bethani,
A. Betti,
M. Blanke,
A. Blondel,
L. Borgonovi,
E. Brost,
P. Bryant,
G. Buchalla,
T. J. Burch,
V. M. M. Cairo,
F. Campanario,
M. Carena,
A. Carvalho,
N. Chernyavskaya,
V. D'Amico
, et al. (82 additional authors not shown)
Abstract:
This document summarises the current theoretical and experimental status of the di-Higgs boson production searches, and of the direct and indirect constraints on the Higgs boson self-coupling, with the wish to serve as a useful guide for the next years. The document discusses the theoretical status, including state-of-the-art predictions for di-Higgs cross sections, developments on the effective f…
▽ More
This document summarises the current theoretical and experimental status of the di-Higgs boson production searches, and of the direct and indirect constraints on the Higgs boson self-coupling, with the wish to serve as a useful guide for the next years. The document discusses the theoretical status, including state-of-the-art predictions for di-Higgs cross sections, developments on the effective field theory approach, and studies on specific new physics scenarios that can show up in the di-Higgs final state. The status of di-Higgs searches and the direct and indirect constraints on the Higgs self-coupling at the LHC are presented, with an overview of the relevant experimental techniques, and covering all the variety of relevant signatures. Finally, the capabilities of future colliders in determining the Higgs self-coupling are addressed, comparing the projected precision that can be obtained in such facilities. The work has started as the proceedings of the Di-Higgs workshop at Colliders, held at Fermilab from the 4th to the 9th of September 2018, but it went beyond the topics discussed at that workshop and included further developments.
△ Less
Submitted 18 May, 2020; v1 submitted 30 September, 2019;
originally announced October 2019.
-
Kinematic Focus Point Method for Particle Mass Measurements in Missing Energy Events
Authors:
Doo** Kim,
Konstantin T. Matchev,
Prasanth Shyamsundar
Abstract:
We investigate the solvability of the event kinematics in missing energy events at hadron colliders, as a function of the particle mass ansatz. To be specific, we reconstruct the neutrino momenta in dilepton $t\bar{t}$-like events, without assuming any prior knowledge of the mass spectrum. We identify a class of events, which we call extreme events, with the property that the kinematic boundary of…
▽ More
We investigate the solvability of the event kinematics in missing energy events at hadron colliders, as a function of the particle mass ansatz. To be specific, we reconstruct the neutrino momenta in dilepton $t\bar{t}$-like events, without assuming any prior knowledge of the mass spectrum. We identify a class of events, which we call extreme events, with the property that the kinematic boundary of their allowed region in mass parameter space passes through the true mass point. We develop techniques for recognizing extreme events in the data and demonstrate that they are abundant in a realistic data sample, due to expected singularities in phase space. We propose a new method for mass measurement whereby we obtain the true values of the mass parameters as the focus point of the kinematic boundaries for all events in the data sample. Since the masses are determined from a relatively sharp peak structure (the density of kinematic boundary curves), the method avoids some of the systematic errors associated with other techniques. We show that this new approach is complementary to previously considered methods in the literature where one studies the solvability of the kinematic constraints throughout the mass parameter space. In particular, we identify a problematic direction in mass space of nearly 100% solvability, and then show that the focus point method is effective in lifting the degeneracy.
△ Less
Submitted 12 September, 2019; v1 submitted 6 June, 2019;
originally announced June 2019.
-
Portraying Double Higgs at the Large Hadron Collider
Authors:
Jeong Han Kim,
Minho Kim,
Kyoungchul Kong,
Konstantin T. Matchev,
Myeonghun Park
Abstract:
We examine the discovery potential for double Higgs production at the high luminosity LHC in the final state with two $b$-tagged jets, two leptons and missing transverse momentum. Although this dilepton final state has been considered a difficult channel due to the large backgrounds, we argue that it is possible to obtain sizable signal significance, by adopting a deep learning framework making fu…
▽ More
We examine the discovery potential for double Higgs production at the high luminosity LHC in the final state with two $b$-tagged jets, two leptons and missing transverse momentum. Although this dilepton final state has been considered a difficult channel due to the large backgrounds, we argue that it is possible to obtain sizable signal significance, by adopting a deep learning framework making full use of the relevant kinematics along with the jet images from the Higgs decay. For the relevant number of signal events we obtain a substantial increase in signal sensitivity over existing analyses. We discuss relative improvements at each stage and the correlations among the different input variables for the neutral network. The proposed method can be easily generalized to the semi-leptonic channel of double Higgs production, as well as to other processes with similar final states.
△ Less
Submitted 17 September, 2019; v1 submitted 17 April, 2019;
originally announced April 2019.
-
Dreaming Awake: Disentangling the Underlying Physics in Case of a SUSY-like Discovery at the LHC
Authors:
Konstantin T. Matchev,
Filip Moortgat,
Luc Pape
Abstract:
The purpose of this review is to investigate what kind of physics can be extracted at the LHC, assuming a discovery is made in events with missing transverse momentum, as generically expected in supersymmetry (SUSY) with R-parity conservation. To set the scene, we first discuss the collider phenomenology of the six possible electroweakino benchmark scenarios, as they provide valuable insight into…
▽ More
The purpose of this review is to investigate what kind of physics can be extracted at the LHC, assuming a discovery is made in events with missing transverse momentum, as generically expected in supersymmetry (SUSY) with R-parity conservation. To set the scene, we first discuss the collider phenomenology of the six possible electroweakino benchmark scenarios, as they provide valuable insight into what one might be facing at the LHC. We review the existing methods for mass reconstruction from measured kinematic endpoints in the distributions of suitable variables, e.g., the invariant masses of various sets of visible decay products, as well as the $M_{T2}$ and the $M_2$ types of variables. We propose to extend the application of these methods to the various topologies of fully hadronic final states, possibly with hadronically reconstructed massive bosons (W, Z or h). We test the idea with a simplified simulation of events in the main electroweakino benchmark scenarios. We find that the fully hadronic events allow the complete determination of the relevant mass spectrum. For comparison, we also review the potential of the standard kinematic endpoint methods for final states involving leptons from the decays of (on-shell or off-shell) sleptons. We find that with 300 $fb^{-1}$, the statistics for the leptonic events is very marginal and they look less promising than the fully hadronic channels. This corresponds to a complete reversal of the usual paradigm, where leptonic events comprised the gold-plated SUSY channels. Finally, we put together all available information and summarize what level of understanding of the underlying physics can be achieved. We show that, as a by-product of the mass reconstruction, it is also possible to determine the production cross sections and decay branching ratios, which in turn enable us to pinpoint the underlying model.
△ Less
Submitted 28 February, 2019;
originally announced February 2019.
-
Higgs Physics at the HL-LHC and HE-LHC
Authors:
M. Cepeda,
S. Gori,
P. Ilten,
M. Kado,
F. Riva,
R. Abdul Khalek,
A. Aboubrahim,
J. Alimena,
S. Alioli,
A. Alves,
C. Asawatangtrakuldee,
A. Azatov,
P. Azzi,
S. Bailey,
S. Banerjee,
E. L. Barberio,
D. Barducci,
G. Barone,
M. Bauer,
C. Bautista,
P. Bechtle,
K. Becker,
A. Benaglia,
M. Bengala,
N. Berger
, et al. (352 additional authors not shown)
Abstract:
The discovery of the Higgs boson in 2012, by the ATLAS and CMS experiments, was a success achieved with only a percent of the entire dataset foreseen for the LHC. It opened a landscape of possibilities in the study of Higgs boson properties, Electroweak Symmetry breaking and the Standard Model in general, as well as new avenues in probing new physics beyond the Standard Model. Six years after the…
▽ More
The discovery of the Higgs boson in 2012, by the ATLAS and CMS experiments, was a success achieved with only a percent of the entire dataset foreseen for the LHC. It opened a landscape of possibilities in the study of Higgs boson properties, Electroweak Symmetry breaking and the Standard Model in general, as well as new avenues in probing new physics beyond the Standard Model. Six years after the discovery, with a conspicuously larger dataset collected during LHC Run 2 at a 13 TeV centre-of-mass energy, the theory and experimental particle physics communities have started a meticulous exploration of the potential for precision measurements of its properties. This includes studies of Higgs boson production and decays processes, the search for rare decays and production modes, high energy observables, and searches for an extended electroweak symmetry breaking sector. This report summarises the potential reach and opportunities in Higgs physics during the High Luminosity phase of the LHC, with an expected dataset of pp collisions at 14 TeV, corresponding to an integrated luminosity of 3 ab$^{-1}$. These studies are performed in light of the most recent analyses from LHC collaborations and the latest theoretical developments. The potential of an LHC upgrade, colliding protons at a centre-of-mass energy of 27 TeV and producing a dataset corresponding to an integrated luminosity of 15 ab$^{-1}$, is also discussed.
△ Less
Submitted 19 March, 2019; v1 submitted 31 January, 2019;
originally announced February 2019.
-
Enhancing the discovery prospects for SUSY-like decays with a forgotten kinematic variable
Authors:
Dipsikha Debnath,
James S. Gainer,
Can Kilic,
Doo** Kim,
Konstantin T. Matchev,
Yuan-Pao Yang
Abstract:
The lack of a new physics signal thus far at the Large Hadron Collider motivates us to consider how to look for challenging final states, with large Standard Model backgrounds and subtle kinematic features, such as cascade decays with compressed spectra. Adopting a benchmark SUSY-like decay topology with a four-body final state proceeding through a sequence of two-body decays via intermediate reso…
▽ More
The lack of a new physics signal thus far at the Large Hadron Collider motivates us to consider how to look for challenging final states, with large Standard Model backgrounds and subtle kinematic features, such as cascade decays with compressed spectra. Adopting a benchmark SUSY-like decay topology with a four-body final state proceeding through a sequence of two-body decays via intermediate resonances, we focus our attention on the kinematic variable $Δ_{4}$ which previously has been used to parameterize the boundary of the allowed four-body phase space. We highlight the advantages of using $Δ_{4}$ as a discovery variable, and present an analysis suggesting that the pairing of $Δ_{4}$ with another invariant mass variable leads to a significant improvement over more conventional variable choices and techniques.
△ Less
Submitted 23 May, 2019; v1 submitted 12 September, 2018;
originally announced September 2018.
-
Adding Pseudo-Observables to the Four-Lepton Experimentalist's Toolbox
Authors:
James S. Gainer,
Martín González-Alonso,
Admir Greljo,
Senad Isaković,
Gino Isidori,
Andrey Korytov,
Joseph Lykken,
David Marzocca,
Konstantin T. Matchev,
Predrag Milenović,
Guenakh Mitselmakher,
Stephen Mrenna,
Myeonghun Park,
Aurelijus Rinkevicius,
Nudzeim Selimović
Abstract:
The "golden" channel, in which the newly-discovered Higgs boson decays to four leptons by means of intermediate vector bosons, is important for determining the properties of the Higgs boson and for searching for subtle new physics effects. Different approaches exist for parametrizing the relevant Higgs couplings in this channel; here we relate the use of pseudo-observables to methods based on spec…
▽ More
The "golden" channel, in which the newly-discovered Higgs boson decays to four leptons by means of intermediate vector bosons, is important for determining the properties of the Higgs boson and for searching for subtle new physics effects. Different approaches exist for parametrizing the relevant Higgs couplings in this channel; here we relate the use of pseudo-observables to methods based on specifying the most general amplitude or Lagrangian terms for the $HVV$ interactions. We also provide projections for sensitivity in this channel in several novel scenarios, illustrating the use of pseudo-observables, and analyze the role of kinematic distributions and (ratios of) rates in such $H\to4\ell$ studies.
△ Less
Submitted 9 October, 2018; v1 submitted 2 August, 2018;
originally announced August 2018.
-
Probing the Triple Higgs Self-Interaction at the Large Hadron Collider
Authors:
Jeong Han Kim,
Kyoungchul Kong,
Konstantin T. Matchev,
Myeonghun Park
Abstract:
We propose a novel kinematic method to expedite the discovery of the double Higgs ($hh$) production in the $\ell^+\ell^- b \bar{b} + E_T \hspace{-0.52cm} \big / ~$ final state. We make full use of recently developed kinematic variables, as well as the variables $\it Topness$ for the dominant background (top quark pair production) and $\it Higgsness$ for the signal. We obtain a significant increase…
▽ More
We propose a novel kinematic method to expedite the discovery of the double Higgs ($hh$) production in the $\ell^+\ell^- b \bar{b} + E_T \hspace{-0.52cm} \big / ~$ final state. We make full use of recently developed kinematic variables, as well as the variables $\it Topness$ for the dominant background (top quark pair production) and $\it Higgsness$ for the signal. We obtain a significant increase in sensitivity compared to the previous analyses which used sophisticated algorithms like boosted decision trees or neutral networks. The method can be easily generalized to resonant $hh$ production as well as other non-resonant channels.
△ Less
Submitted 8 March, 2019; v1 submitted 30 July, 2018;
originally announced July 2018.
-
How to prove that the LHC did not discover dark matter
Authors:
Doo** Kim,
Konstantin T. Matchev
Abstract:
If the LHC is able to produce dark matter particles, they would appear at the end of cascade decay chains, manifesting themselves as missing transverse energy. However, such "dark matter candidates" may decay invisibly later on. We propose to test for this possibility by studying the effect of particle widths on the observable invariant mass distributions of the visible particles seen in the detec…
▽ More
If the LHC is able to produce dark matter particles, they would appear at the end of cascade decay chains, manifesting themselves as missing transverse energy. However, such "dark matter candidates" may decay invisibly later on. We propose to test for this possibility by studying the effect of particle widths on the observable invariant mass distributions of the visible particles seen in the detector. We consider the simplest non-trivial case of a two-step two-body cascade decay and derive analytically the shapes of the invariant mass distributions, for generic values of the widths of the new particles. We demonstrate that the resulting distortion in the shape of the invariant mass distribution can be significant enough to measure the width of the dark matter "candidate", ruling it out as the source of the cosmological dark matter.
△ Less
Submitted 20 December, 2017;
originally announced December 2017.
-
Measuring the mass, width, and couplings of semi-invisible resonances with the Matrix Element Method
Authors:
Amalia Betancur,
Dipsikha Debnath,
James S. Gainer,
Konstantin T. Matchev,
Prasanth Shyamsundar
Abstract:
We demonstrate the use of the Matrix Element Method (MEM) for the measurement of masses, widths, and couplings in the case of single or pair production of semi-invisibly decaying resonances. For definiteness, we consider the two-body decay of a generic resonance to a visible particle from the Standard Model (SM) and a massive invisible particle. It is well known that the mass difference can be ext…
▽ More
We demonstrate the use of the Matrix Element Method (MEM) for the measurement of masses, widths, and couplings in the case of single or pair production of semi-invisibly decaying resonances. For definiteness, we consider the two-body decay of a generic resonance to a visible particle from the Standard Model (SM) and a massive invisible particle. It is well known that the mass difference can be extracted from the endpoint of a transverse kinematic variable like the transverse mass, $M_T$, or the Cambridge $M_{T2}$ variable, but measuring the overall mass scale is a very difficult problem. We show that the MEM can be used to obtain not only the absolute mass scale, but also the width of the resonance and the tensor structure of its couplings. Apart from new physics searches, our results can be readily applied to the case of SM $W$ boson production at the CERN Large Hadron Collider (LHC), where one can repeat the measurements of the $W$ properties in a general and model-independent framework.
△ Less
Submitted 22 June, 2019; v1 submitted 25 August, 2017;
originally announced August 2017.
-
Resolving Combinatorial Ambiguities in Dilepton $t\bar t$ Event Topologies with Constrained $M_2$ Variables
Authors:
Dipsikha Debnath,
Doo** Kim,
Jeong Han Kim,
Kyoungchul Kong,
Konstantin T. Matchev
Abstract:
We advocate the use of on-shell constrained $M_2$ variables in order to mitigate the combinatorial problem in SUSY-like events with two invisible particles at the LHC. We show that in comparison to other approaches in the literature, the constrained $M_2$ variables provide superior ansatze for the unmeasured invisible momenta and therefore can be usefully applied to discriminate combinatorial ambi…
▽ More
We advocate the use of on-shell constrained $M_2$ variables in order to mitigate the combinatorial problem in SUSY-like events with two invisible particles at the LHC. We show that in comparison to other approaches in the literature, the constrained $M_2$ variables provide superior ansatze for the unmeasured invisible momenta and therefore can be usefully applied to discriminate combinatorial ambiguities. We illustrate our procedure with the example of dilepton $t\bar{t}$ events. We critically review the existing methods based on the Cambridge $M_{T2}$ variable and MAOS-reconstruction of invisible momenta, and show that their algorithm can be simplified without loss of sensitivity, due to a perfect correlation between events with complex solutions for the invisible momenta and events exhibiting a kinematic endpoint violation. Then we demonstrate that the efficiency for selecting the correct partition is further improved by utilizing the $M_2$ variables instead. Finally, we also consider the general case when the underlying mass spectrum is unknown, and no kinematic endpoint information is available.
△ Less
Submitted 15 June, 2017;
originally announced June 2017.
-
Testing Invisible Momentum Ansatze in Missing Energy Events at the LHC
Authors:
Doo** Kim,
Konstantin T. Matchev,
Filip Moortgat,
Luc Pape
Abstract:
We consider SUSY-like events with two decay chains, each terminating in an invisible particle, whose true energy and momentum are not measured in the detector. Nevertheless, a useful educated guess about the invisible momenta can still be obtained by optimizing a suitable invariant mass function. We review and contrast several proposals in the literature for such ansatze: four versions of the M_T2…
▽ More
We consider SUSY-like events with two decay chains, each terminating in an invisible particle, whose true energy and momentum are not measured in the detector. Nevertheless, a useful educated guess about the invisible momenta can still be obtained by optimizing a suitable invariant mass function. We review and contrast several proposals in the literature for such ansatze: four versions of the M_T2-assisted on-shell reconstruction (MAOS), as well as several variants of the on-shell constrained M_2 variables. We compare the performance of these methods with regards to the mass determination of a new particle resonance along the decay chain from the peak of the reconstructed invariant mass distribution. For concreteness, we consider the event topology of dilepton ttbar events and study each of the three possible subsystems, in both a ttbar and a SUSY example. We find that the M_2 variables generally provide sharper peaks and therefore better ansatze for the invisible momenta. We show that the performance can be further improved by preselecting events near the kinematic endpoint of the corresponding variable from which the momentum ansatz originates.
△ Less
Submitted 20 March, 2017;
originally announced March 2017.
-
LHC Collider Phenomenology of Minimal Universal Extra Dimensions
Authors:
Jyotiranjan Beuria,
AseshKrishna Datta,
Dipsikha Debnath,
Konstantin T. Matchev
Abstract:
We discuss the collider phenomenology of the model of Minimal Universal Extra Dimensions (MUED) at the Large hadron Collider (LHC). We derive analytical results for all relevant strong pair-production processes of two level 1 Kaluza-Klein partners and use them to validate and correct the existing MUED implementation in the fortran version of the PYTHIA event generator. We also develop a new implem…
▽ More
We discuss the collider phenomenology of the model of Minimal Universal Extra Dimensions (MUED) at the Large hadron Collider (LHC). We derive analytical results for all relevant strong pair-production processes of two level 1 Kaluza-Klein partners and use them to validate and correct the existing MUED implementation in the fortran version of the PYTHIA event generator. We also develop a new implementation of the model in the C++ version of PYTHIA. We use our implementations in conjunction with the CHECKMATE package to derive the LHC bounds on MUED from a large number of published experimental analyses from Run 1 at the LHC.
△ Less
Submitted 1 February, 2017;
originally announced February 2017.
-
Detecting kinematic boundary surfaces in phase space: particle mass measurements in SUSY-like events
Authors:
Dipsikha Debnath,
James S. Gainer,
Can Kilic,
Doo** Kim,
Konstantin T. Matchev,
Yuan-Pao Yang
Abstract:
We critically examine the classic endpoint method for particle mass determination, focusing on difficult corners of parameter space, where some of the measurements are not independent, while others are adversely affected by the experimental resolution. In such scenarios, mass differences can be measured relatively well, but the overall mass scale remains poorly constrained. Using the example of th…
▽ More
We critically examine the classic endpoint method for particle mass determination, focusing on difficult corners of parameter space, where some of the measurements are not independent, while others are adversely affected by the experimental resolution. In such scenarios, mass differences can be measured relatively well, but the overall mass scale remains poorly constrained. Using the example of the standard SUSY decay chain $\tilde q\to \tildeχ^0_2\to \tilde \ell \to \tilde χ^0_1$, we demonstrate that sensitivity to the remaining mass scale parameter can be recovered by measuring the two-dimensional kinematical boundary in the relevant three-dimensional phase space of invariant masses squared. We develop an algorithm for detecting this boundary, which uses the geometric properties of the Voronoi tessellation of the data, and in particular, the relative standard deviation (RSD) of the volumes of the neighbors for each Voronoi cell in the tessellation. We propose a new observable, $\barΣ$, which is the average RSD per unit area, calculated over the hypothesized boundary. We show that the location of the $\barΣ$ maximum correlates very well with the true values of the new particle masses. Our approach represents the natural extension of the one-dimensional kinematic endpoint method to the relevant three dimensions of invariant mass phase space.
△ Less
Submitted 27 May, 2018; v1 submitted 14 November, 2016;
originally announced November 2016.
-
Identifying Phase Space Boundaries with Voronoi Tessellations
Authors:
Dipsikha Debnath,
James S. Gainer,
Can Kilic,
Doo** Kim,
Konstantin T. Matchev,
Yuan-Pao Yang
Abstract:
Determining the masses of new physics particles appearing in decay chains is an important and longstanding problem in high energy phenomenology. Recently it has been shown that these mass measurements can be improved by utilizing the boundary of the allowed region in the fully differentiable phase space in its full dimensionality. Here we show that the practical challenge of identifying this bound…
▽ More
Determining the masses of new physics particles appearing in decay chains is an important and longstanding problem in high energy phenomenology. Recently it has been shown that these mass measurements can be improved by utilizing the boundary of the allowed region in the fully differentiable phase space in its full dimensionality. Here we show that the practical challenge of identifying this boundary can be solved using techniques based on the geometric properties of the cells resulting from Voronoi tessellations of the relevant data. The robust detection of such phase space boundaries in the data could also be used to corroborate a new physics discovery based on a cut-and-count analysis.
△ Less
Submitted 5 May, 2017; v1 submitted 8 June, 2016;
originally announced June 2016.
-
The 750 GeV Diphoton Excess May Not Imply a 750 GeV Resonance
Authors:
Won Sang Cho,
Doo** Kim,
Kyoungchul Kong,
Sung Hak Lim,
Konstantin T. Matchev,
Jong-Chul Park,
Myeonghun Park
Abstract:
We discuss non-standard interpretations of the 750 GeV diphoton excess recently reported by the ATLAS and CMS Collaborations which do not involve a new, relatively broad, resonance with a mass near 750 GeV. Instead, we consider the sequential cascade decay of a much heavier, possibly quite narrow, resonance into two photons along with one or more invisible particles. The resulting diphoton invaria…
▽ More
We discuss non-standard interpretations of the 750 GeV diphoton excess recently reported by the ATLAS and CMS Collaborations which do not involve a new, relatively broad, resonance with a mass near 750 GeV. Instead, we consider the sequential cascade decay of a much heavier, possibly quite narrow, resonance into two photons along with one or more invisible particles. The resulting diphoton invariant mass signal is generically rather broad, as suggested by the data. We examine three specific event topologies - the antler, the sandwich, and the 2-step cascade decay, and show that they all can provide a good fit to the observed published data. In each case, we delineate the preferred mass parameter space selected by the best fit. In spite of the presence of invisible particles in the final state, the measured missing transverse energy is moderate, due to its anti- correlation with the diphoton invariant mass. We comment on the future prospects of discriminating with higher statistics between our scenarios, as well as from more conventional interpretations.
△ Less
Submitted 30 March, 2016; v1 submitted 21 December, 2015;
originally announced December 2015.
-
Using sorted invariant mass variables to evade combinatorial ambiguities in cascade decays
Authors:
Doo** Kim,
Konstantin T. Matchev,
Myeonghun Park
Abstract:
The classic method for mass determination in a SUSY-like cascade decay chain relies on measurements of the kinematic endpoints in the invariant mass distributions of suitable collections of visible decay products. However, the procedure is complicated by combinatorial ambiguities: e.g., the visible final state particles may be indistinguishable (as in the case of QCD jets), or one may not know the…
▽ More
The classic method for mass determination in a SUSY-like cascade decay chain relies on measurements of the kinematic endpoints in the invariant mass distributions of suitable collections of visible decay products. However, the procedure is complicated by combinatorial ambiguities: e.g., the visible final state particles may be indistinguishable (as in the case of QCD jets), or one may not know the exact order in which they are emitted along the decay chain. In order to avoid such combinatorial ambiguities, we propose to treat the final state particles fully democratically and consider the sorted set of the invariant masses of all possible partitions of the visible particles in the decay chain. In particular, for a decay to N visible particles, one considers the sorted sets of all possible n-body invariant mass combinations (2 <= n <= N) and determines the kinematic endpoint m_(n,r)^max of the distribution of the r-th largest n-body invariant mass m_(n,r) for each possible value of n and r. For the classic example of a squark decay in supersymmetry, we provide analytical formulas for the interpretation of these endpoints in terms of the underlying physical masses. We point out that these measurements can be used to determine the structure of the decay topology, e.g., the number and position of intermediate on-shell resonances.
△ Less
Submitted 7 December, 2015;
originally announced December 2015.
-
Discovering New Physics with Voronoi Tessellations
Authors:
Dipsikha Debnath,
James S. Gainer,
Doo** Kim,
Konstantin T. Matchev
Abstract:
High energy experimental data can be viewed as a sampling of the relevant phase space. We point out that one can apply Voronoi tessellations in order to understand the underlying probability distributions in this phase space. Interesting features in the data can then be discovered by studying the properties of the ensemble of Voronoi cells. For illustration, we demonstrate the detection of kinemat…
▽ More
High energy experimental data can be viewed as a sampling of the relevant phase space. We point out that one can apply Voronoi tessellations in order to understand the underlying probability distributions in this phase space. Interesting features in the data can then be discovered by studying the properties of the ensemble of Voronoi cells. For illustration, we demonstrate the detection of kinematic "edges" in two dimensions, which may signal physics beyond the standard model. We motivate the algorithm with some analytical results derived for perfect lattices, and show that the method is further improved with the addition of a few Voronoi relaxation steps via Lloyd's method.
△ Less
Submitted 9 November, 2015;
originally announced November 2015.
-
OPTIMASS: A Package for the Minimization of Kinematic Mass Functions with Constraints
Authors:
Won Sang Cho,
James S. Gainer,
Doo** Kim,
Sung Hak Lim,
Konstantin T. Matchev,
Filip Moortgat,
Luc Pape,
Myeonghun Park
Abstract:
Reconstructed mass variables, such as $M_2$, $M_{2C}$, $M_T^\star$, and $M_{T2}^W$, play an essential role in searches for new physics at hadron colliders. The calculation of these variables generally involves constrained minimization in a large parameter space, which is numerically challenging. We provide a C++ code, OPTIMASS, which interfaces with the MINUIT library to perform this constrained m…
▽ More
Reconstructed mass variables, such as $M_2$, $M_{2C}$, $M_T^\star$, and $M_{T2}^W$, play an essential role in searches for new physics at hadron colliders. The calculation of these variables generally involves constrained minimization in a large parameter space, which is numerically challenging. We provide a C++ code, OPTIMASS, which interfaces with the MINUIT library to perform this constrained minimization using the Augmented Lagrangian Method. The code can be applied to arbitrarily general event topologies and thus allows the user to significantly extend the existing set of kinematic variables. We describe this code and its physics motivation, and demonstrate its use in the analysis of the fully leptonic decay of pair-produced top quarks using the $M_2$ variables.
△ Less
Submitted 12 January, 2016; v1 submitted 3 August, 2015;
originally announced August 2015.