-
Using graph neural networks to reconstruct charged pion showers in the CMS High Granularity Calorimeter
Authors:
M. Aamir,
B. Acar,
G. Adamov,
T. Adams,
C. Adloff,
S. Afanasiev,
C. Agrawal,
C. Agrawal,
A. Ahmad,
H. A. Ahmed,
S. Akbar,
N. Akchurin,
B. Akgul,
B. Akgun,
R. O. Akpinar,
E. Aktas,
A. AlKadhim,
V. Alexakhin,
J. Alimena,
J. Alison,
A. Alpana,
W. Alshehri,
P. Alvarez Dominguez,
M. Alyari,
C. Amendola
, et al. (550 additional authors not shown)
Abstract:
A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadr…
▽ More
A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadronic section. The shower reconstruction method is based on graph neural networks and it makes use of a dynamic reduction network architecture. It is shown that the algorithm is able to capture and mitigate the main effects that normally hinder the reconstruction of hadronic showers using classical reconstruction methods, by compensating for fluctuations in the multiplicity, energy, and spatial distributions of the shower's constituents. The performance of the algorithm is evaluated using test beam data collected in 2018 prototype of the CMS HGCAL accompanied by a section of the CALICE AHCAL prototype. The capability of the method to mitigate the impact of energy leakage from the calorimeter is also demonstrated.
△ Less
Submitted 30 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Analysis Facilities White Paper
Authors:
D. Ciangottini,
A. Forti,
L. Heinrich,
N. Skidmore,
C. Alpigiani,
M. Aly,
D. Benjamin,
B. Bockelman,
L. Bryant,
J. Catmore,
M. D'Alfonso,
A. Delgado Peris,
C. Doglioni,
G. Duckeck,
P. Elmer,
J. Eschle,
M. Feickert,
J. Frost,
R. Gardner,
V. Garonne,
M. Giffels,
J. Gooding,
E. Gramstad,
L. Gray,
B. Hegner
, et al. (41 additional authors not shown)
Abstract:
This white paper presents the current status of the R&D for Analysis Facilities (AFs) and attempts to summarize the views on the future direction of these facilities. These views have been collected through the High Energy Physics (HEP) Software Foundation's (HSF) Analysis Facilities forum, established in March 2022, the Analysis Ecosystems II workshop, that took place in May 2022, and the WLCG/HS…
▽ More
This white paper presents the current status of the R&D for Analysis Facilities (AFs) and attempts to summarize the views on the future direction of these facilities. These views have been collected through the High Energy Physics (HEP) Software Foundation's (HSF) Analysis Facilities forum, established in March 2022, the Analysis Ecosystems II workshop, that took place in May 2022, and the WLCG/HSF pre-CHEP workshop, that took place in May 2023. The paper attempts to cover all the aspects of an analysis facility.
△ Less
Submitted 15 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Smartpixels: Towards on-sensor inference of charged particle track parameters and uncertainties
Authors:
Jennet Dickinson,
Rachel Kovach-Fuentes,
Lindsey Gray,
Morris Swartz,
Giuseppe Di Guglielmo,
Alice Bean,
Doug Berry,
Manuel Blanco Valentin,
Karri DiPetrillo,
Farah Fahim,
James Hirschauer,
Shruti R. Kulkarni,
Ron Lipton,
Petar Maksimovic,
Corrinne Mills,
Mark S. Neubauer,
Benjamin Parpillon,
Gauri Pradhan,
Chinar Syal,
Nhan Tran,
Dahai Wen,
Jieun Yoo,
Aaron Young
Abstract:
The combinatorics of track seeding has long been a computational bottleneck for triggering and offline computing in High Energy Physics (HEP), and remains so for the HL-LHC. Next-generation pixel sensors will be sufficiently fine-grained to determine angular information of the charged particle passing through from pixel-cluster properties. This detector technology immediately improves the situatio…
▽ More
The combinatorics of track seeding has long been a computational bottleneck for triggering and offline computing in High Energy Physics (HEP), and remains so for the HL-LHC. Next-generation pixel sensors will be sufficiently fine-grained to determine angular information of the charged particle passing through from pixel-cluster properties. This detector technology immediately improves the situation for offline tracking, but any major improvements in physics reach are unrealized since they are dominated by lowest-level hardware trigger acceptance. We will demonstrate track angle and hit position prediction, including errors, using a mixture density network within a single layer of silicon as well as the progress towards and status of implementing the neural network in hardware on both FPGAs and ASICs.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Optimizing High Throughput Inference on Graph Neural Networks at Shared Computing Facilities with the NVIDIA Triton Inference Server
Authors:
Claire Savard,
Nicholas Manganelli,
Burt Holzman,
Lindsey Gray,
Alexx Perloff,
Kevin Pedro,
Kevin Stenson,
Keith Ulmer
Abstract:
With machine learning applications now spanning a variety of computational tasks, multi-user shared computing facilities are devoting a rapidly increasing proportion of their resources to such algorithms. Graph neural networks (GNNs), for example, have provided astounding improvements in extracting complex signatures from data and are now widely used in a variety of applications, such as particle…
▽ More
With machine learning applications now spanning a variety of computational tasks, multi-user shared computing facilities are devoting a rapidly increasing proportion of their resources to such algorithms. Graph neural networks (GNNs), for example, have provided astounding improvements in extracting complex signatures from data and are now widely used in a variety of applications, such as particle jet classification in high energy physics (HEP). However, GNNs also come with an enormous computational penalty that requires the use of GPUs to maintain reasonable throughput. At shared computing facilities, such as those used by physicists at Fermi National Accelerator Laboratory (Fermilab), methodical resource allocation and high throughput at the many-user scale are key to ensuring that resources are being used as efficiently as possible. These facilities, however, primarily provide CPU-only nodes, which proves detrimental to time-to-insight and computational throughput for workflows that include machine learning inference. In this work, we describe how a shared computing facility can use the NVIDIA Triton Inference Server to optimize its resource allocation and computing structure, recovering high throughput while scaling out to multiple users by massively parallelizing their machine learning inference. To demonstrate the effectiveness of this system in a realistic multi-user environment, we use the Fermilab Elastic Analysis Facility augmented with the Triton Inference Server to provide scalable and high throughput access to a HEP-specific GNN and report on the outcome.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
The U.S. CMS HL-LHC R&D Strategic Plan
Authors:
Oliver Gutsche,
Tulika Bose,
Margaret Votava,
David Mason,
Andrew Melo,
Mia Liu,
Dirk Hufnagel,
Lindsey Gray,
Mike Hildreth,
Burt Holzman,
Kevin Lannon,
Saba Sehrish,
David Sperka,
James Letts,
Lothar Bauerdick,
Kenneth Bloom
Abstract:
The HL-LHC run is anticipated to start at the end of this decade and will pose a significant challenge for the scale of the HEP software and computing infrastructure. The mission of the U.S. CMS Software & Computing Operations Program is to develop and operate the software and computing resources necessary to process CMS data expeditiously and to enable U.S. physicists to fully participate in the…
▽ More
The HL-LHC run is anticipated to start at the end of this decade and will pose a significant challenge for the scale of the HEP software and computing infrastructure. The mission of the U.S. CMS Software & Computing Operations Program is to develop and operate the software and computing resources necessary to process CMS data expeditiously and to enable U.S. physicists to fully participate in the physics of CMS. We have developed a strategic plan to prioritize R&D efforts to reach this goal for the HL-LHC. This plan includes four grand challenges: modernizing physics software and improving algorithms, building infrastructure for exabyte-scale datasets, transforming the scientific data analysis process and transitioning from R&D to operations. We are involved in a variety of R&D projects that fall within these grand challenges. In this talk, we will introduce our four grand challenges and outline the R&D program of the U.S. CMS Software & Computing Operations Program.
△ Less
Submitted 4 December, 2023; v1 submitted 1 December, 2023;
originally announced December 2023.
-
Smart pixel sensors: towards on-sensor filtering of pixel clusters with deep learning
Authors:
Jieun Yoo,
Jennet Dickinson,
Morris Swartz,
Giuseppe Di Guglielmo,
Alice Bean,
Douglas Berry,
Manuel Blanco Valentin,
Karri DiPetrillo,
Farah Fahim,
Lindsey Gray,
James Hirschauer,
Shruti R. Kulkarni,
Ron Lipton,
Petar Maksimovic,
Corrinne Mills,
Mark S. Neubauer,
Benjamin Parpillon,
Gauri Pradhan,
Chinar Syal,
Nhan Tran,
Dahai Wen,
Aaron Young
Abstract:
Highly granular pixel detectors allow for increasingly precise measurements of charged particle tracks. Next-generation detectors require that pixel sizes will be further reduced, leading to unprecedented data rates exceeding those foreseen at the High Luminosity Large Hadron Collider. Signal processing that handles data incoming at a rate of O(40MHz) and intelligently reduces the data within the…
▽ More
Highly granular pixel detectors allow for increasingly precise measurements of charged particle tracks. Next-generation detectors require that pixel sizes will be further reduced, leading to unprecedented data rates exceeding those foreseen at the High Luminosity Large Hadron Collider. Signal processing that handles data incoming at a rate of O(40MHz) and intelligently reduces the data within the pixelated region of the detector at rate will enhance physics performance at high luminosity and enable physics analyses that are not currently possible. Using the shape of charge clusters deposited in an array of small pixels, the physical properties of the traversing particle can be extracted with locally customized neural networks. In this first demonstration, we present a neural network that can be embedded into the on-sensor readout and filter out hits from low momentum tracks, reducing the detector's data volume by 54.4-75.4%. The network is designed and simulated as a custom readout integrated circuit with 28 nm CMOS technology and is expected to operate at less than 300 $μW$ with an area of less than 0.2 mm$^2$. The temporal development of charge clusters is investigated to demonstrate possible future performance gains, and there is also a discussion of future algorithmic and technological improvements that could enhance efficiency, data reduction, and power per area.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
Data Science and Machine Learning in Education
Authors:
Gabriele Benelli,
Thomas Y. Chen,
Javier Duarte,
Matthew Feickert,
Matthew Graham,
Lindsey Gray,
Dan Hackett,
Phil Harris,
Shih-Chieh Hsu,
Gregor Kasieczka,
Elham E. Khoda,
Matthias Komm,
Mia Liu,
Mark S. Neubauer,
Scarlet Norberg,
Alexx Perloff,
Marcel Rieger,
Claire Savard,
Kazuhiro Terao,
Savannah Thais,
Avik Roy,
Jean-Roch Vlimant,
Grigorios Chachamis
Abstract:
The growing role of data science (DS) and machine learning (ML) in high-energy physics (HEP) is well established and pertinent given the complex detectors, large data, sets and sophisticated analyses at the heart of HEP research. Moreover, exploiting symmetries inherent in physics data have inspired physics-informed ML as a vibrant sub-field of computer science research. HEP researchers benefit gr…
▽ More
The growing role of data science (DS) and machine learning (ML) in high-energy physics (HEP) is well established and pertinent given the complex detectors, large data, sets and sophisticated analyses at the heart of HEP research. Moreover, exploiting symmetries inherent in physics data have inspired physics-informed ML as a vibrant sub-field of computer science research. HEP researchers benefit greatly from materials widely available materials for use in education, training and workforce development. They are also contributing to these materials and providing software to DS/ML-related fields. Increasingly, physics departments are offering courses at the intersection of DS, ML and physics, often using curricula developed by HEP researchers and involving open software and data used in HEP. In this white paper, we explore synergies between HEP research and DS/ML education, discuss opportunities and challenges at this intersection, and propose community activities that will be mutually beneficial.
△ Less
Submitted 19 July, 2022;
originally announced July 2022.
-
Collaborative Computing Support for Analysis Facilities Exploiting Software as Infrastructure Techniques
Authors:
Maria Acosta Flechas,
Garhan Attebury,
Kenneth Bloom,
Brian Bockelman,
Lindsey Gray,
Burt Holzman,
Carl Lundstedt,
Oksana Shadura,
Nicholas Smith,
John Thiltges
Abstract:
Prior to the public release of Kubernetes it was difficult to conduct joint development of elaborate analysis facilities due to the highly non-homogeneous nature of hardware and network topology across compute facilities. However, since the advent of systems like Kubernetes and OpenShift, which provide declarative interfaces for building fault-tolerant and self-healing deployments of networked sof…
▽ More
Prior to the public release of Kubernetes it was difficult to conduct joint development of elaborate analysis facilities due to the highly non-homogeneous nature of hardware and network topology across compute facilities. However, since the advent of systems like Kubernetes and OpenShift, which provide declarative interfaces for building fault-tolerant and self-healing deployments of networked software, it is possible for multiple institutes to collaborate more effectively since resource details are abstracted away through various forms of hardware and software virtualization. In this whitepaper we will outline the development of two analysis facilities: "Coffea-casa" at University of Nebraska Lincoln and the "Elastic Analysis Facility" at Fermilab, and how utilizing platform abstraction has improved the development of common software for each of these facilities, and future development plans made possible by this methodology.
△ Less
Submitted 22 March, 2022; v1 submitted 18 March, 2022;
originally announced March 2022.
-
Reconstruction of Large Radius Tracks with the Exa.TrkX pipeline
Authors:
Chun-Yi Wang,
Xiangyang Ju,
Shih-Chieh Hsu,
Daniel Murnane,
Paolo Calafiura,
Steven Farrell,
Maria Spiropulu,
Jean-Roch Vlimant,
Adam Aurisano,
V Hewes,
Giuseppe Cerati,
Lindsey Gray,
Thomas Klijnsma,
Jim Kowalkowski,
Markus Atkinson,
Mark Neubauer,
Gage DeZoort,
Savannah Thais,
Alexandra Ballow,
Alina Lazar,
Sylvain Caillou,
Charline Rougier,
Jan Stark,
Alexis Vallier,
Jad Sardain
Abstract:
Particle tracking is a challenging pattern recognition task at the Large Hadron Collider (LHC) and the High Luminosity-LHC. Conventional algorithms, such as those based on the Kalman Filter, achieve excellent performance in reconstructing the prompt tracks from the collision points. However, they require dedicated configuration and additional computing time to efficiently reconstruct the large rad…
▽ More
Particle tracking is a challenging pattern recognition task at the Large Hadron Collider (LHC) and the High Luminosity-LHC. Conventional algorithms, such as those based on the Kalman Filter, achieve excellent performance in reconstructing the prompt tracks from the collision points. However, they require dedicated configuration and additional computing time to efficiently reconstruct the large radius tracks created away from the collision points. We developed an end-to-end machine learning-based track finding algorithm for the HL-LHC, the Exa.TrkX pipeline. The pipeline is designed so as to be agnostic about global track positions. In this work, we study the performance of the Exa.TrkX pipeline for finding large radius tracks. Trained with all tracks in the event, the pipeline simultaneously reconstructs prompt tracks and large radius tracks with high efficiencies. This new capability offered by the Exa.TrkX pipeline may enable us to search for new physics in real time.
△ Less
Submitted 14 March, 2022;
originally announced March 2022.
-
Strategy for Understanding the Higgs Physics: The Cool Copper Collider
Authors:
Sridhara Dasu,
Emilio A. Nanni,
Michael E. Peskin,
Caterina Vernieri,
Tim Barklow,
Rainer Bartoldus,
Pushpalatha C. Bhat,
Kevin Black,
Jim Brau,
Martin Breidenbach,
Nathaniel Craig,
Dmitri Denisov,
Lindsey Gray,
Philip C. Harris,
Michael Kagan,
Zhen Liu,
Patrick Meade,
Nathan Majernik,
Sergei Nagaitsev,
Isobel Ojalvo,
Christoph Paus,
Carl Schroeder,
Ariel G. Schwartzman,
Jan Strube,
Su Dong
, et al. (4 additional authors not shown)
Abstract:
A program to build a lepton-collider Higgs factory, to precisely measure the couplings of the Higgs boson to other particles, followed by a higher energy run to establish the Higgs self-coupling and expand the new physics reach, is widely recognized as a primary focus of modern particle physics. We propose a strategy that focuses on a new technology and preliminary estimates suggest that can lead…
▽ More
A program to build a lepton-collider Higgs factory, to precisely measure the couplings of the Higgs boson to other particles, followed by a higher energy run to establish the Higgs self-coupling and expand the new physics reach, is widely recognized as a primary focus of modern particle physics. We propose a strategy that focuses on a new technology and preliminary estimates suggest that can lead to a compact, affordable machine. New technology investigations will provide much needed enthusiasm for our field, resulting in trained workforce. This cost-effective, compact design, with technologies useful for a broad range of other accelerator applications, could be realized as a project in the US. Its technology innovations, both in the accelerator and the detector, will offer unique and exciting opportunities to young scientists. Moreover, cost effective compact designs, broadly applicable to other fields of research, are more likely to obtain financial support from our funding agencies.
△ Less
Submitted 7 June, 2022; v1 submitted 15 March, 2022;
originally announced March 2022.
-
The International Linear Collider: Report to Snowmass 2021
Authors:
Alexander Aryshev,
Ties Behnke,
Mikael Berggren,
James Brau,
Nathaniel Craig,
Ayres Freitas,
Frank Gaede,
Spencer Gessner,
Stefania Gori,
Christophe Grojean,
Sven Heinemeyer,
Daniel Jeans,
Katja Kruger,
Benno List,
Jenny List,
Zhen Liu,
Shinichiro Michizono,
David W. Miller,
Ian Moult,
Hitoshi Murayama,
Tatsuya Nakada,
Emilio Nanni,
Mihoko Nojiri,
Hasan Padamsee,
Maxim Perelstein
, et al. (487 additional authors not shown)
Abstract:
The International Linear Collider (ILC) is on the table now as a new global energy-frontier accelerator laboratory taking data in the 2030s. The ILC addresses key questions for our current understanding of particle physics. It is based on a proven accelerator technology. Its experiments will challenge the Standard Model of particle physics and will provide a new window to look beyond it. This docu…
▽ More
The International Linear Collider (ILC) is on the table now as a new global energy-frontier accelerator laboratory taking data in the 2030s. The ILC addresses key questions for our current understanding of particle physics. It is based on a proven accelerator technology. Its experiments will challenge the Standard Model of particle physics and will provide a new window to look beyond it. This document brings the story of the ILC up to date, emphasizing its strong physics motivation, its readiness for construction, and the opportunity it presents to the US and the global particle physics community.
△ Less
Submitted 16 January, 2023; v1 submitted 14 March, 2022;
originally announced March 2022.
-
GNN-based end-to-end reconstruction in the CMS Phase 2 High-Granularity Calorimeter
Authors:
Saptaparna Bhattacharya,
Nadezda Chernyavskaya,
Saranya Ghosh,
Lindsey Gray,
Jan Kieseler,
Thomas Klijnsma,
Kenneth Long,
Raheel Nawaz,
Kevin Pedro,
Maurizio Pierini,
Gauri Pradhan,
Shah Rukh Qasim,
Oleksander Viazlo,
Philipp Zehetner
Abstract:
We present the current stage of research progress towards a one-pass, completely Machine Learning (ML) based imaging calorimeter reconstruction. The model used is based on Graph Neural Networks (GNNs) and directly analyzes the hits in each HGCAL endcap. The ML algorithm is trained to predict clusters of hits originating from the same incident particle by labeling the hits with the same cluster ind…
▽ More
We present the current stage of research progress towards a one-pass, completely Machine Learning (ML) based imaging calorimeter reconstruction. The model used is based on Graph Neural Networks (GNNs) and directly analyzes the hits in each HGCAL endcap. The ML algorithm is trained to predict clusters of hits originating from the same incident particle by labeling the hits with the same cluster index. We impose simple criteria to assess whether the hits associated as a cluster by the prediction are matched to those hits resulting from any particular individual incident particles. The algorithm is studied by simulating two tau leptons in each of the two HGCAL endcaps, where each tau may decay according to its measured standard model branching probabilities. The simulation includes the material interaction of the tau decay products which may create additional particles incident upon the calorimeter. Using this varied multiparticle environment we can investigate the application of this reconstruction technique and begin to characterize energy containment and performance.
△ Less
Submitted 2 March, 2022;
originally announced March 2022.
-
Constraints on future analysis metadata systems in High Energy Physics
Authors:
T. J. Khoo,
A. Reinsvold Hall,
N. Skidmore,
S. Alderweireldt,
J. Anders,
C. Burr,
W. Buttinger,
P. David,
L. Gouskos,
L. Gray,
S. Hageboeck,
A. Krasznahorkay,
P. Laycock,
A. Lister,
Z. Marshall,
A. B. Meyer,
T. Novak,
S. Rappoccio,
M. Ritter,
E. Rodrigues,
J. Rumsevicius,
L. Sexton-Kennedy,
N. Smith,
G. A. Stewart,
S. Wertz
Abstract:
In High Energy Physics (HEP), analysis metadata comes in many forms -- from theoretical cross-sections, to calibration corrections, to details about file processing. Correctly applying metadata is a crucial and often time-consuming step in an analysis, but designing analysis metadata systems has historically received little direct attention. Among other considerations, an ideal metadata tool shoul…
▽ More
In High Energy Physics (HEP), analysis metadata comes in many forms -- from theoretical cross-sections, to calibration corrections, to details about file processing. Correctly applying metadata is a crucial and often time-consuming step in an analysis, but designing analysis metadata systems has historically received little direct attention. Among other considerations, an ideal metadata tool should be easy to use by new analysers, should scale to large data volumes and diverse processing paradigms, and should enable future analysis reinterpretation. This document, which is the product of community discussions organised by the HEP Software Foundation, categorises types of metadata by scope and format and gives examples of current metadata solutions. Important design considerations for metadata systems, including sociological factors, analysis preservation efforts, and technical factors, are discussed. A list of best practices and technical requirements for future analysis metadata systems is presented. These best practices could guide the development of a future cross-experimental effort for analysis metadata tools.
△ Less
Submitted 19 May, 2022; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Accelerating the Inference of the Exa.TrkX Pipeline
Authors:
Alina Lazar,
Xiangyang Ju,
Daniel Murnane,
Paolo Calafiura,
Steven Farrell,
Yaoyuan Xu,
Maria Spiropulu,
Jean-Roch Vlimant,
Giuseppe Cerati,
Lindsey Gray,
Thomas Klijnsma,
Jim Kowalkowski,
Markus Atkinson,
Mark Neubauer,
Gage DeZoort,
Savannah Thais,
Shih-Chieh Hsu,
Adam Aurisano,
V Hewes,
Alexandra Ballow,
Nirajan Acharya,
Chun-yi Wang,
Emma Liu,
Alberto Lucas
Abstract:
Recently, graph neural networks (GNNs) have been successfully used for a variety of particle reconstruction problems in high energy physics, including particle tracking. The Exa.TrkX pipeline based on GNNs demonstrated promising performance in reconstructing particle tracks in dense environments. It includes five discrete steps: data encoding, graph building, edge filtering, GNN, and track labelin…
▽ More
Recently, graph neural networks (GNNs) have been successfully used for a variety of particle reconstruction problems in high energy physics, including particle tracking. The Exa.TrkX pipeline based on GNNs demonstrated promising performance in reconstructing particle tracks in dense environments. It includes five discrete steps: data encoding, graph building, edge filtering, GNN, and track labeling. All steps were written in Python and run on both GPUs and CPUs. In this work, we accelerate the Python implementation of the pipeline through customized and commercial GPU-enabled software libraries, and develop a C++ implementation for inferencing the pipeline. The implementation features an improved, CUDA-enabled fixed-radius nearest neighbor search for graph building and a weakly connected component graph algorithm for track labeling. GNNs and other trained deep learning models are converted to ONNX and inferenced via the ONNX Runtime C++ API. The complete C++ implementation of the pipeline allows integration with existing tracking software. We report the memory usage and average event latency tracking performance of our implementation applied to the TrackML benchmark dataset.
△ Less
Submitted 14 February, 2022;
originally announced February 2022.
-
Performance of a Geometric Deep Learning Pipeline for HL-LHC Particle Tracking
Authors:
Xiangyang Ju,
Daniel Murnane,
Paolo Calafiura,
Nicholas Choma,
Sean Conlon,
Steve Farrell,
Yaoyuan Xu,
Maria Spiropulu,
Jean-Roch Vlimant,
Adam Aurisano,
V Hewes,
Giuseppe Cerati,
Lindsey Gray,
Thomas Klijnsma,
Jim Kowalkowski,
Markus Atkinson,
Mark Neubauer,
Gage DeZoort,
Savannah Thais,
Aditi Chauhan,
Alex Schuy,
Shih-Chieh Hsu,
Alex Ballow,
and Alina Lazar
Abstract:
The Exa.TrkX project has applied geometric learning concepts such as metric learning and graph neural networks to HEP particle tracking. Exa.TrkX's tracking pipeline groups detector measurements to form track candidates and filters them. The pipeline, originally developed using the TrackML dataset (a simulation of an LHC-inspired tracking detector), has been demonstrated on other detectors, includ…
▽ More
The Exa.TrkX project has applied geometric learning concepts such as metric learning and graph neural networks to HEP particle tracking. Exa.TrkX's tracking pipeline groups detector measurements to form track candidates and filters them. The pipeline, originally developed using the TrackML dataset (a simulation of an LHC-inspired tracking detector), has been demonstrated on other detectors, including DUNE Liquid Argon TPC and CMS High-Granularity Calorimeter. This paper documents new developments needed to study the physics and computing performance of the Exa.TrkX pipeline on the full TrackML dataset, a first step towards validating the pipeline using ATLAS and CMS data. The pipeline achieves tracking efficiency and purity similar to production tracking algorithms. Crucially for future HEP applications, the pipeline benefits significantly from GPU acceleration, and its computational requirements scale close to linearly with the number of particles in the event.
△ Less
Submitted 21 September, 2021; v1 submitted 11 March, 2021;
originally announced March 2021.
-
Graph Neural Network for Object Reconstruction in Liquid Argon Time Projection Chambers
Authors:
V Hewes,
Adam Aurisano,
Giuseppe Cerati,
Jim Kowalkowski,
Claire Lee,
Wei-keng Liao,
Alexandra Day,
Ankit Agrawal,
Maria Spiropulu,
Jean-Roch Vlimant,
Lindsey Gray,
Thomas Klijnsma,
Paolo Calafiura,
Sean Conlon,
Steve Farrell,
Xiangyang Ju,
Daniel Murnane
Abstract:
This paper presents a graph neural network (GNN) technique for low-level reconstruction of neutrino interactions in a Liquid Argon Time Projection Chamber (LArTPC). GNNs are still a relatively novel technique, and have shown great promise for similar reconstruction tasks in the LHC. In this paper, a multihead attention message passing network is used to classify the relationship between detector…
▽ More
This paper presents a graph neural network (GNN) technique for low-level reconstruction of neutrino interactions in a Liquid Argon Time Projection Chamber (LArTPC). GNNs are still a relatively novel technique, and have shown great promise for similar reconstruction tasks in the LHC. In this paper, a multihead attention message passing network is used to classify the relationship between detector hits by labelling graph edges, determining whether hits were produced by the same underlying particle, and if so, the particle type. The trained model is 84% accurate overall, and performs best on the EM shower and muon track classes. The model's strengths and weaknesses are discussed, and plans for develo** this technique further are summarised.
△ Less
Submitted 11 March, 2021; v1 submitted 10 March, 2021;
originally announced March 2021.
-
The Analog Front-end for the LGAD Based Precision Timing Application in CMS ETL
Authors:
Quan Sun,
Sunil M. Dogra,
Christopher Edwards,
Datao Gong,
Lindsey Gray,
Xing Huang,
Siddhartha Joshi,
Jongho Lee,
Chonghan Liu,
Tiehui Liu,
Tiankuan Liu,
Sergey Los,
Chang-Seong Moon,
Geonhee Oh,
Jamieson Olsen,
Luciano Ristori,
Hanhan Sun,
Xiao Wang,
**yuan Wu,
**gbo Ye,
Zhenyu Ye,
Li Zhang,
Wei Zhang
Abstract:
The analog front-end for the Low Gain Avalanche Detector (LGAD) based precision timing application in the CMS Endcap Timing Layer (ETL) has been prototyped in a 65 nm CMOS mini-ASIC named ETROC0. Serving as the very first prototype of ETL readout chip (ETROC), ETROC0 aims to study and demonstrate the performance of the analog frontend, with the goal to achieve 40 to 50 ps time resolution per hit w…
▽ More
The analog front-end for the Low Gain Avalanche Detector (LGAD) based precision timing application in the CMS Endcap Timing Layer (ETL) has been prototyped in a 65 nm CMOS mini-ASIC named ETROC0. Serving as the very first prototype of ETL readout chip (ETROC), ETROC0 aims to study and demonstrate the performance of the analog frontend, with the goal to achieve 40 to 50 ps time resolution per hit with LGAD (therefore reach about 30ps per track with two detector-layer hits per track). ETROC0 consists of preamplifier and discriminator stages, which amplifies the LGAD signal and generates digital pulses containing time of arrival and time over threshold information. This paper will focus on the design considerations that lead to the ETROC front-end architecture choice, the key design features of the building blocks, the methodology of using the LGAD simulation data to evaluate and optimize the front-end design. The ETROC0 prototype chips have been extensively tested using charge injection and the measured performance agrees well with simulation. The initial beam test results are also presented, with time resolution of around 33 ps observed from the preamplifier waveform analysis and around 41 ps from the discriminator pulses analysis. A subset of ETROC0 chips have also been tested to a total ionizing dose of 100 MRad with X-ray and no performance degradation been observed.
△ Less
Submitted 28 December, 2020;
originally announced December 2020.
-
Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs
Authors:
Aneesh Heintz,
Vesal Razavimaleki,
Javier Duarte,
Gage DeZoort,
Isobel Ojalvo,
Savannah Thais,
Markus Atkinson,
Mark Neubauer,
Lindsey Gray,
Sergo **dariani,
Nhan Tran,
Philip Harris,
Dylan Rankin,
Thea Aarrestad,
Vladimir Loncar,
Maurizio Pierini,
Sioni Summers,
Jennifer Ngadiuba,
Mia Liu,
Edward Kreinar,
Zhenbin Wu
Abstract:
We develop and study FPGA implementations of algorithms for charged particle tracking based on graph neural networks. The two complementary FPGA designs are based on OpenCL, a framework for writing programs that execute across heterogeneous platforms, and hls4ml, a high-level-synthesis-based compiler for neural network to firmware conversion. We evaluate and compare the resource usage, latency, an…
▽ More
We develop and study FPGA implementations of algorithms for charged particle tracking based on graph neural networks. The two complementary FPGA designs are based on OpenCL, a framework for writing programs that execute across heterogeneous platforms, and hls4ml, a high-level-synthesis-based compiler for neural network to firmware conversion. We evaluate and compare the resource usage, latency, and tracking performance of our implementations based on a benchmark dataset. We find a considerable speedup over CPU-based execution is possible, potentially enabling such algorithms to be used effectively in future computing workflows and the FPGA-based Level-1 trigger at the CERN Large Hadron Collider.
△ Less
Submitted 30 November, 2020;
originally announced December 2020.
-
HL-LHC Computing Review: Common Tools and Community Software
Authors:
HEP Software Foundation,
:,
Thea Aarrestad,
Simone Amoroso,
Markus Julian Atkinson,
Joshua Bendavid,
Tommaso Boccali,
Andrea Bocci,
Andy Buckley,
Matteo Cacciari,
Paolo Calafiura,
Philippe Canal,
Federico Carminati,
Taylor Childers,
Vitaliano Ciulli,
Gloria Corti,
Davide Costanzo,
Justin Gage Dezoort,
Caterina Doglioni,
Javier Mauricio Duarte,
Agnieszka Dziurda,
Peter Elmer,
Markus Elsing,
V. Daniel Elvira,
Giulio Eulisse
, et al. (85 additional authors not shown)
Abstract:
Common and community software packages, such as ROOT, Geant4 and event generators have been a key part of the LHC's success so far and continued development and optimisation will be critical in the future. The challenges are driven by an ambitious physics programme, notably the LHC accelerator upgrade to high-luminosity, HL-LHC, and the corresponding detector upgrades of ATLAS and CMS. In this doc…
▽ More
Common and community software packages, such as ROOT, Geant4 and event generators have been a key part of the LHC's success so far and continued development and optimisation will be critical in the future. The challenges are driven by an ambitious physics programme, notably the LHC accelerator upgrade to high-luminosity, HL-LHC, and the corresponding detector upgrades of ATLAS and CMS. In this document we address the issues for software that is used in multiple experiments (usually even more widely than ATLAS and CMS) and maintained by teams of developers who are either not linked to a particular experiment or who contribute to common software within the context of their experiment activity. We also give space to general considerations for future software and projects that tackle upcoming challenges, no matter who writes it, which is an area where community convergence on best practice is extremely useful.
△ Less
Submitted 31 August, 2020;
originally announced August 2020.
-
Coffea -- Columnar Object Framework For Effective Analysis
Authors:
Nicholas Smith,
Lindsey Gray,
Matteo Cremonesi,
Bo Jayatilaka,
Oliver Gutsche,
Allison Hall,
Kevin Pedro,
Maria Acosta,
Andrew Melo,
Stefano Belforte,
Jim Pivarski
Abstract:
The coffea framework provides a new approach to High-Energy Physics analysis, via columnar operations, that improves time-to-insight, scalability, portability, and reproducibility of analysis. It is implemented with the Python programming language, the scientific python package ecosystem, and commodity big data technologies. To achieve this suite of improvements across many use cases, coffea takes…
▽ More
The coffea framework provides a new approach to High-Energy Physics analysis, via columnar operations, that improves time-to-insight, scalability, portability, and reproducibility of analysis. It is implemented with the Python programming language, the scientific python package ecosystem, and commodity big data technologies. To achieve this suite of improvements across many use cases, coffea takes a factorized approach, separating the analysis implementation and data delivery scheme. All analysis operations are implemented using the NumPy or awkward-array packages which are wrapped to yield user code whose purpose is quickly intuited. Various data delivery schemes are wrapped into a common front-end which accepts user inputs and code, and returns user defined outputs. We will discuss our experience in implementing analysis of CMS data using the coffea framework along with a discussion of the user experience and future directions.
△ Less
Submitted 6 August, 2021; v1 submitted 28 August, 2020;
originally announced August 2020.
-
Track Seeding and Labelling with Embedded-space Graph Neural Networks
Authors:
Nicholas Choma,
Daniel Murnane,
Xiangyang Ju,
Paolo Calafiura,
Sean Conlon,
Steven Farrell,
Prabhat,
Giuseppe Cerati,
Lindsey Gray,
Thomas Klijnsma,
Jim Kowalkowski,
Panagiotis Spentzouris,
Jean-Roch Vlimant,
Maria Spiropulu,
Adam Aurisano,
V Hewes,
Aristeidis Tsaris,
Kazuhiro Terao,
Tracy Usher
Abstract:
To address the unprecedented scale of HL-LHC data, the Exa.TrkX project is investigating a variety of machine learning approaches to particle track reconstruction. The most promising of these solutions, graph neural networks (GNN), process the event as a graph that connects track measurements (detector hits corresponding to nodes) with candidate line segments between the hits (corresponding to edg…
▽ More
To address the unprecedented scale of HL-LHC data, the Exa.TrkX project is investigating a variety of machine learning approaches to particle track reconstruction. The most promising of these solutions, graph neural networks (GNN), process the event as a graph that connects track measurements (detector hits corresponding to nodes) with candidate line segments between the hits (corresponding to edges). Detector information can be associated with nodes and edges, enabling a GNN to propagate the embedded parameters around the graph and predict node-, edge- and graph-level observables. Previously, message-passing GNNs have shown success in predicting doublet likelihood, and we here report updates on the state-of-the-art architectures for this task. In addition, the Exa.TrkX project has investigated innovations in both graph construction, and embedded representations, in an effort to achieve fully learned end-to-end track finding. Hence, we present a suite of extensions to the original model, with encouraging results for hitgraph classification. In addition, we explore increased performance by constructing graphs from learned representations which contain non-linear metric structure, allowing for efficient clustering and neighborhood queries of data points. We demonstrate how this framework fits in with both traditional clustering pipelines, and GNN approaches. The embedded graphs feed into high-accuracy doublet and triplet classifiers, or can be used as an end-to-end track classifier by clustering in an embedded space. A set of post-processing methods improve performance with knowledge of the detector physics. Finally, we present numerical results on the TrackML particle tracking challenge dataset, where our framework shows favorable results in both seeding and track finding.
△ Less
Submitted 30 June, 2020;
originally announced July 2020.
-
Graph Neural Networks for Particle Reconstruction in High Energy Physics detectors
Authors:
Xiangyang Ju,
Steven Farrell,
Paolo Calafiura,
Daniel Murnane,
Prabhat,
Lindsey Gray,
Thomas Klijnsma,
Kevin Pedro,
Giuseppe Cerati,
Jim Kowalkowski,
Gabriel Perdue,
Panagiotis Spentzouris,
Nhan Tran,
Jean-Roch Vlimant,
Alexander Zlokapa,
Joosep Pata,
Maria Spiropulu,
Sitong An,
Adam Aurisano,
V Hewes,
Aristeidis Tsaris,
Kazuhiro Terao,
Tracy Usher
Abstract:
Pattern recognition problems in high energy physics are notably different from traditional machine learning applications in computer vision. Reconstruction algorithms identify and measure the kinematic properties of particles produced in high energy collisions and recorded with complex detector systems. Two critical applications are the reconstruction of charged particle trajectories in tracking d…
▽ More
Pattern recognition problems in high energy physics are notably different from traditional machine learning applications in computer vision. Reconstruction algorithms identify and measure the kinematic properties of particles produced in high energy collisions and recorded with complex detector systems. Two critical applications are the reconstruction of charged particle trajectories in tracking detectors and the reconstruction of particle showers in calorimeters. These two problems have unique challenges and characteristics, but both have high dimensionality, high degree of sparsity, and complex geometric layouts. Graph Neural Networks (GNNs) are a relatively new class of deep learning architectures which can deal with such data effectively, allowing scientists to incorporate domain knowledge in a graph structure and learn powerful representations leveraging that structure to identify patterns of interest. In this work we demonstrate the applicability of GNNs to these two diverse particle reconstruction problems.
△ Less
Submitted 3 June, 2020; v1 submitted 25 March, 2020;
originally announced March 2020.
-
A Dynamic Reduction Network for Point Clouds
Authors:
Lindsey Gray,
Thomas Klijnsma,
Shamik Ghosh
Abstract:
Classifying whole images is a classic problem in machine learning, and graph neural networks are a powerful methodology to learn highly irregular geometries. It is often the case that certain parts of a point cloud are more important than others when determining overall classification. On graph structures this started by pooling information at the end of convolutional filters, and has evolved to a…
▽ More
Classifying whole images is a classic problem in machine learning, and graph neural networks are a powerful methodology to learn highly irregular geometries. It is often the case that certain parts of a point cloud are more important than others when determining overall classification. On graph structures this started by pooling information at the end of convolutional filters, and has evolved to a variety of staged pooling techniques on static graphs. In this paper, a dynamic graph formulation of pooling is introduced that removes the need for predetermined graph structure. It achieves this by dynamically learning the most important relationships between data via an intermediate clustering. The network architecture yields interesting results considering representation size and efficiency. It also adapts easily to a large number of tasks from image classification to energy regression in high energy particle physics.
△ Less
Submitted 17 March, 2020;
originally announced March 2020.
-
Novel deep learning methods for track reconstruction
Authors:
Steven Farrell,
Paolo Calafiura,
Mayur Mudigonda,
Prabhat,
Dustin Anderson,
Jean-Roch Vlimant,
Stephan Zheng,
Josh Bendavid,
Maria Spiropulu,
Giuseppe Cerati,
Lindsey Gray,
Jim Kowalkowski,
Panagiotis Spentzouris,
Aristeidis Tsaris
Abstract:
For the past year, the HEP.TrkX project has been investigating machine learning solutions to LHC particle track reconstruction problems. A variety of models were studied that drew inspiration from computer vision applications and operated on an image-like representation of tracking detector data. While these approaches have shown some promise, image-based methods face challenges in scaling up to r…
▽ More
For the past year, the HEP.TrkX project has been investigating machine learning solutions to LHC particle track reconstruction problems. A variety of models were studied that drew inspiration from computer vision applications and operated on an image-like representation of tracking detector data. While these approaches have shown some promise, image-based methods face challenges in scaling up to realistic HL-LHC data due to high dimensionality and sparsity. In contrast, models that can operate on the spacepoint representation of track measurements ("hits") can exploit the structure of the data to solve tasks efficiently. In this paper we will show two sets of new deep learning models for reconstructing tracks using space-point data arranged as sequences or connected graphs. In the first set of models, Recurrent Neural Networks (RNNs) are used to extrapolate, build, and evaluate track candidates akin to Kalman Filter algorithms. Such models can express their own uncertainty when trained with an appropriate likelihood loss function. The second set of models use Graph Neural Networks (GNNs) for the tasks of hit classification and segment classification. These models read a graph of connected hits and compute features on the nodes and edges. They adaptively learn which hit connections are important and which are spurious. The models are scaleable with simple architecture and relatively few parameters. Results for all models will be presented on ACTS generic detector simulated data.
△ Less
Submitted 14 October, 2018;
originally announced October 2018.
-
Machine Learning in High Energy Physics Community White Paper
Authors:
Kim Albertsson,
Piero Altoe,
Dustin Anderson,
John Anderson,
Michael Andrews,
Juan Pedro Araque Espinosa,
Adam Aurisano,
Laurent Basara,
Adrian Bevan,
Wahid Bhimji,
Daniele Bonacorsi,
Bjorn Burkle,
Paolo Calafiura,
Mario Campanelli,
Louis Capps,
Federico Carminati,
Stefano Carrazza,
Yi-fan Chen,
Taylor Childers,
Yann Coadou,
Elias Coniavitis,
Kyle Cranmer,
Claire David,
Douglas Davis,
Andrea De Simone
, et al. (103 additional authors not shown)
Abstract:
Machine learning has been applied to several problems in particle physics research, beginning with applications to high-level physics analysis in the 1990s and 2000s, followed by an explosion of applications in particle and event identification and reconstruction in the 2010s. In this document we discuss promising future research and development areas for machine learning in particle physics. We d…
▽ More
Machine learning has been applied to several problems in particle physics research, beginning with applications to high-level physics analysis in the 1990s and 2000s, followed by an explosion of applications in particle and event identification and reconstruction in the 2010s. In this document we discuss promising future research and development areas for machine learning in particle physics. We detail a roadmap for their implementation, software and hardware resource requirements, collaborative initiatives with the data science community, academia and industry, and training the particle physics community in data science. The main objective of the document is to connect and motivate these areas of research and development with the physics drivers of the High-Luminosity Large Hadron Collider and future neutrino experiments and identify the resource needs for their implementation. Additionally we identify areas where collaboration with external communities will be of great benefit.
△ Less
Submitted 16 May, 2019; v1 submitted 8 July, 2018;
originally announced July 2018.
-
Initial performance studies of a general-purpose detector for multi-TeV physics at a 100 TeV pp collider
Authors:
S. V. Chekanov,
M. Beydler,
A. V. Kotwal,
L. Gray,
S. Sen,
N. V. Tran,
S. -S. Yu,
J. Zuzelski
Abstract:
This paper describes simulations of detector response to multi-TeV physics at the Future Circular Collider (FCC-hh) or Super proton-proton Collider (SppC) which aim to collide proton beams with a centre-of-mass energy of 100 TeV. The unprecedented energy regime of these future experiments imposes new requirements on detector technologies which can be studied using the detailed GEANT4 simulations p…
▽ More
This paper describes simulations of detector response to multi-TeV physics at the Future Circular Collider (FCC-hh) or Super proton-proton Collider (SppC) which aim to collide proton beams with a centre-of-mass energy of 100 TeV. The unprecedented energy regime of these future experiments imposes new requirements on detector technologies which can be studied using the detailed GEANT4 simulations presented in this paper. The initial performance of a detector designed for physics studies at the FCC-hh or SppC experiments is described with an emphasis on measurements of single particles up to 33 TeV in transverse momentum. The reconstruction of hadronic jets has also been studied in the transverse momentum range from 50 GeV to 26 TeV. The granularity requirements for calorimetry are investigated using the two-particle spatial resolution achieved for hadron showers.
△ Less
Submitted 9 June, 2017; v1 submitted 21 December, 2016;
originally announced December 2016.
-
Study Of Boosted W-Jets And Higgs-Jets With the SiFCC Detector
Authors:
Shin-Shan Yu,
Sergei Chekanov,
Lindsey Gray,
Ashutosh Kotwal,
Sourav Sen,
Nhan Viet Tran
Abstract:
We study the detector performance in the reconstruction of hadronically-decaying W bosons and Higgs bosons at very high energy proton colliders using a full GEANT4 simulation of the SiFCC detector. The W and Higgs bosons carry transverse momentum in the multi-TeV range, which results in collimated decay products that are reconstructed as a single jet. We present a measurement of the energy respons…
▽ More
We study the detector performance in the reconstruction of hadronically-decaying W bosons and Higgs bosons at very high energy proton colliders using a full GEANT4 simulation of the SiFCC detector. The W and Higgs bosons carry transverse momentum in the multi-TeV range, which results in collimated decay products that are reconstructed as a single jet. We present a measurement of the energy response and resolution of boosted W-jets and Higgs-jets and show the separation of two sub-jets within the boosted boson jet.
△ Less
Submitted 3 November, 2016;
originally announced November 2016.
-
Observation of the rare $B^0_s\toμ^+μ^-$ decay from the combined analysis of CMS and LHCb data
Authors:
The CMS,
LHCb Collaborations,
:,
V. Khachatryan,
A. M. Sirunyan,
A. Tumasyan,
W. Adam,
T. Bergauer,
M. Dragicevic,
J. Erö,
M. Friedl,
R. Frühwirth,
V. M. Ghete,
C. Hartl,
N. Hörmann,
J. Hrubec,
M. Jeitler,
W. Kiesenhofer,
V. Knünz,
M. Krammer,
I. Krätschmer,
D. Liko,
I. Mikulec,
D. Rabady,
B. Rahbaran
, et al. (2807 additional authors not shown)
Abstract:
A joint measurement is presented of the branching fractions $B^0_s\toμ^+μ^-$ and $B^0\toμ^+μ^-$ in proton-proton collisions at the LHC by the CMS and LHCb experiments. The data samples were collected in 2011 at a centre-of-mass energy of 7 TeV, and in 2012 at 8 TeV. The combined analysis produces the first observation of the $B^0_s\toμ^+μ^-$ decay, with a statistical significance exceeding six sta…
▽ More
A joint measurement is presented of the branching fractions $B^0_s\toμ^+μ^-$ and $B^0\toμ^+μ^-$ in proton-proton collisions at the LHC by the CMS and LHCb experiments. The data samples were collected in 2011 at a centre-of-mass energy of 7 TeV, and in 2012 at 8 TeV. The combined analysis produces the first observation of the $B^0_s\toμ^+μ^-$ decay, with a statistical significance exceeding six standard deviations, and the best measurement of its branching fraction so far. Furthermore, evidence for the $B^0\toμ^+μ^-$ decay is obtained with a statistical significance of three standard deviations. The branching fraction measurements are statistically compatible with SM predictions and impose stringent constraints on several theories beyond the SM.
△ Less
Submitted 17 August, 2015; v1 submitted 17 November, 2014;
originally announced November 2014.