-
Optimizing High Throughput Inference on Graph Neural Networks at Shared Computing Facilities with the NVIDIA Triton Inference Server
Authors:
Claire Savard,
Nicholas Manganelli,
Burt Holzman,
Lindsey Gray,
Alexx Perloff,
Kevin Pedro,
Kevin Stenson,
Keith Ulmer
Abstract:
With machine learning applications now spanning a variety of computational tasks, multi-user shared computing facilities are devoting a rapidly increasing proportion of their resources to such algorithms. Graph neural networks (GNNs), for example, have provided astounding improvements in extracting complex signatures from data and are now widely used in a variety of applications, such as particle…
▽ More
With machine learning applications now spanning a variety of computational tasks, multi-user shared computing facilities are devoting a rapidly increasing proportion of their resources to such algorithms. Graph neural networks (GNNs), for example, have provided astounding improvements in extracting complex signatures from data and are now widely used in a variety of applications, such as particle jet classification in high energy physics (HEP). However, GNNs also come with an enormous computational penalty that requires the use of GPUs to maintain reasonable throughput. At shared computing facilities, such as those used by physicists at Fermi National Accelerator Laboratory (Fermilab), methodical resource allocation and high throughput at the many-user scale are key to ensuring that resources are being used as efficiently as possible. These facilities, however, primarily provide CPU-only nodes, which proves detrimental to time-to-insight and computational throughput for workflows that include machine learning inference. In this work, we describe how a shared computing facility can use the NVIDIA Triton Inference Server to optimize its resource allocation and computing structure, recovering high throughput while scaling out to multiple users by massively parallelizing their machine learning inference. To demonstrate the effectiveness of this system in a realistic multi-user environment, we use the Fermilab Elastic Analysis Facility augmented with the Triton Inference Server to provide scalable and high throughput access to a HEP-specific GNN and report on the outcome.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Evidence for Four-Top Quark Production at the LHC
Authors:
Nicholas Manganelli,
Melissa Quinnan
Abstract:
The standard model production of four top quarks is predicted to have a cross section of the order of 12fb. The CMS Collaboration presents new results on this rare production mechanism for Run 2 data collected in 2016 through 2018 at 13 TeV, considering event signatures containing zero to four electrons or muons. This is the first time the all-hadronic channel is investigated in the study of four…
▽ More
The standard model production of four top quarks is predicted to have a cross section of the order of 12fb. The CMS Collaboration presents new results on this rare production mechanism for Run 2 data collected in 2016 through 2018 at 13 TeV, considering event signatures containing zero to four electrons or muons. This is the first time the all-hadronic channel is investigated in the study of four top quarks, made possible through novel machine learning based data-driven background estimation techniques.
△ Less
Submitted 12 December, 2022; v1 submitted 12 December, 2022;
originally announced December 2022.
-
Second Analysis Ecosystem Workshop Report
Authors:
Mohamed Aly,
Jackson Burzynski,
Bryan Cardwell,
Daniel C. Craik,
Tal van Daalen,
Tomas Dado,
Ayanabha Das,
Antonio Delgado Peris,
Caterina Doglioni,
Peter Elmer,
Engin Eren,
Martin B. Eriksen,
Jonas Eschle,
Giulio Eulisse,
Conor Fitzpatrick,
José Flix Molina,
Alessandra Forti,
Ben Galewsky,
Sean Gasiorowski,
Aman Goel,
Loukas Gouskos,
Enrico Guiraud,
Kanhaiya Gupta,
Stephan Hageboeck,
Allison Reinsvold Hall
, et al. (44 additional authors not shown)
Abstract:
The second workshop on the HEP Analysis Ecosystem took place 23-25 May 2022 at IJCLab in Orsay, to look at progress and continuing challenges in scaling up HEP analysis to meet the needs of HL-LHC and DUNE, as well as the very pressing needs of LHC Run 3 analysis.
The workshop was themed around six particular topics, which were felt to capture key questions, opportunities and challenges. Each to…
▽ More
The second workshop on the HEP Analysis Ecosystem took place 23-25 May 2022 at IJCLab in Orsay, to look at progress and continuing challenges in scaling up HEP analysis to meet the needs of HL-LHC and DUNE, as well as the very pressing needs of LHC Run 3 analysis.
The workshop was themed around six particular topics, which were felt to capture key questions, opportunities and challenges. Each topic arranged a plenary session introduction, often with speakers summarising the state-of-the art and the next steps for analysis. This was then followed by parallel sessions, which were much more discussion focused, and where attendees could grapple with the challenges and propose solutions that could be tried. Where there was significant overlap between topics, a joint discussion between them was arranged.
In the weeks following the workshop the session conveners wrote this document, which is a summary of the main discussions, the key points raised and the conclusions and outcomes. The document was circulated amongst the participants for comments before being finalised here.
△ Less
Submitted 9 December, 2022;
originally announced December 2022.
-
Learning from the Pandemic: the Future of Meetings in HEP and Beyond
Authors:
Mark S. Neubauer,
Todd Adams,
Jennifer Adelman-McCarthy,
Gabriele Benelli,
Tulika Bose,
David Britton,
Pat Burchat,
Joel Butler,
Timothy A. Cartwright,
Tomáš Davídek,
Jacques Dumarchez,
Peter Elmer,
Matthew Feickert,
Ben Galewsky,
Mandeep Gill,
Maciej Gladki,
Aman Goel,
Jonathan E. Guyer,
Bo Jayatilaka,
Brendan Kiburg,
Benjamin Krikler,
David Lange,
Claire Lee,
Nick Manganelli,
Giovanni Marchiori
, et al. (14 additional authors not shown)
Abstract:
The COVID-19 pandemic has by-and-large prevented in-person meetings since March 2020. While the increasing deployment of effective vaccines around the world is a very positive development, the timeline and pathway to "normality" is uncertain and the "new normal" we will settle into is anyone's guess. Particle physics, like many other scientific fields, has more than a year of experience in holding…
▽ More
The COVID-19 pandemic has by-and-large prevented in-person meetings since March 2020. While the increasing deployment of effective vaccines around the world is a very positive development, the timeline and pathway to "normality" is uncertain and the "new normal" we will settle into is anyone's guess. Particle physics, like many other scientific fields, has more than a year of experience in holding virtual meetings, workshops, and conferences. A great deal of experimentation and innovation to explore how to execute these meetings effectively has occurred. Therefore, it is an appropriate time to take stock of what we as a community learned from running virtual meetings and discuss possible strategies for the future. Continuing to develop effective strategies for meetings with a virtual component is likely to be important for reducing the carbon footprint of our research activities, while also enabling greater diversity and inclusion for participation. This report summarizes a virtual two-day workshop on Virtual Meetings held May 5-6, 2021 which brought together experts from both inside and outside of high-energy physics to share their experiences and practices with organizing and executing virtual workshops, and to develop possible strategies for future meetings as we begin to emerge from the COVID-19 pandemic. This report outlines some of the practices and tools that have worked well which we hope will serve as a valuable resource for future virtual meeting organizers in all scientific fields.
△ Less
Submitted 29 June, 2021;
originally announced June 2021.