Skip to main content

Showing 1–40 of 40 results for author: Harris, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19522  [pdf, other

    cs.LG

    Reliable edge machine learning hardware for scientific applications

    Authors: Tommaso Baldi, Javier Campos, Ben Hawks, Jennifer Ngadiuba, Nhan Tran, Daniel Diaz, Javier Duarte, Ryan Kastner, Andres Meza, Melissa Quinnan, Olivia Weng, Caleb Geniesse, Amir Gholami, Michael W. Mahoney, Vladimir Loncar, Philip Harris, Joshua Agar, Shuyu Qin

    Abstract: Extreme data rate scientific experiments create massive amounts of data that require efficient ML edge processing. This leads to unique validation challenges for VLSI implementations of ML algorithms: enabling bit-accurate functional simulations for performance validation in experimental software frameworks, verifying those ML models are robust under extreme quantization and pruning, and enabling… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: IEEE VLSI Test Symposium 2024 (VTS)

    Report number: FERMILAB-CONF-24-0116-CSAID

  2. arXiv:2403.07066  [pdf, other

    hep-ph cs.LG hep-ex

    Re-Simulation-based Self-Supervised Learning for Pre-Training Foundation Models

    Authors: Philip Harris, Michael Kagan, Jeffrey Krupa, Benedikt Maier, Nathaniel Woodward

    Abstract: Self-Supervised Learning (SSL) is at the core of training modern large machine learning models, providing a scheme for learning powerful representations that can be used in a variety of downstream tasks. However, SSL strategies must be adapted to the type of training data and downstream tasks required. We propose RS3L, a novel simulation-based SSL strategy that employs a method of re-simulation to… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 24 pages, 9 figures

  3. arXiv:2402.01047  [pdf, other

    cs.LG cs.AR hep-ex

    Ultra Fast Transformers on FPGAs for Particle Physics Experiments

    Authors: Zhixing Jiang, Dennis Yin, Elham E Khoda, Vladimir Loncar, Ekaterina Govorkova, Eric Moreno, Philip Harris, Scott Hauck, Shih-Chieh Hsu

    Abstract: This work introduces a highly efficient implementation of the transformer architecture on a Field-Programmable Gate Array (FPGA) by using the \texttt{hls4ml} tool. Given the demonstrated effectiveness of transformer models in addressing a wide range of problems, their application in experimental triggers within particle physics becomes a subject of significant interest. In this work, we have imple… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 6 pages, 2 figures

    Journal ref: Machine Learning and the Physical Sciences Workshop, NeurIPS 2023

  4. arXiv:2401.09949  [pdf, other

    cs.LG hep-ex physics.ins-det

    SymbolNet: Neural Symbolic Regression with Adaptive Dynamic Pruning

    Authors: Ho Fung Tsoi, Vladimir Loncar, Sridhara Dasu, Philip Harris

    Abstract: Contrary to the use of genetic programming, the neural network approach to symbolic regression can scale well with high input dimension and leverage gradient methods for faster equation searching. Common ways of constraining expression complexity have relied on multistage pruning methods with fine-tuning, but these often lead to significant performance loss. In this work, we propose SymbolNet, a n… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 11 pages. Submitted to IEEE TNNLS, under review

  5. arXiv:2312.07615  [pdf, other

    cs.LG astro-ph.IM

    Optimizing Likelihood-free Inference using Self-supervised Neural Symmetry Embeddings

    Authors: Deep Chatterjee, Philip C. Harris, Maanas Goel, Malina Desai, Michael W. Coughlin, Erik Katsavounidis

    Abstract: Likelihood-free inference is quickly emerging as a powerful tool to perform fast/effective parameter estimation. We demonstrate a technique of optimizing likelihood-free inference to make it even faster by marginalizing symmetries in a physical problem. In this approach, physical symmetries, for example, time-translation are learned using joint-embedding via self-supervised learning with symmetry… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted for Machine Learning and the Physical Sciences Workshop (submission 69) at NeurIPS 2023; for codes, see https://github.com/ML4GW/summer-projects-2023/blob/neurips-2023/symmetry-informed-flows/README.md

  6. arXiv:2310.06047  [pdf, other

    cs.LG

    Knowledge Distillation for Anomaly Detection

    Authors: Adrian Alan Pol, Ekaterina Govorkova, Sonja Gronroos, Nadezda Chernyavskaya, Philip Harris, Maurizio Pierini, Isobel Ojalvo, Peter Elmer

    Abstract: Unsupervised deep learning techniques are widely used to identify anomalous behaviour. The performance of such methods is a product of the amount of training data and the model size. However, the size is often a limiting factor for the deployment on resource-constrained devices. We present a novel procedure based on knowledge distillation for compressing an unsupervised anomaly detection model int… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  7. arXiv:2309.12265  [pdf, ps, other

    math.CO cs.DM cs.GT

    Cost-sharing in Parking Games

    Authors: Jennifer Elder, Pamela E. Harris, Jan Kretschmann, J. Carlos Martínez Mori

    Abstract: In this paper, we study the total displacement statistic of parking functions from the perspective of cooperative game theory. We introduce parking games, which are coalitional cost-sharing games in characteristic function form derived from the total displacement statistic. We show that parking games are supermodular cost-sharing games, indicating that cooperation is difficult (i.e., their core is… ▽ More

    Submitted 14 November, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: 12 pages

    MSC Class: 05A05; 91A12; 91A46

  8. Identifying and Extracting Rare Disease Phenotypes with Large Language Models

    Authors: Cathy Shyr, Yan Hu, Paul A. Harris, Hua Xu

    Abstract: Rare diseases (RDs) are collectively common and affect 300 million people worldwide. Accurate phenoty** is critical for informing diagnosis and treatment, but RD phenotypes are often embedded in unstructured text and time-consuming to extract manually. While natural language processing (NLP) models can perform named entity recognition (NER) to automate extraction, a major bottleneck is the devel… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Journal ref: J Healthc Inform Res 8, 438-461 (2024)

  9. arXiv:2305.04099  [pdf, other

    cs.LG hep-ex physics.ins-det

    Symbolic Regression on FPGAs for Fast Machine Learning Inference

    Authors: Ho Fung Tsoi, Adrian Alan Pol, Vladimir Loncar, Ekaterina Govorkova, Miles Cranmer, Sridhara Dasu, Peter Elmer, Philip Harris, Isobel Ojalvo, Maurizio Pierini

    Abstract: The high-energy physics community is investigating the potential of deploying machine-learning-based solutions on Field-Programmable Gate Arrays (FPGAs) to enhance physics sensitivity while still meeting data processing time constraints. In this contribution, we introduce a novel end-to-end procedure that utilizes a machine learning technique called symbolic regression (SR). It searches the equati… ▽ More

    Submitted 17 January, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: 9 pages. Accepted to 26th International Conference on Computing in High Energy & Nuclear Physics (CHEP 2023)

    Journal ref: EPJ Web of Conferences 295, 09036 (2024)

  10. arXiv:2304.02577  [pdf, other

    physics.med-ph cs.LG eess.SP

    ECG Feature Importance Rankings: Cardiologists vs. Algorithms

    Authors: Temesgen Mehari, Ashish Sundar, Alen Bosnjakovic, Peter Harris, Steven E. Williams, Axel Loewe, Olaf Doessel, Claudia Nagel, Nils Strodthoff, Philip J. Aston

    Abstract: Feature importance methods promise to provide a ranking of features according to importance for a given classification task. A wide range of methods exist but their rankings often disagree and they are inherently difficult to evaluate due to a lack of ground truth beyond synthetic datasets. In this work, we put feature importance methods to the test on real-world data in the domain of cardiology,… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  11. arXiv:2301.04633  [pdf, ps, other

    hep-ex cs.DC physics.data-an

    Accelerating Machine Learning Inference with GPUs in ProtoDUNE Data Processing

    Authors: Te** Cai, Kenneth Herner, Tingjun Yang, Michael Wang, Maria Acosta Flechas, Philip Harris, Burt Holzman, Kevin Pedro, Nhan Tran

    Abstract: We study the performance of a cloud-based GPU-accelerated inference server to speed up event reconstruction in neutrino data batch jobs. Using detector data from the ProtoDUNE experiment and employing the standard DUNE grid job submission tools, we attempt to reprocess the data by running several thousand concurrent grid jobs, a rate we expect to be typical of current and future neutrino physics e… ▽ More

    Submitted 27 October, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: 13 pages, 9 figures, matches accepted version

    Report number: FERMILAB-PUB-22-944-ND-PPD-SCD

    Journal ref: Comput Softw Big Sci 7, 11 (2023)

  12. arXiv:2212.05081  [pdf, other

    hep-ex cs.LG physics.comp-ph

    FAIR AI Models in High Energy Physics

    Authors: Javier Duarte, Haoyang Li, Avik Roy, Ruike Zhu, E. A. Huerta, Daniel Diaz, Philip Harris, Raghav Kansal, Daniel S. Katz, Ishaan H. Kavoori, Volodymyr V. Kindratenko, Farouk Mokhtar, Mark S. Neubauer, Sang Eon Park, Melissa Quinnan, Roger Rusack, Zhizhen Zhao

    Abstract: The findable, accessible, interoperable, and reusable (FAIR) data principles provide a framework for examining, evaluating, and improving how data is shared to facilitate scientific discovery. Generalizing these principles to research software and other digital products is an active area of research. Machine learning (ML) models -- algorithms that have been trained on data without being explicitly… ▽ More

    Submitted 29 December, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 34 pages, 9 figures, 10 tables

    Journal ref: Mach. Learn.: Sci. Technol. 4 (2023) 045062

  13. arXiv:2210.08973  [pdf, ps, other

    cs.CY cs.HC cs.LG hep-ex

    FAIR for AI: An interdisciplinary and international community building perspective

    Authors: E. A. Huerta, Ben Blaiszik, L. Catherine Brinson, Kristofer E. Bouchard, Daniel Diaz, Caterina Doglioni, Javier M. Duarte, Murali Emani, Ian Foster, Geoffrey Fox, Philip Harris, Lukas Heinrich, Shantenu Jha, Daniel S. Katz, Volodymyr Kindratenko, Christine R. Kirkpatrick, Kati Lassila-Perini, Ravi K. Madduri, Mark S. Neubauer, Fotis E. Psomopoulos, Avik Roy, Oliver Rübel, Zhizhen Zhao, Ruike Zhu

    Abstract: A foundational set of findable, accessible, interoperable, and reusable (FAIR) principles were proposed in 2016 as prerequisites for proper data management and stewardship, with the goal of enabling the reusability of scholarly data. The principles were also meant to apply to other digital assets, at a high level, and over time, the FAIR guiding principles have been re-interpreted or extended to i… ▽ More

    Submitted 1 August, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

    Comments: 10 pages, comments welcome!; v2: 12 pages, accepted to Scientific Data

    ACM Class: I.2.0; E.0

    Journal ref: Scientific Data 10, 487 (2023)

  14. arXiv:2208.05484  [pdf, other

    hep-ph cs.LG hep-ex

    Neural Embedding: Learning the Embedding of the Manifold of Physics Data

    Authors: Sang Eon Park, Philip Harris, Bryan Ostdiek

    Abstract: In this paper, we present a method of embedding physics data manifolds with metric structure into lower dimensional spaces with simpler metrics, such as Euclidean and Hyperbolic spaces. We then demonstrate that it can be a powerful step in the data analysis pipeline for many applications. Using progressively more realistic simulated collisions at the Large Hadron Collider, we show that this embedd… ▽ More

    Submitted 14 August, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

  15. arXiv:2207.09060  [pdf, other

    physics.ed-ph cs.LG hep-ex physics.comp-ph

    Data Science and Machine Learning in Education

    Authors: Gabriele Benelli, Thomas Y. Chen, Javier Duarte, Matthew Feickert, Matthew Graham, Lindsey Gray, Dan Hackett, Phil Harris, Shih-Chieh Hsu, Gregor Kasieczka, Elham E. Khoda, Matthias Komm, Mia Liu, Mark S. Neubauer, Scarlet Norberg, Alexx Perloff, Marcel Rieger, Claire Savard, Kazuhiro Terao, Savannah Thais, Avik Roy, Jean-Roch Vlimant, Grigorios Chachamis

    Abstract: The growing role of data science (DS) and machine learning (ML) in high-energy physics (HEP) is well established and pertinent given the complex detectors, large data, sets and sophisticated analyses at the heart of HEP research. Moreover, exploiting symmetries inherent in physics data have inspired physics-informed ML as a vibrant sub-field of computer science research. HEP researchers benefit gr… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Contribution to Snowmass 2021

  16. arXiv:2207.00559  [pdf, other

    cs.LG hep-ex physics.ins-det stat.ML

    Ultra-low latency recurrent neural network inference on FPGAs for physics applications with hls4ml

    Authors: Elham E Khoda, Dylan Rankin, Rafael Teixeira de Lima, Philip Harris, Scott Hauck, Shih-Chieh Hsu, Michael Kagan, Vladimir Loncar, Chaitanya Paikara, Richa Rao, Sioni Summers, Caterina Vernieri, Aaron Wang

    Abstract: Recurrent neural networks have been shown to be effective architectures for many tasks in high energy physics, and thus have been widely adopted. Their use in low-latency environments has, however, been limited as a result of the difficulties of implementing recurrent architectures on field-programmable gate arrays (FPGAs). In this paper we present an implementation of two types of recurrent neura… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: 12 pages, 6 figures, 5 tables

  17. arXiv:2205.07690  [pdf, other

    cs.CV cs.AR cs.LG physics.ins-det stat.ML

    Real-time semantic segmentation on FPGAs for autonomous vehicles with hls4ml

    Authors: Nicolò Ghielmetti, Vladimir Loncar, Maurizio Pierini, Marcel Roed, Sioni Summers, Thea Aarrestad, Christoffer Petersson, Hampus Linander, Jennifer Ngadiuba, Kelvin Lin, Philip Harris

    Abstract: In this paper, we investigate how field programmable gate arrays can serve as hardware accelerators for real-time semantic segmentation tasks relevant for autonomous driving. Considering compressed versions of the ENet convolutional neural network architecture, we demonstrate a fully-on-chip deployment with a latency of 4.9 ms per image, using less than 30% of the available resources on a Xilinx Z… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: 11 pages, 6 tables, 5 figures

  18. arXiv:2203.16255  [pdf, other

    cs.LG gr-qc hep-ex physics.ins-det

    Physics Community Needs, Tools, and Resources for Machine Learning

    Authors: Philip Harris, Erik Katsavounidis, William Patrick McCormack, Dylan Rankin, Yongbin Feng, Abhijith Gandrakota, Christian Herwig, Burt Holzman, Kevin Pedro, Nhan Tran, Tingjun Yang, Jennifer Ngadiuba, Michael Coughlin, Scott Hauck, Shih-Chieh Hsu, Elham E Khoda, Deming Chen, Mark Neubauer, Javier Duarte, Georgia Karagiorgi, Mia Liu

    Abstract: Machine learning (ML) is becoming an increasingly important component of cutting-edge physics research, but its computational requirements present significant challenges. In this white paper, we discuss the needs of the physics community regarding ML across latency and throughput regimes, the tools and resources that offer the possibility of addressing these needs, and how these can be best utiliz… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Contribution to Snowmass 2021, 33 pages, 5 figures

  19. arXiv:2110.13041  [pdf, other

    cs.LG cs.AR physics.data-an physics.ins-det

    Applications and Techniques for Fast Machine Learning in Science

    Authors: Allison McCarn Deiana, Nhan Tran, Joshua Agar, Michaela Blott, Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Scott Hauck, Mia Liu, Mark S. Neubauer, Jennifer Ngadiuba, Seda Ogrenci-Memik, Maurizio Pierini, Thea Aarrestad, Steffen Bahr, Jurgen Becker, Anne-Sophie Berthold, Richard J. Bonventre, Tomas E. Muller Bravo, Markus Diefenthaler, Zhen Dong, Nick Fritzsche, Amir Gholami, Ekaterina Govorkova, Kyle J Hazelwood , et al. (62 additional authors not shown)

    Abstract: In this community review report, we discuss applications and techniques for fast machine learning (ML) in science -- the concept of integrating power ML methods into the real-time experimental data processing loop to accelerate scientific discovery. The material for the report builds on two workshops held by the Fast ML for Science community and covers three main areas: applications for fast ML ac… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 66 pages, 13 figures, 5 tables

    Report number: FERMILAB-PUB-21-502-AD-E-SCD

    Journal ref: Front. Big Data 5, 787421 (2022)

  20. arXiv:2108.02214  [pdf, other

    hep-ex cs.AI cs.DB hep-ph

    A FAIR and AI-ready Higgs boson decay dataset

    Authors: Yifan Chen, E. A. Huerta, Javier Duarte, Philip Harris, Daniel S. Katz, Mark S. Neubauer, Daniel Diaz, Farouk Mokhtar, Raghav Kansal, Sang Eon Park, Volodymyr V. Kindratenko, Zhizhen Zhao, Roger Rusack

    Abstract: To enable the reusability of massive scientific datasets by humans and machines, researchers aim to adhere to the principles of findability, accessibility, interoperability, and reusability (FAIR) for data and artificial intelligence (AI) models. This article provides a domain-agnostic, step-by-step assessment guide to evaluate whether or not a given dataset meets these principles. We demonstrate… ▽ More

    Submitted 16 February, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: 13 pages, 3 figures. v2: Accepted to Nature Scientific Data. Learn about the FAIR4HEP project at https://fair4hep.github.io. See our invited Behind the Paper Blog in Springer Nature Research Data Community at https://go.nature.com/3oMVYxo

    ACM Class: I.2; J.2

    Journal ref: Scientific Data volume 9, Article number: 31 (2022)

  21. Robustness of convolutional neural networks to physiological ECG noise

    Authors: J. Venton, P. M. Harris, A. Sundar, N. A. S. Smith, P. J. Aston

    Abstract: The electrocardiogram (ECG) is one of the most widespread diagnostic tools in healthcare and supports the diagnosis of cardiovascular disorders. Deep learning methods are a successful and popular technique to detect indications of disorders from an ECG signal. However, there are open questions around the robustness of these methods to various factors, including physiological ECG noise. In this stu… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 16 pages, 7 figures

  22. arXiv:2105.01683  [pdf, other

    physics.ins-det cs.LG hep-ex

    A reconfigurable neural network ASIC for detector front-end data compression at the HL-LHC

    Authors: Giuseppe Di Guglielmo, Farah Fahim, Christian Herwig, Manuel Blanco Valentin, Javier Duarte, Cristian Gingu, Philip Harris, James Hirschauer, Martin Kwok, Vladimir Loncar, Yingyi Luo, Llovizna Miranda, Jennifer Ngadiuba, Daniel Noonan, Seda Ogrenci-Memik, Maurizio Pierini, Sioni Summers, Nhan Tran

    Abstract: Despite advances in the programmable logic capabilities of modern trigger systems, a significant bottleneck remains in the amount of data to be transported from the detector to off-detector logic where trigger decisions are made. We demonstrate that a neural network autoencoder model can be implemented in a radiation tolerant ASIC to perform lossy data compression alleviating the data transmission… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    Comments: 9 pages, 8 figures, 3 tables

    Report number: FERMILAB-PUB-21-217-CMS-E-SCD

    Journal ref: IEEE Trans. Nucl. Sci. 68, 2179 (2021)

  23. arXiv:2103.05579  [pdf, other

    cs.LG cs.AR physics.ins-det

    hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices

    Authors: Farah Fahim, Benjamin Hawks, Christian Herwig, James Hirschauer, Sergo **dariani, Nhan Tran, Luca P. Carloni, Giuseppe Di Guglielmo, Philip Harris, Jeffrey Krupa, Dylan Rankin, Manuel Blanco Valentin, Josiah Hester, Yingyi Luo, John Mamish, Seda Orgrenci-Memik, Thea Aarrestad, Hamza Javed, Vladimir Loncar, Maurizio Pierini, Adrian Alan Pol, Sioni Summers, Javier Duarte, Scott Hauck, Shih-Chieh Hsu , et al. (5 additional authors not shown)

    Abstract: Accessible machine learning algorithms, software, and diagnostic tools for energy-efficient devices and systems are extremely valuable across a broad range of application domains. In scientific domains, real-time near-sensor processing can drastically improve experimental design and accelerate scientific discoveries. To support domain scientists, we have developed hls4ml, an open-source software-h… ▽ More

    Submitted 23 March, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: 10 pages, 8 figures, TinyML Research Symposium 2021

    Report number: FERMILAB-CONF-21-080-SCD

  24. arXiv:2103.00560  [pdf, other

    cs.CV q-bio.QM

    Perspectives on individual animal identification from biology and computer vision

    Authors: Maxime Vidal, Nathan Wolf, Beth Rosenberg, Bradley P. Harris, Alexander Mathis

    Abstract: Identifying individual animals is crucial for many biological investigations. In response to some of the limitations of current identification methods, new automated computer vision approaches have emerged with strong performance. Here, we review current advances of computer vision identification techniques to provide both computer scientists and biologists with an overview of the available tools… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: 12 pages, 1 figure, 2 boxes and 1 table

    Journal ref: Integr Comp Biol . 2021 Oct 4;61(3):900-916

  25. arXiv:2101.05108  [pdf, other

    cs.LG cs.CV hep-ex physics.ins-det stat.ML

    Fast convolutional neural networks on FPGAs with hls4ml

    Authors: Thea Aarrestad, Vladimir Loncar, Nicolò Ghielmetti, Maurizio Pierini, Sioni Summers, Jennifer Ngadiuba, Christoffer Petersson, Hampus Linander, Yutaro Iiyama, Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Dylan Rankin, Sergo **dariani, Kevin Pedro, Nhan Tran, Mia Liu, Edward Kreinar, Zhenbin Wu, Duc Hoang

    Abstract: We introduce an automated tool for deploying ultra low-latency, low-power deep neural networks with convolutional layers on FPGAs. By extending the hls4ml library, we demonstrate an inference latency of $5\,μ$s using convolutional architectures, targeting microsecond latency applications like those at the CERN Large Hadron Collider. Considering benchmark models trained on the Street View House Num… ▽ More

    Submitted 29 April, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: 18 pages, 18 figures, 4 tables

    Journal ref: Mach. Learn.: Sci. Technol. 2 045015 (2021)

  26. arXiv:2012.01563  [pdf, other

    physics.ins-det cs.LG hep-ex physics.comp-ph

    Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

    Authors: Aneesh Heintz, Vesal Razavimaleki, Javier Duarte, Gage DeZoort, Isobel Ojalvo, Savannah Thais, Markus Atkinson, Mark Neubauer, Lindsey Gray, Sergo **dariani, Nhan Tran, Philip Harris, Dylan Rankin, Thea Aarrestad, Vladimir Loncar, Maurizio Pierini, Sioni Summers, Jennifer Ngadiuba, Mia Liu, Edward Kreinar, Zhenbin Wu

    Abstract: We develop and study FPGA implementations of algorithms for charged particle tracking based on graph neural networks. The two complementary FPGA designs are based on OpenCL, a framework for writing programs that execute across heterogeneous platforms, and hls4ml, a high-level-synthesis-based compiler for neural network to firmware conversion. We evaluate and compare the resource usage, latency, an… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

    Comments: 8 pages, 4 figures, To appear in Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020)

    Report number: FERMILAB-CONF-20-622-CMS-SCD

  27. arXiv:2010.08556  [pdf, other

    physics.comp-ph cs.DC hep-ex physics.data-an physics.ins-det

    FPGAs-as-a-Service Toolkit (FaaST)

    Authors: Dylan Sheldon Rankin, Jeffrey Krupa, Philip Harris, Maria Acosta Flechas, Burt Holzman, Thomas Klijnsma, Kevin Pedro, Nhan Tran, Scott Hauck, Shih-Chieh Hsu, Matthew Trahms, Kelvin Lin, Yu Lou, Ta-Wei Ho, Javier Duarte, Mia Liu

    Abstract: Computing needs for high energy physics are already intensive and are expected to increase drastically in the coming years. In this context, heterogeneous computing, specifically as-a-service computing, has the potential for significant gains over traditional computing models. Although previous studies and packages in the field of heterogeneous computing have focused on GPUs as accelerators, FPGAs… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: 10 pages, 7 figures, to appear in proceedings of the 2020 IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing

    Report number: FERMILAB-CONF-20-426-SCD

    Journal ref: 2020 IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC), 2020, pp. 38-47

  28. arXiv:2009.04509  [pdf, other

    physics.comp-ph cs.DC hep-ex physics.data-an

    GPU-accelerated machine learning inference as a service for computing in neutrino experiments

    Authors: Michael Wang, Tingjun Yang, Maria Acosta Flechas, Philip Harris, Benjamin Hawks, Burt Holzman, Kyle Knoepfel, Jeffrey Krupa, Kevin Pedro, Nhan Tran

    Abstract: Machine learning algorithms are becoming increasingly prevalent and performant in the reconstruction of events in accelerator-based neutrino experiments. These sophisticated algorithms can be computationally expensive. At the same time, the data volumes of such experiments are rapidly increasing. The demand to process billions of neutrino events with many machine learning algorithm inferences crea… ▽ More

    Submitted 22 March, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

    Comments: 15 pages, 7 figures, 2 tables

    Report number: FERMILAB-PUB-20-428-ND-SCD

  29. arXiv:2008.03601  [pdf, other

    physics.ins-det cs.LG hep-ex

    Distance-Weighted Graph Neural Networks on FPGAs for Real-Time Particle Reconstruction in High Energy Physics

    Authors: Yutaro Iiyama, Gianluca Cerminara, Abhijay Gupta, Jan Kieseler, Vladimir Loncar, Maurizio Pierini, Shah Rukh Qasim, Marcel Rieger, Sioni Summers, Gerrit Van Onsem, Kinga Wozniak, Jennifer Ngadiuba, Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Dylan Rankin, Sergo **dariani, Mia Liu, Kevin Pedro, Nhan Tran, Edward Kreinar, Zhenbin Wu

    Abstract: Graph neural networks have been shown to achieve excellent performance for several crucial tasks in particle physics, such as charged particle tracking, jet tagging, and clustering. An important domain for the application of these networks is the FGPA-based first layer of real-time data filtering at the CERN Large Hadron Collider, which has strict latency and resource constraints. We discuss how t… ▽ More

    Submitted 3 February, 2021; v1 submitted 8 August, 2020; originally announced August 2020.

    Comments: 15 pages, 4 figures

    Report number: FERMILAB-PUB-20-405-E-SCD

    Journal ref: Frontiers in Big Data 3 (2021) 44

  30. arXiv:2007.10359  [pdf, other

    physics.comp-ph cs.DC hep-ex physics.data-an physics.ins-det

    GPU coprocessors as a service for deep learning inference in high energy physics

    Authors: Jeffrey Krupa, Kelvin Lin, Maria Acosta Flechas, Jack Dinsmore, Javier Duarte, Philip Harris, Scott Hauck, Burt Holzman, Shih-Chieh Hsu, Thomas Klijnsma, Mia Liu, Kevin Pedro, Dylan Rankin, Natchanon Suaysom, Matt Trahms, Nhan Tran

    Abstract: In the next decade, the demands for computing in large scientific experiments are expected to grow tremendously. During the same time period, CPU performance increases will be limited. At the CERN Large Hadron Collider (LHC), these two issues will confront one another as the collider is upgraded for high luminosity running. Alternative processors such as graphics processing units (GPUs) can resolv… ▽ More

    Submitted 23 April, 2021; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: 26 pages, 7 figures, 2 tables

    Report number: FERMILAB-PUB-20-338-E-SCD

    Journal ref: Mach. Learn.: Sci. Technol. 2 (2021) 035005

  31. arXiv:2004.00606  [pdf, ps, other

    math.CO cs.DM

    Tipsy cop and drunken robber: a variant of the cop and robber game on graphs

    Authors: Pamela Harris, Erik Insko, Alicia Prieto-Langarica, Rade Stoisavljevic, Shaun Sullivan

    Abstract: Motivated by a biological scenario illustrated in the YouTube video \url{ https://www.youtube.com/watch?v=Z_mXDvZQ6dU} where a neutrophil chases a bacteria cell moving in random directions, we present a variant of the cop and robber game on graphs called the tipsy cop and drunken robber game. In this game, we place a tipsy cop and a drunken robber at different vertices of a finite connected graph… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: 18 pages

    MSC Class: 05A05; 05C25; 05C30; 05C78; 05C85

  32. arXiv:2003.06308  [pdf, other

    cs.LG eess.SP hep-ex

    Compressing deep neural networks on FPGAs to binary and ternary precision with HLS4ML

    Authors: Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Duc Hoang, Sergo **dariani, Edward Kreinar, Mia Liu, Vladimir Loncar, Jennifer Ngadiuba, Kevin Pedro, Maurizio Pierini, Dylan Rankin, Sheila Sagear, Sioni Summers, Nhan Tran, Zhenbin Wu

    Abstract: We present the implementation of binary and ternary neural networks in the hls4ml library, designed to automatically convert deep neural network models to digital circuits with FPGA firmware. Starting from benchmark models trained with floating point precision, we investigate different strategies to reduce the network's resource consumption by reducing the numerical precision of the network parame… ▽ More

    Submitted 29 June, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: Update to MLST journal version

    Report number: FERMILAB-PUB-20-167-PPD-SCD

    Journal ref: Mach. Learn.: Sci. Technol. 2, 015001 (2020)

  33. arXiv:2002.02534  [pdf, other

    physics.comp-ph astro-ph.IM cs.LG hep-ex

    Fast inference of Boosted Decision Trees in FPGAs for particle physics

    Authors: Sioni Summers, Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Duc Hoang, Sergo **dariani, Edward Kreinar, Vladimir Loncar, Jennifer Ngadiuba, Maurizio Pierini, Dylan Rankin, Nhan Tran, Zhenbin Wu

    Abstract: We describe the implementation of Boosted Decision Trees in the hls4ml library, which allows the translation of a trained model into FPGA firmware through an automated conversion process. Thanks to its fully on-chip implementation, hls4ml performs inference of Boosted Decision Tree models with extremely low latency. With a typical latency less than 100 ns, this solution is suitable for FPGA-based… ▽ More

    Submitted 19 February, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

    Journal ref: JINST 15 P05026 (2020)

  34. arXiv:1911.05796  [pdf, ps, other

    astro-ph.IM cs.AI physics.soc-ph

    Response to NITRD, NCO, NSF Request for Information on "Update to the 2016 National Artificial Intelligence Research and Development Strategic Plan"

    Authors: J. Amundson, J. Annis, C. Avestruz, D. Bowring, J. Caldeira, G. Cerati, C. Chang, S. Dodelson, D. Elvira, A. Farahi, K. Genser, L. Gray, O. Gutsche, P. Harris, J. Kinney, J. B. Kowalkowski, R. Kutschke, S. Mrenna, B. Nord, A. Para, K. Pedro, G. N. Perdue, A. Scheinker, P. Spentzouris, J. St. John , et al. (5 additional authors not shown)

    Abstract: We present a response to the 2018 Request for Information (RFI) from the NITRD, NCO, NSF regarding the "Update to the 2016 National Artificial Intelligence Research and Development Strategic Plan." Through this document, we provide a response to the question of whether and how the National Artificial Intelligence Research and Development Strategic Plan (NAIRDSP) should be updated from the perspect… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

    Report number: FERMILAB-FN-1092-SCD

  35. arXiv:1804.06913  [pdf, other

    physics.ins-det cs.CV hep-ex stat.ML

    Fast inference of deep neural networks in FPGAs for particle physics

    Authors: Javier Duarte, Song Han, Philip Harris, Sergo **dariani, Edward Kreinar, Benjamin Kreis, Jennifer Ngadiuba, Maurizio Pierini, Ryan Rivera, Nhan Tran, Zhenbin Wu

    Abstract: Recent results at the Large Hadron Collider (LHC) have pointed to enhanced physics capabilities through the improvement of the real-time event processing techniques. Machine learning methods are ubiquitous and have proven to be very powerful in LHC physics, and particle physics as a whole. However, exploration of the use of such techniques in low-latency, low-power FPGA hardware has only just begu… ▽ More

    Submitted 28 June, 2018; v1 submitted 16 April, 2018; originally announced April 2018.

    Comments: 22 pages, 17 figures, 2 tables, JINST revision

    Report number: FERMILAB-PUB-18-089-E

    Journal ref: JINST 13 P07027 (2018)

  36. Achievable Rates and Training Overheads for a Measured LOS Massive MIMO Channel

    Authors: Paul Harris, Wael Boukley Hasan, Liang Liu, Steffen Malkowsky, Mark Beach, Simon Armour, Fredrik Tufvesson, Ove Edfors

    Abstract: This paper presents achievable uplink (UL) sumrate predictions for a measured line-of-sight (LOS) massive multiple-input, multiple-output (MIMO) (MMIMO) scenario and illustrates the trade-off between spatial multiplexing performance and channel de-coherence rate for an increasing number of base station (BS) antennas. In addition, an orthogonal frequency division multiplexing (OFDM) case study is f… ▽ More

    Submitted 22 February, 2018; v1 submitted 9 November, 2017; originally announced November 2017.

    Comments: 4 pages, 5 figures

    Journal ref: IEEE Wireless Communications Letters 2018

  37. An Overview of Massive MIMO Research at the University of Bristol

    Authors: Paul Harris, Wael Boukley Hasan, Henry Brice, Benny Chitambira, Mark Beach, Evangelos Mellios, Andrew Nix, Simon Armour, Angela Doufexi

    Abstract: Massive MIMO has rapidly gained popularity as a technology crucial to the capacity advances required for 5G wireless systems. Since its theoretical conception six years ago, research activity has grown exponentially, and there is now a develo** industrial interest to commercialise the technology. For this to happen effectively, we believe it is crucial that further pragmatic research is conducte… ▽ More

    Submitted 21 May, 2017; originally announced May 2017.

    Comments: Presented at the IET Radio Propagation and Technologies for 5G Conference (2016). 5 pages

  38. arXiv:1703.04723  [pdf, other

    cs.IT

    Temporal Analysis of Measured LOS Massive MIMO Channels with Mobility

    Authors: Paul Harris, Steffen Malkowsky, Joao Vieira, Fredrik Tufvesson, Wael Boukley Hasan, Liang Liu, Mark Beach, Simon Armour, Ove Edfors

    Abstract: The first measured results for massive multiple-input, multiple-output (MIMO) performance in a line-of-sight (LOS) scenario with moderate mobility are presented, with 8 users served by a 100 antenna base Station (BS) at 3.7 GHz. When such a large number of channels dynamically change, the inherent propagation and processing delay has a critical relationship with the rate of change, as the use of o… ▽ More

    Submitted 14 March, 2017; originally announced March 2017.

    Comments: Accepted for presentation at the 85th IEEE Vehicular Technology Conference in Sydney. 5 Pages. arXiv admin note: substantial text overlap with arXiv:1701.08818

  39. Performance Characterization of a Real-Time Massive MIMO System with LOS Mobile Channels

    Authors: Paul Harris, Steffen Malkowsky, Joao Vieira, Fredrik Tufvesson Wael Boukley Hassan, Liang Liu, Mark Beach, Simon Armour, Ove Edfors

    Abstract: The first measured results for massive MIMO performance in a line-of-sight (LOS) scenario with moderate mobility are presented, with 8 users served in real-time using a 100-antenna base Station (BS) at 3.7 GHz. When such a large number of channels dynamically change, the inherent propagation and processing delay has a critical relationship with the rate of change, as the use of outdated channel in… ▽ More

    Submitted 19 May, 2017; v1 submitted 30 January, 2017; originally announced January 2017.

    Comments: Submitted to the 2017 IEEE JSAC Special Issue on Deployment Issues and Performance Challenges for 5G, IEEE Journal on Selected Areas in Communications, 2017, vol.PP, no.99, pp.1-1

  40. arXiv:1701.01161  [pdf, other

    cs.IT

    The World's First Real-Time Testbed for Massive MIMO: Design, Implementation, and Validation

    Authors: Steffen Malkowsky, Joao Vieira, Liang Liu, Paul Harris, Karl Nieman, Nikhil Kundargi, Ian Wong, Fredrik Tufvesson, Viktor Öwall, Ove Edfors

    Abstract: This paper sets up a framework for designing a massive multiple-input multiple-output (MIMO) testbed by investigating hardware (HW) and system-level requirements such as processing complexity, duplexing mode and frame structure. Taking these into account, a generic system and processing partitioning is proposed which allows flexible scaling and processing distribution onto a multitude of physicall… ▽ More

    Submitted 16 May, 2017; v1 submitted 20 December, 2016; originally announced January 2017.

    Comments: 15 pages, accepted for publication in IEEE Access