Skip to main content

Showing 1–21 of 21 results for author: Lončar, V

Searching in archive physics. Search in all archives.
.
  1. arXiv:2405.00645  [pdf, other

    cs.LG physics.ins-det

    Gradient-based Automatic Per-Weight Mixed Precision Quantization for Neural Networks On-Chip

    Authors: Chang Sun, Thea K. Årrestad, Vladimir Loncar, Jennifer Ngadiuba, Maria Spiropulu

    Abstract: Model size and inference speed at deployment time, are major challenges in many deep learning applications. A promising strategy to overcome these challenges is quantization. However, a straightforward uniform quantization to very low precision can result in significant accuracy loss. Mixed-precision quantization, based on the idea that certain parts of the network can accommodate lower precision… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  2. arXiv:2402.01876  [pdf, other

    hep-ex cs.LG physics.ins-det

    Ultrafast jet classification on FPGAs for the HL-LHC

    Authors: Patrick Odagiu, Zhiqiang Que, Javier Duarte, Johannes Haller, Gregor Kasieczka, Artur Lobanov, Vladimir Loncar, Wayne Luk, Jennifer Ngadiuba, Maurizio Pierini, Philipp Rincke, Arpita Seksaria, Sioni Summers, Andre Sznajder, Alexander Tapper, Thea K. Aarrestad

    Abstract: Three machine learning models are used to perform jet origin classification. These models are optimized for deployment on a field-programmable gate array device. In this context, we demonstrate how latency and resource consumption scale with the input size and choice of algorithm. Moreover, the models proposed here are designed to work on the type of data and under the foreseen conditions at the C… ▽ More

    Submitted 4 July, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 13 pages, 3 figures, 3 tables. Mach. Learn.: Sci. Technol (2024)

    Report number: FERMILAB-PUB-24-0030-CMS-CSAID-PPD

  3. arXiv:2401.09949  [pdf, other

    cs.LG hep-ex physics.ins-det

    SymbolNet: Neural Symbolic Regression with Adaptive Dynamic Pruning

    Authors: Ho Fung Tsoi, Vladimir Loncar, Sridhara Dasu, Philip Harris

    Abstract: Contrary to the use of genetic programming, the neural network approach to symbolic regression can scale well with high input dimension and leverage gradient methods for faster equation searching. Common ways of constraining expression complexity have relied on multistage pruning methods with fine-tuning, but these often lead to significant performance loss. In this work, we propose SymbolNet, a n… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 11 pages. Submitted to IEEE TNNLS, under review

  4. arXiv:2305.04099  [pdf, other

    cs.LG hep-ex physics.ins-det

    Symbolic Regression on FPGAs for Fast Machine Learning Inference

    Authors: Ho Fung Tsoi, Adrian Alan Pol, Vladimir Loncar, Ekaterina Govorkova, Miles Cranmer, Sridhara Dasu, Peter Elmer, Philip Harris, Isobel Ojalvo, Maurizio Pierini

    Abstract: The high-energy physics community is investigating the potential of deploying machine-learning-based solutions on Field-Programmable Gate Arrays (FPGAs) to enhance physics sensitivity while still meeting data processing time constraints. In this contribution, we introduce a novel end-to-end procedure that utilizes a machine learning technique called symbolic regression (SR). It searches the equati… ▽ More

    Submitted 17 January, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: 9 pages. Accepted to 26th International Conference on Computing in High Energy & Nuclear Physics (CHEP 2023)

    Journal ref: EPJ Web of Conferences 295, 09036 (2024)

  5. arXiv:2207.00559  [pdf, other

    cs.LG hep-ex physics.ins-det stat.ML

    Ultra-low latency recurrent neural network inference on FPGAs for physics applications with hls4ml

    Authors: Elham E Khoda, Dylan Rankin, Rafael Teixeira de Lima, Philip Harris, Scott Hauck, Shih-Chieh Hsu, Michael Kagan, Vladimir Loncar, Chaitanya Paikara, Richa Rao, Sioni Summers, Caterina Vernieri, Aaron Wang

    Abstract: Recurrent neural networks have been shown to be effective architectures for many tasks in high energy physics, and thus have been widely adopted. Their use in low-latency environments has, however, been limited as a result of the difficulties of implementing recurrent architectures on field-programmable gate arrays (FPGAs). In this paper we present an implementation of two types of recurrent neura… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: 12 pages, 6 figures, 5 tables

  6. arXiv:2205.07690  [pdf, other

    cs.CV cs.AR cs.LG physics.ins-det stat.ML

    Real-time semantic segmentation on FPGAs for autonomous vehicles with hls4ml

    Authors: Nicolò Ghielmetti, Vladimir Loncar, Maurizio Pierini, Marcel Roed, Sioni Summers, Thea Aarrestad, Christoffer Petersson, Hampus Linander, Jennifer Ngadiuba, Kelvin Lin, Philip Harris

    Abstract: In this paper, we investigate how field programmable gate arrays can serve as hardware accelerators for real-time semantic segmentation tasks relevant for autonomous driving. Considering compressed versions of the ENet convolutional neural network architecture, we demonstrate a fully-on-chip deployment with a latency of 4.9 ms per image, using less than 30% of the available resources on a Xilinx Z… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: 11 pages, 6 tables, 5 figures

  7. arXiv:2108.03986  [pdf, other

    physics.ins-det hep-ex

    Autoencoders on FPGAs for real-time, unsupervised new physics detection at 40 MHz at the Large Hadron Collider

    Authors: Ekaterina Govorkova, Ema Puljak, Thea Aarrestad, Thomas James, Vladimir Loncar, Maurizio Pierini, Adrian Alan Pol, Nicolò Ghielmetti, Maksymilian Graczyk, Sioni Summers, Jennifer Ngadiuba, Thong Q. Nguyen, Javier Duarte, Zhenbin Wu

    Abstract: In this paper, we show how to adapt and deploy anomaly detection algorithms based on deep autoencoders, for the unsupervised detection of new physics signatures in the extremely challenging environment of a real-time event selection system at the Large Hadron Collider (LHC). We demonstrate that new physics signatures can be enhanced by three orders of magnitude, while staying within the strict lat… ▽ More

    Submitted 12 August, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Report number: FERMILAB-PUB-21-487-CMS

    Journal ref: Nature Machine Intelligence 4, 154 (2022)

  8. arXiv:2106.14089  [pdf, other

    cs.LG cs.AR physics.ins-det

    Accelerating Recurrent Neural Networks for Gravitational Wave Experiments

    Authors: Zhiqiang Que, Erwei Wang, Umar Marikar, Eric Moreno, Jennifer Ngadiuba, Hamza Javed, Bartłomiej Borzyszkowski, Thea Aarrestad, Vladimir Loncar, Sioni Summers, Maurizio Pierini, Peter Y Cheung, Wayne Luk

    Abstract: This paper presents novel reconfigurable architectures for reducing the latency of recurrent neural networks (RNNs) that are used for detecting gravitational waves. Gravitational interferometers such as the LIGO detectors capture cosmic events such as black hole mergers which happen at unknown times and of varying durations, producing time-series data. We have developed a new architecture capable… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

    Comments: Accepted at the 2021 32nd IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP)

  9. arXiv:2105.01683  [pdf, other

    physics.ins-det cs.LG hep-ex

    A reconfigurable neural network ASIC for detector front-end data compression at the HL-LHC

    Authors: Giuseppe Di Guglielmo, Farah Fahim, Christian Herwig, Manuel Blanco Valentin, Javier Duarte, Cristian Gingu, Philip Harris, James Hirschauer, Martin Kwok, Vladimir Loncar, Yingyi Luo, Llovizna Miranda, Jennifer Ngadiuba, Daniel Noonan, Seda Ogrenci-Memik, Maurizio Pierini, Sioni Summers, Nhan Tran

    Abstract: Despite advances in the programmable logic capabilities of modern trigger systems, a significant bottleneck remains in the amount of data to be transported from the detector to off-detector logic where trigger decisions are made. We demonstrate that a neural network autoencoder model can be implemented in a radiation tolerant ASIC to perform lossy data compression alleviating the data transmission… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    Comments: 9 pages, 8 figures, 3 tables

    Report number: FERMILAB-PUB-21-217-CMS-E-SCD

    Journal ref: IEEE Trans. Nucl. Sci. 68, 2179 (2021)

  10. arXiv:2103.05579  [pdf, other

    cs.LG cs.AR physics.ins-det

    hls4ml: An Open-Source Codesign Workflow to Empower Scientific Low-Power Machine Learning Devices

    Authors: Farah Fahim, Benjamin Hawks, Christian Herwig, James Hirschauer, Sergo **dariani, Nhan Tran, Luca P. Carloni, Giuseppe Di Guglielmo, Philip Harris, Jeffrey Krupa, Dylan Rankin, Manuel Blanco Valentin, Josiah Hester, Yingyi Luo, John Mamish, Seda Orgrenci-Memik, Thea Aarrestad, Hamza Javed, Vladimir Loncar, Maurizio Pierini, Adrian Alan Pol, Sioni Summers, Javier Duarte, Scott Hauck, Shih-Chieh Hsu , et al. (5 additional authors not shown)

    Abstract: Accessible machine learning algorithms, software, and diagnostic tools for energy-efficient devices and systems are extremely valuable across a broad range of application domains. In scientific domains, real-time near-sensor processing can drastically improve experimental design and accelerate scientific discoveries. To support domain scientists, we have developed hls4ml, an open-source software-h… ▽ More

    Submitted 23 March, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: 10 pages, 8 figures, TinyML Research Symposium 2021

    Report number: FERMILAB-CONF-21-080-SCD

  11. arXiv:2101.05108  [pdf, other

    cs.LG cs.CV hep-ex physics.ins-det stat.ML

    Fast convolutional neural networks on FPGAs with hls4ml

    Authors: Thea Aarrestad, Vladimir Loncar, Nicolò Ghielmetti, Maurizio Pierini, Sioni Summers, Jennifer Ngadiuba, Christoffer Petersson, Hampus Linander, Yutaro Iiyama, Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Dylan Rankin, Sergo **dariani, Kevin Pedro, Nhan Tran, Mia Liu, Edward Kreinar, Zhenbin Wu, Duc Hoang

    Abstract: We introduce an automated tool for deploying ultra low-latency, low-power deep neural networks with convolutional layers on FPGAs. By extending the hls4ml library, we demonstrate an inference latency of $5\,μ$s using convolutional architectures, targeting microsecond latency applications like those at the CERN Large Hadron Collider. Considering benchmark models trained on the Street View House Num… ▽ More

    Submitted 29 April, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: 18 pages, 18 figures, 4 tables

    Journal ref: Mach. Learn.: Sci. Technol. 2 045015 (2021)

  12. arXiv:2012.01563  [pdf, other

    physics.ins-det cs.LG hep-ex physics.comp-ph

    Accelerated Charged Particle Tracking with Graph Neural Networks on FPGAs

    Authors: Aneesh Heintz, Vesal Razavimaleki, Javier Duarte, Gage DeZoort, Isobel Ojalvo, Savannah Thais, Markus Atkinson, Mark Neubauer, Lindsey Gray, Sergo **dariani, Nhan Tran, Philip Harris, Dylan Rankin, Thea Aarrestad, Vladimir Loncar, Maurizio Pierini, Sioni Summers, Jennifer Ngadiuba, Mia Liu, Edward Kreinar, Zhenbin Wu

    Abstract: We develop and study FPGA implementations of algorithms for charged particle tracking based on graph neural networks. The two complementary FPGA designs are based on OpenCL, a framework for writing programs that execute across heterogeneous platforms, and hls4ml, a high-level-synthesis-based compiler for neural network to firmware conversion. We evaluate and compare the resource usage, latency, an… ▽ More

    Submitted 30 November, 2020; originally announced December 2020.

    Comments: 8 pages, 4 figures, To appear in Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020)

    Report number: FERMILAB-CONF-20-622-CMS-SCD

  13. arXiv:2008.13636  [pdf, ps, other

    physics.comp-ph hep-ex

    HL-LHC Computing Review: Common Tools and Community Software

    Authors: HEP Software Foundation, :, Thea Aarrestad, Simone Amoroso, Markus Julian Atkinson, Joshua Bendavid, Tommaso Boccali, Andrea Bocci, Andy Buckley, Matteo Cacciari, Paolo Calafiura, Philippe Canal, Federico Carminati, Taylor Childers, Vitaliano Ciulli, Gloria Corti, Davide Costanzo, Justin Gage Dezoort, Caterina Doglioni, Javier Mauricio Duarte, Agnieszka Dziurda, Peter Elmer, Markus Elsing, V. Daniel Elvira, Giulio Eulisse , et al. (85 additional authors not shown)

    Abstract: Common and community software packages, such as ROOT, Geant4 and event generators have been a key part of the LHC's success so far and continued development and optimisation will be critical in the future. The challenges are driven by an ambitious physics programme, notably the LHC accelerator upgrade to high-luminosity, HL-LHC, and the corresponding detector upgrades of ATLAS and CMS. In this doc… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

    Comments: 40 pages contribution to Snowmass 2021

    Report number: HSF-DOC-2020-01

  14. arXiv:2008.03601  [pdf, other

    physics.ins-det cs.LG hep-ex

    Distance-Weighted Graph Neural Networks on FPGAs for Real-Time Particle Reconstruction in High Energy Physics

    Authors: Yutaro Iiyama, Gianluca Cerminara, Abhijay Gupta, Jan Kieseler, Vladimir Loncar, Maurizio Pierini, Shah Rukh Qasim, Marcel Rieger, Sioni Summers, Gerrit Van Onsem, Kinga Wozniak, Jennifer Ngadiuba, Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Dylan Rankin, Sergo **dariani, Mia Liu, Kevin Pedro, Nhan Tran, Edward Kreinar, Zhenbin Wu

    Abstract: Graph neural networks have been shown to achieve excellent performance for several crucial tasks in particle physics, such as charged particle tracking, jet tagging, and clustering. An important domain for the application of these networks is the FGPA-based first layer of real-time data filtering at the CERN Large Hadron Collider, which has strict latency and resource constraints. We discuss how t… ▽ More

    Submitted 3 February, 2021; v1 submitted 8 August, 2020; originally announced August 2020.

    Comments: 15 pages, 4 figures

    Report number: FERMILAB-PUB-20-405-E-SCD

    Journal ref: Frontiers in Big Data 3 (2021) 44

  15. arXiv:2006.10159  [pdf, other

    physics.ins-det cs.LG eess.IV eess.SP hep-ex

    Automatic heterogeneous quantization of deep neural networks for low-latency inference on the edge for particle detectors

    Authors: Claudionor N. Coelho Jr., Aki Kuusela, Shan Li, Hao Zhuang, Thea Aarrestad, Vladimir Loncar, Jennifer Ngadiuba, Maurizio Pierini, Adrian Alan Pol, Sioni Summers

    Abstract: Although the quest for more accurate solutions is pushing deep learning research towards larger and more complex algorithms, edge devices demand efficient inference and therefore reduction in model size, latency and energy consumption. One technique to limit model size is quantization, which implies using fewer bits to represent weights and biases. Such an approach usually results in a decline in… ▽ More

    Submitted 21 June, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Journal ref: Nature Machine Intelligence, Volume 3 (2021)

  16. arXiv:2002.02534  [pdf, other

    physics.comp-ph astro-ph.IM cs.LG hep-ex

    Fast inference of Boosted Decision Trees in FPGAs for particle physics

    Authors: Sioni Summers, Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Duc Hoang, Sergo **dariani, Edward Kreinar, Vladimir Loncar, Jennifer Ngadiuba, Maurizio Pierini, Dylan Rankin, Nhan Tran, Zhenbin Wu

    Abstract: We describe the implementation of Boosted Decision Trees in the hls4ml library, which allows the translation of a trained model into FPGA firmware through an automated conversion process. Thanks to its fully on-chip implementation, hls4ml performs inference of Boosted Decision Tree models with extremely low latency. With a typical latency less than 100 ns, this solution is suitable for FPGA-based… ▽ More

    Submitted 19 February, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

    Journal ref: JINST 15 P05026 (2020)

  17. arXiv:1906.06327  [pdf, other

    cond-mat.quant-gas nlin.PS physics.comp-ph quant-ph

    C and Fortran OpenMP programs for rotating Bose-Einstein condensates

    Authors: Ramavarmaraja Kishor Kumar, Vladimir Loncar, Paulsamy Muruganandam, Sadhan K. Adhikari, Antun Balaz

    Abstract: We present OpenMP versions of C and Fortran programs for solving the Gross-Pitaevskii equation for a rotating trapped Bose-Einstein condensate (BEC) in two (2D) and three (3D) spatial dimensions. The programs can be used to generate vortex lattices and study dynamics of rotating BECs. We use the split-step Crank-Nicolson algorithm for imaginary- and real-time propagation to calculate stationary st… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

    Comments: 12 pages, 7 figures; to download the programs, click 'Other formats' and download the source

    Journal ref: Comput. Phys. Commun. 240 (2019) 74

  18. arXiv:1904.08986  [pdf, other

    physics.data-an hep-ex physics.comp-ph physics.ins-det

    FPGA-accelerated machine learning inference as a service for particle physics computing

    Authors: Javier Duarte, Philip Harris, Scott Hauck, Burt Holzman, Shih-Chieh Hsu, Sergo **dariani, Suffian Khan, Benjamin Kreis, Brian Lee, Mia Liu, Vladimir Lončar, Jennifer Ngadiuba, Kevin Pedro, Brandon Perez, Maurizio Pierini, Dylan Rankin, Nhan Tran, Matthew Trahms, Aristeidis Tsaris, Colin Versteeg, Ted W. Way, Dustin Werran, Zhenbin Wu

    Abstract: New heterogeneous computing paradigms on dedicated hardware with increased parallelization, such as Field Programmable Gate Arrays (FPGAs), offer exciting solutions with large potential gains. The growing applications of machine learning algorithms in particle physics for simulation, reconstruction, and analysis are naturally deployed on such platforms. We demonstrate that the acceleration of mach… ▽ More

    Submitted 16 October, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: 16 pages, 14 figures, 2 tables

    Report number: FERMILAB-PUB-19-170-CD-CMS-E-ND

    Journal ref: Comput Softw Big Sci (2019) 3: 13

  19. arXiv:1709.04423  [pdf, other

    physics.comp-ph cond-mat.quant-gas cs.MS nlin.PS

    OpenMP GNU and Intel Fortran programs for solving the time-dependent Gross-Pitaevskii equation

    Authors: Luis E. Young-S., Paulsamy Muruganandam, Sadhan K. Adhikari, Vladimir Loncar, Dusan Vudragovic, Antun Balaz

    Abstract: We present Open Multi-Processing (OpenMP) version of Fortran 90 programs for solving the Gross-Pitaevskii (GP) equation for a Bose-Einstein condensate in one, two, and three spatial dimensions, optimized for use with GNU and Intel compilers. We use the split-step Crank-Nicolson algorithm for imaginary- and real-time propagation, which enables efficient calculation of stationary and non-stationary… ▽ More

    Submitted 13 September, 2017; originally announced September 2017.

    Comments: 5 pages, 2 figures; to download the programs, click 'Other formats' and download the source

    Journal ref: Comput. Phys. Commun. 220 (2017) 503

  20. arXiv:1610.05329  [pdf, ps, other

    cond-mat.quant-gas cs.MS nlin.PS physics.comp-ph

    OpenMP, OpenMP/MPI, and CUDA/MPI C programs for solving the time-dependent dipolar Gross-Pitaevskii equation

    Authors: Vladimir Loncar, Luis E. Young-S., Srdjan Skrbic, Paulsamy Muruganandam, Sadhan K. Adhikari, Antun Balaz

    Abstract: We present new versions of the previously published C and CUDA programs for solving the dipolar Gross-Pitaevskii equation in one, two, and three spatial dimensions, which calculate stationary and non-stationary solutions by propagation in imaginary or real time. Presented programs are improved and parallelized versions of previous programs, divided into three packages according to the type of para… ▽ More

    Submitted 1 August, 2022; v1 submitted 17 October, 2016; originally announced October 2016.

    Comments: 8 pages, 6 figures; to download the programs, click "Other formats" and download the source

    Journal ref: Comput. Phys. Commun. 209 (2016) 190

  21. arXiv:1601.04640  [pdf, ps, other

    physics.comp-ph cond-mat.quant-gas nlin.PS quant-ph

    CUDA programs for solving the time-dependent dipolar Gross-Pitaevskii equation in an anisotropic trap

    Authors: Vladimir Loncar, Antun Balaz, Aleksandar Bogojevic, Srdjan Skrbic, Paulsamy Muruganandam, Sadhan K. Adhikari

    Abstract: In this paper we present new versions of previously published numerical programs for solving the dipolar Gross-Pitaevskii (GP) equation including the contact interaction in two and three spatial dimensions in imaginary and in real time, yielding both stationary and non-stationary solutions. New versions of programs were developed using CUDA toolkit and can make use of Nvidia GPU devices. The algor… ▽ More

    Submitted 1 August, 2022; v1 submitted 18 January, 2016; originally announced January 2016.

    Comments: 7 pages, 2 figures; to download the programs, click "Other formats" and download the source

    Journal ref: Comput. Phys. Commun. 200 (2016) 406