Search | arXiv e-print repository

APEIRON: composing smart TDAQ systems for high energy physics experiments

Authors: Roberto Ammendola, Andrea Biagioni, Carlotta Chiarini, Andrea Ciardiello, Paolo Cretaro, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Pier Stanislao Paolucci, Cristian Rossi, Francesco Simula, Matteo Turisini, Piero Vicini

Abstract: APEIRON is a framework encompassing the general architecture of a distributed heterogeneous processing platform and the corresponding software stack, from the low level device drivers up to the high level programming model. The framework is designed to be efficiently used for studying, prototy** and deploying smart trigger and data acquisition (TDAQ) systems for high energy physics experiments. APEIRON is a framework encompassing the general architecture of a distributed heterogeneous processing platform and the corresponding software stack, from the low level device drivers up to the high level programming model. The framework is designed to be efficiently used for studying, prototy** and deploying smart trigger and data acquisition (TDAQ) systems for high energy physics experiments. △ Less

Submitted 3 July, 2023; originally announced July 2023.

Comments: Under review in Journal of Physics: Conference Series (ACAT 2022)

arXiv:2211.16586 [pdf, other]

HIKE, High Intensity Kaon Experiments at the CERN SPS

Authors: E. Cortina Gil, J. Jerhot, N. Lurkin, T. Numao, B. Velghe, V. W. S. Wong, D. Bryman, L. Bician, Z. Hives, T. Husek, K. Kampf, M. Koval, A. T. Akmete, R. Aliberti, V. Büscher, L. Di Lella, N. Doble, L. Peruzzo, M. Schott, H. Wahl, R. Wanke, B. Döbrich, L. Montalto, D. Rinaldi, F. Dettori , et al. (154 additional authors not shown)

Abstract: A timely and long-term programme of kaon decay measurements at a new level of precision is presented, leveraging the capabilities of the CERN Super Proton Synchrotron (SPS). The proposed programme is firmly anchored on the experience built up studying kaon decays at the SPS over the past four decades, and includes rare processes, CP violation, dark sectors, symmetry tests and other tests of the St… ▽ More A timely and long-term programme of kaon decay measurements at a new level of precision is presented, leveraging the capabilities of the CERN Super Proton Synchrotron (SPS). The proposed programme is firmly anchored on the experience built up studying kaon decays at the SPS over the past four decades, and includes rare processes, CP violation, dark sectors, symmetry tests and other tests of the Standard Model. The experimental programme is based on a staged approach involving experiments with charged and neutral kaon beams, as well as operation in beam-dump mode. The various phases will rely on a common infrastructure and set of detectors. △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: Letter of Intent submitted to CERN SPSC. Address all correspondence to [email protected]

Report number: CERN-SPSC-2022-031/SPSC-I-257

arXiv:2202.03942 [pdf, other]

Progress report on the online processing upgrade at the NA62 experiment

Authors: M. Turisini, R. Ammendola, A. Biagioni, A. Ciardiello, P. Cretaro, O. Frezza, G. Lamanna, F. Lo Cicero, A. Lonardo, M. Martinelli, R. Piandani, D. Soldi, P. Vicini

Abstract: A new FPGA-based low-level trigger processor has been installed at the NA62 experiment. It is intended to extend the features of its predecessor due to a faster interconnection technology and additional logic resources available on the new platform. With the aim of improving trigger selectivity and exploring new architectures for complex trigger computation, a GPU system has been developed and a n… ▽ More A new FPGA-based low-level trigger processor has been installed at the NA62 experiment. It is intended to extend the features of its predecessor due to a faster interconnection technology and additional logic resources available on the new platform. With the aim of improving trigger selectivity and exploring new architectures for complex trigger computation, a GPU system has been developed and a neural network on FPGA is in progress. They both process data streams from the Ring Imaging Cherenkov detector of the experiment to extract in real time high level features for the trigger logic. Description of the systems, latest developments and design flows are reported in this paper. △ Less

Submitted 8 February, 2022; originally announced February 2022.

Comments: 5 pages, 3 figures, 1 table. Submitted to JINST as part of the TWEPP2021 conference proceeding (Topical Workshop on Electronics for Particle Physics, 20 - 24 September, 2021, online)

arXiv:2102.11704 [pdf, other]

doi 10.1038/s41550-021-01308-0

Optical and ultraviolet pulsed emission from an accreting millisecond pulsar

Authors: F. Ambrosino, A. Miraval Zanon, A. Papitto, F. Coti Zelati, S. Campana, P. D'Avanzo, L. Stella, T. Di Salvo, L. Burderi, P. Casella, A. Sanna, D. de Martino, M. Cadelano, A. Ghedina, F. Leone, F. Meddi, P. Cretaro, M. C. Baglio, E. Poretti, R. P. Mignani, D. F. Torres, G. L. Israel, M. Cecconi, D. M. Russell, M. D. Gonzalez Gomez , et al. (6 additional authors not shown)

Abstract: Millisecond spinning, low magnetic field neutron stars are believed to attain their fast rotation in a 0.1-1 Gyr-long phase during which they accrete matter endowed with angular momentum from a low-mass companion star. Despite extensive searches, coherent periodicities originating from accreting neutron star magnetospheres have been detected only at X-ray energies and in ~10% of the presently know… ▽ More Millisecond spinning, low magnetic field neutron stars are believed to attain their fast rotation in a 0.1-1 Gyr-long phase during which they accrete matter endowed with angular momentum from a low-mass companion star. Despite extensive searches, coherent periodicities originating from accreting neutron star magnetospheres have been detected only at X-ray energies and in ~10% of the presently known systems. Here we report the detection of optical and ultraviolet coherent pulsations at the X-ray period of the transient low mass X-ray binary system SAX J1808.4-3658, during an accretion outburst that occurred in August 2019. At the time of the observations, the pulsar was surrounded by an accretion disc, displayed X-ray pulsations and its luminosity was consistent with magnetically funneled accretion onto the neutron star. Current accretion models fail to account for the luminosity of both optical and ultraviolet pulsations; these are instead more likely driven by synchro-curvature radiation in the pulsar magnetosphere or just outside of it. This interpretation would imply that particle acceleration can take place even when mass accretion is going on, and opens up new perspectives in the study of coherent optical/UV pulsations from fast spinning accreting neutron stars in low-mass X-ray binary systems. △ Less

Submitted 23 February, 2021; originally announced February 2021.

Comments: 47 pages, 9 figures. The first two authors contributed equally to this work; Nature Astronomy (2021), published on-line on February 22, 2021; doi:10.1038/s41550-021-01308-0

Journal ref: Nature Astronomy (2021)

arXiv:1812.04974 [pdf, other]

doi 10.1109/EMPDP.2019.8671627

Real-time cortical simulations: energy and interconnect scaling on distributed systems

Authors: Francesco Simula, Elena Pastorelli, Pier Stanislao Paolucci, Michele Martinelli, Alessandro Lonardo, Andrea Biagioni, Cristiano Capone, Fabrizio Capuani, Paolo Cretaro, Giulia De Bonis, Francesca Lo Cicero, Luca Pontisso, Piero Vicini, Roberto Ammendola

Abstract: We profile the impact of computation and inter-processor communication on the energy consumption and on the scaling of cortical simulations approaching the real-time regime on distributed computing platforms. Also, the speed and energy consumption of processor architectures typical of standard HPC and embedded platforms are compared. We demonstrate the importance of the design of low-latency inter… ▽ More We profile the impact of computation and inter-processor communication on the energy consumption and on the scaling of cortical simulations approaching the real-time regime on distributed computing platforms. Also, the speed and energy consumption of processor architectures typical of standard HPC and embedded platforms are compared. We demonstrate the importance of the design of low-latency interconnect for speed and energy consumption. The cost of cortical simulations is quantified using the Joule per synaptic event metric on both architectures. Reaching efficient real-time on large scale cortical simulations is of increasing relevance for both future bio-inspired artificial intelligence applications and for understanding the cognitive functions of the brain, a scientific quest that will require to embed large scale simulations into highly complex virtual or real worlds. This work stands at the crossroads between the WaveScalES experiment in the Human Brain Project (HBP), which includes the objective of large scale thalamo-cortical simulations of brain states and their transitions, and the ExaNeSt and EuroExa projects, that investigate the design of an ARM-based, low-power High Performance Computing (HPC) architecture with a dedicated interconnect scalable to million of cores; simulation of deep sleep Slow Wave Activity (SWA) and Asynchronous aWake (AW) regimes expressed by thalamo-cortical models are among their benchmarks. △ Less

Submitted 26 November, 2019; v1 submitted 12 December, 2018; originally announced December 2018.

Comments: 8 pages, 8 figures, 4 tables, submitted after final publication on PDP2019 proceedings, corrected final DOI. arXiv admin note: text overlap with arXiv:1812.04974, arXiv:1804.03441

Journal ref: 27th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP), Pavia, Italy, February 13-15, 2019, pp. 283-290

arXiv:1804.03893 [pdf, other]

doi 10.3233/978-1-61499-843-3-750

Large Scale Low Power Computing System - Status of Network Design in ExaNeSt and EuroExa Projects

Authors: Roberto Ammendola, Andrea Biagioni, Fabrizio Capuani, Paolo Cretaro, Giulia De Bonis, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Pier Stanislao Paolucci, Elena Pastorelli, Luca Pontisso, Francesco Simula, Piero Vicini

Abstract: The deployment of the next generation computing platform at ExaFlops scale requires to solve new technological challenges mainly related to the impressive number (up to 10^6) of compute elements required. This impacts on system power consumption, in terms of feasibility and costs, and on system scalability and computing efficiency. In this perspective analysis, exploration and evaluation of techno… ▽ More The deployment of the next generation computing platform at ExaFlops scale requires to solve new technological challenges mainly related to the impressive number (up to 10^6) of compute elements required. This impacts on system power consumption, in terms of feasibility and costs, and on system scalability and computing efficiency. In this perspective analysis, exploration and evaluation of technologies characterized by low power, high efficiency and high degree of customization is strongly needed. Among the various European initiative targeting the design of ExaFlops system, ExaNeSt and EuroExa are EU-H2020 funded initiatives leveraging on high end MPSoC FPGAs. Last generation MPSoC FPGAs can be seen as non-mainstream but powerful HPC Exascale enabling components thanks to the integration of embedded multi-core, ARM-based low power CPUs and a huge number of hardware resources usable to co-design application oriented accelerators and to develop a low latency high bandwidth network architecture. In this paper we introduce ExaNet the FPGA-based, scalable, direct network architecture of ExaNeSt system. ExaNet allow us to explore different interconnection topologies, to evaluate advanced routing functions for congestion control and fault tolerance and to design specific hardware components for acceleration of collective operations. After a brief introduction of the motivations and goals of ExaNeSt and EuroExa projects, we will report on the status of network architecture design and its hardware/software testbed adding preliminary bandwidth and latency achievements. △ Less

Submitted 11 April, 2018; originally announced April 2018.

Journal ref: (2018) Advances in Parallel Computing, 32, pp. 750-759

arXiv:1804.03441 [pdf, other]

doi 10.3233/978-1-61499-843-3-760

The Brain on Low Power Architectures - Efficient Simulation of Cortical Slow Waves and Asynchronous States

Authors: Roberto Ammendola, Andrea Biagioni, Fabrizio Capuani, Paolo Cretaro, Giulia De Bonis, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Pier Stanislao Paolucci, Elena Pastorelli, Luca Pontisso, Francesco Simula, Piero Vicini

Abstract: Efficient brain simulation is a scientific grand challenge, a parallel/distributed coding challenge and a source of requirements and suggestions for future computing architectures. Indeed, the human brain includes about 10^15 synapses and 10^11 neurons activated at a mean rate of several Hz. Full brain simulation poses Exascale challenges even if simulated at the highest abstraction level. The Wav… ▽ More Efficient brain simulation is a scientific grand challenge, a parallel/distributed coding challenge and a source of requirements and suggestions for future computing architectures. Indeed, the human brain includes about 10^15 synapses and 10^11 neurons activated at a mean rate of several Hz. Full brain simulation poses Exascale challenges even if simulated at the highest abstraction level. The WaveScalES experiment in the Human Brain Project (HBP) has the goal of matching experimental measures and simulations of slow waves during deep-sleep and anesthesia and the transition to other brain states. The focus is the development of dedicated large-scale parallel/distributed simulation technologies. The ExaNeSt project designs an ARM-based, low-power HPC architecture scalable to million of cores, develo** a dedicated scalable interconnect system, and SWA/AW simulations are included among the driving benchmarks. At the joint between both projects is the INFN proprietary Distributed and Plastic Spiking Neural Networks (DPSNN) simulation engine. DPSNN can be configured to stress either the networking or the computation features available on the execution platforms. The simulation stresses the networking component when the neural net - composed by a relatively low number of neurons, each one projecting thousands of synapses - is distributed over a large number of hardware cores. When growing the number of neurons per core, the computation starts to be the dominating component for short range connections. This paper reports about preliminary performance results obtained on an ARM-based HPC prototype developed in the framework of the ExaNeSt project. Furthermore, a comparison is given of instantaneous power, total energy consumption, execution time and energetic cost per synaptic event of SWA/AW DPSNN simulations when executed on either ARM- or Intel-based server platforms. △ Less

Submitted 10 April, 2018; originally announced April 2018.

Journal ref: (2018) Advances in Parallel Computing, 32, pp. 760-769

arXiv:1803.08833 [pdf, other]

doi 10.1109/PDP2018.2018.00110

Gaussian and exponential lateral connectivity on distributed spiking neural network simulation

Authors: Elena Pastorelli, Pier Stanislao Paolucci, Francesco Simula, Andrea Biagioni, Fabrizio Capuani, Paolo Cretaro, Giulia De Bonis, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Luca Pontisso, Piero Vicini, Roberto Ammendola

Abstract: We measured the impact of long-range exponentially decaying intra-areal lateral connectivity on the scaling and memory occupation of a distributed spiking neural network simulator compared to that of short-range Gaussian decays. While previous studies adopted short-range connectivity, recent experimental neurosciences studies are pointing out the role of longer-range intra-areal connectivity with… ▽ More We measured the impact of long-range exponentially decaying intra-areal lateral connectivity on the scaling and memory occupation of a distributed spiking neural network simulator compared to that of short-range Gaussian decays. While previous studies adopted short-range connectivity, recent experimental neurosciences studies are pointing out the role of longer-range intra-areal connectivity with implications on neural simulation platforms. Two-dimensional grids of cortical columns composed by up to 11 M point-like spiking neurons with spike frequency adaption were connected by up to 30 G synapses using short- and long-range connectivity models. The MPI processes composing the distributed simulator were run on up to 1024 hardware cores, hosted on a 64 nodes server platform. The hardware platform was a cluster of IBM NX360 M5 16-core compute nodes, each one containing two Intel Xeon Haswell 8-core E5-2630 v3 processors, with a clock of 2.40 G Hz, interconnected through an InfiniBand network, equipped with 4x QDR switches. △ Less

Submitted 19 February, 2019; v1 submitted 23 March, 2018; originally announced March 2018.

Comments: 9 pages, 9 figures, added reference to final peer reviewed version on conference paper and DOI

arXiv:1709.01946 [pdf, ps, other]

doi 10.1038/s41550-017-0266-2

Optical pulsations from a transitional millisecond pulsar

Authors: F. Ambrosino, A. Papitto, L. Stella, F. Meddi, P. Cretaro, L. Burderi, T. Di Salvo, G. L. Israel, A. Ghedina, L. Di Fabrizio, L. Riverol

Abstract: Weakly magnetic, millisecond spinning neutron stars attain their very fast rotation through a 1E8-1E9 yr long phase during which they undergo disk-accretion of matter from a low mass companion star. They can be detected as accretion-powered millisecond X-ray pulsars if towards the end of this phase their magnetic field is still strong enough to channel the accreting matter towards the magnetic pol… ▽ More Weakly magnetic, millisecond spinning neutron stars attain their very fast rotation through a 1E8-1E9 yr long phase during which they undergo disk-accretion of matter from a low mass companion star. They can be detected as accretion-powered millisecond X-ray pulsars if towards the end of this phase their magnetic field is still strong enough to channel the accreting matter towards the magnetic poles. When mass transfer is much reduced or ceases altogether, pulsed emission generated by particle acceleration in the magnetosphere and powered by the rotation of the neutron star is observed, preferentially in the radio and gamma-ray bands. A few transitional millisecond pulsars that swing between an accretion-powered X-ray pulsar regime and a rotationally-powered radio pulsar regime in response to variations of the mass in-flow rate have been recently identified. Here we report the detection of optical pulsations from a transitional pulsar, the first ever from a millisecond spinning neutron star. The pulsations were observed when the pulsar was surrounded by an accretion disk and originated inside the magnetosphere or within a few hundreds of kilometres from it. Energy arguments rule out reprocessing of accretion-powered X-ray emission and argue against a process related to accretion onto the pulsar polar caps; synchrotron emission of electrons in a rotation-powered pulsar magnetosphere seems more likely. △ Less

Submitted 19 October, 2018; v1 submitted 6 September, 2017; originally announced September 2017.

Comments: 32 pages, 7 figures. The first two authors contributed equally to this work

Journal ref: Nature Astronomy (2017), published on-line on October 2, 2017

arXiv:1606.04099 [pdf, other]

GPU-based Real-time Triggering in the NA62 Experiment

Authors: R. Ammendola, A. Biagioni, P. Cretaro, S. Di Lorenzo, R. Fantechi, M. Fiorini, O. Frezza, G. Lamanna, F. Lo Cicero, A. Lonardo, M. Martinelli, I. Neri, P. S. Paolucci, E. Pastorelli, R. Piandani, L. Pontisso, D. Rossetti, F. Simula, M. Sozzi, P. Vicini

Abstract: Over the last few years the GPGPU (General-Purpose computing on Graphics Processing Units) paradigm represented a remarkable development in the world of computing. Computing for High-Energy Physics is no exception: several works have demonstrated the effectiveness of the integration of GPU-based systems in high level trigger of different experiments. On the other hand the use of GPUs in the low le… ▽ More Over the last few years the GPGPU (General-Purpose computing on Graphics Processing Units) paradigm represented a remarkable development in the world of computing. Computing for High-Energy Physics is no exception: several works have demonstrated the effectiveness of the integration of GPU-based systems in high level trigger of different experiments. On the other hand the use of GPUs in the low level trigger systems, characterized by stringent real-time constraints, such as tight time budget and high throughput, poses several challenges. In this paper we focus on the low level trigger in the CERN NA62 experiment, investigating the use of real-time computing on GPUs in this synchronous system. Our approach aimed at harvesting the GPU computing power to build in real-time refined physics-related trigger primitives for the RICH detector, as the the knowledge of Cerenkov rings parameters allows to build stringent conditions for data selection at trigger level. Latencies of all components of the trigger chain have been analyzed, pointing out that networking is the most critical one. To keep the latency of data transfer task under control, we devised NaNet, an FPGA-based PCIe Network Interface Card (NIC) with GPUDirect capabilities. For the processing task, we developed specific multiple ring trigger algorithms to leverage the parallel architecture of GPUs and increase the processing throughput to keep up with the high event rate. Results obtained during the first months of 2016 NA62 run are presented and discussed. △ Less

Submitted 13 June, 2016; originally announced June 2016.

Showing 1–10 of 10 results for author: Cretaro, P