-
Two-compartment neuronal spiking model expressing brain-state specific apical-amplification, -isolation and -drive regimes
Authors:
Elena Pastorelli,
Alper Yegenoglu,
Nicole Kolodziej,
Willem Wybo,
Francesco Simula,
Sandra Diaz,
Johan Frederik Storm,
Pier Stanislao Paolucci
Abstract:
Mounting experimental evidence suggests that brain-state-specific neural mechanisms, supported by connectomic architectures, play a crucial role in integrating past and contextual knowledge with the current, incoming flow of evidence (e.g., from sensory systems). These mechanisms operate across multiple spatial and temporal scales, necessitating dedicated support at the levels of individual neuron…
▽ More
Mounting experimental evidence suggests that brain-state-specific neural mechanisms, supported by connectomic architectures, play a crucial role in integrating past and contextual knowledge with the current, incoming flow of evidence (e.g., from sensory systems). These mechanisms operate across multiple spatial and temporal scales, necessitating dedicated support at the levels of individual neurons and synapses. A notable feature within the neocortex is the structure of large, deep pyramidal neurons, which exhibit a distinctive separation between an apical dendritic compartment and a basal dendritic/perisomatic compartment. This separation is characterized by distinct patterns of incoming connections and brain-state-specific activation mechanisms, namely, apical amplification, isolation, and drive, which are associated with wakefulness, deeper NREM sleep stages, and REM sleep, respectively. The cognitive roles of apical mechanisms have been demonstrated in behaving animals. In contrast, classical models of learning in spiking networks are based on single-compartment neurons, lacking the ability to describe the integration of apical and basal/somatic information. This work aims to provide the computational community with a two-compartment spiking neuron model that incorporates features essential for supporting brain-state-specific learning. This model includes a piece-wise linear transfer function (ThetaPlanes) at the highest abstraction level, making it suitable for use in large-scale bio-inspired artificial intelligence systems. A machine learning evolutionary algorithm, guided by a set of fitness functions, selected the parameters that define neurons expressing the desired apical mechanisms.
△ Less
Submitted 26 March, 2024; v1 submitted 10 November, 2023;
originally announced November 2023.
-
APEIRON: composing smart TDAQ systems for high energy physics experiments
Authors:
Roberto Ammendola,
Andrea Biagioni,
Carlotta Chiarini,
Andrea Ciardiello,
Paolo Cretaro,
Ottorino Frezza,
Francesca Lo Cicero,
Alessandro Lonardo,
Michele Martinelli,
Pier Stanislao Paolucci,
Cristian Rossi,
Francesco Simula,
Matteo Turisini,
Piero Vicini
Abstract:
APEIRON is a framework encompassing the general architecture of a distributed heterogeneous processing platform and the corresponding software stack, from the low level device drivers up to the high level programming model. The framework is designed to be efficiently used for studying, prototy** and deploying smart trigger and data acquisition (TDAQ) systems for high energy physics experiments.
APEIRON is a framework encompassing the general architecture of a distributed heterogeneous processing platform and the corresponding software stack, from the low level device drivers up to the high level programming model. The framework is designed to be efficiently used for studying, prototy** and deploying smart trigger and data acquisition (TDAQ) systems for high energy physics experiments.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
Runtime Construction of Large-Scale Spiking Neuronal Network Models on GPU Devices
Authors:
Bruno Golosio,
Jose Villamar,
Gianmarco Tiddia,
Elena Pastorelli,
Jonas Stapmanns,
Viviana Fanti,
Pier Stanislao Paolucci,
Abigail Morrison,
Johanna Senk
Abstract:
Simulation speed matters for neuroscientific research: this includes not only how quickly the simulated model time of a large-scale spiking neuronal network progresses, but also how long it takes to instantiate the network model in computer memory. On the hardware side, acceleration via highly parallel GPUs is being increasingly utilized. On the software side, code generation approaches ensure hig…
▽ More
Simulation speed matters for neuroscientific research: this includes not only how quickly the simulated model time of a large-scale spiking neuronal network progresses, but also how long it takes to instantiate the network model in computer memory. On the hardware side, acceleration via highly parallel GPUs is being increasingly utilized. On the software side, code generation approaches ensure highly optimized code, at the expense of repeated code regeneration and recompilation after modifications to the network model. Aiming for a greater flexibility with respect to iterative model changes, here we propose a new method for creating network connections interactively, dynamically, and directly in GPU memory through a set of commonly used high-level connection rules. We validate the simulation performance with both consumer and data center GPUs on two neuroscientifically relevant models: a cortical microcircuit of about 77,000 leaky-integrate-and-fire neuron models and 300 million static synapses, and a two-population network recurrently connected using a variety of connection rules. With our proposed ad hoc network instantiation, both network construction and simulation times are comparable or shorter than those obtained with other state-of-the-art simulation technologies, while still meeting the flexibility demands of explorative network modeling.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Characterization of a gas detector prototype based on Thick-GEM for the MAGNEX focal plane detector
Authors:
I. Ciraldo,
G. A. Brischetto,
D. Torresi,
M. Cavallaro,
C. Agodi,
A. Boiano,
S. Calabrese,
F. Cappuzzello,
D. Carbone,
M. Cortesi,
F. Delaunay,
M. Fisichella,
L. Neri,
A. Pandalone,
P. Paolucci,
B. Rossi,
O. Sgouros,
V. Soukeras,
A. Spatafora,
A. Vanzanella,
A. Yildirim
Abstract:
A new gas detector prototype for the upgrade of the focal plane detector of the MAGNEX large-acceptance magnetic spectrometer has been developed and tested in view of the NUMEN project. It has been designed to operate at low gas pressure for detecting medium to heavy ions in the energy range between 15 and 60 AMeV. It is a drift chamber based on Multi-layer Thick-GEM (M-THGEM) as electron multipli…
▽ More
A new gas detector prototype for the upgrade of the focal plane detector of the MAGNEX large-acceptance magnetic spectrometer has been developed and tested in view of the NUMEN project. It has been designed to operate at low gas pressure for detecting medium to heavy ions in the energy range between 15 and 60 AMeV. It is a drift chamber based on Multi-layer Thick-GEM (M-THGEM) as electron multiplication technology. Tests with two different M-THGEM layouts have been performed using both a radioactive $α$-particle source and accelerated heavy-ion beams. The characterization of the detector in terms of measured currents that flow through the electrodes as a function of different parameters, including applied voltages, gas pressure and rate of incident particle, is described. The gain and ion backflow properties have been studied.
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
Machine Learning based tool for CMS RPC currents quality monitoring
Authors:
E. Shumka,
A. Samalan,
M. Tytgat,
M. El Sawy,
G. A. Alves,
F. Marujo,
E. A. Coelho,
E. M. Da Costa,
H. Nogima,
A. Santoro,
S. Fonseca De Souza,
D. De Jesus Damiao,
M. Thiel,
K. Mota Amarilo,
M. Barroso Ferreira Filho,
A. Aleksandrov,
R. Hadjiiska,
P. Iaydjiev,
M. Rodozov,
M. Shopova,
G. Soultanov,
A. Dimitrov,
L. Litov,
B. Pavlov,
P. Petkov
, et al. (83 additional authors not shown)
Abstract:
The muon system of the CERN Compact Muon Solenoid (CMS) experiment includes more than a thousand Resistive Plate Chambers (RPC). They are gaseous detectors operated in the hostile environment of the CMS underground cavern on the Large Hadron Collider where pp luminosities of up to $2\times 10^{34}$ $\text{cm}^{-2}\text{s}^{-1}$ are routinely achieved. The CMS RPC system performance is constantly m…
▽ More
The muon system of the CERN Compact Muon Solenoid (CMS) experiment includes more than a thousand Resistive Plate Chambers (RPC). They are gaseous detectors operated in the hostile environment of the CMS underground cavern on the Large Hadron Collider where pp luminosities of up to $2\times 10^{34}$ $\text{cm}^{-2}\text{s}^{-1}$ are routinely achieved. The CMS RPC system performance is constantly monitored and the detector is regularly maintained to ensure stable operation. The main monitorable characteristics are dark current, efficiency for muon detection, noise rate etc. Herein we describe an automated tool for CMS RPC current monitoring which uses Machine Learning techniques. We further elaborate on the dedicated generalized linear model proposed already and add autoencoder models for self-consistent predictions as well as hybrid models to allow for RPC current predictions in a distant future.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
RPC based tracking system at CERN GIF++ facility
Authors:
K. Mota Amarilo,
A. Samalan,
M. Tytgat,
M. El Sawy,
G. A. Alves,
F. Marujo,
E. A. Coelho,
E. M. Da Costa,
H. Nogima,
A. Santoro,
S. Fonseca De Souza,
D. De Jesus Damiao,
M. Thiel,
M. Barroso Ferreira Filho,
A. Aleksandrov,
R. Hadjiiska,
P. Iaydjiev,
M. Rodozov,
M. Shopova,
G. Soultanov,
A. Dimitrov,
L. Litov,
B. Pavlov,
P. Petkov,
A. Petrov
, et al. (83 additional authors not shown)
Abstract:
With the HL-LHC upgrade of the LHC machine, an increase of the instantaneous luminosity by a factor of five is expected and the current detection systems need to be validated for such working conditions to ensure stable data taking. At the CERN Gamma Irradiation Facility (GIF++) many muon detectors undergo such studies, but the high gamma background can pose a challenge to the muon trigger system…
▽ More
With the HL-LHC upgrade of the LHC machine, an increase of the instantaneous luminosity by a factor of five is expected and the current detection systems need to be validated for such working conditions to ensure stable data taking. At the CERN Gamma Irradiation Facility (GIF++) many muon detectors undergo such studies, but the high gamma background can pose a challenge to the muon trigger system which is exposed to many fake hits from the gamma background. A tracking system using RPCs is implemented to clean the fake hits, taking profit of the high muon efficiency of these chambers. This work will present the tracking system configuration, used detector analysis algorithm and results.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Comparing apples to apples -- Using a modular and adaptable analysis pipeline to compare slow cerebral rhythms across heterogeneous datasets
Authors:
Robin Gutzen,
Giulia De Bonis,
Chiara De Luca,
Elena Pastorelli,
Cristiano Capone,
Anna Letizia Allegra Mascaro,
Francesco Resta,
Arnau Manasanch,
Francesco Saverio Pavone,
Maria V. Sanchez-Vives,
Maurizio Mattia,
Sonja Grün,
Pier Stanislao Paolucci,
Michael Denker
Abstract:
Neuroscience is moving towards a more integrative discipline, where understanding brain function requires consolidating the accumulated evidence seen across experiments, species, and measurement techniques. A remaining challenge on that path is integrating such heterogeneous data into analysis workflows such that consistent and comparable conclusions can be distilled as an experimental basis for m…
▽ More
Neuroscience is moving towards a more integrative discipline, where understanding brain function requires consolidating the accumulated evidence seen across experiments, species, and measurement techniques. A remaining challenge on that path is integrating such heterogeneous data into analysis workflows such that consistent and comparable conclusions can be distilled as an experimental basis for models and theories. Here, we propose a solution in the context of slow wave activity ($<1$ Hz), which occurs during unconscious brain states like sleep and general anesthesia, and is observed across diverse experimental approaches. We address the issue of integrating and comparing heterogeneous data by conceptualizing a general pipeline design that is adaptable to a variety of inputs and applications. Furthermore, we present the Collaborative Brain Wave Analysis Pipeline (Cobrawap) as a concrete, reusable software implementation to perform broad, detailed, and rigorous comparisons of slow wave characteristics across multiple, openly available ECoG and calcium imaging datasets.
△ Less
Submitted 7 February, 2023; v1 submitted 15 November, 2022;
originally announced November 2022.
-
NREM and REM: cognitive and energetic gains in thalamo-cortical slee** and awake spiking model
Authors:
Chiara De Luca,
Leonardo Tonielli,
Elena Pastorelli,
Cristiano Capone,
Francesco Simula,
Cosimo Lupo,
Irene Bernava,
Giulia De Bonis,
Gianmarco Tiddia,
Bruno Golosio,
Pier Stanislao Paolucci
Abstract:
Sleep is essential for learning and cognition, but the mechanisms by which it stabilizes learning, supports creativity, and manages the energy consumption of networks engaged in post-sleep task have not been yet modelled. During sleep, the brain cycles between non-rapid eye movement (NREM), a mainly unconscious state characterized by collective oscillations, and rapid eye movement (REM), associate…
▽ More
Sleep is essential for learning and cognition, but the mechanisms by which it stabilizes learning, supports creativity, and manages the energy consumption of networks engaged in post-sleep task have not been yet modelled. During sleep, the brain cycles between non-rapid eye movement (NREM), a mainly unconscious state characterized by collective oscillations, and rapid eye movement (REM), associated with the integrated experience of dreaming. We propose a biologically grounded two-area thalamo-cortical plastic spiking neural network model and investigate the role of NREM - REM cycles on its awake performance. We demonstrate that sleep has a positive effect on energy consumption and cognitive performance during the post-sleep awake classification task of handwritten digits. NREM and REM simulated dynamics modify the synaptic structure into a sharper representation of training experiences. Sleep-induced synaptic modifications reduce firing rates and synaptic activity without reducing cognitive performance. Also, it creates novel multi-area associations. The model leverages the apical amplification, isolation and drive experimentally grounded principles and the combination of contextual and perceptual information. In summary, the main novelty is the proposal of a multi-area plastic model that also expresses REM and integrates information during a plastic dream-like state, with cognitive and energetic benefits during post-sleep awake classification.
△ Less
Submitted 3 January, 2023; v1 submitted 13 November, 2022;
originally announced November 2022.
-
Beyond spiking networks: the computational advantages of dendritic amplification and input segregation
Authors:
Cristiano Capone,
Cosimo Lupo,
Paolo Muratore,
Pier Stanislao Paolucci
Abstract:
The brain can efficiently learn a wide range of tasks, motivating the search for biologically inspired learning rules for improving current artificial intelligence technology. Most biological models are composed of point neurons, and cannot achieve the state-of-the-art performances in machine learning. Recent works have proposed that segregation of dendritic input (neurons receive sensory informat…
▽ More
The brain can efficiently learn a wide range of tasks, motivating the search for biologically inspired learning rules for improving current artificial intelligence technology. Most biological models are composed of point neurons, and cannot achieve the state-of-the-art performances in machine learning. Recent works have proposed that segregation of dendritic input (neurons receive sensory information and higher-order feedback in segregated compartments) and generation of high-frequency bursts of spikes would support error backpropagation in biological neurons. However, these approaches require propagating errors with a fine spatio-temporal structure to the neurons, which is unlikely to be feasible in a biological network.
To relax this assumption, we suggest that bursts and dendritic input segregation provide a natural support for biologically plausible target-based learning, which does not require error propagation. We propose a pyramidal neuron model composed of three separated compartments. A coincidence mechanism between the basal and the apical compartments allows for generating high-frequency bursts of spikes. This architecture allows for a burst-dependent learning rule, based on the comparison between the target bursting activity triggered by the teaching signal and the one caused by the recurrent connections, providing the support for target-based learning. We show that this framework can be used to efficiently solve spatio-temporal tasks, such as the store and recall of 3D trajectories.
Finally, we suggest that this neuronal architecture naturally allows for orchestrating ``hierarchical imitation learning'', enabling the decomposition of challenging long-horizon decision-making tasks into simpler subtasks. This can be implemented in a two-level network, where the high-network acts as a ``manager'' and produces the contextual signal for the low-network, the ``worker''.
△ Less
Submitted 4 November, 2022;
originally announced November 2022.
-
Towards biologically plausible Dreaming and Planning in recurrent spiking networks
Authors:
Cristiano Capone,
Pier Stanislao Paolucci
Abstract:
Humans and animals can learn new skills after practicing for a few hours, while current reinforcement learning algorithms require a large amount of data to achieve good performances. Recent model-based approaches show promising results by reducing the number of necessary interactions with the environment to learn a desirable policy. However, these methods require biological implausible ingredients…
▽ More
Humans and animals can learn new skills after practicing for a few hours, while current reinforcement learning algorithms require a large amount of data to achieve good performances. Recent model-based approaches show promising results by reducing the number of necessary interactions with the environment to learn a desirable policy. However, these methods require biological implausible ingredients, such as the detailed storage of older experiences, and long periods of offline learning. The optimal way to learn and exploit word-models is still an open question. Taking inspiration from biology, we suggest that dreaming might be an efficient expedient to use an inner model. We propose a two-module (agent and model) spiking neural network in which "dreaming" (living new experiences in a model-based simulated environment) significantly boosts learning. We also explore "planning", an online alternative to dreaming, that shows comparable performances. Importantly, our model does not require the detailed storage of experiences, and learns online the world-model and the policy. Moreover, we stress that our network is composed of spiking neurons, further increasing the biological plausibility and implementability in neuromorphic hardware.
△ Less
Submitted 8 June, 2023; v1 submitted 20 May, 2022;
originally announced May 2022.
-
Quality Control of Mass-Produced GEM Detectors for the CMS GE1/1 Muon Upgrade
Authors:
M. Abbas,
M. Abbrescia,
H. Abdalla,
A. Abdelalim,
S. AbuZeid,
A. Agapitos,
A. Ahmad,
A. Ahmed,
W. Ahmed,
C. Aimè,
C. Aruta,
I. Asghar,
P. Aspell,
C. Avila,
J. Babbar,
Y. Ban,
R. Band,
S. Bansal,
L. Benussi,
T. Beyrouthy,
V. Bhatnagar,
M. Bianco,
S. Bianco,
K. Black,
L. Borgonovi
, et al. (157 additional authors not shown)
Abstract:
The series of upgrades to the Large Hadron Collider, culminating in the High Luminosity Large Hadron Collider, will enable a significant expansion of the physics program of the CMS experiment. However, the accelerator upgrades will also make the experimental conditions more challenging, with implications for detector operations, triggering, and data analysis. The luminosity of the proton-proton co…
▽ More
The series of upgrades to the Large Hadron Collider, culminating in the High Luminosity Large Hadron Collider, will enable a significant expansion of the physics program of the CMS experiment. However, the accelerator upgrades will also make the experimental conditions more challenging, with implications for detector operations, triggering, and data analysis. The luminosity of the proton-proton collisions is expected to exceed $2-3\times10^{34}$~cm$^{-2}$s$^{-1}$ for Run 3 (starting in 2022), and it will be at least $5\times10^{34}$~cm$^{-2}$s$^{-1}$ when the High Luminosity Large Hadron Collider is completed for Run 4. These conditions will affect muon triggering, identification, and measurement, which are critical capabilities of the experiment. To address these challenges, additional muon detectors are being installed in the CMS endcaps, based on Gas Electron Multiplier technology. For this purpose, 161 large triple-Gas Electron Multiplier detectors have been constructed and tested. Installation of these devices began in 2019 with the GE1/1 station and will be followed by two additional stations, GE2/1 and ME0, to be installed in 2023 and 2026, respectively. The assembly and quality control of the GE1/1 detectors were distributed across several production sites around the world. We motivate and discuss the quality control procedures that were developed to standardize the performance of the detectors, and we present the final results of the production. Out of 161 detectors produced, 156 detectors passed all tests, and 144 detectors are now installed in the CMS experiment. The various visual inspections, gas tightness tests, intrinsic noise rate characterizations, and effective gas gain and response uniformity tests allowed the project to achieve this high success rate.
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
Burst-dependent plasticity and dendritic amplification support target-based learning and hierarchical imitation learning
Authors:
Cristiano Capone,
Cosimo Lupo,
Paolo Muratore,
Pier Stanislao Paolucci
Abstract:
The brain can learn to solve a wide range of tasks with high temporal and energetic efficiency. However, most biological models are composed of simple single compartment neurons and cannot achieve the state-of-art performances of artificial intelligence. We propose a multi-compartment model of pyramidal neuron, in which bursts and dendritic input segregation give the possibility to plausibly suppo…
▽ More
The brain can learn to solve a wide range of tasks with high temporal and energetic efficiency. However, most biological models are composed of simple single compartment neurons and cannot achieve the state-of-art performances of artificial intelligence. We propose a multi-compartment model of pyramidal neuron, in which bursts and dendritic input segregation give the possibility to plausibly support a biological target-based learning. In target-based learning, the internal solution of a problem (a spatio temporal pattern of bursts in our case) is suggested to the network, bypassing the problems of error backpropagation and credit assignment. Finally, we show that this neuronal architecture naturally support the orchestration of hierarchical imitation learning, enabling the decomposition of challenging long-horizon decision-making tasks into simpler subtasks.
△ Less
Submitted 27 January, 2022;
originally announced January 2022.
-
Architectural improvements and technological enhancements for the APEnet+ interconnect system
Authors:
R. Ammendola,
A. Biagioni,
O. Frezza,
A. Lonardo,
F. Lo Cicero,
M. Martinelli,
P. S. Paolucci,
E. Pastorelli,
D. Rossetti,
F. Simula,
L. Tosoratto,
P. Vicini
Abstract:
The APEnet+ board delivers a point-to-point, low-latency, 3D torus network interface card. In this paper we describe the latest generation of APEnet NIC, APEnet v5, integrated in a PCIe Gen3 board based on a state-of-the-art, 28 nm Altera Stratix V FPGA. The NIC features a network architecture designed following the Remote DMA paradigm and tailored to tightly bind the computing power of modern GPU…
▽ More
The APEnet+ board delivers a point-to-point, low-latency, 3D torus network interface card. In this paper we describe the latest generation of APEnet NIC, APEnet v5, integrated in a PCIe Gen3 board based on a state-of-the-art, 28 nm Altera Stratix V FPGA. The NIC features a network architecture designed following the Remote DMA paradigm and tailored to tightly bind the computing power of modern GPUs to the communication fabric. For the APEnet v5 board we show characterizing figures as achieved bandwidth and BER obtained by exploiting new high performance ALTERA transceivers and PCIe Gen3 compliancy.
△ Less
Submitted 4 January, 2022;
originally announced January 2022.
-
Upgrade of the CMS Resistive Plate Chambers for the High Luminosity LHC
Authors:
A. Samalan,
M. Tytgat,
G. A. Alves,
F. Marujo,
F. Torres Da Silva De Araujo,
E. M. DaCosta,
D. De Jesus Damiao,
H. Nogima,
A. Santoro,
S. Fonseca De Souza,
A. Aleksandrov,
R. Hadjiiska,
P. Iaydjiev,
M. Rodozov,
M. Shopova,
G. Soultanov,
M. Bonchev,
A. Dimitrov,
L. Litov,
B. Pavlov,
P. Petkov,
A. Petrov,
S. J. Qian,
C. Bernal,
A. Cabrera
, et al. (86 additional authors not shown)
Abstract:
During the upcoming High Luminosity phase of the Large Hadron Collider (HL-LHC), the integrated luminosity of the accelerator will increase to 3000 fb$^{-1}$. The expected experimental conditions in that period in terms of background rates, event pileup, and the probable aging of the current detectors present a challenge for all the existing experiments at the LHC, including the Compact Muon Solen…
▽ More
During the upcoming High Luminosity phase of the Large Hadron Collider (HL-LHC), the integrated luminosity of the accelerator will increase to 3000 fb$^{-1}$. The expected experimental conditions in that period in terms of background rates, event pileup, and the probable aging of the current detectors present a challenge for all the existing experiments at the LHC, including the Compact Muon Solenoid (CMS) experiment. To ensure a highly performing muon system for this period, several upgrades of the Resistive Plate Chamber (RPC) system of the CMS are currently being implemented. These include the replacement of the readout system for the present system, and the installation of two new RPC stations with improved chamber and front-end electronics designs. The current overall status of this CMS RPC upgrade project is presented.
△ Less
Submitted 2 November, 2021; v1 submitted 29 September, 2021;
originally announced September 2021.
-
Error-based or target-based? A unifying framework for learning in recurrent spiking networks
Authors:
Cristiano Capone,
Paolo Muratore,
Pier Stanislao Paolucci
Abstract:
Learning in biological or artificial networks means changing the laws governing the network dynamics in order to better behave in a specific situation. In the field of supervised learning, two complementary approaches stand out: error-based and target-based learning. However, there exists no consensus on which is better suited for which task, and what is the most biologically plausible. Here we pr…
▽ More
Learning in biological or artificial networks means changing the laws governing the network dynamics in order to better behave in a specific situation. In the field of supervised learning, two complementary approaches stand out: error-based and target-based learning. However, there exists no consensus on which is better suited for which task, and what is the most biologically plausible. Here we propose a comprehensive theoretical framework that includes these two frameworks as special cases. This novel theoretical formulation offers major insights into the differences between the two approaches. In particular, we show how target-based naturally emerges from error-based when the number of constraints on the target dynamics, and as a consequence on the internal network dynamics, is comparable to the degrees of freedom of the network. Moreover, given the experimental evidences on the relevance that spikes have in biological networks, we investigate the role of coding with specific patterns of spikes by introducing a parameter that defines the tolerance to precise spike timing during learning. Our approach naturally lends itself to Imitation Learning (and Behavioral Cloning in particular) and we apply it to solve relevant closed-loop tasks such as the button-and-food task, and the 2D Bipedal Walker. We show that a high dimensionality feedback structure is extremely important when it is necessary to solve a task that requires retaining memory for a long time (button-and-food). On the other hand, we find that coding with specific patterns of spikes enables optimal performances in a motor task (the 2D Bipedal Walker). Finally, we show that our theoretical formulation suggests protocols to deduce the structure of learning feedback in biological networks.
△ Less
Submitted 8 September, 2021; v1 submitted 2 September, 2021;
originally announced September 2021.
-
Performance of a Triple-GEM Demonstrator in $pp$ Collisions at the CMS Detector
Authors:
M. Abbas,
M. Abbrescia,
H. Abdalla,
A. Abdelalim,
S. AbuZeid,
A. Agapitos,
A. Ahmad,
A. Ahmed,
W. Ahmed,
C. Aimè,
C. Aruta,
I. Asghar,
P. Aspell,
C. Avila,
J. Babbar,
Y. Ban,
R. Band,
S. Bansal,
L. Benussi,
V. Bhatnagar,
M. Bianco,
S. Bianco,
K. Black,
L. Borgonovi,
O. Bouhali
, et al. (156 additional authors not shown)
Abstract:
After the Phase-2 high-luminosity upgrade to the Large Hadron Collider (LHC), the collision rate and therefore the background rate will significantly increase, particularly in the high $η$ region. To improve both the tracking and triggering of muons, the Compact Muon Solenoid (CMS) Collaboration plans to install triple-layer Gas Electron Multiplier (GEM) detectors in the CMS muon endcaps. Demonstr…
▽ More
After the Phase-2 high-luminosity upgrade to the Large Hadron Collider (LHC), the collision rate and therefore the background rate will significantly increase, particularly in the high $η$ region. To improve both the tracking and triggering of muons, the Compact Muon Solenoid (CMS) Collaboration plans to install triple-layer Gas Electron Multiplier (GEM) detectors in the CMS muon endcaps. Demonstrator GEM detectors were installed in CMS during 2017 to gain operational experience and perform a preliminary investigation of detector performance. We present the results of triple-GEM detector performance studies performed in situ during normal CMS and LHC operations in 2018. The distribution of cluster size and the efficiency to reconstruct high $p_T$ muons in proton--proton collisions are presented as well as the measurement of the environmental background rate to produce hits in the GEM detector.
△ Less
Submitted 22 September, 2021; v1 submitted 20 July, 2021;
originally announced July 2021.
-
Modeling the triple-GEM detector response to background particles for the CMS Experiment
Authors:
M. Abbas,
M. Abbrescia,
H. Abdalla,
A. Abdelalim,
S. AbuZeid,
A. Agapitos,
A. Ahmad,
A. Ahmed,
W. Ahmed,
C. Aimè,
C. Aruta,
I. Asghar,
P. Aspell,
C. Avila,
I. Azhgirey,
J. Babbar,
Y. Ban,
R. Band,
S. Bansal,
L. Benussi,
V. Bhatnagar,
M. Bianco,
S. Bianco,
K. Black,
L. Borgonovi
, et al. (164 additional authors not shown)
Abstract:
An estimate of environmental background hit rate on triple-GEM chambers is performed using Monte Carlo (MC) simulation and compared to data taken by test chambers installed in the CMS experiment (GE1/1) during Run-2 at the Large Hadron Collider (LHC). The hit rate is measured using data collected with proton-proton collisions at 13 TeV and a luminosity of 1.5$\times10^{34}$ cm$^{-2}$ s$^{-1}$. The…
▽ More
An estimate of environmental background hit rate on triple-GEM chambers is performed using Monte Carlo (MC) simulation and compared to data taken by test chambers installed in the CMS experiment (GE1/1) during Run-2 at the Large Hadron Collider (LHC). The hit rate is measured using data collected with proton-proton collisions at 13 TeV and a luminosity of 1.5$\times10^{34}$ cm$^{-2}$ s$^{-1}$. The simulation framework uses a combination of the FLUKA and Geant4 packages to obtain the hit rate. FLUKA provides the radiation environment around the GE1/1 chambers, which is comprised of the particle flux with momentum direction and energy spectra ranging from $10^{-11}$ to $10^{4}$ MeV for neutrons, $10^{-3}$ to $10^{4}$ MeV for $γ$'s, $10^{-2}$ to $10^{4}$ MeV for $e^{\pm}$, and $10^{-1}$ to $10^{4}$ MeV for charged hadrons. Geant4 provides an estimate of detector response (sensitivity) based on an accurate description of detector geometry, material composition and interaction of particles with the various detector layers. The MC simulated hit rate is estimated as a function of the perpendicular distance from the beam line and agrees with data within the assigned uncertainties of 10-14.5%. This simulation framework can be used to obtain a reliable estimate of background rates expected at the High Luminosity LHC.
△ Less
Submitted 8 July, 2021;
originally announced July 2021.
-
Simulations Approaching Data: Cortical Slow Waves in Inferred Models of the Whole Hemisphere of Mouse
Authors:
Cristiano Capone,
Chiara De Luca,
Giulia De Bonis,
Robin Gutzen,
Irene Bernava,
Elena Pastorelli,
Francesco Simula,
Cosimo Lupo,
Leonardo Tonielli,
Anna Letizia Allegra Mascaro,
Francesco Resta,
Francesco Pavone,
Micheal Denker,
Pier Stanislao Paolucci
Abstract:
Thanks to novel, powerful brain activity recording techniques, we can create data-driven models from thousands of recording channels and large portions of the cortex, which can improve our understanding of brain-states neuromodulation and the related richness of traveling waves dynamics.
We investigate the inference of data-driven models and the comparison among experiments and simulations, thro…
▽ More
Thanks to novel, powerful brain activity recording techniques, we can create data-driven models from thousands of recording channels and large portions of the cortex, which can improve our understanding of brain-states neuromodulation and the related richness of traveling waves dynamics.
We investigate the inference of data-driven models and the comparison among experiments and simulations, through the characterization of the spatio-temporal features of cortical waves in experimental recordings and simulations. Inference is built in two steps: the inner loop that optimizes by likelihood maximization a mean-field model, and the outer loop that optimizes a periodic neuro-modulation by relying on direct comparison of observables apt for the characterization of cortical slow waves. The model is capable to reproduce most of the features of the non-stationary and non-linear dynamics displayed by the high-resolution recording of the in-vivo mouse brain obtained by wide-field calcium imaging techniques. The proposed approach is of interest for both experimental and computational neuroscientists.
△ Less
Submitted 29 November, 2022; v1 submitted 15 April, 2021;
originally announced April 2021.
-
Interstrip Capacitances of the Readout Board used in Large Triple-GEM Detectors for the CMS Muon Upgrade
Authors:
M. Abbas,
M. Abbrescia,
H. Abdalla,
A. Abdelalim,
S. AbuZeid,
A. Agapitos,
A. Ahmad,
A. Ahmed,
W. Ahmed,
C. Aimè,
C. Aruta,
I. Asghar,
P. Aspell,
C. Avila,
J. Babbar,
Y. Ban,
R. Band,
S. Bansal,
L. Benussi,
V. Bhatnagar,
M. Bianco,
S. Bianco,
K. Black,
L. Borgonovi,
O. Bouhali
, et al. (156 additional authors not shown)
Abstract:
We present analytical calculations, Finite Element Analysis modeling, and physical measurements of the interstrip capacitances for different potential strip geometries and dimensions of the readout boards for the GE2/1 triple-Gas Electron Multiplier detector in the CMS muon system upgrade. The main goal of the study is to find configurations that minimize the interstrip capacitances and consequent…
▽ More
We present analytical calculations, Finite Element Analysis modeling, and physical measurements of the interstrip capacitances for different potential strip geometries and dimensions of the readout boards for the GE2/1 triple-Gas Electron Multiplier detector in the CMS muon system upgrade. The main goal of the study is to find configurations that minimize the interstrip capacitances and consequently maximize the signal-to-noise ratio for the detector. We find agreement at the 1.5--4.8% level between the two methods of calculations and on the average at the 17% level between calculations and measurements. A configuration with halved strip lengths and doubled strip widths results in a measured 27--29% reduction over the original configuration while leaving the total number of strips unchanged. We have now adopted this design modification for all eight module types of the GE2/1 detector and will produce the final detector with this new strip design.
△ Less
Submitted 20 September, 2020;
originally announced September 2020.
-
Fast simulations of highly-connected spiking cortical models using GPUs
Authors:
Bruno Golosio,
Gianmarco Tiddia,
Chiara De Luca,
Elena Pastorelli,
Francesco Simula,
Pier Stanislao Paolucci
Abstract:
Over the past decade there has been a growing interest in the development of parallel hardware systems for simulating large-scale networks of spiking neurons. Compared to other highly-parallel systems, GPU-accelerated solutions have the advantage of a relatively low cost and a great versatility, thanks also to the possibility of using the CUDA-C/C++ programming languages. NeuronGPU is a GPU librar…
▽ More
Over the past decade there has been a growing interest in the development of parallel hardware systems for simulating large-scale networks of spiking neurons. Compared to other highly-parallel systems, GPU-accelerated solutions have the advantage of a relatively low cost and a great versatility, thanks also to the possibility of using the CUDA-C/C++ programming languages. NeuronGPU is a GPU library for large-scale simulations of spiking neural network models, written in the C++ and CUDA-C++ programming languages, based on a novel spike-delivery algorithm. This library includes simple LIF (leaky-integrate-and-fire) neuron models as well as several multisynapse AdEx (adaptive-exponential-integrate-and-fire) neuron models with current or conductance based synapses, user definable models and different devices. The numerical solution of the differential equations of the dynamics of the AdEx models is performed through a parallel implementation, written in CUDA-C++, of the fifth-order Runge-Kutta method with adaptive step-size control. In this work we evaluate the performance of this library on the simulation of a cortical microcircuit model, based on LIF neurons and current-based synapses, and on a balanced network of excitatory and inhibitory neurons, using AdEx neurons and conductance-based synapses. On these models, we will show that the proposed library achieves state-of-the-art performance in terms of simulation time per second of biological activity. In particular, using a single NVIDIA GeForce RTX 2080 Ti GPU board, the full-scale cortical-microcircuit model, which includes about 77,000 neurons and $3 \cdot 10^8$ connections, can be simulated at a speed very close to real time, while the simulation time of a balanced network of 1,000,000 AdEx neurons with 1,000 connections per neuron was about 70 s per second of biological activity.
△ Less
Submitted 9 November, 2020; v1 submitted 28 July, 2020;
originally announced July 2020.
-
CMS RPC Background -- Studies and Measurements
Authors:
R. Hadjiiska,
A. Samalan,
M. Tytgat,
N. Zaganidis,
G. A. Alves,
F. Marujo,
F. Torres Da Silva De Araujo,
E. M. Da Costa,
D. De Jesus Damiao,
H. Nogima,
A. Santoro,
S. Fonseca De Souza,
A. Aleksandrov,
P. Iaydjiev,
M. Rodozov,
M. Shopova,
G. Sultanov,
M. Bonchev,
A. Dimitrov,
L. Litov,
B. Pavlov,
P. Petkov,
A. Petrov,
S. J. Qian,
C. Bernal
, et al. (84 additional authors not shown)
Abstract:
The expected radiation background in the CMS RPC system has been studied using the MC prediction with the CMS FLUKA simulation of the detector and the cavern. The MC geometry used in the analysis describes very accurately the present RPC system but still does not include the complete description of the RPC upgrade region with pseudorapidity $1.9 < \lvert η\rvert < 2.4$. Present results will be upd…
▽ More
The expected radiation background in the CMS RPC system has been studied using the MC prediction with the CMS FLUKA simulation of the detector and the cavern. The MC geometry used in the analysis describes very accurately the present RPC system but still does not include the complete description of the RPC upgrade region with pseudorapidity $1.9 < \lvert η\rvert < 2.4$. Present results will be updated with the final geometry description, once it is available. The radiation background has been studied in terms of expected particle rates, absorbed dose and fluence. Two High Luminosity LHC (HL-LHC) scenarios have been investigated - after collecting $3000$ and $4000$ fb$^{-1}$. Estimations with safety factor of 3 have been considered, as well.
△ Less
Submitted 13 December, 2020; v1 submitted 26 May, 2020;
originally announced May 2020.
-
Thalamo-cortical spiking model of incremental learning combining perception, context and NREM-sleep-mediated noise-resilience
Authors:
Bruno Golosio,
Chiara De Luca,
Cristiano Capone,
Elena Pastorelli,
Giovanni Stegel,
Gianmarco Tiddia,
Giulia De Bonis,
Pier Stanislao Paolucci
Abstract:
The brain exhibits capabilities of fast incremental learning from few noisy examples, as well as the ability to associate similar memories in autonomously-created categories and to combine contextual hints with sensory perceptions. Together with sleep, these mechanisms are thought to be key components of many high-level cognitive functions. Yet, little is known about the underlying processes and t…
▽ More
The brain exhibits capabilities of fast incremental learning from few noisy examples, as well as the ability to associate similar memories in autonomously-created categories and to combine contextual hints with sensory perceptions. Together with sleep, these mechanisms are thought to be key components of many high-level cognitive functions. Yet, little is known about the underlying processes and the specific roles of different brain states. In this work, we exploited the combination of context and perception in a thalamo-cortical model based on a soft winner-take-all circuit of excitatory and inhibitory spiking neurons. After calibrating this model to express awake and deep-sleep states with features comparable with biological measures, we demonstrate the model capability of fast incremental learning from few examples, its resilience when proposed with noisy perceptions and contextual signals, and an improvement in visual classification after sleep due to induced synaptic homeostasis and association of similar memories.
△ Less
Submitted 5 August, 2021; v1 submitted 26 March, 2020;
originally announced March 2020.
-
Target spiking patterns enable efficient and biologically plausible learning for complex temporal tasks
Authors:
Paolo Muratore,
Cristiano Capone,
Pier Stanislao Paolucci
Abstract:
Recurrent spiking neural networks (RSNN) in the human brain learn to perform a wide range of perceptual, cognitive and motor tasks very efficiently in terms of energy consumption and requires very few examples. This motivates the search for biologically inspired learning rules for RSNNs to improve our understanding of brain computation and the efficiency of artificial intelligence. Several spiking…
▽ More
Recurrent spiking neural networks (RSNN) in the human brain learn to perform a wide range of perceptual, cognitive and motor tasks very efficiently in terms of energy consumption and requires very few examples. This motivates the search for biologically inspired learning rules for RSNNs to improve our understanding of brain computation and the efficiency of artificial intelligence. Several spiking models and learning rules have been proposed, but it remains a challenge to design RSNNs whose learning relies on biologically plausible mechanisms and are capable of solving complex temporal tasks. In this paper, we derive a learning rule, local to the synapse, from a simple mathematical principle, the maximization of the likelihood for the network to solve a specific task. We propose a novel target-based learning scheme in which the learning rule derived from likelihood maximization is used to mimic a specific spiking pattern that encodes the solution to complex temporal tasks. This method makes the learning extremely rapid and precise, outperforming state of the art algorithms for RSNNs. We demonstrate the capacity of our model to tackle several problems like learning multidimensional trajectories and solving the classical temporal XOR benchmark. Finally, we show that an online approximation of the gradient ascent, in addition to guaranteeing complete locality in time and space, allows learning after very few presentations of the target output. Our model can be applied to different types of biological neurons. The analytically derived plasticity learning rule is specific to each neuron model and can produce a theoretical prediction for experimental validation.
△ Less
Submitted 19 March, 2021; v1 submitted 13 February, 2020;
originally announced February 2020.
-
Slow Waves Analysis Pipeline for extracting the Features of the Bi-Modality from the Cerebral Cortex of Anesthetized Mice
Authors:
Giulia De Bonis,
Miguel Dasilva,
Antonio Pazienti,
Maria V. Sanchez-Vives,
Maurizio Mattia,
Pier Stanislao Paolucci
Abstract:
Cortical slow oscillations are an emergent property of the cortical network, a hallmark of low complexity brain states like sleep, and represent a default activity pattern. Here, we present a methodological approach for quantifying the spatial and temporal properties of this emergent activity. We improved and enriched a robust analysis procedure that has already been successfully applied to both i…
▽ More
Cortical slow oscillations are an emergent property of the cortical network, a hallmark of low complexity brain states like sleep, and represent a default activity pattern. Here, we present a methodological approach for quantifying the spatial and temporal properties of this emergent activity. We improved and enriched a robust analysis procedure that has already been successfully applied to both in vitro and in vivo data acquisitions. We tested the new tools of the methodology by analyzing the electrocorticography (ECoG) traces recorded from a custom 32-channel multi-electrode array in wild-type isoflurane-anesthetized mice. The enhanced analysis pipeline, named SWAP (Slow Waves Analysis Pipeline), detects Up and Down states, enables the characterization of the spatial dependency of their statistical properties, and supports the comparison of different subjects. The SWAP is implemented in a data-independent way, allowing its application to other data sets (acquired from different subjects, or with different recording tools), as well as to the outcome of numerical simulations. By using SWAP, we report statistically significant differences in the observed slow oscillations (SO) across cortical areas and cortical sites. Computing cortical maps by interpolating the features of SO acquired at the electrode positions, we give evidence of gradients at the global scale along an oblique axis directed from fronto-lateral towards occipito-medial regions, further highlighting some heterogeneity within cortical areas. The results obtained on spatial characterization of slow oscillations will be essential for producing data-driven brain simulations and for triggering a discussion on the role of, and the interplay between, the different regions in the cortex, improving our understanding of the mechanisms of generation and propagation of delta rhythms and, more generally, of cortical properties.
△ Less
Submitted 8 March, 2019; v1 submitted 22 February, 2019;
originally announced February 2019.
-
Scaling of a large-scale simulation of synchronous slow-wave and asynchronous awake-like activity of a cortical model with long-range interconnections
Authors:
Elena Pastorelli,
Cristiano Capone,
Francesco Simula,
Maria V. Sanchez-Vives,
Paolo Del Giudice,
Maurizio Mattia,
Pier Stanislao Paolucci
Abstract:
Cortical synapse organization supports a range of dynamic states on multiple spatial and temporal scales, from synchronous slow wave activity (SWA), characteristic of deep sleep or anesthesia, to fluctuating, asynchronous activity during wakefulness (AW). Such dynamic diversity poses a challenge for producing efficient large-scale simulations that embody realistic metaphors of short- and long-rang…
▽ More
Cortical synapse organization supports a range of dynamic states on multiple spatial and temporal scales, from synchronous slow wave activity (SWA), characteristic of deep sleep or anesthesia, to fluctuating, asynchronous activity during wakefulness (AW). Such dynamic diversity poses a challenge for producing efficient large-scale simulations that embody realistic metaphors of short- and long-range synaptic connectivity. In fact, during SWA and AW different spatial extents of the cortical tissue are active in a given timespan and at different firing rates, which implies a wide variety of loads of local computation and communication. A balanced evaluation of simulation performance and robustness should therefore include tests of a variety of cortical dynamic states. Here, we demonstrate performance scaling of our proprietary Distributed and Plastic Spiking Neural Networks (DPSNN) simulation engine in both SWA and AW for bidimensional grids of neural populations, which reflects the modular organization of the cortex. We explored networks up to 192x192 modules, each composed of 1250 integrate-and-fire neurons with spike-frequency adaptation, and exponentially decaying inter-modular synaptic connectivity with varying spatial decay constant. For the largest networks the total number of synapses was over 70 billion. The execution platform included up to 64 dual-socket nodes, each socket mounting 8 Intel Xeon Haswell processor cores @ 2.40GHz clock rates. Network initialization time, memory usage, and execution time showed good scaling performances from 1 to 1024 processes, implemented using the standard Message Passing Interface (MPI) protocol. We achieved simulation speeds of between 2.3x10^9 and 4.1x10^9 synaptic events per second for both cortical states in the explored range of inter-modular interconnections.
△ Less
Submitted 26 November, 2019; v1 submitted 22 February, 2019;
originally announced February 2019.
-
Real-time cortical simulations: energy and interconnect scaling on distributed systems
Authors:
Francesco Simula,
Elena Pastorelli,
Pier Stanislao Paolucci,
Michele Martinelli,
Alessandro Lonardo,
Andrea Biagioni,
Cristiano Capone,
Fabrizio Capuani,
Paolo Cretaro,
Giulia De Bonis,
Francesca Lo Cicero,
Luca Pontisso,
Piero Vicini,
Roberto Ammendola
Abstract:
We profile the impact of computation and inter-processor communication on the energy consumption and on the scaling of cortical simulations approaching the real-time regime on distributed computing platforms. Also, the speed and energy consumption of processor architectures typical of standard HPC and embedded platforms are compared. We demonstrate the importance of the design of low-latency inter…
▽ More
We profile the impact of computation and inter-processor communication on the energy consumption and on the scaling of cortical simulations approaching the real-time regime on distributed computing platforms. Also, the speed and energy consumption of processor architectures typical of standard HPC and embedded platforms are compared. We demonstrate the importance of the design of low-latency interconnect for speed and energy consumption. The cost of cortical simulations is quantified using the Joule per synaptic event metric on both architectures. Reaching efficient real-time on large scale cortical simulations is of increasing relevance for both future bio-inspired artificial intelligence applications and for understanding the cognitive functions of the brain, a scientific quest that will require to embed large scale simulations into highly complex virtual or real worlds. This work stands at the crossroads between the WaveScalES experiment in the Human Brain Project (HBP), which includes the objective of large scale thalamo-cortical simulations of brain states and their transitions, and the ExaNeSt and EuroExa projects, that investigate the design of an ARM-based, low-power High Performance Computing (HPC) architecture with a dedicated interconnect scalable to million of cores; simulation of deep sleep Slow Wave Activity (SWA) and Asynchronous aWake (AW) regimes expressed by thalamo-cortical models are among their benchmarks.
△ Less
Submitted 26 November, 2019; v1 submitted 12 December, 2018;
originally announced December 2018.
-
Analysis and Model of Cortical Slow Waves Acquired with Optical Techniques
Authors:
Marco Celotto,
Chiara De Luca,
Paolo Muratore,
Francesco Resta,
Anna Letizia Allegra Mascaro,
Francesco Saverio Pavone,
Giulia De Bonis,
Pier Stanislao Paolucci
Abstract:
Slow waves (SWs) are spatio-temporal patterns of cortical activity that occur both during natural sleep and anesthesia and are preserved across species. Even though electrophysiological recordings have been largely used to characterize brain states, they are limited in the spatial resolution and cannot target specific neuronal population. Recently, large-scale optical imaging techniques coupled wi…
▽ More
Slow waves (SWs) are spatio-temporal patterns of cortical activity that occur both during natural sleep and anesthesia and are preserved across species. Even though electrophysiological recordings have been largely used to characterize brain states, they are limited in the spatial resolution and cannot target specific neuronal population. Recently, large-scale optical imaging techniques coupled with functional indicators overcame these restrictions, and new pipelines of analysis and novel approaches of SWs modelling are needed to extract relevant features of the spatio-temporal dynamics of SWs from these highly spatially resolved data-sets. Here we combined wide-field fluorescence microscopy and a transgenic mouse model expressing a calcium indicator (GCaMP6f) in excitatory neurons to study SW propagation over the meso-scale under ketamine anesthesia. We developed a versatile analysis pipeline to identify and quantify the spatio-temporal propagation of the SWs. Moreover, we designed a computational simulator based on a simple theoretical model, which takes into account the statistics of neuronal activity, the response of fluorescence proteins and the slow waves dynamics. The simulator was capable of synthesizing artificial signals that could reliably reproduce several features of the SWs observed in vivo, thus enabling a calibration tool for the analysis pipeline. Comparison of experimental and simulated data shows the robustness of the analysis tools and its potential to uncover mechanistic insights of the Slow Wave Activity (SWA).
△ Less
Submitted 31 January, 2020; v1 submitted 28 November, 2018;
originally announced November 2018.
-
Sleep-like slow oscillations improve visual classification through synaptic homeostasis and memory association in a thalamo-cortical model
Authors:
Cristiano Capone,
Elena Pastorelli,
Bruno Golosio,
Pier Stanislao Paolucci
Abstract:
The occurrence of sleep passed through the evolutionary sieve and is widespread in animal species. Sleep is known to be beneficial to cognitive and mnemonic tasks, while chronic sleep deprivation is detrimental. Despite the importance of the phenomenon, a complete understanding of its functions and underlying mechanisms is still lacking. In this paper, we show interesting effects of deep-sleep-lik…
▽ More
The occurrence of sleep passed through the evolutionary sieve and is widespread in animal species. Sleep is known to be beneficial to cognitive and mnemonic tasks, while chronic sleep deprivation is detrimental. Despite the importance of the phenomenon, a complete understanding of its functions and underlying mechanisms is still lacking. In this paper, we show interesting effects of deep-sleep-like slow oscillation activity on a simplified thalamo-cortical model which is trained to encode, retrieve and classify images of handwritten digits. During slow oscillations, spike-timing-dependent-plasticity (STDP) produces a differential homeostatic process. It is characterized by both a specific unsupervised enhancement of connections among groups of neurons associated to instances of the same class (digit) and a simultaneous down-regulation of stronger synapses created by the training. This hierarchical organization of post-sleep internal representations favours higher performances in retrieval and classification tasks. The mechanism is based on the interaction between top-down cortico-thalamic predictions and bottom-up thalamo-cortical projections during deep-sleep-like slow oscillations. Indeed, when learned patterns are replayed during sleep, cortico-thalamo-cortical connections favour the activation of other neurons coding for similar thalamic inputs, promoting their association. Such mechanism hints at possible applications to artificial learning systems.
△ Less
Submitted 18 November, 2019; v1 submitted 24 October, 2018;
originally announced October 2018.
-
High Rate RPC detector for LHC
Authors:
F. Lagarde,
A. Fagot,
M. Gul,
C. Roskas,
M. Tytgat,
N. Zaganidis,
S. Fonseca De Souza,
A. Santoro,
F. Torres Da Silva De Araujo,
A. Aleksandrov,
R. Hadjiiska,
P. Iaydjiev,
M. Rodozov,
M. Shopova,
G. Sultanov,
A. Dimitrov,
L. Litov,
B. Pavlov,
P. Petkov,
A. Petrov,
S. J. Qian,
D. Han,
W. Yi,
C. Avila,
A. Cabrera
, et al. (77 additional authors not shown)
Abstract:
The High Luminosity LHC (HL-LHC) phase is designed to increase by an order of magnitude the amount of data to be collected by the LHC experiments. The foreseen gradual increase of the instantaneous luminosity of up to more than twice its nominal value of $10\times10^{34}\
{\rm cm}^{-1}{\rm s}^{-2}$ during Phase I and Phase II of the LHC running, presents special challenges for the experiments. The…
▽ More
The High Luminosity LHC (HL-LHC) phase is designed to increase by an order of magnitude the amount of data to be collected by the LHC experiments. The foreseen gradual increase of the instantaneous luminosity of up to more than twice its nominal value of $10\times10^{34}\
{\rm cm}^{-1}{\rm s}^{-2}$ during Phase I and Phase II of the LHC running, presents special challenges for the experiments. The region with high pseudo rapidity ($η$) region of the forward muon spectrometer ($2.4 > |η| > 1.9$) is not equipped with RPC stations. The increase of the expected particles rate up to 2 kHz cm$^{-1}$ ( including a safety factor 3 ) motivates the installation of RPC chambers to guarantee redundancy with the CSC chambers already present. The current CMS RPC technology cannot sustain the expected background level. A new generation of Glass-RPC (GRPC) using low-resistivity glass was proposed to equip the two most far away of the four high $η$ muon stations of CMS. In their single-gap version they can stand rates of few kHz cm$^{-1}$. Their time precision of about 1 ns can allow to reduce the noise contribution leading to an improvement of the trigger rate. The proposed design for large size chambers is examined and some preliminary results obtained during beam tests at Gamma Irradiation Facility (GIF++) and Super Proton Synchrotron (SPS) at CERN are shown. They were performed to validate the capability of such detectors to support high irradiation environment with limited consequence on their efficiency.
△ Less
Submitted 16 July, 2018;
originally announced July 2018.
-
Large Scale Low Power Computing System - Status of Network Design in ExaNeSt and EuroExa Projects
Authors:
Roberto Ammendola,
Andrea Biagioni,
Fabrizio Capuani,
Paolo Cretaro,
Giulia De Bonis,
Francesca Lo Cicero,
Alessandro Lonardo,
Michele Martinelli,
Pier Stanislao Paolucci,
Elena Pastorelli,
Luca Pontisso,
Francesco Simula,
Piero Vicini
Abstract:
The deployment of the next generation computing platform at ExaFlops scale requires to solve new technological challenges mainly related to the impressive number (up to 10^6) of compute elements required. This impacts on system power consumption, in terms of feasibility and costs, and on system scalability and computing efficiency. In this perspective analysis, exploration and evaluation of techno…
▽ More
The deployment of the next generation computing platform at ExaFlops scale requires to solve new technological challenges mainly related to the impressive number (up to 10^6) of compute elements required. This impacts on system power consumption, in terms of feasibility and costs, and on system scalability and computing efficiency. In this perspective analysis, exploration and evaluation of technologies characterized by low power, high efficiency and high degree of customization is strongly needed. Among the various European initiative targeting the design of ExaFlops system, ExaNeSt and EuroExa are EU-H2020 funded initiatives leveraging on high end MPSoC FPGAs. Last generation MPSoC FPGAs can be seen as non-mainstream but powerful HPC Exascale enabling components thanks to the integration of embedded multi-core, ARM-based low power CPUs and a huge number of hardware resources usable to co-design application oriented accelerators and to develop a low latency high bandwidth network architecture. In this paper we introduce ExaNet the FPGA-based, scalable, direct network architecture of ExaNeSt system. ExaNet allow us to explore different interconnection topologies, to evaluate advanced routing functions for congestion control and fault tolerance and to design specific hardware components for acceleration of collective operations. After a brief introduction of the motivations and goals of ExaNeSt and EuroExa projects, we will report on the status of network architecture design and its hardware/software testbed adding preliminary bandwidth and latency achievements.
△ Less
Submitted 11 April, 2018;
originally announced April 2018.
-
The Brain on Low Power Architectures - Efficient Simulation of Cortical Slow Waves and Asynchronous States
Authors:
Roberto Ammendola,
Andrea Biagioni,
Fabrizio Capuani,
Paolo Cretaro,
Giulia De Bonis,
Francesca Lo Cicero,
Alessandro Lonardo,
Michele Martinelli,
Pier Stanislao Paolucci,
Elena Pastorelli,
Luca Pontisso,
Francesco Simula,
Piero Vicini
Abstract:
Efficient brain simulation is a scientific grand challenge, a parallel/distributed coding challenge and a source of requirements and suggestions for future computing architectures. Indeed, the human brain includes about 10^15 synapses and 10^11 neurons activated at a mean rate of several Hz. Full brain simulation poses Exascale challenges even if simulated at the highest abstraction level. The Wav…
▽ More
Efficient brain simulation is a scientific grand challenge, a parallel/distributed coding challenge and a source of requirements and suggestions for future computing architectures. Indeed, the human brain includes about 10^15 synapses and 10^11 neurons activated at a mean rate of several Hz. Full brain simulation poses Exascale challenges even if simulated at the highest abstraction level. The WaveScalES experiment in the Human Brain Project (HBP) has the goal of matching experimental measures and simulations of slow waves during deep-sleep and anesthesia and the transition to other brain states. The focus is the development of dedicated large-scale parallel/distributed simulation technologies. The ExaNeSt project designs an ARM-based, low-power HPC architecture scalable to million of cores, develo** a dedicated scalable interconnect system, and SWA/AW simulations are included among the driving benchmarks. At the joint between both projects is the INFN proprietary Distributed and Plastic Spiking Neural Networks (DPSNN) simulation engine. DPSNN can be configured to stress either the networking or the computation features available on the execution platforms. The simulation stresses the networking component when the neural net - composed by a relatively low number of neurons, each one projecting thousands of synapses - is distributed over a large number of hardware cores. When growing the number of neurons per core, the computation starts to be the dominating component for short range connections. This paper reports about preliminary performance results obtained on an ARM-based HPC prototype developed in the framework of the ExaNeSt project. Furthermore, a comparison is given of instantaneous power, total energy consumption, execution time and energetic cost per synaptic event of SWA/AW DPSNN simulations when executed on either ARM- or Intel-based server platforms.
△ Less
Submitted 10 April, 2018;
originally announced April 2018.
-
Gaussian and exponential lateral connectivity on distributed spiking neural network simulation
Authors:
Elena Pastorelli,
Pier Stanislao Paolucci,
Francesco Simula,
Andrea Biagioni,
Fabrizio Capuani,
Paolo Cretaro,
Giulia De Bonis,
Francesca Lo Cicero,
Alessandro Lonardo,
Michele Martinelli,
Luca Pontisso,
Piero Vicini,
Roberto Ammendola
Abstract:
We measured the impact of long-range exponentially decaying intra-areal lateral connectivity on the scaling and memory occupation of a distributed spiking neural network simulator compared to that of short-range Gaussian decays. While previous studies adopted short-range connectivity, recent experimental neurosciences studies are pointing out the role of longer-range intra-areal connectivity with…
▽ More
We measured the impact of long-range exponentially decaying intra-areal lateral connectivity on the scaling and memory occupation of a distributed spiking neural network simulator compared to that of short-range Gaussian decays. While previous studies adopted short-range connectivity, recent experimental neurosciences studies are pointing out the role of longer-range intra-areal connectivity with implications on neural simulation platforms. Two-dimensional grids of cortical columns composed by up to 11 M point-like spiking neurons with spike frequency adaption were connected by up to 30 G synapses using short- and long-range connectivity models. The MPI processes composing the distributed simulator were run on up to 1024 hardware cores, hosted on a 64 nodes server platform. The hardware platform was a cluster of IBM NX360 M5 16-core compute nodes, each one containing two Intel Xeon Haswell 8-core E5-2630 v3 processors, with a clock of 2.40 G Hz, interconnected through an InfiniBand network, equipped with 4x QDR switches.
△ Less
Submitted 19 February, 2019; v1 submitted 23 March, 2018;
originally announced March 2018.
-
R&D towards the CMS RPC Phase-2 upgrade
Authors:
A. Fagot,
A. Cimmino,
S. Crucy,
M. Gul,
A. A. O. Rios,
M. Tytgat,
N. Zaganidis,
S. Aly,
Y. Assran,
A. Radi,
A. Sayed,
G. Singh,
M. Abbrescia,
G. Iaselli,
M. Maggi,
G. Pugliese,
P. Verwilligen,
W. Van Doninck,
S. Colafranceschi,
A. Sharma,
L. Benussi,
S. Bianco,
D. Piccolo,
F. Primavera,
V. Bhatnagar
, et al. (71 additional authors not shown)
Abstract:
The high pseudo-rapidity region of the CMS muon system is covered by Cathode Strip Chambers (CSC) only and lacks redundant coverage despite the fact that it is a challenging region for muons in terms of backgrounds and momentum resolution. In order to maintain good efficiency for the muon trigger in this region additional RPCs are planned to be installed in the two outermost stations at low angle…
▽ More
The high pseudo-rapidity region of the CMS muon system is covered by Cathode Strip Chambers (CSC) only and lacks redundant coverage despite the fact that it is a challenging region for muons in terms of backgrounds and momentum resolution. In order to maintain good efficiency for the muon trigger in this region additional RPCs are planned to be installed in the two outermost stations at low angle named RE3/1 and RE4/1. These stations will use RPCs with finer granularity and good timing resolution to mitigate background effects and to increase the redundancy of the system.
△ Less
Submitted 14 June, 2016;
originally announced June 2016.
-
GPU-based Real-time Triggering in the NA62 Experiment
Authors:
R. Ammendola,
A. Biagioni,
P. Cretaro,
S. Di Lorenzo,
R. Fantechi,
M. Fiorini,
O. Frezza,
G. Lamanna,
F. Lo Cicero,
A. Lonardo,
M. Martinelli,
I. Neri,
P. S. Paolucci,
E. Pastorelli,
R. Piandani,
L. Pontisso,
D. Rossetti,
F. Simula,
M. Sozzi,
P. Vicini
Abstract:
Over the last few years the GPGPU (General-Purpose computing on Graphics Processing Units) paradigm represented a remarkable development in the world of computing. Computing for High-Energy Physics is no exception: several works have demonstrated the effectiveness of the integration of GPU-based systems in high level trigger of different experiments. On the other hand the use of GPUs in the low le…
▽ More
Over the last few years the GPGPU (General-Purpose computing on Graphics Processing Units) paradigm represented a remarkable development in the world of computing. Computing for High-Energy Physics is no exception: several works have demonstrated the effectiveness of the integration of GPU-based systems in high level trigger of different experiments. On the other hand the use of GPUs in the low level trigger systems, characterized by stringent real-time constraints, such as tight time budget and high throughput, poses several challenges. In this paper we focus on the low level trigger in the CERN NA62 experiment, investigating the use of real-time computing on GPUs in this synchronous system. Our approach aimed at harvesting the GPU computing power to build in real-time refined physics-related trigger primitives for the RICH detector, as the the knowledge of Cerenkov rings parameters allows to build stringent conditions for data selection at trigger level. Latencies of all components of the trigger chain have been analyzed, pointing out that networking is the most critical one. To keep the latency of data transfer task under control, we devised NaNet, an FPGA-based PCIe Network Interface Card (NIC) with GPUDirect capabilities. For the processing task, we developed specific multiple ring trigger algorithms to leverage the parallel architecture of GPUs and increase the processing throughput to keep up with the high event rate. Results obtained during the first months of 2016 NA62 run are presented and discussed.
△ Less
Submitted 13 June, 2016;
originally announced June 2016.
-
High rate, fast timing Glass RPC for the high η CMS muon detectors
Authors:
F. Lagarde,
M. Gouzevitch,
I. Laktineh,
V. Buridon,
X. Chen,
C. Combaret,
A. Eynard,
L. Germani,
G. Grenier,
H. Mathez,
L. Mirabito,
A. Petrukhin,
A. Steen,
W. Tromeuraa,
Y. Wang,
A. Gongab,
N. Moreau,
C. de la Taille,
F. Dulucqac,
A. Cimmino,
S. Crucy,
A. Fagot,
M. Gul,
A. A. O. Rios,
M. Tytgat
, et al. (86 additional authors not shown)
Abstract:
The HL-LHC phase is designed to increase by an order of magnitude the amount of data to be collected by the LHC experiments. To achieve this goal in a reasonable time scale the instantaneous luminosity would also increase by an order of magnitude up to $6.10^{34} cm^{-2} s^{-1}$ . The region of the forward muon spectrometer ($|η| > 1.6$) is not equipped with RPC stations. The increase of the expec…
▽ More
The HL-LHC phase is designed to increase by an order of magnitude the amount of data to be collected by the LHC experiments. To achieve this goal in a reasonable time scale the instantaneous luminosity would also increase by an order of magnitude up to $6.10^{34} cm^{-2} s^{-1}$ . The region of the forward muon spectrometer ($|η| > 1.6$) is not equipped with RPC stations. The increase of the expected particles rate up to $2 kHz/cm^{2}$ (including a safety factor 3) motivates the installation of RPC chambers to guarantee redundancy with the CSC chambers already present. The actual RPC technology of CMS cannot sustain the expected background level. The new technology that will be chosen should have a high rate capability and provides a good spatial and timing resolution. A new generation of Glass-RPC (GRPC) using low-resistivity (LR) glass is proposed to equip at least the two most far away of the four high $η$ muon stations of CMS. First the design of small size prototypes and studies of their performance in high-rate particles flux is presented. Then the proposed designs for large size chambers and their fast-timing electronic readout are examined and preliminary results are provided.
△ Less
Submitted 22 July, 2016; v1 submitted 4 June, 2016;
originally announced June 2016.
-
Performance of Resistive Plate Chambers installed during the first long shutdown of the CMS experiment
Authors:
M. Shopova,
A. Aleksandrov,
R. Hadjiiska,
P. Iaydjiev,
G. Sultanov,
M. Rodozov,
S. Stoykova,
Y. Assran,
A. Sayed,
A. Radi,
S. Aly,
G. Singh,
M. Abbrescia,
G. Iaselli,
M. Maggi,
G. Pugliese,
P. Verwilligen,
W. Van Doninck,
S. Colafranceschi,
A. Sharma,
L. Benussi,
S. Bianco,
D. Piccolo,
F. Primavera,
A. Cimmino
, et al. (71 additional authors not shown)
Abstract:
The CMS experiment, located at the CERN Large Hadron Collider, has a redundant muon system composed by three different detector technologies: Cathode Strip Chambers (in the forward regions), Drift Tubes (in the central region) and Resistive Plate Chambers (both its central and forward regions). All three are used for muon reconstruction and triggering. During the first long shutdown (LS1) of the L…
▽ More
The CMS experiment, located at the CERN Large Hadron Collider, has a redundant muon system composed by three different detector technologies: Cathode Strip Chambers (in the forward regions), Drift Tubes (in the central region) and Resistive Plate Chambers (both its central and forward regions). All three are used for muon reconstruction and triggering. During the first long shutdown (LS1) of the LHC (2013-2014) the CMS muon system has been upgraded with 144 newly installed RPCs on the forth forward stations. The new chambers ensure and enhance the muon trigger efficiency in the high luminosity conditions of the LHC Run2. The chambers have been successfully installed and commissioned. The system has been run successfully and experimental data has been collected and analyzed. The performance results of the newly installed RPCs will be presented.
△ Less
Submitted 22 May, 2016;
originally announced May 2016.
-
Radiation Tests of Real-Sized Prototype RPCs for the Future CMS RPC Upscope
Authors:
K. S. Lee,
S. Choi,
B. S. Hong,
M. Jo,
J. W. Kang,
M. Kang,
H. Kim,
K. Lee,
S. K. Parka,
A. Cimmino,
S. Crucy,
A. Fagot,
M. Gul,
A. A. O. Rios,
M. Tytgat,
N. Zaganidis,
S. Ali,
Y. Assran,
A. Radi,
A. Sayed,
G. Singh,
M. Abbrescia,
G. Iaselli,
M. Maggi,
G. Pugliese
, et al. (71 additional authors not shown)
Abstract:
We report on a systematic study of double-gap and four-gap phenolic resistive plate chambers (RPCs) for future high-η RPC triggers in the CMS. In the present study, we constructed real-sized double-gap and four-gap RPCs with gap thicknesses of 1.6 and 0.8 mm, respectively, with 2-mm-thick phenolic high-pressure-laminated (HPL) plates. We examined the prototype RPCs for cosmic rays and 100 GeV muon…
▽ More
We report on a systematic study of double-gap and four-gap phenolic resistive plate chambers (RPCs) for future high-η RPC triggers in the CMS. In the present study, we constructed real-sized double-gap and four-gap RPCs with gap thicknesses of 1.6 and 0.8 mm, respectively, with 2-mm-thick phenolic high-pressure-laminated (HPL) plates. We examined the prototype RPCs for cosmic rays and 100 GeV muons provided by the SPS H4 beam line at CERN. We applied maximum gamma rates of 1.5 kHz cm-2 provided by 137Cs sources at Korea University and the GIF++ irradiation facility installed at the SPS H4 beam line to examine the rate capabilities of the prototype RPCs. In contrast to the case of the four-gap RPCs, we found the relatively high threshold was conducive to effectively suppressing the rapid increase of strip cluster sizes of muon hits with high voltage, especially when measuring the narrow-pitch strips. The gamma-induced currents drawn in the four-gap RPC were about one-fourth of those drawn in the double-gap RPC. The rate capabilities of both RPC types, proven through the present testing using gamma-ray sources, far exceeded the maximum rate expected in the new high-η endcap RPCs planned for future phase-II LHC runs.
△ Less
Submitted 4 May, 2016; v1 submitted 2 May, 2016;
originally announced May 2016.
-
A novel application of Fiber Bragg Grating (FBG) sensors in MPGD
Authors:
D. Abbaneo,
M. Abbas,
M. Abbrescia,
A. A. Abdelalim,
M. Abi Akl,
O. Aboamer,
D. Acosta,
A. Ahmad,
W. Ahmed,
W. Ahmed,
A. Aleksandrov,
R. Aly,
P. Altieri,
C. Asawatangtrakuldee,
P. Aspell,
Y. Assran,
I. Awan,
S. Bally,
Y. Ban,
S. Banerjee,
V. Barashko,
P. Barria,
G. Bencze,
N. Beni,
L. Benussi
, et al. (133 additional authors not shown)
Abstract:
We present a novel application of Fiber Bragg Grating (FBG) sensors in the construction and characterisation of Micro Pattern Gaseous Detector (MPGD), with particular attention to the realisation of the largest triple (Gas electron Multiplier) GEM chambers so far operated, the GE1/1 chambers of the CMS experiment at LHC. The GE1/1 CMS project consists of 144 GEM chambers of about 0.5 m2 active are…
▽ More
We present a novel application of Fiber Bragg Grating (FBG) sensors in the construction and characterisation of Micro Pattern Gaseous Detector (MPGD), with particular attention to the realisation of the largest triple (Gas electron Multiplier) GEM chambers so far operated, the GE1/1 chambers of the CMS experiment at LHC. The GE1/1 CMS project consists of 144 GEM chambers of about 0.5 m2 active area each, employing three GEM foils per chamber, to be installed in the forward region of the CMS endcap during the long shutdown of LHC in 2108-2019. The large active area of each GE1/1 chamber consists of GEM foils that are mechanically stretched in order to secure their flatness and the consequent uniform performance of the GE1/1 chamber across its whole active surface. So far FBGs have been used in high energy physics mainly as high precision positioning and re-positioning sensors and as low cost, easy to mount, low space consuming temperature sensors. FBGs are also commonly used for very precise strain measurements in material studies. In this work we present a novel use of FBGs as flatness and mechanical tensioning sensors applied to the wide GEM foils of the GE1/1 chambers. A network of FBG sensors have been used to determine the optimal mechanical tension applied and to characterise the mechanical tension that should be applied to the foils. We discuss the results of the test done on a full-sized GE1/1 final prototype, the studies done to fully characterise the GEM material, how this information was used to define a standard assembly procedure and possible future developments.
△ Less
Submitted 28 December, 2015;
originally announced December 2015.
-
Fiber Bragg Grating (FBG) sensors as flatness and mechanical stretching sensors
Authors:
D. Abbaneo,
M. Abbas,
M. Abbrescia,
A. A. Abdelalim,
M. Abi Akl,
O. Aboamer,
D. Acosta,
A. Ahmad,
W. Ahmed,
W. Ahmed,
A. Aleksandrov,
R. Aly,
P. Altieri,
C. Asawatangtrakuldee,
P. Aspell,
Y. Assran,
I. Awan,
S. Bally,
Y. Ban,
S. Banerjee,
V. Barashko,
P. Barria,
G. Bencze,
N. Beni,
L. Benussi
, et al. (133 additional authors not shown)
Abstract:
A novel approach which uses Fibre Bragg Grating (FBG) sensors has been utilised to assess and monitor the flatness of Gaseous Electron Multipliers (GEM) foils. The setup layout and preliminary results are presented.
A novel approach which uses Fibre Bragg Grating (FBG) sensors has been utilised to assess and monitor the flatness of Gaseous Electron Multipliers (GEM) foils. The setup layout and preliminary results are presented.
△ Less
Submitted 28 December, 2015;
originally announced December 2015.
-
Impact of exponential long range and Gaussian short range lateral connectivity on the distributed simulation of neural networks including up to 30 billion synapses
Authors:
Elena Pastorelli,
Pier Stanislao Paolucci,
Roberto Ammendola,
Andrea Biagioni,
Ottorino Frezza,
Francesca Lo Cicero,
Alessandro Lonardo,
Michele Martinelli,
Francesco Simula,
Piero Vicini
Abstract:
Recent experimental neuroscience studies are pointing out the role of long-range intra-areal connectivity that can be modeled by a distance dependent exponential decay of the synaptic probability distribution. This short report provides a preliminary measure of the impact of exponentially decaying lateral connectivity compared to that of shorter-range Gaussian decays on the scaling behaviour and m…
▽ More
Recent experimental neuroscience studies are pointing out the role of long-range intra-areal connectivity that can be modeled by a distance dependent exponential decay of the synaptic probability distribution. This short report provides a preliminary measure of the impact of exponentially decaying lateral connectivity compared to that of shorter-range Gaussian decays on the scaling behaviour and memory occupation of a distributed spiking neural network simulator (DPSNN). Two-dimensional grids of cortical columns composed by point-like spiking neurons have been connected by up to 30 billion synapses using exponential and Gaussian connectivity models. Up to 1024 hardware cores, hosted on a 64 nodes server platform, executed the MPI processes composing the distributed simulator. The hardware platform was a cluster of IBM NX360 M5 16-core compute nodes, each one containing two Intel Xeon Haswell 8-core E5-2630 v3 processors, with a clock of 2.40GHz, interconnected through an InfiniBand network. This study is conducted in the framework of the CORTICONIC FET project, also in view of the next -to-start activities foreseen as part of the Human Brain Project (HBP), SubProject 3 Cognitive and Systems Neuroscience, WaveScalES work-package.
△ Less
Submitted 16 December, 2015;
originally announced December 2015.
-
Scaling to 1024 software processes and hardware cores of the distributed simulation of a spiking neural network including up to 20G synapses
Authors:
Elena Pastorelli,
Pier Stanislao Paolucci,
Roberto Ammendola,
Andrea Biagioni,
Ottorino Frezza,
Francesca Lo Cicero,
Alessandro Lonardo,
Michele Martinelli,
Francesco Simula,
Piero Vicini
Abstract:
This short report describes the scaling, up to 1024 software processes and hardware cores, of a distributed simulator of plastic spiking neural networks. A previous report demonstrated good scalability of the simulator up to 128 processes. Herein we extend the speed-up measurements and strong and weak scaling analysis of the simulator to the range between 1 and 1024 software processes and hardware…
▽ More
This short report describes the scaling, up to 1024 software processes and hardware cores, of a distributed simulator of plastic spiking neural networks. A previous report demonstrated good scalability of the simulator up to 128 processes. Herein we extend the speed-up measurements and strong and weak scaling analysis of the simulator to the range between 1 and 1024 software processes and hardware cores. We simulated two-dimensional grids of cortical columns including up to ~20G synapses connecting ~11M neurons. The neural network was distributed over a set of MPI processes and the simulations were run on a server platform composed of up to 64 dual-socket nodes, each socket equipped with Intel Haswell E5-2630 v3 processors (8 cores @ 2.4 GHz clock). All nodes are interconned through an InfiniBand network. The DPSNN simulator has been developed by INFN in the framework of EURETILE and CORTICONIC European FET Project and will be used by the WaveScalEW tem in the framework of the Human Brain Project (HBP), SubProject 2 - Cognitive and Systems Neuroscience. This report lays the groundwork for a more thorough comparison with the neural simulation tool NEST.
△ Less
Submitted 30 November, 2015;
originally announced November 2015.
-
Power, Energy and Speed of Embedded and Server Multi-Cores applied to Distributed Simulation of Spiking Neural Networks: ARM in NVIDIA Tegra vs Intel Xeon quad-cores
Authors:
Pier Stanislao Paolucci,
Roberto Ammendola,
Andrea Biagioni,
Ottorino Frezza,
Francesca Lo Cicero,
Alessandro Lonardo,
Michele Martinelli,
Elena Pastorelli,
Francesco Simula,
Piero Vicini
Abstract:
This short note regards a comparison of instantaneous power, total energy consumption, execution time and energetic cost per synaptic event of a spiking neural network simulator (DPSNN-STDP) distributed on MPI processes when executed either on an embedded platform (based on a dual socket quad-core ARM platform) or a server platform (INTEL-based quad-core dual socket platform). We also compare the…
▽ More
This short note regards a comparison of instantaneous power, total energy consumption, execution time and energetic cost per synaptic event of a spiking neural network simulator (DPSNN-STDP) distributed on MPI processes when executed either on an embedded platform (based on a dual socket quad-core ARM platform) or a server platform (INTEL-based quad-core dual socket platform). We also compare the measure with those reported by leading custom and semi-custom designs: TrueNorth and SpiNNaker. In summary, we observed that: 1- we spent 2.2 micro-Joule per simulated event on the "embedded platform", approx. 4.4 times lower than what was spent by the "server platform"; 2- the instantaneous power consumption of the "embedded platform" was 14.4 times better than the "server" one; 3- the server platform is a factor 3.3 faster. The "embedded platform" is made of NVIDIA Jetson TK1 boards, interconnected by Ethernet, each mounting a Tegra K1 chip including a quad-core ARM Cortex-A15 at 2.3GHz. The "server platform" is based on dual-socket quad-core Intel Xeon CPUs (E5620 at 2.4GHz). The measures were obtained with the DPSNN-STDP simulator (Distributed Simulator of Polychronous Spiking Neural Network with synaptic Spike Timing Dependent Plasticity) developed by INFN, that already proved its efficient scalability and execution speed-up on hundreds of similar "server" cores and MPI processes, applied to neural nets composed of several billions of synapses.
△ Less
Submitted 12 May, 2015;
originally announced May 2015.
-
Performance of a Large-Area GEM Detector Prototype for the Upgrade of the CMS Muon Endcap System
Authors:
D. Abbaneo,
M. Abbas,
M. Abbrescia,
A. A. Abdelalim,
M. Abi Akl,
W. Ahmed,
W. Ahmed,
P. Altieri,
R. Aly,
C. Asawatangtrakuldee,
A. Ashfaq,
P. Aspell,
Y. Assran,
I. Awan,
S. Bally,
Y. Ban,
S. Banerjee,
P. Barria,
L. Benussi,
V. Bhopatkar,
S. Bianco,
J. Bos,
O. Bouhali,
S. Braibant,
S. Buontempo
, et al. (113 additional authors not shown)
Abstract:
Gas Electron Multiplier (GEM) technology is being considered for the forward muon upgrade of the CMS experiment in Phase 2 of the CERN LHC. Its first implementation is planned for the GE1/1 system in the $1.5 < \midη\mid < 2.2$ region of the muon endcap mainly to control muon level-1 trigger rates after the second long LHC shutdown. A GE1/1 triple-GEM detector is read out by 3,072 radial strips wi…
▽ More
Gas Electron Multiplier (GEM) technology is being considered for the forward muon upgrade of the CMS experiment in Phase 2 of the CERN LHC. Its first implementation is planned for the GE1/1 system in the $1.5 < \midη\mid < 2.2$ region of the muon endcap mainly to control muon level-1 trigger rates after the second long LHC shutdown. A GE1/1 triple-GEM detector is read out by 3,072 radial strips with 455 $μ$rad pitch arranged in eight $η$-sectors. We assembled a full-size GE1/1 prototype of 1m length at Florida Tech and tested it in 20-120 GeV hadron beams at Fermilab using Ar/CO$_{2}$ 70:30 and the RD51 scalable readout system. Four small GEM detectors with 2-D readout and an average measured azimuthal resolution of 36 $μ$rad provided precise reference tracks. Construction of this largest GEM detector built to-date is described. Strip cluster parameters, detection efficiency, and spatial resolution are studied with position and high voltage scans. The plateau detection efficiency is [97.1 $\pm$ 0.2 (stat)]\%. The azimuthal resolution is found to be [123.5 $\pm$ 1.6 (stat)] $μ$rad when operating in the center of the efficiency plateau and using full pulse height information. The resolution can be slightly improved by $\sim$ 10 $μ$rad when correcting for the bias due to discrete readout strips. The CMS upgrade design calls for readout electronics with binary hit output. When strip clusters are formed correspondingly without charge-weighting and with fixed hit thresholds, a position resolution of [136.8 $\pm$ 2.5 stat] $μ$rad is measured, consistent with the expected resolution of strip-pitch/$\sqrt{12}$ = 131.3 $μ$rad. Other $η$-sectors of the detector show similar response and performance.
△ Less
Submitted 8 December, 2014; v1 submitted 30 November, 2014;
originally announced December 2014.
-
Observation of the rare $B^0_s\toμ^+μ^-$ decay from the combined analysis of CMS and LHCb data
Authors:
The CMS,
LHCb Collaborations,
:,
V. Khachatryan,
A. M. Sirunyan,
A. Tumasyan,
W. Adam,
T. Bergauer,
M. Dragicevic,
J. Erö,
M. Friedl,
R. Frühwirth,
V. M. Ghete,
C. Hartl,
N. Hörmann,
J. Hrubec,
M. Jeitler,
W. Kiesenhofer,
V. Knünz,
M. Krammer,
I. Krätschmer,
D. Liko,
I. Mikulec,
D. Rabady,
B. Rahbaran
, et al. (2807 additional authors not shown)
Abstract:
A joint measurement is presented of the branching fractions $B^0_s\toμ^+μ^-$ and $B^0\toμ^+μ^-$ in proton-proton collisions at the LHC by the CMS and LHCb experiments. The data samples were collected in 2011 at a centre-of-mass energy of 7 TeV, and in 2012 at 8 TeV. The combined analysis produces the first observation of the $B^0_s\toμ^+μ^-$ decay, with a statistical significance exceeding six sta…
▽ More
A joint measurement is presented of the branching fractions $B^0_s\toμ^+μ^-$ and $B^0\toμ^+μ^-$ in proton-proton collisions at the LHC by the CMS and LHCb experiments. The data samples were collected in 2011 at a centre-of-mass energy of 7 TeV, and in 2012 at 8 TeV. The combined analysis produces the first observation of the $B^0_s\toμ^+μ^-$ decay, with a statistical significance exceeding six standard deviations, and the best measurement of its branching fraction so far. Furthermore, evidence for the $B^0\toμ^+μ^-$ decay is obtained with a statistical significance of three standard deviations. The branching fraction measurements are statistically compatible with SM predictions and impose stringent constraints on several theories beyond the SM.
△ Less
Submitted 17 August, 2015; v1 submitted 17 November, 2014;
originally announced November 2014.
-
EURETILE D7.3 - Dynamic DAL benchmark coding, measurements on MPI version of DPSNN-STDP (distributed plastic spiking neural net) and improvements to other DAL codes
Authors:
Pier Stanislao Paolucci,
Iuliana Bacivarov,
Devendra Rai,
Lars Schor,
Lothar Thiele,
Hoeseok Yang,
Elena Pastorelli,
Roberto Ammendola,
Andrea Biagioni,
Ottorino Frezza,
Francesca Lo Cicero,
Alessandro Lonardo,
Francesco Simula,
Laura Tosoratto,
Piero Vicini
Abstract:
The EURETILE project required the selection and coding of a set of dedicated benchmarks. The project is about the software and hardware architecture of future many-tile distributed fault-tolerant systems. We focus on dynamic workloads characterised by heavy numerical processing requirements. The ambition is to identify common techniques that could be applied to both the Embedded Systems and HPC do…
▽ More
The EURETILE project required the selection and coding of a set of dedicated benchmarks. The project is about the software and hardware architecture of future many-tile distributed fault-tolerant systems. We focus on dynamic workloads characterised by heavy numerical processing requirements. The ambition is to identify common techniques that could be applied to both the Embedded Systems and HPC domains. This document is the first public deliverable of Work Package 7: Challenging Tiled Applications.
△ Less
Submitted 20 August, 2014;
originally announced August 2014.
-
The Physics of the B Factories
Authors:
A. J. Bevan,
B. Golob,
Th. Mannel,
S. Prell,
B. D. Yabsley,
K. Abe,
H. Aihara,
F. Anulli,
N. Arnaud,
T. Aushev,
M. Beneke,
J. Beringer,
F. Bianchi,
I. I. Bigi,
M. Bona,
N. Brambilla,
J. B rodzicka,
P. Chang,
M. J. Charles,
C. H. Cheng,
H. -Y. Cheng,
R. Chistov,
P. Colangelo,
J. P. Coleman,
A. Drutskoy
, et al. (2009 additional authors not shown)
Abstract:
This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C.
Please note that version 3 on the archive is the auxiliary…
▽ More
This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C.
Please note that version 3 on the archive is the auxiliary version of the Physics of the B Factories book. This uses the notation alpha, beta, gamma for the angles of the Unitarity Triangle. The nominal version uses the notation phi_1, phi_2 and phi_3. Please cite this work as Eur. Phys. J. C74 (2014) 3026.
△ Less
Submitted 31 October, 2015; v1 submitted 24 June, 2014;
originally announced June 2014.
-
NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features
Authors:
A. Lonardo,
F. Ameli,
R. Ammendola,
A. Biagioni,
O. Frezza,
G. Lamanna,
F. Lo Cicero,
M. Martinelli,
P. S. Paolucci,
E. Pastorelli,
L. Pontisso,
D. Rossetti,
F. Simeone,
F. Simula,
M. Sozzi,
L. Tosoratto,
P. Vicini
Abstract:
While the GPGPU paradigm is widely recognized as an effective approach to high performance computing, its adoption in low-latency, real-time systems is still in its early stages.
Although GPUs typically show deterministic behaviour in terms of latency in executing computational kernels as soon as data is available in their internal memories, assessment of real-time features of a standard GPGPU s…
▽ More
While the GPGPU paradigm is widely recognized as an effective approach to high performance computing, its adoption in low-latency, real-time systems is still in its early stages.
Although GPUs typically show deterministic behaviour in terms of latency in executing computational kernels as soon as data is available in their internal memories, assessment of real-time features of a standard GPGPU system needs careful characterization of all subsystems along data stream path.
The networking subsystem results in being the most critical one in terms of absolute value and fluctuations of its response latency.
Our envisioned solution to this issue is NaNet, a FPGA-based PCIe Network Interface Card (NIC) design featuring a configurable and extensible set of network channels with direct access through GPUDirect to NVIDIA Fermi/Kepler GPU memories.
NaNet design currently supports both standard - GbE (1000BASE-T) and 10GbE (10Base-R) - and custom - 34~Gbps APElink and 2.5~Gbps deterministic latency KM3link - channels, but its modularity allows for a straightforward inclusion of other link technologies.
To avoid host OS intervention on data stream and remove a possible source of jitter, the design includes a network/transport layer offload module with cycle-accurate, upper-bound latency, supporting UDP, KM3link Time Division Multiplexing and APElink protocols.
After NaNet architecture description and its latency/bandwidth characterization for all supported links, two real world use cases will be presented: the GPU-based low level trigger for the RICH detector in the NA62 experiment at CERN and the on-/off-shore data link for KM3 underwater neutrino telescope.
△ Less
Submitted 13 June, 2014;
originally announced June 2014.
-
NaNet: a flexible and configurable low-latency NIC for real-time trigger systems based on GPUs
Authors:
R. Ammendola,
A. Biagioni,
O. Frezza,
G. Lamanna,
A. Lonardo,
F. Lo Cicero,
P. S. Paolucci,
F. Pantaleo,
D. Rossetti,
F. Simula,
M. Sozzi,
L. Tosoratto,
P. Vicini
Abstract:
NaNet is an FPGA-based PCIe X8 Gen2 NIC supporting 1/10 GbE links and the custom 34 Gbps APElink channel. The design has GPUDirect RDMA capabilities and features a network stack protocol offloading module, making it suitable for building low-latency, real-time GPU-based computing systems. We provide a detailed description of the NaNet hardware modular architecture. Benchmarks for latency and bandw…
▽ More
NaNet is an FPGA-based PCIe X8 Gen2 NIC supporting 1/10 GbE links and the custom 34 Gbps APElink channel. The design has GPUDirect RDMA capabilities and features a network stack protocol offloading module, making it suitable for building low-latency, real-time GPU-based computing systems. We provide a detailed description of the NaNet hardware modular architecture. Benchmarks for latency and bandwidth for GbE and APElink channels are presented, followed by a performance analysis on the case study of the GPU-based low level trigger for the RICH detector in the NA62 CERN experiment, using either the NaNet GbE and APElink channels. Finally, we give an outline of project future activities.
△ Less
Submitted 9 January, 2014; v1 submitted 15 November, 2013;
originally announced November 2013.
-
Architectural improvements and 28 nm FPGA implementation of the APEnet+ 3D Torus network for hybrid HPC systems
Authors:
Roberto Ammendola,
Andrea Biagioni,
Ottorino Frezza,
Francesca Lo Cicero,
Pier Stanislao Paolucci,
Alessandro Lonardo,
Davide Rossetti,
Francesco Simula,
Laura Tosoratto,
Piero Vicini
Abstract:
Modern Graphics Processing Units (GPUs) are now considered accelerators for general purpose computation. A tight interaction between the GPU and the interconnection network is the strategy to express the full potential on capability computing of a multi-GPU system on large HPC clusters; that is the reason why an efficient and scalable interconnect is a key technology to finally deliver GPUs for sc…
▽ More
Modern Graphics Processing Units (GPUs) are now considered accelerators for general purpose computation. A tight interaction between the GPU and the interconnection network is the strategy to express the full potential on capability computing of a multi-GPU system on large HPC clusters; that is the reason why an efficient and scalable interconnect is a key technology to finally deliver GPUs for scientific HPC. In this paper we show the latest architectural and performance improvement of the APEnet+ network fabric, a FPGA-based PCIe board with 6 fully bidirectional off-board links with 34 Gbps of raw bandwidth per direction, and X8 Gen2 bandwidth towards the host PC. The board implements a Remote Direct Memory Access (RDMA) protocol that leverages upon peer-to-peer (P2P) capabilities of Fermi- and Kepler-class NVIDIA GPUs to obtain real zero-copy, low-latency GPU-to-GPU transfers. Finally, we report on the development activities for 2013 focusing on the adoption of the latest generation 28 nm FPGAs and the preliminary tests performed on this new platform.
△ Less
Submitted 14 November, 2013; v1 submitted 7 November, 2013;
originally announced November 2013.
-
NaNet:a low-latency NIC enabling GPU-based, real-time low level trigger systems
Authors:
Roberto Ammendola,
Andrea Biagioni,
Riccardo Fantechi,
Ottorino Frezza,
Gianluca Lamanna,
Francesca Lo Cicero,
Alessandro Lonardo,
Pier Stanislao Paolucci,
Felice Pantaleo,
Roberto Piandani,
Luca Pontisso,
Davide Rossetti,
Francesco Simula,
Marco Sozzi,
Laura Tosoratto,
Piero Vicini
Abstract:
We implemented the NaNet FPGA-based PCI2 Gen2 GbE/APElink NIC, featuring GPUDirect RDMA capabilities and UDP protocol management offloading. NaNet is able to receive a UDP input data stream from its GbE interface and redirect it, without any intermediate buffering or CPU intervention, to the memory of a Fermi/Kepler GPU hosted on the same PCIe bus, provided that the two devices share the same upst…
▽ More
We implemented the NaNet FPGA-based PCI2 Gen2 GbE/APElink NIC, featuring GPUDirect RDMA capabilities and UDP protocol management offloading. NaNet is able to receive a UDP input data stream from its GbE interface and redirect it, without any intermediate buffering or CPU intervention, to the memory of a Fermi/Kepler GPU hosted on the same PCIe bus, provided that the two devices share the same upstream root complex. Synthetic benchmarks for latency and bandwidth are presented. We describe how NaNet can be employed in the prototype of the GPU-based RICH low-level trigger processor of the NA62 CERN experiment, to implement the data link between the TEL62 readout boards and the low level trigger processor. Results for the throughput and latency of the integrated system are presented and discussed.
△ Less
Submitted 22 November, 2013; v1 submitted 5 November, 2013;
originally announced November 2013.