Skip to main content

Showing 1–45 of 45 results for author: Paolucci, P S

.
  1. arXiv:2311.06074  [pdf, other

    q-bio.NC cs.NE

    Two-compartment neuronal spiking model expressing brain-state specific apical-amplification, -isolation and -drive regimes

    Authors: Elena Pastorelli, Alper Yegenoglu, Nicole Kolodziej, Willem Wybo, Francesco Simula, Sandra Diaz, Johan Frederik Storm, Pier Stanislao Paolucci

    Abstract: Mounting experimental evidence suggests that brain-state-specific neural mechanisms, supported by connectomic architectures, play a crucial role in integrating past and contextual knowledge with the current, incoming flow of evidence (e.g., from sensory systems). These mechanisms operate across multiple spatial and temporal scales, necessitating dedicated support at the levels of individual neuron… ▽ More

    Submitted 26 March, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: 23 pages, 9 figures (29 single images), 4 tables, paper

  2. arXiv:2307.01009  [pdf, other

    cs.DC physics.ins-det

    APEIRON: composing smart TDAQ systems for high energy physics experiments

    Authors: Roberto Ammendola, Andrea Biagioni, Carlotta Chiarini, Andrea Ciardiello, Paolo Cretaro, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Pier Stanislao Paolucci, Cristian Rossi, Francesco Simula, Matteo Turisini, Piero Vicini

    Abstract: APEIRON is a framework encompassing the general architecture of a distributed heterogeneous processing platform and the corresponding software stack, from the low level device drivers up to the high level programming model. The framework is designed to be efficiently used for studying, prototy** and deploying smart trigger and data acquisition (TDAQ) systems for high energy physics experiments.

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: Under review in Journal of Physics: Conference Series (ACAT 2022)

  3. arXiv:2306.09855  [pdf, other

    q-bio.NC cs.NE

    Runtime Construction of Large-Scale Spiking Neuronal Network Models on GPU Devices

    Authors: Bruno Golosio, Jose Villamar, Gianmarco Tiddia, Elena Pastorelli, Jonas Stapmanns, Viviana Fanti, Pier Stanislao Paolucci, Abigail Morrison, Johanna Senk

    Abstract: Simulation speed matters for neuroscientific research: this includes not only how quickly the simulated model time of a large-scale spiking neuronal network progresses, but also how long it takes to instantiate the network model in computer memory. On the hardware side, acceleration via highly parallel GPUs is being increasingly utilized. On the software side, code generation approaches ensure hig… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 29 pages, 9 figures

    Journal ref: Appl. Sci. 2023, 13(17), 9598

  4. Comparing apples to apples -- Using a modular and adaptable analysis pipeline to compare slow cerebral rhythms across heterogeneous datasets

    Authors: Robin Gutzen, Giulia De Bonis, Chiara De Luca, Elena Pastorelli, Cristiano Capone, Anna Letizia Allegra Mascaro, Francesco Resta, Arnau Manasanch, Francesco Saverio Pavone, Maria V. Sanchez-Vives, Maurizio Mattia, Sonja Grün, Pier Stanislao Paolucci, Michael Denker

    Abstract: Neuroscience is moving towards a more integrative discipline, where understanding brain function requires consolidating the accumulated evidence seen across experiments, species, and measurement techniques. A remaining challenge on that path is integrating such heterogeneous data into analysis workflows such that consistent and comparable conclusions can be distilled as an experimental basis for m… ▽ More

    Submitted 7 February, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  5. arXiv:2211.06889  [pdf, other

    q-bio.NC cs.DC

    NREM and REM: cognitive and energetic gains in thalamo-cortical slee** and awake spiking model

    Authors: Chiara De Luca, Leonardo Tonielli, Elena Pastorelli, Cristiano Capone, Francesco Simula, Cosimo Lupo, Irene Bernava, Giulia De Bonis, Gianmarco Tiddia, Bruno Golosio, Pier Stanislao Paolucci

    Abstract: Sleep is essential for learning and cognition, but the mechanisms by which it stabilizes learning, supports creativity, and manages the energy consumption of networks engaged in post-sleep task have not been yet modelled. During sleep, the brain cycles between non-rapid eye movement (NREM), a mainly unconscious state characterized by collective oscillations, and rapid eye movement (REM), associate… ▽ More

    Submitted 3 January, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: 22 pages, 9 figures

  6. arXiv:2211.02553  [pdf, other

    q-bio.NC

    Beyond spiking networks: the computational advantages of dendritic amplification and input segregation

    Authors: Cristiano Capone, Cosimo Lupo, Paolo Muratore, Pier Stanislao Paolucci

    Abstract: The brain can efficiently learn a wide range of tasks, motivating the search for biologically inspired learning rules for improving current artificial intelligence technology. Most biological models are composed of point neurons, and cannot achieve the state-of-the-art performances in machine learning. Recent works have proposed that segregation of dendritic input (neurons receive sensory informat… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2201.11717

  7. arXiv:2205.10044  [pdf, other

    cs.LG q-bio.NC

    Towards biologically plausible Dreaming and Planning in recurrent spiking networks

    Authors: Cristiano Capone, Pier Stanislao Paolucci

    Abstract: Humans and animals can learn new skills after practicing for a few hours, while current reinforcement learning algorithms require a large amount of data to achieve good performances. Recent model-based approaches show promising results by reducing the number of necessary interactions with the environment to learn a desirable policy. However, these methods require biological implausible ingredients… ▽ More

    Submitted 8 June, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

  8. arXiv:2201.11717  [pdf, other

    q-bio.NC

    Burst-dependent plasticity and dendritic amplification support target-based learning and hierarchical imitation learning

    Authors: Cristiano Capone, Cosimo Lupo, Paolo Muratore, Pier Stanislao Paolucci

    Abstract: The brain can learn to solve a wide range of tasks with high temporal and energetic efficiency. However, most biological models are composed of simple single compartment neurons and cannot achieve the state-of-art performances of artificial intelligence. We propose a multi-compartment model of pyramidal neuron, in which bursts and dendritic input segregation give the possibility to plausibly suppo… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: 9 pages, 3 figures

  9. arXiv:2201.01088  [pdf, other

    physics.comp-ph cs.AR

    Architectural improvements and technological enhancements for the APEnet+ interconnect system

    Authors: R. Ammendola, A. Biagioni, O. Frezza, A. Lonardo, F. Lo Cicero, M. Martinelli, P. S. Paolucci, E. Pastorelli, D. Rossetti, F. Simula, L. Tosoratto, P. Vicini

    Abstract: The APEnet+ board delivers a point-to-point, low-latency, 3D torus network interface card. In this paper we describe the latest generation of APEnet NIC, APEnet v5, integrated in a PCIe Gen3 board based on a state-of-the-art, 28 nm Altera Stratix V FPGA. The NIC features a network architecture designed following the Remote DMA paradigm and tailored to tightly bind the computing power of modern GPU… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

    Journal ref: **st February 3, 2015

  10. Error-based or target-based? A unifying framework for learning in recurrent spiking networks

    Authors: Cristiano Capone, Paolo Muratore, Pier Stanislao Paolucci

    Abstract: Learning in biological or artificial networks means changing the laws governing the network dynamics in order to better behave in a specific situation. In the field of supervised learning, two complementary approaches stand out: error-based and target-based learning. However, there exists no consensus on which is better suited for which task, and what is the most biologically plausible. Here we pr… ▽ More

    Submitted 8 September, 2021; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: Main text: 14 pages, 5 figures Suppl. Mat.: 12 pages, 3 figures

  11. arXiv:2104.07445  [pdf, other

    q-bio.NC math.DS

    Simulations Approaching Data: Cortical Slow Waves in Inferred Models of the Whole Hemisphere of Mouse

    Authors: Cristiano Capone, Chiara De Luca, Giulia De Bonis, Robin Gutzen, Irene Bernava, Elena Pastorelli, Francesco Simula, Cosimo Lupo, Leonardo Tonielli, Anna Letizia Allegra Mascaro, Francesco Resta, Francesco Pavone, Micheal Denker, Pier Stanislao Paolucci

    Abstract: Thanks to novel, powerful brain activity recording techniques, we can create data-driven models from thousands of recording channels and large portions of the cortex, which can improve our understanding of brain-states neuromodulation and the related richness of traveling waves dynamics. We investigate the inference of data-driven models and the comparison among experiments and simulations, thro… ▽ More

    Submitted 29 November, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

  12. arXiv:2007.14236  [pdf, other

    q-bio.NC cs.DC cs.NE

    Fast simulations of highly-connected spiking cortical models using GPUs

    Authors: Bruno Golosio, Gianmarco Tiddia, Chiara De Luca, Elena Pastorelli, Francesco Simula, Pier Stanislao Paolucci

    Abstract: Over the past decade there has been a growing interest in the development of parallel hardware systems for simulating large-scale networks of spiking neurons. Compared to other highly-parallel systems, GPU-accelerated solutions have the advantage of a relatively low cost and a great versatility, thanks also to the possibility of using the CUDA-C/C++ programming languages. NeuronGPU is a GPU librar… ▽ More

    Submitted 9 November, 2020; v1 submitted 28 July, 2020; originally announced July 2020.

    Journal ref: Front. Comput. Neurosci. 15:627620 2021

  13. Thalamo-cortical spiking model of incremental learning combining perception, context and NREM-sleep-mediated noise-resilience

    Authors: Bruno Golosio, Chiara De Luca, Cristiano Capone, Elena Pastorelli, Giovanni Stegel, Gianmarco Tiddia, Giulia De Bonis, Pier Stanislao Paolucci

    Abstract: The brain exhibits capabilities of fast incremental learning from few noisy examples, as well as the ability to associate similar memories in autonomously-created categories and to combine contextual hints with sensory perceptions. Together with sleep, these mechanisms are thought to be key components of many high-level cognitive functions. Yet, little is known about the underlying processes and t… ▽ More

    Submitted 5 August, 2021; v1 submitted 26 March, 2020; originally announced March 2020.

    Journal ref: PLOS Computational Biology 17(6): e1009045 (2021)

  14. Target spiking patterns enable efficient and biologically plausible learning for complex temporal tasks

    Authors: Paolo Muratore, Cristiano Capone, Pier Stanislao Paolucci

    Abstract: Recurrent spiking neural networks (RSNN) in the human brain learn to perform a wide range of perceptual, cognitive and motor tasks very efficiently in terms of energy consumption and requires very few examples. This motivates the search for biologically inspired learning rules for RSNNs to improve our understanding of brain computation and the efficiency of artificial intelligence. Several spiking… ▽ More

    Submitted 19 March, 2021; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: 22 pages, 5 figures. original research

    Journal ref: PLOS ONE 16(2): e0247014 (2021)

  15. Slow Waves Analysis Pipeline for extracting the Features of the Bi-Modality from the Cerebral Cortex of Anesthetized Mice

    Authors: Giulia De Bonis, Miguel Dasilva, Antonio Pazienti, Maria V. Sanchez-Vives, Maurizio Mattia, Pier Stanislao Paolucci

    Abstract: Cortical slow oscillations are an emergent property of the cortical network, a hallmark of low complexity brain states like sleep, and represent a default activity pattern. Here, we present a methodological approach for quantifying the spatial and temporal properties of this emergent activity. We improved and enriched a robust analysis procedure that has already been successfully applied to both i… ▽ More

    Submitted 8 March, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: 18 pages, 10 figures, 1 table

  16. Scaling of a large-scale simulation of synchronous slow-wave and asynchronous awake-like activity of a cortical model with long-range interconnections

    Authors: Elena Pastorelli, Cristiano Capone, Francesco Simula, Maria V. Sanchez-Vives, Paolo Del Giudice, Maurizio Mattia, Pier Stanislao Paolucci

    Abstract: Cortical synapse organization supports a range of dynamic states on multiple spatial and temporal scales, from synchronous slow wave activity (SWA), characteristic of deep sleep or anesthesia, to fluctuating, asynchronous activity during wakefulness (AW). Such dynamic diversity poses a challenge for producing efficient large-scale simulations that embody realistic metaphors of short- and long-rang… ▽ More

    Submitted 26 November, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: 22 pages, 9 figures, 4 tables

    Journal ref: Front Syst Neurosci. 2019;13:33. Published 2019 Jul 23

  17. arXiv:1812.04974  [pdf, other

    cs.DC cs.NE q-bio.NC

    Real-time cortical simulations: energy and interconnect scaling on distributed systems

    Authors: Francesco Simula, Elena Pastorelli, Pier Stanislao Paolucci, Michele Martinelli, Alessandro Lonardo, Andrea Biagioni, Cristiano Capone, Fabrizio Capuani, Paolo Cretaro, Giulia De Bonis, Francesca Lo Cicero, Luca Pontisso, Piero Vicini, Roberto Ammendola

    Abstract: We profile the impact of computation and inter-processor communication on the energy consumption and on the scaling of cortical simulations approaching the real-time regime on distributed computing platforms. Also, the speed and energy consumption of processor architectures typical of standard HPC and embedded platforms are compared. We demonstrate the importance of the design of low-latency inter… ▽ More

    Submitted 26 November, 2019; v1 submitted 12 December, 2018; originally announced December 2018.

    Comments: 8 pages, 8 figures, 4 tables, submitted after final publication on PDP2019 proceedings, corrected final DOI. arXiv admin note: text overlap with arXiv:1812.04974, arXiv:1804.03441

    Journal ref: 27th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP), Pavia, Italy, February 13-15, 2019, pp. 283-290

  18. Analysis and Model of Cortical Slow Waves Acquired with Optical Techniques

    Authors: Marco Celotto, Chiara De Luca, Paolo Muratore, Francesco Resta, Anna Letizia Allegra Mascaro, Francesco Saverio Pavone, Giulia De Bonis, Pier Stanislao Paolucci

    Abstract: Slow waves (SWs) are spatio-temporal patterns of cortical activity that occur both during natural sleep and anesthesia and are preserved across species. Even though electrophysiological recordings have been largely used to characterize brain states, they are limited in the spatial resolution and cannot target specific neuronal population. Recently, large-scale optical imaging techniques coupled wi… ▽ More

    Submitted 31 January, 2020; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: 26 pages, 19 figures, 1 table

    Journal ref: Methods Protoc. 2020, 3(1), 14

  19. arXiv:1810.10498  [pdf

    q-bio.NC cs.AI cs.DC

    Sleep-like slow oscillations improve visual classification through synaptic homeostasis and memory association in a thalamo-cortical model

    Authors: Cristiano Capone, Elena Pastorelli, Bruno Golosio, Pier Stanislao Paolucci

    Abstract: The occurrence of sleep passed through the evolutionary sieve and is widespread in animal species. Sleep is known to be beneficial to cognitive and mnemonic tasks, while chronic sleep deprivation is detrimental. Despite the importance of the phenomenon, a complete understanding of its functions and underlying mechanisms is still lacking. In this paper, we show interesting effects of deep-sleep-lik… ▽ More

    Submitted 18 November, 2019; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: 11 pages, 5 figures, v5 is the final version published on Scientific Reports journal

    Journal ref: Sci Rep 9, 8990 (2019)

  20. Large Scale Low Power Computing System - Status of Network Design in ExaNeSt and EuroExa Projects

    Authors: Roberto Ammendola, Andrea Biagioni, Fabrizio Capuani, Paolo Cretaro, Giulia De Bonis, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Pier Stanislao Paolucci, Elena Pastorelli, Luca Pontisso, Francesco Simula, Piero Vicini

    Abstract: The deployment of the next generation computing platform at ExaFlops scale requires to solve new technological challenges mainly related to the impressive number (up to 10^6) of compute elements required. This impacts on system power consumption, in terms of feasibility and costs, and on system scalability and computing efficiency. In this perspective analysis, exploration and evaluation of techno… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Journal ref: (2018) Advances in Parallel Computing, 32, pp. 750-759

  21. arXiv:1804.03441  [pdf, other

    cs.DC cs.NE q-bio.NC

    The Brain on Low Power Architectures - Efficient Simulation of Cortical Slow Waves and Asynchronous States

    Authors: Roberto Ammendola, Andrea Biagioni, Fabrizio Capuani, Paolo Cretaro, Giulia De Bonis, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Pier Stanislao Paolucci, Elena Pastorelli, Luca Pontisso, Francesco Simula, Piero Vicini

    Abstract: Efficient brain simulation is a scientific grand challenge, a parallel/distributed coding challenge and a source of requirements and suggestions for future computing architectures. Indeed, the human brain includes about 10^15 synapses and 10^11 neurons activated at a mean rate of several Hz. Full brain simulation poses Exascale challenges even if simulated at the highest abstraction level. The Wav… ▽ More

    Submitted 10 April, 2018; originally announced April 2018.

    Journal ref: (2018) Advances in Parallel Computing, 32, pp. 760-769

  22. arXiv:1803.08833  [pdf, other

    cs.DC cs.NE q-bio.NC

    Gaussian and exponential lateral connectivity on distributed spiking neural network simulation

    Authors: Elena Pastorelli, Pier Stanislao Paolucci, Francesco Simula, Andrea Biagioni, Fabrizio Capuani, Paolo Cretaro, Giulia De Bonis, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Luca Pontisso, Piero Vicini, Roberto Ammendola

    Abstract: We measured the impact of long-range exponentially decaying intra-areal lateral connectivity on the scaling and memory occupation of a distributed spiking neural network simulator compared to that of short-range Gaussian decays. While previous studies adopted short-range connectivity, recent experimental neurosciences studies are pointing out the role of longer-range intra-areal connectivity with… ▽ More

    Submitted 19 February, 2019; v1 submitted 23 March, 2018; originally announced March 2018.

    Comments: 9 pages, 9 figures, added reference to final peer reviewed version on conference paper and DOI

  23. arXiv:1606.04099  [pdf, other

    physics.ins-det hep-ex

    GPU-based Real-time Triggering in the NA62 Experiment

    Authors: R. Ammendola, A. Biagioni, P. Cretaro, S. Di Lorenzo, R. Fantechi, M. Fiorini, O. Frezza, G. Lamanna, F. Lo Cicero, A. Lonardo, M. Martinelli, I. Neri, P. S. Paolucci, E. Pastorelli, R. Piandani, L. Pontisso, D. Rossetti, F. Simula, M. Sozzi, P. Vicini

    Abstract: Over the last few years the GPGPU (General-Purpose computing on Graphics Processing Units) paradigm represented a remarkable development in the world of computing. Computing for High-Energy Physics is no exception: several works have demonstrated the effectiveness of the integration of GPU-based systems in high level trigger of different experiments. On the other hand the use of GPUs in the low le… ▽ More

    Submitted 13 June, 2016; originally announced June 2016.

  24. arXiv:1512.05264  [pdf

    cs.DC q-bio.NC

    Impact of exponential long range and Gaussian short range lateral connectivity on the distributed simulation of neural networks including up to 30 billion synapses

    Authors: Elena Pastorelli, Pier Stanislao Paolucci, Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Francesco Simula, Piero Vicini

    Abstract: Recent experimental neuroscience studies are pointing out the role of long-range intra-areal connectivity that can be modeled by a distance dependent exponential decay of the synaptic probability distribution. This short report provides a preliminary measure of the impact of exponentially decaying lateral connectivity compared to that of shorter-range Gaussian decays on the scaling behaviour and m… ▽ More

    Submitted 16 December, 2015; originally announced December 2015.

    Comments: 6 pages, 4 figures, 1 table

    ACM Class: C.2.4; C.1.4

  25. arXiv:1511.09325  [pdf

    cs.DC q-bio.NC

    Scaling to 1024 software processes and hardware cores of the distributed simulation of a spiking neural network including up to 20G synapses

    Authors: Elena Pastorelli, Pier Stanislao Paolucci, Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Francesco Simula, Piero Vicini

    Abstract: This short report describes the scaling, up to 1024 software processes and hardware cores, of a distributed simulator of plastic spiking neural networks. A previous report demonstrated good scalability of the simulator up to 128 processes. Herein we extend the speed-up measurements and strong and weak scaling analysis of the simulator to the range between 1 and 1024 software processes and hardware… ▽ More

    Submitted 30 November, 2015; originally announced November 2015.

    Comments: 6 pages, 4 figures, 1 table

    ACM Class: C.2.4; C.1.4

  26. arXiv:1505.03015  [pdf

    cs.DC q-bio.NC

    Power, Energy and Speed of Embedded and Server Multi-Cores applied to Distributed Simulation of Spiking Neural Networks: ARM in NVIDIA Tegra vs Intel Xeon quad-cores

    Authors: Pier Stanislao Paolucci, Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Elena Pastorelli, Francesco Simula, Piero Vicini

    Abstract: This short note regards a comparison of instantaneous power, total energy consumption, execution time and energetic cost per synaptic event of a spiking neural network simulator (DPSNN-STDP) distributed on MPI processes when executed either on an embedded platform (based on a dual socket quad-core ARM platform) or a server platform (INTEL-based quad-core dual socket platform). We also compare the… ▽ More

    Submitted 12 May, 2015; originally announced May 2015.

    Comments: 4 pages, 1 table

    ACM Class: C.2.4; C.1.4

  27. arXiv:1408.4587  [pdf

    cs.DC cs.CE cs.MS cs.NE q-bio.NC

    EURETILE D7.3 - Dynamic DAL benchmark coding, measurements on MPI version of DPSNN-STDP (distributed plastic spiking neural net) and improvements to other DAL codes

    Authors: Pier Stanislao Paolucci, Iuliana Bacivarov, Devendra Rai, Lars Schor, Lothar Thiele, Hoeseok Yang, Elena Pastorelli, Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Francesco Simula, Laura Tosoratto, Piero Vicini

    Abstract: The EURETILE project required the selection and coding of a set of dedicated benchmarks. The project is about the software and hardware architecture of future many-tile distributed fault-tolerant systems. We focus on dynamic workloads characterised by heavy numerical processing requirements. The ambition is to identify common techniques that could be applied to both the Embedded Systems and HPC do… ▽ More

    Submitted 20 August, 2014; originally announced August 2014.

    Comments: 34 pages. arXiv admin note: substantial text overlap with arXiv:1310.8478

  28. arXiv:1406.3568  [pdf, other

    physics.ins-det cs.AR

    NaNet: a Low-Latency, Real-Time, Multi-Standard Network Interface Card with GPUDirect Features

    Authors: A. Lonardo, F. Ameli, R. Ammendola, A. Biagioni, O. Frezza, G. Lamanna, F. Lo Cicero, M. Martinelli, P. S. Paolucci, E. Pastorelli, L. Pontisso, D. Rossetti, F. Simeone, F. Simula, M. Sozzi, L. Tosoratto, P. Vicini

    Abstract: While the GPGPU paradigm is widely recognized as an effective approach to high performance computing, its adoption in low-latency, real-time systems is still in its early stages. Although GPUs typically show deterministic behaviour in terms of latency in executing computational kernels as soon as data is available in their internal memories, assessment of real-time features of a standard GPGPU s… ▽ More

    Submitted 13 June, 2014; originally announced June 2014.

  29. arXiv:1311.4007  [pdf, other

    physics.ins-det cs.DC

    NaNet: a flexible and configurable low-latency NIC for real-time trigger systems based on GPUs

    Authors: R. Ammendola, A. Biagioni, O. Frezza, G. Lamanna, A. Lonardo, F. Lo Cicero, P. S. Paolucci, F. Pantaleo, D. Rossetti, F. Simula, M. Sozzi, L. Tosoratto, P. Vicini

    Abstract: NaNet is an FPGA-based PCIe X8 Gen2 NIC supporting 1/10 GbE links and the custom 34 Gbps APElink channel. The design has GPUDirect RDMA capabilities and features a network stack protocol offloading module, making it suitable for building low-latency, real-time GPU-based computing systems. We provide a detailed description of the NaNet hardware modular architecture. Benchmarks for latency and bandw… ▽ More

    Submitted 9 January, 2014; v1 submitted 15 November, 2013; originally announced November 2013.

    Comments: Proceedings for the TWEPP 2013 - Topical Workshop on Electronics for Particle Physics workshop

  30. arXiv:1311.1741  [pdf, other

    cs.AR cs.DC physics.comp-ph

    Architectural improvements and 28 nm FPGA implementation of the APEnet+ 3D Torus network for hybrid HPC systems

    Authors: Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Pier Stanislao Paolucci, Alessandro Lonardo, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini

    Abstract: Modern Graphics Processing Units (GPUs) are now considered accelerators for general purpose computation. A tight interaction between the GPU and the interconnection network is the strategy to express the full potential on capability computing of a multi-GPU system on large HPC clusters; that is the reason why an efficient and scalable interconnect is a key technology to finally deliver GPUs for sc… ▽ More

    Submitted 14 November, 2013; v1 submitted 7 November, 2013; originally announced November 2013.

    Comments: Proceedings for the 20th International Conference on Computing in High Energy and Nuclear Physics (CHEP)

  31. NaNet:a low-latency NIC enabling GPU-based, real-time low level trigger systems

    Authors: Roberto Ammendola, Andrea Biagioni, Riccardo Fantechi, Ottorino Frezza, Gianluca Lamanna, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Felice Pantaleo, Roberto Piandani, Luca Pontisso, Davide Rossetti, Francesco Simula, Marco Sozzi, Laura Tosoratto, Piero Vicini

    Abstract: We implemented the NaNet FPGA-based PCI2 Gen2 GbE/APElink NIC, featuring GPUDirect RDMA capabilities and UDP protocol management offloading. NaNet is able to receive a UDP input data stream from its GbE interface and redirect it, without any intermediate buffering or CPU intervention, to the memory of a Fermi/Kepler GPU hosted on the same PCIe bus, provided that the two devices share the same upst… ▽ More

    Submitted 22 November, 2013; v1 submitted 5 November, 2013; originally announced November 2013.

    Comments: Proceedings for the 20th International Conference on Computing in High Energy and Nuclear Physics (CHEP)

  32. arXiv:1310.8478  [pdf

    cs.DC q-bio.NC

    Distributed simulation of polychronous and plastic spiking neural networks: strong and weak scaling of a representative mini-application benchmark executed on a small-scale commodity cluster

    Authors: Pier Stanislao Paolucci, Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Elena Pastorelli, Francesco Simula, Laura Tosoratto, Piero Vicini

    Abstract: We introduce a natively distributed mini-application benchmark representative of plastic spiking neural network simulators. It can be used to measure performances of existing computing platforms and to drive the development of future parallel/distributed computing systems dedicated to the simulation of plastic spiking networks. The mini-application is designed to generate spiking behaviors and syn… ▽ More

    Submitted 14 April, 2014; v1 submitted 31 October, 2013; originally announced October 2013.

    Comments: Added detailed profiling of computational and communication components. Improved speed and size of simulated networks. 15 pages, 5 figures, 3 tables

    ACM Class: C.2.4; C.1.4

  33. arXiv:1307.8276  [pdf, other

    physics.comp-ph cs.DC

    GPU peer-to-peer techniques applied to a cluster interconnect

    Authors: Roberto Ammendola, Massimo Bernaschi, Andrea Biagioni, Mauro Bisson, Massimiliano Fatica, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Enrico Mastrostefano, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini

    Abstract: Modern GPUs support special protocols to exchange data directly across the PCI Express bus. While these protocols could be used to reduce GPU data transmission times, basically by avoiding staging to host memory, they require specific hardware features which are not available on current generation network adapters. In this paper we describe the architectural modifications required to implement pee… ▽ More

    Submitted 31 July, 2013; originally announced July 2013.

    Comments: paper accepted to CASS 2013

  34. arXiv:1307.1270  [pdf, other

    cs.DC

    A heterogeneous many-core platform for experiments on scalable custom interconnects and management of fault and critical events, applied to many-process applications: Vol. II, 2012 technical report

    Authors: Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Werner Geurts, Gert Goossens, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini

    Abstract: This is the second of a planned collection of four yearly volumes describing the deployment of a heterogeneous many-core platform for experiments on scalable custom interconnects and management of fault and critical events, applied to many-process applications. This volume covers several topics, among which: 1- a system for awareness of faults and critical events (named LO|FA|MO) on experimental h… ▽ More

    Submitted 4 July, 2013; originally announced July 2013.

    Comments: 119 pages

    MSC Class: 68M10; 68M14; 68M15 ACM Class: B.8.1; C.1.4; C.3; C.4; C.5.1

  35. arXiv:1307.0433  [pdf, other

    cs.DC cs.NI

    'Mutual Watch-dog Networking': Distributed Awareness of Faults and Critical Events in Petascale/Exascale systems

    Authors: Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Davide Rossetti, Francesco Simula, Laura Tosoratto, Piero Vicini

    Abstract: Many tile systems require techniques to be applied to increase components resilience and control the FIT (Failures In Time) rate. When scaling to peta- exa-scale systems the FIT rate may become unacceptable due to component numerosity, requiring more systemic countermeasures. Thus, the ability to be fault aware, i.e. to detect and collect information about fault and critical events, is a necessary… ▽ More

    Submitted 2 July, 2013; v1 submitted 1 July, 2013; originally announced July 2013.

    Comments: Technical Report, Preprint

  36. arXiv:1305.1459  [pdf

    cs.DC cs.AR cs.NE cs.OS cs.PL

    EURETILE 2010-2012 summary: first three years of activity of the European Reference Tiled Experiment

    Authors: Pier Stanislao Paolucci, Iuliana Bacivarov, Gert Goossens, Rainer Leupers, Frédéric Rousseau, Christoph Schumacher, Lothar Thiele, Piero Vicini

    Abstract: This is the summary of first three years of activity of the EURETILE FP7 project 247846. EURETILE investigates and implements brain-inspired and fault-tolerant foundational innovations to the system architecture of massively parallel tiled computer architectures and the corresponding programming paradigm. The execution targets are a many-tile HW platform, and a many-tile simulator. A set of SW pro… ▽ More

    Submitted 7 May, 2013; originally announced May 2013.

    Comments: 56 pages

    ACM Class: C.1.4; C.3; B.7.2; F.2.2

  37. arXiv:1203.1536  [pdf, other

    cs.AR cs.NI

    The Distributed Network Processor: a novel off-chip and on-chip interconnection network architecture

    Authors: Andrea Biagioni, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Mersia Perra, Davide Rossetti, Carlo Sidore, Francesco Simula, Laura Tosoratto, Piero Vicini

    Abstract: One of the most demanding challenges for the designers of parallel computing architectures is to deliver an efficient network infrastructure providing low latency, high bandwidth communications while preserving scalability. Besides off-chip communications between processors, recent multi-tile (i.e. multi-core) architectures face the challenge for an efficient on-chip interconnection network betwee… ▽ More

    Submitted 7 March, 2012; originally announced March 2012.

    Comments: 8 pages, 11 figures, submitted to Hot Interconnect 2009

  38. arXiv:1103.0128  [pdf, other

    physics.ins-det physics.comp-ph

    High-speed data transfer with FPGAs and QSFP+ modules

    Authors: R. Ammendola, A. Biagioni, G. Chiodi, O. Frezza, F. Lo Cicero, A. Lonardo, R. Lunadei, P. S. Paolucci, D. Rossetti, A. Salamon, G. Salina, F. Simula, L. Tosoratto, P. Vicini

    Abstract: We present test results and characterization of a data transmission system based on a last generation FPGA and a commercial QSFP+ (Quad Small Form Pluggable +) module. QSFP+ standard defines a hot-pluggable transceiver available in copper or optical cable assemblies for an aggregated bandwidth of up to 40 Gbps. We implemented a complete testbench based on a commercial development card mounting an… ▽ More

    Submitted 1 March, 2011; originally announced March 2011.

    Comments: 5 pages, 3 figures, Published on JINST Journal of Instrumentation proceedings of Topical Workshop on Electronics for Particle Physics 2010, 20-24 September 2010, Aachen, Germany(R Ammendola et al 2010 JINST 5 C12019)

    Journal ref: JINST 5:C12019,2010

  39. arXiv:1102.3796  [pdf, other

    physics.comp-ph cs.AR

    APEnet+: high bandwidth 3D torus direct network for petaflops scale commodity clusters

    Authors: Roberto Ammendola, Andrea Biagioni, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Pier Stanislao Paolucci, Davide Rossetti, Andrea Salamon, Gaetano Salina, Francesco Simula, Laura Tosoratto, Piero Vicini

    Abstract: We describe herein the APElink+ board, a PCIe interconnect adapter featuring the latest advances in wire speed and interface technology plus hardware support for a RDMA programming model and experimental acceleration of GPU networking; this design allows us to build a low latency, high bandwidth PC cluster, the APEnet+ network, the new generation of our cost-effective, tens-of-thousands-scalable c… ▽ More

    Submitted 18 February, 2011; originally announced February 2011.

    Comments: 6 pages, 7 figures, proceeding of CHEP 2010, Taiwan, October 18-22

  40. Progress and status of APEmille

    Authors: APE collaboration, A. Bartoloni, S. Cabasino, N. Cabibbo, M. Cosimi, P. De Riso, W. Errico, S. Giovannetti, F. Laico, H. Leich, A. Lonardo, G. Magazzu, A. Michelotti, E. Panizzi, P. S. Paolucci, D. Rossetti, U. Schwendicke, H. Simma, K. H. Sulanke, M. Torelli, R. Tripiccione, P. Vicini

    Abstract: We report on the progress and status of the APEmille project: a SIMD parallel computer with a peak performance in the TeraFlops range which is now in an advanced development phase. We discuss the hardware and software architecture, and present some performance estimates for Lattice Gauge Theory (LGT) applications.

    Submitted 1 October, 1997; originally announced October 1997.

    Comments: Talk presented at LATTICE97, 3 pages, Latex

    Journal ref: Nucl.Phys.Proc.Suppl. 63 (1998) 991-993

  41. Quenched $B_K$-parameter with the Wilson and Clover actions at $β= 6.0$

    Authors: M. Crisafulli, A. Donini, V. Lubicz, G. Martinelli, F. Rapuano, C. Ungarelli, A. Vladikas, the APE Collaboration, :, A. Bartoloni, C. Battista, S. Cabasino, N. Cabibbo, F. Marzano, E. Panizzi, P. S. Paolucci, R. Sarno, G. M. Todesco, M. Torelli, P. Vicini

    Abstract: We present results for the Kaon $B$ parameter from a sample of $200$ configurations using the Wilson action and $460$ configurations using the Clover action, on a $18^3 \times 64$ lattice at $β=6.0$. A slight improvement of the chiral behaviour of $B_K$ is observed due to the Clover action. We have also compared the results for $B_K$ obtained from two different procedures for the boosting of the… ▽ More

    Submitted 25 May, 1995; originally announced May 1995.

    Comments: 3 pages, Latex, Postscript file with figures available at ftp://hpteo.roma1.infn.it/pub/preprints/lat94/donini ; to appear in Lattice '94, Nucl. Phys. (Proc.Suppl.)

    Journal ref: Nucl.Phys.Proc.Suppl. 42 (1995) 397-399

  42. APE Results of Hadron Masses in Full QCD Simulations

    Authors: S. Antonelli, A. Bartoloni, C. Battista, M. Bellacci, S. Cabasino, N. Cabibbo, L. A. Fernandez, E. Panizzi, P. S. Paolucci, A. Munoz-Sudupe, J. J. Ruiz-Lorenzo, R. Sarno, A. Tarancon, G. M. Todesco, M. Torelli, P. Vicini

    Abstract: We present numerical results obtained in full QCD with 2 flavors of Wilson fermions. We discuss the relation between the phase of Polyakov loops and the {\bf sea} quarks boundary conditions. We report preliminary results about the HMC autocorrelation of the hadronic masses, on a $16^3 \times 32$ lattice volume, at $β=5.55$ with $k_{sea}=0.1570$.

    Submitted 30 November, 1994; originally announced November 1994.

    Comments: 3 pages, compressed ps-file (uufiles), Contribution to Lattice 94

    Journal ref: Nucl.Phys.Proc.Suppl. 42 (1995) 300-302

  43. Lattice Calculation of D- and B-meson Semileptonic Decays, using the Clover Action at beta=6.0 on APE

    Authors: C. R. Allton, M. Crisafulli, V. Lubicz, G. Martinelli, F. Rapuano, N. Stella, A. Vladikas, A. Bartoloni, C. Battista, S. Cabasino, N. Cabibbo, E. Panizzi, P. S. Paolucci, R. Sarno, G. M. Todesco, M. Torelli, P. Vicini

    Abstract: We present the results of a high statistics lattice calculation of hadronic form factors relevant for $D-$ and $B-$meson semi-leptonic decays into light pseudoscalar and vector mesons. The results have been obtained by averaging over 170 gauge field configurations, generated in the quenched approximation, at $β=6.0$, on a $18^3 \times 64$ lattice, using the $O(a)$-improved SW-Clover action.From… ▽ More

    Submitted 7 November, 1994; originally announced November 1994.

    Comments: LaTeX, 15 pages, postscript figures attached uuencoded

    Report number: BU-HEP 94-29, CERN-TH.7484/94, ROME prep. 94/1050

    Journal ref: Phys.Lett. B345 (1995) 513-523

  44. Polyakov Loops and Finite-Size Effects of Hadron Masses in Lattice Full Q.C.D

    Authors: S. Antonelli, M. Bellacci, L. A. Fernández, A. Muñoz-Sudupe, J. J. Ruiz-Lorenzo, R. Sarno, A. Tarancón, A. Bartoloni, C. Battista, N. Cabibbo, S. Cabasino, E. Panizzi, P. S. Paolucci, G. M. Todesco, M. Torelli, R. Tripiccione, P. Vicini

    Abstract: The polarization of Polyakov type loops is responsible for the difference between quenched and unquenched finite size effects on the QCD mass spectrum. With a numerical simulation, using different sea quarks boundary conditions, we show that we can align the spatial Polyakov loops in a predefined direction. Starting from these results, we propose a procedure to partially remove the Polyakov type… ▽ More

    Submitted 12 May, 1994; originally announced May 1994.

    Comments: (6 pages latex). 4 postscript figures availables via anonymous ftp to 141.108.5.193 . Preprint Rome 94/1013

    Journal ref: Phys.Lett.B345:49-54,1995

  45. A High Statistics Lattice Calculation of $f^{static}_B$ at $β=6.2$ Using the Clover Action

    Authors: C. R. Allton, M. Crisafulli, V. Lubicz, G. Salina, G. Martinelli, A. Vladikas, A. Bartoloni, C. Battista, S. Cabasino, N. Cabibbo, F. Marzano, P. S. Paolucci, J. Pech, F. Rapuano, R. Sarno, G. M. Todesco, M. Torelli, W. Tross, P. Vicini, The APE Collaboration

    Abstract: We present a calculation of $f_B$ in the static limit, obtained by numerical simulation of quenched QCD, at $β=6.2$ on a $18^3 \times 64$ lattice, using the SW-Clover quark action. The decay constant has been extracted by studying heavy(static)-light correlation functions of different smeared operators, on a sample of 220 gauge field configurations. We have obtained… ▽ More

    Submitted 23 February, 1994; originally announced February 1994.

    Comments: 12 pages, LaTex, 3 figs. (figures not included; available upon request from [email protected]) ROME prep. 94/981, 18 February 1994

    Journal ref: Phys.Lett. B326 (1994) 295-302