-
Robustness of Decentralised Learning to Nodes and Data Disruption
Authors:
Luigi Palmieri,
Chiara Boldrini,
Lorenzo Valerio,
Andrea Passarella,
Marco Conti,
János Kertész
Abstract:
In the vibrant landscape of AI research, decentralised learning is gaining momentum. Decentralised learning allows individual nodes to keep data locally where they are generated and to share knowledge extracted from local data among themselves through an interactive process of collaborative refinement. This paradigm supports scenarios where data cannot leave local nodes due to privacy or sovereign…
▽ More
In the vibrant landscape of AI research, decentralised learning is gaining momentum. Decentralised learning allows individual nodes to keep data locally where they are generated and to share knowledge extracted from local data among themselves through an interactive process of collaborative refinement. This paradigm supports scenarios where data cannot leave local nodes due to privacy or sovereignty reasons or real-time constraints imposing proximity of models to locations where inference has to be carried out. The distributed nature of decentralised learning implies significant new research challenges with respect to centralised learning. Among them, in this paper, we focus on robustness issues. Specifically, we study the effect of nodes' disruption on the collective learning process. Assuming a given percentage of "central" nodes disappear from the network, we focus on different cases, characterised by (i) different distributions of data across nodes and (ii) different times when disruption occurs with respect to the start of the collaborative learning task. Through these configurations, we are able to show the non-trivial interplay between the properties of the network connecting nodes, the persistence of knowledge acquired collectively before disruption or lack thereof, and the effect of data availability pre- and post-disruption. Our results show that decentralised learning processes are remarkably robust to network disruption. As long as even minimum amounts of data remain available somewhere in the network, the learning process is able to recover from disruptions and achieve significant classification accuracy. This clearly varies depending on the remaining connectivity after disruption, but we show that even nodes that remain completely isolated can retain significant knowledge acquired before the disruption.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Initialisation and Topology Effects in Decentralised Federated Learning
Authors:
Arash Badie-Modiri,
Chiara Boldrini,
Lorenzo Valerio,
János Kertész,
Márton Karsai
Abstract:
Fully decentralised federated learning enables collaborative training of individual machine learning models on distributed devices on a communication network while kee** the training data localised. This approach enhances data privacy and eliminates both the single point of failure and the necessity for central coordination. Our research highlights that the effectiveness of decentralised federat…
▽ More
Fully decentralised federated learning enables collaborative training of individual machine learning models on distributed devices on a communication network while kee** the training data localised. This approach enhances data privacy and eliminates both the single point of failure and the necessity for central coordination. Our research highlights that the effectiveness of decentralised federated learning is significantly influenced by the network topology of connected devices. We propose a strategy for uncoordinated initialisation of the artificial neural networks, which leverages the distribution of eigenvector centralities of the nodes of the underlying communication network, leading to a radically improved training efficiency. Additionally, our study explores the scaling behaviour and choice of environmental parameters under our proposed initialisation strategy. This work paves the way for more efficient and scalable artificial neural network training in a distributed and uncoordinated environment, offering a deeper understanding of the intertwining roles of network structure and learning dynamics.
△ Less
Submitted 22 May, 2024; v1 submitted 23 March, 2024;
originally announced March 2024.
-
Impact of network topology on the performance of Decentralized Federated Learning
Authors:
Luigi Palmieri,
Chiara Boldrini,
Lorenzo Valerio,
Andrea Passarella,
Marco Conti
Abstract:
Fully decentralized learning is gaining momentum for training AI models at the Internet's edge, addressing infrastructure challenges and privacy concerns. In a decentralized machine learning system, data is distributed across multiple nodes, with each node training a local model based on its respective dataset. The local models are then shared and combined to form a global model capable of making…
▽ More
Fully decentralized learning is gaining momentum for training AI models at the Internet's edge, addressing infrastructure challenges and privacy concerns. In a decentralized machine learning system, data is distributed across multiple nodes, with each node training a local model based on its respective dataset. The local models are then shared and combined to form a global model capable of making accurate predictions on new data. Our exploration focuses on how different types of network structures influence the spreading of knowledge - the process by which nodes incorporate insights gained from learning patterns in data available on other nodes across the network. Specifically, this study investigates the intricate interplay between network structure and learning performance using three network topologies and six data distribution methods. These methods consider different vertex properties, including degree centrality, betweenness centrality, and clustering coefficient, along with whether nodes exhibit high or low values of these metrics. Our findings underscore the significance of global centrality metrics (degree, betweenness) in correlating with learning performance, while local clustering proves less predictive. We highlight the challenges in transferring knowledge from peripheral to central nodes, attributed to a dilution effect during model aggregation. Additionally, we observe that central nodes exert a pull effect, facilitating the spread of knowledge. In examining degree distribution, hubs in Barabasi-Albert networks positively impact learning for central nodes but exacerbate dilution when knowledge originates from peripheral nodes. Finally, we demonstrate the formidable challenge of knowledge circulation outside of segregated communities.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Coordination-free Decentralised Federated Learning on Complex Networks: Overcoming Heterogeneity
Authors:
Lorenzo Valerio,
Chiara Boldrini,
Andrea Passarella,
János Kertész,
Márton Karsai,
Gerardo Iñiguez
Abstract:
Federated Learning (FL) is a well-known framework for successfully performing a learning task in an edge computing scenario where the devices involved have limited resources and incomplete data representation. The basic assumption of FL is that the devices communicate directly or indirectly with a parameter server that centrally coordinates the whole process, overcoming several challenges associat…
▽ More
Federated Learning (FL) is a well-known framework for successfully performing a learning task in an edge computing scenario where the devices involved have limited resources and incomplete data representation. The basic assumption of FL is that the devices communicate directly or indirectly with a parameter server that centrally coordinates the whole process, overcoming several challenges associated with it. However, in highly pervasive edge scenarios, the presence of a central controller that oversees the process cannot always be guaranteed, and the interactions (i.e., the connectivity graph) between devices might not be predetermined, resulting in a complex network structure. Moreover, the heterogeneity of data and devices further complicates the learning process. This poses new challenges from a learning standpoint that we address by proposing a communication-efficient Decentralised Federated Learning (DFL) algorithm able to cope with them. Our solution allows devices communicating only with their direct neighbours to train an accurate model, overcoming the heterogeneity induced by data and different training histories. Our results show that the resulting local models generalise better than those trained with competing approaches, and do so in a more communication-efficient way.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Exploring the Impact of Disrupted Peer-to-Peer Communications on Fully Decentralized Learning in Disaster Scenarios
Authors:
Luigi Palmieri,
Chiara Boldrini,
Lorenzo Valerio,
Andrea Passarella,
Marco Conti
Abstract:
Fully decentralized learning enables the distribution of learning resources and decision-making capabilities across multiple user devices or nodes, and is rapidly gaining popularity due to its privacy-preserving and decentralized nature. Importantly, this crowdsourcing of the learning process allows the system to continue functioning even if some nodes are affected or disconnected. In a disaster s…
▽ More
Fully decentralized learning enables the distribution of learning resources and decision-making capabilities across multiple user devices or nodes, and is rapidly gaining popularity due to its privacy-preserving and decentralized nature. Importantly, this crowdsourcing of the learning process allows the system to continue functioning even if some nodes are affected or disconnected. In a disaster scenario, communication infrastructure and centralized systems may be disrupted or completely unavailable, hindering the possibility of carrying out standard centralized learning tasks in these settings. Thus, fully decentralized learning can help in this case. However, transitioning from centralized to peer-to-peer communications introduces a dependency between the learning process and the topology of the communication graph among nodes. In a disaster scenario, even peer-to-peer communications are susceptible to abrupt changes, such as devices running out of battery or getting disconnected from others due to their position. In this study, we investigate the effects of various disruptions to peer-to-peer communications on decentralized learning in a disaster setting. We examine the resilience of a decentralized learning process when a subset of devices drop from the process abruptly. To this end, we analyze the difference between losing devices holding data, i.e., potential knowledge, vs. devices contributing only to the graph connectivity, i.e., with no data. Our findings on a Barabasi-Albert graph topology, where training data is distributed across nodes in an IID fashion, indicate that the accuracy of the learning process is more affected by a loss of connectivity than by a loss of data. Nevertheless, the network remains relatively robust, and the learning process can achieve a good level of accuracy.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
The effect of network topologies on fully decentralized learning: a preliminary investigation
Authors:
Luigi Palmieri,
Lorenzo Valerio,
Chiara Boldrini,
Andrea Passarella
Abstract:
In a decentralized machine learning system, data is typically partitioned among multiple devices or nodes, each of which trains a local model using its own data. These local models are then shared and combined to create a global model that can make accurate predictions on new data. In this paper, we start exploring the role of the network topology connecting nodes on the performance of a Machine L…
▽ More
In a decentralized machine learning system, data is typically partitioned among multiple devices or nodes, each of which trains a local model using its own data. These local models are then shared and combined to create a global model that can make accurate predictions on new data. In this paper, we start exploring the role of the network topology connecting nodes on the performance of a Machine Learning model trained through direct collaboration between nodes. We investigate how different types of topologies impact the "spreading of knowledge", i.e., the ability of nodes to incorporate in their local model the knowledge derived by learning patterns in data available in other nodes across the networks. Specifically, we highlight the different roles in this process of more or less connected nodes (hubs and leaves), as well as that of macroscopic network properties (primarily, degree distribution and modularity). Among others, we show that, while it is known that even weak connectivity among network components is sufficient for information spread, it may not be sufficient for knowledge spread. More intuitively, we also find that hubs have a more significant role than leaves in spreading knowledge, although this manifests itself not only for heavy-tailed distributions but also when "hubs" have only moderately more connections than leaves. Finally, we show that tightly knit communities severely hinder knowledge spread.
△ Less
Submitted 29 July, 2023;
originally announced July 2023.
-
Anomaly Detection through Unsupervised Federated Learning
Authors:
Mirko Nardi,
Lorenzo Valerio,
Andrea Passarella
Abstract:
Federated learning (FL) is proving to be one of the most promising paradigms for leveraging distributed resources, enabling a set of clients to collaboratively train a machine learning model while kee** the data decentralized. The explosive growth of interest in the topic has led to rapid advancements in several core aspects like communication efficiency, handling non-IID data, privacy, and secu…
▽ More
Federated learning (FL) is proving to be one of the most promising paradigms for leveraging distributed resources, enabling a set of clients to collaboratively train a machine learning model while kee** the data decentralized. The explosive growth of interest in the topic has led to rapid advancements in several core aspects like communication efficiency, handling non-IID data, privacy, and security capabilities. However, the majority of FL works only deal with supervised tasks, assuming that clients' training sets are labeled. To leverage the enormous unlabeled data on distributed edge devices, in this paper, we aim to extend the FL paradigm to unsupervised tasks by addressing the problem of anomaly detection in decentralized settings. In particular, we propose a novel method in which, through a preprocessing phase, clients are grouped into communities, each having similar majority (i.e., inlier) patterns. Subsequently, each community of clients trains the same anomaly detection model (i.e., autoencoders) in a federated fashion. The resulting model is then shared and used to detect anomalies within the clients of the same community that joined the corresponding federated process. Experiments show that our method is robust, and it can detect communities consistent with the ideal partitioning in which groups of clients having the same inlier patterns are known. Furthermore, the performance is significantly better than those in which clients train models exclusively on local data and comparable with federated models of ideal communities' partition.
△ Less
Submitted 9 September, 2022;
originally announced September 2022.
-
Design and evaluation of a cognitive approach for disseminating semantic knowledge and content in opportunistic networks
Authors:
Matteo Mordacchini,
Lorenzo Valerio,
Marco Conti,
Andrea Passarella
Abstract:
In cyber-physical convergence scenarios information flows seamlessly between the physical and the cyber worlds. Here, users' mobile devices represent a natural bridge through which users process acquired information and perform actions. The sheer amount of data available in this context calls for novel, autonomous and lightweight data-filtering solutions, where only relevant information is finally…
▽ More
In cyber-physical convergence scenarios information flows seamlessly between the physical and the cyber worlds. Here, users' mobile devices represent a natural bridge through which users process acquired information and perform actions. The sheer amount of data available in this context calls for novel, autonomous and lightweight data-filtering solutions, where only relevant information is finally presented to users. Moreover, in many real-world scenarios data is not categorised in predefined topics, but it is generally accompanied by semantic descriptions possibly describing users' interests. In these complex conditions, user devices should autonomously become aware not only of the existence of data in the network, but also of their semantic descriptions and correlations between them. To tackle these issues, we present a set of algorithms for knowledge and data dissemination in opportunistic networks, based on simple and very effective models (called cognitive heuristics) coming from cognitive sciences. We show how to exploit them to disseminate both semantic data and the corresponding data items. We provide a thorough performance analysis, under various different conditions comparing our results against non-cognitive solutions. Simulation results demonstrate the superior performance of our solution towards a more effective semantic knowledge acquisition and representation, and a more tailored content acquisition.
△ Less
Submitted 2 June, 2022;
originally announced June 2022.
-
Federated Semi-Supervised Classification of Multimedia Flows for 3D Networks
Authors:
Saira Bano,
Achilles Machumilane,
Lorenzo Valerio,
Pietro Cassarà,
Alberto Gotta
Abstract:
Automatic traffic classification is increasingly becoming important in traffic engineering, as the current trend of encrypting transport information (e.g., behind HTTP-encrypted tunnels) prevents intermediate nodes from accessing end-to-end packet headers. However, this information is crucial for traffic sha**, network slicing, and Quality of Service (QoS) management, for preventing network intr…
▽ More
Automatic traffic classification is increasingly becoming important in traffic engineering, as the current trend of encrypting transport information (e.g., behind HTTP-encrypted tunnels) prevents intermediate nodes from accessing end-to-end packet headers. However, this information is crucial for traffic sha**, network slicing, and Quality of Service (QoS) management, for preventing network intrusion, and for anomaly detection. 3D networks offer multiple routes that can guarantee different levels of QoS. Therefore, service classification and separation are essential to guarantee the required QoS level to each traffic sub-flow through the appropriate network trunk. In this paper, a federated feature selection and feature reduction learning scheme is proposed to classify network traffic in a semi-supervised cooperative manner. The federated gateways of 3D network help to enhance the global knowledge of network traffic to improve the accuracy of anomaly and intrusion detection and service identification of a new traffic flow.
△ Less
Submitted 1 May, 2022;
originally announced May 2022.
-
Cellular traffic offloading via Opportunistic Networking with Reinforcement Learning
Authors:
Lorenzo Valerio,
Raffaele Bruno,
Andrea Passarella
Abstract:
The widespread diffusion of mobile phones is triggering an exponential growth of mobile data traffic that is likely to cause, in the near future, considerable traffic overload issues even in last-generation cellular networks. Offloading part of the traffic to other networks is considered a very promising approach and, in particular, in this paper, we consider offloading through opportunistic netwo…
▽ More
The widespread diffusion of mobile phones is triggering an exponential growth of mobile data traffic that is likely to cause, in the near future, considerable traffic overload issues even in last-generation cellular networks. Offloading part of the traffic to other networks is considered a very promising approach and, in particular, in this paper, we consider offloading through opportunistic networks of users' devices. However, the performance of this solution strongly depends on the pattern of encounters between mobile nodes, which should therefore be taken into account when designing offloading control algorithms. In this paper, we propose an adaptive offloading solution based on the Reinforcement Learning framework and we evaluate and compare the performance of two well-known learning algorithms: Actor-Critic and Q-Learning. More precisely, in our solution the controller of the dissemination process, once trained, is able to select a proper number of content replicas to be injected into the opportunistic network to guarantee the timely delivery of contents to all interested users. We show that our system based on Reinforcement Learning is able to automatically learn a very efficient strategy to reduce the traffic on the cellular network, without relying on any additional context information about the opportunistic network. Our solution achieves a higher level of offloading with respect to other state-of-the-art approaches, in a range of different mobility settings. Moreover, we show that a more refined learning solution, based on the Actor-Critic algorithm, is significantly more efficient than a simpler solution based on Q-learning.
△ Less
Submitted 1 October, 2021;
originally announced October 2021.
-
A communication efficient distributed learning framework for smart environments
Authors:
Lorenzo Valerio,
Andrea Passarella,
Marco Conti
Abstract:
Due to the pervasive diffusion of personal mobile and IoT devices, many ``smart environments'' (e.g., smart cities and smart factories) will be, among others, generators of huge amounts of data. Currently, this is typically achieved through centralised cloud-based data analytics services. However, according to many studies, this approach may present significant issues from the standpoint of data o…
▽ More
Due to the pervasive diffusion of personal mobile and IoT devices, many ``smart environments'' (e.g., smart cities and smart factories) will be, among others, generators of huge amounts of data. Currently, this is typically achieved through centralised cloud-based data analytics services. However, according to many studies, this approach may present significant issues from the standpoint of data ownership, and even wireless network capacity. One possibility to cope with these shortcomings is to move data analytics closer to where data is generated. In this paper, we tackle this issue by proposing and analyzing a distributed learning framework, whereby data analytics are performed at the edge of the network, i.e., on locations very close to where data is generated. Specifically, in our framework, partial data analytics are performed directly on the nodes that generate the data, or on nodes close by (e.g., some of the data generators can take this role on behalf of subsets of other nodes nearby). Then, nodes exchange partial models and refine them accordingly. Our framework is general enough to host different analytics services. In the specific case analysed in the paper, we focus on a learning task, considering two distributed learning algorithms. Using an activity recognition and a pattern recognition task, both on reference datasets, we compare the two learning algorithms between each other and with a central cloud solution (i.e., one that has access to the complete datasets). Our results show that using distributed machine learning techniques, it is possible to drastically reduce the network overhead, while obtaining performance comparable to the cloud solution in terms of learning accuracy. The analysis also shows when each distributed learning approach is preferable, based on the specific distribution of the data on the nodes.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Energy efficient distributed analytics at the edge of the network for IoT environments
Authors:
Lorenzo Valerio,
Marco Conti,
Andrea Passarella
Abstract:
Due to the pervasive diffusion of personal mobile and IoT devices, many "smart environments" (e.g., smart cities and smart factories) will be, generators of huge amounts of data. Currently, analysis of this data is typically achieved through centralised cloud-based services. However, according to many studies, this approach may present significant issues from the standpoint of data ownership, as w…
▽ More
Due to the pervasive diffusion of personal mobile and IoT devices, many "smart environments" (e.g., smart cities and smart factories) will be, generators of huge amounts of data. Currently, analysis of this data is typically achieved through centralised cloud-based services. However, according to many studies, this approach may present significant issues from the standpoint of data ownership, as well as wireless network capacity. In this paper, we exploit the fog computing paradigm to move computation close to where data is produced. We exploit a well-known distributed machine learning framework (Hypothesis Transfer Learning), and perform data analytics on mobile nodes passing by IoT devices, in addition to fog gateways at the edge of the network infrastructure. We analyse the performance of different configurations of the distributed learning framework, in terms of (i) accuracy obtained in the learning task and (ii) energy spent to send data between the involved nodes. Specifically, we consider reference wireless technologies for communication between the different types of nodes we consider, e.g. LTE, Nb-IoT, 802.15.4, 802.11, etc. Our results show that collecting data through the mobile nodes and executing the distributed analytics using short-range communication technologies, such as 802.15.4 and 802.11, allows to strongly reduce the energy consumption of the system up to $94\%$ with a loss in accuracy w.r.t. a centralised cloud solution up to $2\%$.
△ Less
Submitted 23 September, 2021;
originally announced September 2021.
-
Federated Feature Selection for Cyber-Physical Systems of Systems
Authors:
Pietro Cassarà,
Alberto Gotta,
Lorenzo Valerio
Abstract:
Autonomous vehicles (AVs) generate a massive amount of multi-modal data that once collected and processed through Machine Learning algorithms, enable AI-based services at the Edge. In fact, not all these data contain valuable, and informative content but only a subset of the relative attributes should be exploited at the Edge. Therefore, enabling AVs to locally extract such a subset is of utmost i…
▽ More
Autonomous vehicles (AVs) generate a massive amount of multi-modal data that once collected and processed through Machine Learning algorithms, enable AI-based services at the Edge. In fact, not all these data contain valuable, and informative content but only a subset of the relative attributes should be exploited at the Edge. Therefore, enabling AVs to locally extract such a subset is of utmost importance to limit computation and communication workloads. Achieving a consistent subset of data in a distributed manner imposes the AVs to cooperate in finding an agreement on what attributes should be sent to the Edge. In this work, we address such a problem by proposing a federated feature selection algorithm where all the AVs collaborate to filter out, iteratively, the redundant or irrelevant attributes in a distributed manner, without any exchange of raw data. This solution builds on two components: a Mutual-Information-based feature selection algorithm run by the AVs and a novel aggregation function based on the Bayes theorem executed on the Edge. Our federated feature selection algorithm provably converges to a solution in a finite number of steps. Such an algorithm has been tested on two reference datasets: MAV with images and inertial measurements of a monitored vehicle, WESAD with a collection of samples from biophysical sensors to monitor a relative passenger. The numerical results show that the fleet finds a consensus with both the datasets on the minimum achievable subset of features, i.e., 24 out of 2166 (99\%) in MAV and 4 out of 8 (50\%) in WESAD, preserving the informative content of data.
△ Less
Submitted 2 May, 2022; v1 submitted 23 September, 2021;
originally announced September 2021.
-
Matter-wave Atomic Gradiometer Interferometric Sensor (MAGIS-100)
Authors:
Mahiro Abe,
Philip Adamson,
Marcel Borcean,
Daniela Bortoletto,
Kieran Bridges,
Samuel P. Carman,
Swapan Chattopadhyay,
Jonathon Coleman,
Noah M. Curfman,
Kenneth DeRose,
Tejas Deshpande,
Savas Dimopoulos,
Christopher J. Foot,
Josef C. Frisch,
Benjamin E. Garber,
Steve Geer,
Valerie Gibson,
Jonah Glick,
Peter W. Graham,
Steve R. Hahn,
Roni Harnik,
Leonie Hawkins,
Sam Hindley,
Jason M. Hogan,
Yijun Jiang
, et al. (23 additional authors not shown)
Abstract:
MAGIS-100 is a next-generation quantum sensor under construction at Fermilab that aims to explore fundamental physics with atom interferometry over a 100-meter baseline. This novel detector will search for ultralight dark matter, test quantum mechanics in new regimes, and serve as a technology pathfinder for future gravitational wave detectors in a previously unexplored frequency band. It combines…
▽ More
MAGIS-100 is a next-generation quantum sensor under construction at Fermilab that aims to explore fundamental physics with atom interferometry over a 100-meter baseline. This novel detector will search for ultralight dark matter, test quantum mechanics in new regimes, and serve as a technology pathfinder for future gravitational wave detectors in a previously unexplored frequency band. It combines techniques demonstrated in state-of-the-art 10-meter-scale atom interferometers with the latest technological advances of the world's best atomic clocks. MAGIS-100 will provide a development platform for a future kilometer-scale detector that would be sufficiently sensitive to detect gravitational waves from known sources. Here we present the science case for the MAGIS concept, review the operating principles of the detector, describe the instrument design, and study the detector systematics.
△ Less
Submitted 6 April, 2021;
originally announced April 2021.
-
Optimising cost vs accuracy of decentralised analytics in fog computing environments
Authors:
Lorenzo Valerio,
Andrea Passarella,
Marco Conti
Abstract:
The exponential growth of devices and data at the edges of the Internet is rising scalability and privacy concerns on approaches based exclusively on remote cloud platforms. Data gravity, a fundamental concept in Fog Computing, points towards decentralisation of computation for data analysis, as a viable alternative to address those concerns. Decentralising AI tasks on several cooperative devices…
▽ More
The exponential growth of devices and data at the edges of the Internet is rising scalability and privacy concerns on approaches based exclusively on remote cloud platforms. Data gravity, a fundamental concept in Fog Computing, points towards decentralisation of computation for data analysis, as a viable alternative to address those concerns. Decentralising AI tasks on several cooperative devices means identifying the optimal set of locations or Collection Points (CP for short) to use, in the continuum between full centralisation (i.e., all data on a single device) and full decentralisation (i.e., data on source locations). We propose an analytical framework able to find the optimal operating point in this continuum, linking the accuracy of the learning task with the corresponding network and computational cost for moving data and running the distributed training at the CPs. We show through simulations that the model accurately predicts the optimal trade-off, quite often an intermediate point between full centralisation and full decentralisation, showing also a significant cost saving w.r.t. both of them. Finally, the analytical model admits closed-form or numeric solutions, making it not only a performance evaluation instrument but also a design tool to configure a given distributed learning task optimally before its deployment.
△ Less
Submitted 2 August, 2021; v1 submitted 9 December, 2020;
originally announced December 2020.
-
Dynamic Hard Pruning of Neural Networks at the Edge of the Internet
Authors:
Lorenzo Valerio,
Franco Maria Nardini,
Andrea Passarella,
Raffaele Perego
Abstract:
Neural Networks (NN), although successfully applied to several Artificial Intelligence tasks, are often unnecessarily over-parametrised. In edge/fog computing, this might make their training prohibitive on resource-constrained devices, contrasting with the current trend of decentralising intelligence from remote data centres to local constrained devices. Therefore, we investigate the problem of tr…
▽ More
Neural Networks (NN), although successfully applied to several Artificial Intelligence tasks, are often unnecessarily over-parametrised. In edge/fog computing, this might make their training prohibitive on resource-constrained devices, contrasting with the current trend of decentralising intelligence from remote data centres to local constrained devices. Therefore, we investigate the problem of training effective NN models on constrained devices having a fixed, potentially small, memory budget. We target techniques that are both resource-efficient and performance effective while enabling significant network compression. Our Dynamic Hard Pruning (DynHP) technique incrementally prunes the network during training, identifying neurons that marginally contribute to the model accuracy. DynHP enables a tunable size reduction of the final neural network and reduces the NN memory occupancy during training. Freed memory is reused by a \emph{dynamic batch sizing} approach to counterbalance the accuracy degradation caused by the hard pruning strategy, improving its convergence and effectiveness. We assess the performance of DynHP through reproducible experiments on three public datasets, comparing them against reference competitors. Results show that DynHP compresses a NN up to $10$ times without significant performance drops (up to $3.5\%$ additional error w.r.t. the competitors), reducing up to $80\%$ the training memory occupancy.
△ Less
Submitted 22 October, 2021; v1 submitted 17 November, 2020;
originally announced November 2020.
-
The DUNE Far Detector Interim Design Report, Volume 3: Dual-Phase Module
Authors:
DUNE Collaboration,
B. Abi,
R. Acciarri,
M. A. Acero,
M. Adamowski,
C. Adams,
D. Adams,
P. Adamson,
M. Adinolfi,
Z. Ahmad,
C. H. Albright,
L. Aliaga Soplin,
T. Alion,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
J. Anderson,
K. Anderson,
C. Andreopoulos,
M. P. Andrews,
R. A. Andrews,
A. Ankowski,
J. Anthony,
M. Antonello,
M. Antonova
, et al. (1076 additional authors not shown)
Abstract:
The DUNE IDR describes the proposed physics program and technical designs of the DUNE far detector modules in preparation for the full TDR to be published in 2019. It is intended as an intermediate milestone on the path to a full TDR, justifying the technical choices that flow down from the high-level physics goals through requirements at all levels of the Project. These design choices will enable…
▽ More
The DUNE IDR describes the proposed physics program and technical designs of the DUNE far detector modules in preparation for the full TDR to be published in 2019. It is intended as an intermediate milestone on the path to a full TDR, justifying the technical choices that flow down from the high-level physics goals through requirements at all levels of the Project. These design choices will enable the DUNE experiment to make the ground-breaking discoveries that will help to answer fundamental physics questions. Volume 3 describes the dual-phase module's subsystems, the technical coordination required for its design, construction, installation, and integration, and its organizational structure.
△ Less
Submitted 26 July, 2018;
originally announced July 2018.
-
The DUNE Far Detector Interim Design Report Volume 1: Physics, Technology and Strategies
Authors:
DUNE Collaboration,
B. Abi,
R. Acciarri,
M. A. Acero,
M. Adamowski,
C. Adams,
D. Adams,
P. Adamson,
M. Adinolfi,
Z. Ahmad,
C. H. Albright,
L. Aliaga Soplin,
T. Alion,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
J. Anderson,
K. Anderson,
C. Andreopoulos,
M. P. Andrews,
R. A. Andrews,
A. Ankowski,
J. Anthony,
M. Antonello,
M. Antonova
, et al. (1076 additional authors not shown)
Abstract:
The DUNE IDR describes the proposed physics program and technical designs of the DUNE Far Detector modules in preparation for the full TDR to be published in 2019. It is intended as an intermediate milestone on the path to a full TDR, justifying the technical choices that flow down from the high-level physics goals through requirements at all levels of the Project. These design choices will enable…
▽ More
The DUNE IDR describes the proposed physics program and technical designs of the DUNE Far Detector modules in preparation for the full TDR to be published in 2019. It is intended as an intermediate milestone on the path to a full TDR, justifying the technical choices that flow down from the high-level physics goals through requirements at all levels of the Project. These design choices will enable the DUNE experiment to make the ground-breaking discoveries that will help to answer fundamental physics questions. Volume 1 contains an executive summary that describes the general aims of this document. The remainder of this first volume provides a more detailed description of the DUNE physics program that drives the choice of detector technologies. It also includes concise outlines of two overarching systems that have not yet evolved to consortium structures: computing and calibration. Volumes 2 and 3 of this IDR describe, for the single-phase and dual-phase technologies, respectively, each detector module's subsystems, the technical coordination required for its design, construction, installation, and integration, and its organizational structure.
△ Less
Submitted 26 July, 2018;
originally announced July 2018.
-
The DUNE Far Detector Interim Design Report, Volume 2: Single-Phase Module
Authors:
DUNE Collaboration,
B. Abi,
R. Acciarri,
M. A. Acero,
M. Adamowski,
C. Adams,
D. Adams,
P. Adamson,
M. Adinolfi,
Z. Ahmad,
C. H. Albright,
L. Aliaga Soplin,
T. Alion,
S. Alonso Monsalve,
M. Alrashed,
C. Alt,
J. Anderson,
K. Anderson,
C. Andreopoulos,
M. P. Andrews,
R. A. Andrews,
A. Ankowski,
J. Anthony,
M. Antonello,
M. Antonova
, et al. (1076 additional authors not shown)
Abstract:
The DUNE IDR describes the proposed physics program and technical designs of the DUNE far detector modules in preparation for the full TDR to be published in 2019. It is intended as an intermediate milestone on the path to a full TDR, justifying the technical choices that flow down from the high-level physics goals through requirements at all levels of the Project. These design choices will enable…
▽ More
The DUNE IDR describes the proposed physics program and technical designs of the DUNE far detector modules in preparation for the full TDR to be published in 2019. It is intended as an intermediate milestone on the path to a full TDR, justifying the technical choices that flow down from the high-level physics goals through requirements at all levels of the Project. These design choices will enable the DUNE experiment to make the ground-breaking discoveries that will help to answer fundamental physics questions. Volume 2 describes the single-phase module's subsystems, the technical coordination required for its design, construction, installation, and integration, and its organizational structure.
△ Less
Submitted 26 July, 2018;
originally announced July 2018.
-
The Single-Phase ProtoDUNE Technical Design Report
Authors:
B. Abi,
R. Acciarri,
M. A. Acero,
M. Adamowski,
C. Adams,
D. L. Adams,
P. Adamson,
M. Adinolfi,
Z. Ahmad,
C. H. Albright,
T. Alion,
J. Anderson,
K. Anderson,
C. Andreopoulos,
M. P. Andrews,
R. A. Andrews,
J. dos Anjos,
A. Ankowski,
J. Anthony,
M. Antonello,
A. Aranda Fernandez,
A. Ariga,
T. Ariga,
E. Arrieta Diaz,
J. Asaadi
, et al. (806 additional authors not shown)
Abstract:
ProtoDUNE-SP is the single-phase DUNE Far Detector prototype that is under construction and will be operated at the CERN Neutrino Platform (NP) starting in 2018. ProtoDUNE-SP, a crucial part of the DUNE effort towards the construction of the first DUNE 10-kt fiducial mass far detector module (17 kt total LAr mass), is a significant experiment in its own right. With a total liquid argon (LAr) mass…
▽ More
ProtoDUNE-SP is the single-phase DUNE Far Detector prototype that is under construction and will be operated at the CERN Neutrino Platform (NP) starting in 2018. ProtoDUNE-SP, a crucial part of the DUNE effort towards the construction of the first DUNE 10-kt fiducial mass far detector module (17 kt total LAr mass), is a significant experiment in its own right. With a total liquid argon (LAr) mass of 0.77 kt, it represents the largest monolithic single-phase LArTPC detector to be built to date. It's technical design is given in this report.
△ Less
Submitted 27 July, 2017; v1 submitted 21 June, 2017;
originally announced June 2017.
-
Design Of The LBNF Beamline
Authors:
V. Papadimitriou,
K. Ammigan,
J. Anderson Jr.,
K. E. Anderson,
R. Andrews,
V. Bocean,
C. F. Crowley,
N. Eddy,
B. D. Hartsell,
S. Hays,
P. Hurh,
J. Hylen,
J. A. Johnstone,
P. Kasper,
T. Kobilarcik,
G. E. Krafczyk,
B. Lundberg,
A. Marchionni,
N. V. Mokhov,
C. D. Moore,
D. Pushka,
I. Rakhno,
S. D. Reitzner,
P. Schlabach,
V. Sidorov
, et al. (9 additional authors not shown)
Abstract:
The Long Baseline Neutrino Facility (LBNF) will utilize a beamline located at Fermilab to provide and aim a neutrino beam of sufficient intensity and appropriate energy range toward the Deep Underground Neutrino Experiment (DUNE) detectors, placed deep underground at the SURF Facility in Lead, South Dakota. The primary proton beam (60-120 GeV) will be extracted from the MI-10 section of Fermilab's…
▽ More
The Long Baseline Neutrino Facility (LBNF) will utilize a beamline located at Fermilab to provide and aim a neutrino beam of sufficient intensity and appropriate energy range toward the Deep Underground Neutrino Experiment (DUNE) detectors, placed deep underground at the SURF Facility in Lead, South Dakota. The primary proton beam (60-120 GeV) will be extracted from the MI-10 section of Fermilab's Main Injector. Neutrinos will be produced when the protons interact with a solid target to produce mesons which will be subsequently focused by magnetic horns into a 194m long decay pipe where they decay into muons and neutrinos. The parameters of the facility were determined taking into account the physics goals, spatial and radiological constraints, and the experience gained by operating the NuMI facility at Fermilab. The Beamline facility is designed for initial operation at a proton-beam power of 1.2 MW, with the capability to support an upgrade to 2.4 MW. LBNF/DUNE obtained CD-1 approval in November 2015. We discuss here the design status and the associated challenges as well as the R&D and plans for improvements before baselining the facility.
△ Less
Submitted 14 April, 2017;
originally announced April 2017.
-
Long-Baseline Neutrino Facility (LBNF) and Deep Underground Neutrino Experiment (DUNE) Conceptual Design Report Volume 1: The LBNF and DUNE Projects
Authors:
R. Acciarri,
M. A. Acero,
M. Adamowski,
C. Adams,
P. Adamson,
S. Adhikari,
Z. Ahmad,
C. H. Albright,
T. Alion,
E. Amador,
J. Anderson,
K. Anderson,
C. Andreopoulos,
M. Andrews,
R. Andrews,
I. Anghel,
J. d. Anjos,
A. Ankowski,
M. Antonello,
A. ArandaFernandez,
A. Ariga,
T. Ariga,
D. Aristizabal,
E. Arrieta-Diaz,
K. Aryal
, et al. (780 additional authors not shown)
Abstract:
This document presents the Conceptual Design Report (CDR) put forward by an international neutrino community to pursue the Deep Underground Neutrino Experiment at the Long-Baseline Neutrino Facility (LBNF/DUNE), a groundbreaking science experiment for long-baseline neutrino oscillation studies and for neutrino astrophysics and nucleon decay searches. The DUNE far detector will be a very large modu…
▽ More
This document presents the Conceptual Design Report (CDR) put forward by an international neutrino community to pursue the Deep Underground Neutrino Experiment at the Long-Baseline Neutrino Facility (LBNF/DUNE), a groundbreaking science experiment for long-baseline neutrino oscillation studies and for neutrino astrophysics and nucleon decay searches. The DUNE far detector will be a very large modular liquid argon time-projection chamber (LArTPC) located deep underground, coupled to the LBNF multi-megawatt wide-band neutrino beam. DUNE will also have a high-resolution and high-precision near detector.
△ Less
Submitted 20 January, 2016;
originally announced January 2016.
-
First measurement of muon-neutrino disappearance in NOvA
Authors:
P. Adamson,
C. Ader,
M. Andrews,
N. Anfimov,
I. Anghel,
K. Arms,
E. Arrieta-Diaz,
A. Aurisano,
D. Ayres,
C. Backhouse,
M. Baird,
B. A. Bambah,
K. Bays,
R. Bernstein,
M. Betancourt,
V. Bhatnagar,
B. Bhuyan,
J. Bian,
K. Biery,
T. Blackburn,
V. Bocean,
D. Bogert,
A. Bolshakova,
M. Bowden,
C. Bower
, et al. (235 additional authors not shown)
Abstract:
This paper reports the first measurement using the NOvA detectors of $ν_μ$ disappearance in a $ν_μ$ beam. The analysis uses a 14 kton-equivalent exposure of $2.74 \times 10^{20}$ protons-on-target from the Fermilab NuMI beam. Assuming the normal neutrino mass hierarchy, we measure $Δm^{2}_{32}=(2.52^{+0.20}_{-0.18})\times 10^{-3}$ eV$^{2}$ and $\sin^2θ_{23}$ in the range 0.38-0.65, both at the 68%…
▽ More
This paper reports the first measurement using the NOvA detectors of $ν_μ$ disappearance in a $ν_μ$ beam. The analysis uses a 14 kton-equivalent exposure of $2.74 \times 10^{20}$ protons-on-target from the Fermilab NuMI beam. Assuming the normal neutrino mass hierarchy, we measure $Δm^{2}_{32}=(2.52^{+0.20}_{-0.18})\times 10^{-3}$ eV$^{2}$ and $\sin^2θ_{23}$ in the range 0.38-0.65, both at the 68% confidence level, with two statistically-degenerate best fit points at $\sin^2θ_{23} = $ 0.43 and 0.60. Results for the inverted mass hierarchy are also presented.
△ Less
Submitted 20 January, 2016; v1 submitted 19 January, 2016;
originally announced January 2016.
-
First measurement of electron neutrino appearance in NOvA
Authors:
P. Adamson,
C. Ader,
M. Andrews,
N. Anfimov,
I. Anghel,
K. Arms,
E. Arrieta-Diaz,
A. Aurisano,
D. S. Ayres,
C. Backhouse,
M. Baird,
B. A. Bambah,
K. Bays,
R. Bernstein,
M. Betancourt,
V. Bhatnagar,
B. Bhuyan,
J. Bian,
K. Biery,
T. Blackburn,
V. Bocean,
D. Bogert,
A. Bolshakova,
M. Bowden,
C. Bower
, et al. (235 additional authors not shown)
Abstract:
We report results from the first search for $ν_μ\toν_e$ transitions by the NOvA experiment. In an exposure equivalent to $2.74\times10^{20}$ protons-on-target in the upgraded NuMI beam at Fermilab, we observe 6 events in the Far Detector, compared to a background expectation of $0.99\pm0.11$ (syst.) events based on the Near Detector measurement. A secondary analysis observes 11 events with a backg…
▽ More
We report results from the first search for $ν_μ\toν_e$ transitions by the NOvA experiment. In an exposure equivalent to $2.74\times10^{20}$ protons-on-target in the upgraded NuMI beam at Fermilab, we observe 6 events in the Far Detector, compared to a background expectation of $0.99\pm0.11$ (syst.) events based on the Near Detector measurement. A secondary analysis observes 11 events with a background of $1.07\pm0.14$ (syst.). The $3.3σ$ excess of events observed in the primary analysis disfavors $0.1π< δ_{CP} < 0.5π$ in the inverted mass hierarchy at the 90% C.L.
△ Less
Submitted 2 May, 2016; v1 submitted 19 January, 2016;
originally announced January 2016.
-
Long-Baseline Neutrino Facility (LBNF) and Deep Underground Neutrino Experiment (DUNE) Conceptual Design Report, Volume 4 The DUNE Detectors at LBNF
Authors:
R. Acciarri,
M. A. Acero,
M. Adamowski,
C. Adams,
P. Adamson,
S. Adhikari,
Z. Ahmad,
C. H. Albright,
T. Alion,
E. Amador,
J. Anderson,
K. Anderson,
C. Andreopoulos,
M. Andrews,
R. Andrews,
I. Anghel,
J. d. Anjos,
A. Ankowski,
M. Antonello,
A. ArandaFernandez,
A. Ariga,
T. Ariga,
D. Aristizabal,
E. Arrieta-Diaz,
K. Aryal
, et al. (779 additional authors not shown)
Abstract:
A description of the proposed detector(s) for DUNE at LBNF
A description of the proposed detector(s) for DUNE at LBNF
△ Less
Submitted 12 January, 2016;
originally announced January 2016.
-
Long-Baseline Neutrino Facility (LBNF) and Deep Underground Neutrino Experiment (DUNE) Conceptual Design Report Volume 2: The Physics Program for DUNE at LBNF
Authors:
DUNE Collaboration,
R. Acciarri,
M. A. Acero,
M. Adamowski,
C. Adams,
P. Adamson,
S. Adhikari,
Z. Ahmad,
C. H. Albright,
T. Alion,
E. Amador,
J. Anderson,
K. Anderson,
C. Andreopoulos,
M. Andrews,
R. Andrews,
I. Anghel,
J. d. Anjos,
A. Ankowski,
M. Antonello,
A. ArandaFernandez,
A. Ariga,
T. Ariga,
D. Aristizabal,
E. Arrieta-Diaz
, et al. (780 additional authors not shown)
Abstract:
The Physics Program for the Deep Underground Neutrino Experiment (DUNE) at the Fermilab Long-Baseline Neutrino Facility (LBNF) is described.
The Physics Program for the Deep Underground Neutrino Experiment (DUNE) at the Fermilab Long-Baseline Neutrino Facility (LBNF) is described.
△ Less
Submitted 22 January, 2016; v1 submitted 18 December, 2015;
originally announced December 2015.
-
Application of a cognitive-inspired algorithm for detecting communities in mobility networks
Authors:
Emanuele Massaro,
Lorenzo Valerio,
Andrea Guazzini,
Andrea Passarella,
Franco Bagnoli
Abstract:
The emergence and the global adaptation of mobile devices has influenced human interactions at the individual, community, and social levels leading to the so called Cyber-Physical World (CPW) convergence scenario [1]. One of the most important features of CPW is the possibility of exploiting information about the structure of the social communities of users, revealed by joint movement patterns and…
▽ More
The emergence and the global adaptation of mobile devices has influenced human interactions at the individual, community, and social levels leading to the so called Cyber-Physical World (CPW) convergence scenario [1]. One of the most important features of CPW is the possibility of exploiting information about the structure of the social communities of users, revealed by joint movement patterns and frequency of physical co-location. Mobile devices of users that belong to the same social community are likely to "see" each other (and thus be able to communicate through ad-hoc networking techniques) more frequently and regularly than devices outside the community. In mobile opportunistic networks, this fact can be exploited, for example, to optimize networking operations such as forwarding and dissemination of messages. In this paper we present the application of a cognitive-inspired algorithm [2,3,4] for revealing the structure of these dynamic social networks (simulated by the HCMM model [5]) using information about physical encounters logged by the users' mobile devices. The main features of our algorithm are: (i) the capacity of detecting social communities induced by physical co-location of users through distributed algorithms; (ii) the capacity to detect users belonging to more communities (thus acting as bridges across them), and (iii) the capacity to detect the time evolution of communities.
△ Less
Submitted 6 August, 2013; v1 submitted 11 July, 2013;
originally announced July 2013.
-
Operational aspects of the Main Injector large aperture quadrupole (WQB)
Authors:
W. Chou,
L. Bartelson,
B. Brown,
D. Capista,
J. Crisp,
J. DiMarco,
J. Fitzgerald,
H. Glass,
D. Harding,
D. Johnson,
V. Kashikhin,
I. Kourbanis,
P. Prieto,
W. Robotham,
T. Sager,
M. Tartaglia,
L. Valerio,
R. Webber,
M. Wendt,
D. Wolff,
M. Yang
Abstract:
A two-year Large Aperture Quadrupole (WQB) Project was completed in the summer of 2006 at Fermilab. Nine WQBs were designed, fabricated and bench-tested by the Technical Division. Seven of them were installed in the Main Injector and the other two for spares. They perform well. The aperture increase meets the design goal and the perturbation to the lattice is minimal. The machine acceptance in t…
▽ More
A two-year Large Aperture Quadrupole (WQB) Project was completed in the summer of 2006 at Fermilab. Nine WQBs were designed, fabricated and bench-tested by the Technical Division. Seven of them were installed in the Main Injector and the other two for spares. They perform well. The aperture increase meets the design goal and the perturbation to the lattice is minimal. The machine acceptance in the injection and extraction regions is increased from 40pi to 60pi mm-mrad. This paper gives a brief report of the operation and performance of these magnets.
△ Less
Submitted 26 February, 2008;
originally announced February 2008.