-
The Online Observation Quality System Implementation for the ASTRI Mini-Array Project
Authors:
L. Castaldini,
N. Parmiggiani,
A. Bulgarelli,
L. Baroncelli,
V. Fioretti,
A. Di Piano,
I. Abu,
M. Capalbi,
O. Catalano,
V. Conforti,
M. Fiori,
F. Gianotti,
F. Lucarelli,
M. C. Maccarone,
T. Mineo,
S. Lombardi,
V. Pastore,
F. Russo,
P. Sangiorgi,
S. Scuderi,
G. Tosti,
M. Trifoglio,
L. Zampieri
Abstract:
The ASTRI Mini-Array project, led by the Italian National Institute for Astrophysics, aims to construct and operate nine Imaging Atmospheric Cherenkov Telescopes for high-energy gamma-ray source study and stellar intensity interferometry. Located at the Teide Astronomical Observatory in Tenerife, the project's software is essential for remote operation, emphasizing the need for prompt feedback on…
▽ More
The ASTRI Mini-Array project, led by the Italian National Institute for Astrophysics, aims to construct and operate nine Imaging Atmospheric Cherenkov Telescopes for high-energy gamma-ray source study and stellar intensity interferometry. Located at the Teide Astronomical Observatory in Tenerife, the project's software is essential for remote operation, emphasizing the need for prompt feedback on observations. This contribution introduces the Online Observation Quality System (OOQS) as part of the Supervisory Control And Data Acquisition (SCADA) software. OOQS performs real-time data quality checks on data from Cherenkov cameras and Intensity Interferometry instruments. It provides feedback to SCADA and operators, highlighting abnormal conditions and ensuring quick corrective actions for optimal observations. Results are archived for operator visualization and further analysis. The OOQS data quality pipeline prototype utilizes a distributed application with three main components to handle the maximum array data rate of 1.15 Gb/s. The first is a Kafka consumer that manages the data stream from the Array Data Acquisition System through Apache Kafka, handling the data serialization and deserialization involved in the transmission. The data stream is divided into batches of data written in files. The second component monitors new files and conducts analyses using the Slurm workload scheduler, leveraging its parallel processing capabilities and scalability. Finally, the process results are collected by the last component and stored in the Quality Archive.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Is in-domain data beneficial in transfer learning for landmarks detection in x-ray images?
Authors:
Roberto Di Via,
Matteo Santacesaria,
Francesca Odone,
Vito Paolo Pastore
Abstract:
In recent years, deep learning has emerged as a promising technique for medical image analysis. However, this application domain is likely to suffer from a limited availability of large public datasets and annotations. A common solution to these challenges in deep learning is the usage of a transfer learning framework, typically with a fine-tuning protocol, where a large-scale source dataset is us…
▽ More
In recent years, deep learning has emerged as a promising technique for medical image analysis. However, this application domain is likely to suffer from a limited availability of large public datasets and annotations. A common solution to these challenges in deep learning is the usage of a transfer learning framework, typically with a fine-tuning protocol, where a large-scale source dataset is used to pre-train a model, further fine-tuned on the target dataset. In this paper, we present a systematic study analyzing whether the usage of small-scale in-domain x-ray image datasets may provide any improvement for landmark detection over models pre-trained on large natural image datasets only. We focus on the multi-landmark localization task for three datasets, including chest, head, and hand x-ray images. Our results show that using in-domain source datasets brings marginal or no benefit with respect to an ImageNet out-of-domain pre-training. Our findings can provide an indication for the development of robust landmark detection systems in medical images when no large annotated dataset is available.
△ Less
Submitted 3 March, 2024;
originally announced March 2024.
-
The Online Observation Quality System Software Architecture for the ASTRI Mini-Array Project
Authors:
N. Parmiggiani,
A. Bulgarelli,
L. Baroncelli,
A. Addis,
V. Fioretti,
A. Di Piano,
M. Capalbi,
O. Catalano,
V. Conforti,
M. Fiori,
F. Gianotti,
S. Iovenitti,
F. Lucarelli,
M. C. Maccarone,
T. Mineo,
S. Lombardi,
V. Pastore,
F. Russo,
P. Sangiorgi,
S. Scuderi,
G. Tosti,
M. Trifoglio,
L. Zampieri,
the ASTRI Project
Abstract:
The ASTRI Mini-Array is an international collaboration led by the Italian National Institute for Astrophysics. This project aims to construct and operate an array of nine Imaging Atmospheric Cherenkov Telescopes to study gamma-ray sources at very high energy (TeV) and perform stellar intensity interferometry observations. We describe the software architecture and the technologies used to implement…
▽ More
The ASTRI Mini-Array is an international collaboration led by the Italian National Institute for Astrophysics. This project aims to construct and operate an array of nine Imaging Atmospheric Cherenkov Telescopes to study gamma-ray sources at very high energy (TeV) and perform stellar intensity interferometry observations. We describe the software architecture and the technologies used to implement the Online Observation Quality System (OOQS) for the ASTRI Mini-Array project. The OOQS aims to execute data quality checks on the data acquired in real-time by the Cherenkov cameras and intensity interferometry instruments, and provides feedback to both the Central Control System and the Operator about abnormal conditions detected. The OOQS can notify other sub-systems, triggering their reaction to promptly correct anomalies. The results from the data quality analyses (e.g. camera plots, histograms, tables, and more) are stored in the Quality Archive for further investigation and they are summarised in reports available to the Operator. Once the OOQS results are stored, the operator can visualize them using the Human Machine Interface. The OOQS is designed to manage the high data rate generated by the instruments (up to 4.5 GB/s) and received from the Array Data Acquisition System through the Kafka service. The data are serialized and deserialized during the transmission using the Avro framework. The Slurm workload scheduler executes the analyses exploiting key features such as parallel analyses and scalability.
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Top-Tuning: a study on transfer learning for an efficient alternative to fine tuning for image classification with fast kernel methods
Authors:
Paolo Didier Alfano,
Vito Paolo Pastore,
Lorenzo Rosasco,
Francesca Odone
Abstract:
The impressive performance of deep learning architectures is associated with a massive increase in model complexity. Millions of parameters need to be tuned, with training and inference time scaling accordingly, together with energy consumption. But is massive fine-tuning always necessary? In this paper, focusing on image classification, we consider a simple transfer learning approach exploiting p…
▽ More
The impressive performance of deep learning architectures is associated with a massive increase in model complexity. Millions of parameters need to be tuned, with training and inference time scaling accordingly, together with energy consumption. But is massive fine-tuning always necessary? In this paper, focusing on image classification, we consider a simple transfer learning approach exploiting pre-trained convolutional features as input for a fast-to-train kernel method. We refer to this approach as \textit{top-tuning} since only the kernel classifier is trained on the target dataset. In our study, we perform more than 3000 training processes focusing on 32 small to medium-sized target datasets, a typical situation where transfer learning is necessary. We show that the top-tuning approach provides comparable accuracy with respect to fine-tuning, with a training time between one and two orders of magnitude smaller. These results suggest that top-tuning is an effective alternative to fine-tuning in small/medium datasets, being especially useful when training time efficiency and computational resources saving are crucial.
△ Less
Submitted 9 November, 2023; v1 submitted 16 September, 2022;
originally announced September 2022.
-
Efficient Unsupervised Learning for Plankton Images
Authors:
Paolo Didier Alfano,
Marco Rando,
Marco Letizia,
Francesca Odone,
Lorenzo Rosasco,
Vito Paolo Pastore
Abstract:
Monitoring plankton populations in situ is fundamental to preserve the aquatic ecosystem. Plankton microorganisms are in fact susceptible of minor environmental perturbations, that can reflect into consequent morphological and dynamical modifications. Nowadays, the availability of advanced automatic or semi-automatic acquisition systems has been allowing the production of an increasingly large amo…
▽ More
Monitoring plankton populations in situ is fundamental to preserve the aquatic ecosystem. Plankton microorganisms are in fact susceptible of minor environmental perturbations, that can reflect into consequent morphological and dynamical modifications. Nowadays, the availability of advanced automatic or semi-automatic acquisition systems has been allowing the production of an increasingly large amount of plankton image data. The adoption of machine learning algorithms to classify such data may be affected by the significant cost of manual annotation, due to both the huge quantity of acquired data and the numerosity of plankton species. To address these challenges, we propose an efficient unsupervised learning pipeline to provide accurate classification of plankton microorganisms. We build a set of image descriptors exploiting a two-step procedure. First, a Variational Autoencoder (VAE) is trained on features extracted by a pre-trained neural network. We then use the learnt latent space as image descriptor for clustering. We compare our method with state-of-the-art unsupervised approaches, where a set of pre-defined hand-crafted features is used for clustering of plankton images. The proposed pipeline outperforms the benchmark algorithms for all the plankton datasets included in our analysis, providing better image embedding properties.
△ Less
Submitted 14 September, 2022;
originally announced September 2022.
-
The Monitoring Logging and Alarm System of the ASTRI Mini-Array gamma-ray air-Cherenkov experiment at the Observatorio del Teide
Authors:
Federico Incardona,
Alessandro Costa,
Kevin Munari,
Salvatore Gambadoro,
Stefano Germani,
Pietro Bruno,
Andrea Bulgarelli,
Vito Conforti,
Fulvio Gianotti,
Alessandro Grillo,
Valerio Pastore,
Federico Russo,
Joseph Schwarz,
Gino Tosti,
Salvatore Cavalieri,
for the ASTRI Project
Abstract:
The ASTRI Mini-Array is a project for the Cherenkov astronomy in the TeV energy range. ASTRI Mini-Array consists of nine Imaging Atmospheric Cherenkov telescopes located at the Teide Observatory (Canarias Islands). Large volumes of monitoring and logging data result from the operation of a large-scale astrophysical observatory. In the last few years, several "Big Data" technologies have been devel…
▽ More
The ASTRI Mini-Array is a project for the Cherenkov astronomy in the TeV energy range. ASTRI Mini-Array consists of nine Imaging Atmospheric Cherenkov telescopes located at the Teide Observatory (Canarias Islands). Large volumes of monitoring and logging data result from the operation of a large-scale astrophysical observatory. In the last few years, several "Big Data" technologies have been developed to deal with such volumes of data, especially in the Internet of Things (IoT) framework. We present the Monitoring, Logging, and Alarm (MLA) system for the ASTRI Mini-Array aimed at supporting the analysis of scientific data and improving the operational activities of the telescope facility. The MLA system was designed and built considering the latest software tools and concepts coming from Big Data and IoT to respond to the challenges posed by the operation of the array. A particular relevance has been given to satisfying the reliability, availability, and maintainability requirements towards all the array sub-systems and auxiliary devices. The system architecture has been designed to scale up with the number of devices to be monitored and with the number of software components to be considered in the distributed logging system.
△ Less
Submitted 13 July, 2022;
originally announced July 2022.
-
Embedded System to Detect, Track and Classify Plankton Using a Lensless Video Microscope
Authors:
Thomas G. Zimmerman,
Vito P. Pastore,
Sujoy K. Biswas,
Simone Bianco
Abstract:
Plankton provide the foundation for life on earth. To advance our understanding of the marine ecosystem, for scientific, commercial and survival purposes, more in situ continuous monitoring and analysis of plankton is required. Cost, complexity, power and data communication demands are barriers to widespread deployment of in situ plankton microscopes. We address these barriers by building and char…
▽ More
Plankton provide the foundation for life on earth. To advance our understanding of the marine ecosystem, for scientific, commercial and survival purposes, more in situ continuous monitoring and analysis of plankton is required. Cost, complexity, power and data communication demands are barriers to widespread deployment of in situ plankton microscopes. We address these barriers by building and characterizing a lensless microscope with a data pipeline optimized for the Raspberry Pi 3. The pipeline records 1080p video of multiple plankton swimming in a sample well while simultaneously detecting, tracking and selecting salient cropped images for classification @ 5.1 frames per second. Thirteen machine learning classifiers and combinations of nine sets of features are evaluated on nine plankton classes, optimized for speed (F1=0.74 @ 1 msec. per image prediction) and accuracy (F1=0.81 @ .80 sec.). System performance results confirm that performing the entire data pipeline from image capture to classification is possible on a low-cost open-source embedded computer.
△ Less
Submitted 26 May, 2020;
originally announced May 2020.