-
Universal energy and magnetisation distributions in the Blume-Capel and Baxter-Wu models
Authors:
A. R. S. Macedo,
J. A. Plascak,
A. Vasilopoulos,
N. G. Fytas,
M. Akritidis,
M. Weigel
Abstract:
We analyse the probability distribution functions of the energy and magnetisation of the two-dimensional Blume-Capel and Baxter-Wu models with spin values $S \in \{1/2,1, 3/2\}$ in the presence of a crystal field $Δ$. By employing extensive single-spin flip Monte Carlo simulations and a recently developed method of studying the zeros of the energy probability distribution we are able to probe, wit…
▽ More
We analyse the probability distribution functions of the energy and magnetisation of the two-dimensional Blume-Capel and Baxter-Wu models with spin values $S \in \{1/2,1, 3/2\}$ in the presence of a crystal field $Δ$. By employing extensive single-spin flip Monte Carlo simulations and a recently developed method of studying the zeros of the energy probability distribution we are able to probe, with a good numerical accuracy, several critical characteristics of the transitions. Additionally, the universal aspects of these transitions are scrutinised by computing the corresponding probability distribution functions. The energy distribution has been underutilised in the literature when compared to that of the magnetisation. Somewhat surprisingly, however, the former appears to be more robust in characterising the universality class for both models upon varying the crystal field $Δ$ than the latter. Finally, our analysis suggests that in contrast to the Blume-Capel ferromagnet, the Baxter-Wu model appears to suffer from strong finite-size effects, especially upon increasing $Δ$ and $S$, that obscure the application of traditional finite-size scaling approaches.
△ Less
Submitted 7 July, 2024;
originally announced July 2024.
-
Learning-to-learn enables rapid learning with phase-change memory-based in-memory computing
Authors:
Thomas Ortner,
Horst Petschenig,
Athanasios Vasilopoulos,
Roland Renner,
Špela Brglez,
Thomas Limbacher,
Enrique Piñero,
Alejandro Linares Barranco,
Angeliki Pantazi,
Robert Legenstein
Abstract:
There is a growing demand for low-power, autonomously learning artificial intelligence (AI) systems that can be applied at the edge and rapidly adapt to the specific situation at deployment site. However, current AI models struggle in such scenarios, often requiring extensive fine-tuning, computational resources, and data. In contrast, humans can effortlessly adjust to new tasks by transferring kn…
▽ More
There is a growing demand for low-power, autonomously learning artificial intelligence (AI) systems that can be applied at the edge and rapidly adapt to the specific situation at deployment site. However, current AI models struggle in such scenarios, often requiring extensive fine-tuning, computational resources, and data. In contrast, humans can effortlessly adjust to new tasks by transferring knowledge from related ones. The concept of learning-to-learn (L2L) mimics this process and enables AI models to rapidly adapt with only little computational effort and data. In-memory computing neuromorphic hardware (NMHW) is inspired by the brain's operating principles and mimics its physical co-location of memory and compute. In this work, we pair L2L with in-memory computing NMHW based on phase-change memory devices to build efficient AI models that can rapidly adapt to new tasks. We demonstrate the versatility of our approach in two scenarios: a convolutional neural network performing image classification and a biologically-inspired spiking neural network generating motor commands for a real robotic arm. Both models rapidly learn with few parameter updates. Deployed on the NMHW, they perform on-par with their software equivalents. Moreover, meta-training of these models can be performed in software with high-precision, alleviating the need for accurate hardware models.
△ Less
Submitted 22 April, 2024;
originally announced May 2024.
-
A Precision-Optimized Fixed-Point Near-Memory Digital Processing Unit for Analog In-Memory Computing
Authors:
Elena Ferro,
Athanasios Vasilopoulos,
Corey Lammie,
Manuel Le Gallo,
Luca Benini,
Irem Boybat,
Abu Sebastian
Abstract:
Analog In-Memory Computing (AIMC) is an emerging technology for fast and energy-efficient Deep Learning (DL) inference. However, a certain amount of digital post-processing is required to deal with circuit mismatches and non-idealities associated with the memory devices. Efficient near-memory digital logic is critical to retain the high area/energy efficiency and low latency of AIMC. Existing syst…
▽ More
Analog In-Memory Computing (AIMC) is an emerging technology for fast and energy-efficient Deep Learning (DL) inference. However, a certain amount of digital post-processing is required to deal with circuit mismatches and non-idealities associated with the memory devices. Efficient near-memory digital logic is critical to retain the high area/energy efficiency and low latency of AIMC. Existing systems adopt Floating Point 16 (FP16) arithmetic with limited parallelization capability and high latency. To overcome these limitations, we propose a Near-Memory digital Processing Unit (NMPU) based on fixed-point arithmetic. It achieves competitive accuracy and higher computing throughput than previous approaches while minimizing the area overhead. Moreover, the NMPU supports standard DL activation steps, such as ReLU and Batch Normalization. We perform a physical implementation of the NMPU design in a 14 nm CMOS technology and provide detailed performance, power, and area assessments. We validate the efficacy of the NMPU by using data from an AIMC chip and demonstrate that a simulated AIMC system with the proposed NMPU outperforms existing FP16-based implementations, providing 139$\times$ speed-up, 7.8$\times$ smaller area, and a competitive power consumption. Additionally, our approach achieves an inference accuracy of 86.65 %/65.06 %, with an accuracy drop of just 0.12 %/0.4 % compared to the FP16 baseline when benchmarked with ResNet9/ResNet32 networks trained on the CIFAR10/CIFAR100 datasets, respectively.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Improving the Accuracy of Analog-Based In-Memory Computing Accelerators Post-Training
Authors:
Corey Lammie,
Athanasios Vasilopoulos,
Julian Büchel,
Giacomo Camposampiero,
Manuel Le Gallo,
Malte Rasch,
Abu Sebastian
Abstract:
Analog-Based In-Memory Computing (AIMC) inference accelerators can be used to efficiently execute Deep Neural Network (DNN) inference workloads. However, to mitigate accuracy losses, due to circuit and device non-idealities, Hardware-Aware (HWA) training methodologies must be employed. These typically require significant information about the underlying hardware. In this paper, we propose two Post…
▽ More
Analog-Based In-Memory Computing (AIMC) inference accelerators can be used to efficiently execute Deep Neural Network (DNN) inference workloads. However, to mitigate accuracy losses, due to circuit and device non-idealities, Hardware-Aware (HWA) training methodologies must be employed. These typically require significant information about the underlying hardware. In this paper, we propose two Post-Training (PT) optimization methods to improve accuracy after training is performed. For each crossbar, the first optimizes the conductance range of each column, and the second optimizes the input, i.e, Digital-to-Analog Converter (DAC), range. It is demonstrated that, when these methods are employed, the complexity during training, and the amount of information about the underlying hardware can be reduced, with no notable change in accuracy ($\leq$0.1%) when finetuning the pretrained RoBERTa transformer model for all General Language Understanding Evaluation (GLUE) benchmark tasks. Additionally, it is demonstrated that further optimizing learned parameters PT improves accuracy.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
Gradient descent-based programming of analog in-memory computing cores
Authors:
Julian Büchel,
Athanasios Vasilopoulos,
Benedikt Kersting,
Frederic Odermatt,
Kevin Brew,
Injo Ok,
Sam Choi,
Iqbal Saraf,
Victor Chan,
Timothy Philip,
Nicole Saulnier,
Vijay Narayanan,
Manuel Le Gallo,
Abu Sebastian
Abstract:
The precise programming of crossbar arrays of unit-cells is crucial for obtaining high matrix-vector-multiplication (MVM) accuracy in analog in-memory computing (AIMC) cores. We propose a radically different approach based on directly minimizing the MVM error using gradient descent with synthetic random input data. Our method significantly reduces the MVM error compared with conventional unit-cell…
▽ More
The precise programming of crossbar arrays of unit-cells is crucial for obtaining high matrix-vector-multiplication (MVM) accuracy in analog in-memory computing (AIMC) cores. We propose a radically different approach based on directly minimizing the MVM error using gradient descent with synthetic random input data. Our method significantly reduces the MVM error compared with conventional unit-cell by unit-cell iterative programming. It also eliminates the need for high-resolution analog-to-digital converters (ADCs) to read the small unit-cell conductance during programming. Our method improves the experimental inference accuracy of ResNet-9 implemented on two phase-change memory (PCM)-based AIMC cores by 1.26%.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
Two-dimensional dilute Baxter-Wu model: Transition order and universality
Authors:
A. R. S. Macedo,
A. Vasilopoulos,
M. Akritidis,
J. A. Plascak,
N. G. Fytas,
M. Weigel
Abstract:
We investigate the critical behavior of the two-dimensional spin-$1$ Baxter-Wu model in the presence of a crystal-field coupling $Δ$ with the goal of determining the universality class of transitions along the second-order part of the transition line as one approaches the putative location of the multicritical point. We employ extensive Monte Carlo simulations using two different methodologies: (i…
▽ More
We investigate the critical behavior of the two-dimensional spin-$1$ Baxter-Wu model in the presence of a crystal-field coupling $Δ$ with the goal of determining the universality class of transitions along the second-order part of the transition line as one approaches the putative location of the multicritical point. We employ extensive Monte Carlo simulations using two different methodologies: (i) a study of the zeros of the energy probability distribution, closely related to the Fisher zeros of the partition function, and (ii) the well-established multicanonical approach employed to study the probability distribution of the crystal-field energy. A detailed finite-size scaling analysis in the regime of second-order phase transitions in the $(Δ, T)$ phase diagram supports previous claims that the transition belongs to the universality class of the $4$-state Potts model. For positive values of $Δ$, we observe the presence of strong finite-size effects, indicative of crossover effects due to the proximity of the first-order part of the transition line. Finally, we demonstrate how a combination of cluster and heat-bath updates allows one to equilibrate larger systems, and we demonstrate the potential of this approach for resolving the ambiguities observed in the regime of $Δ\gtrsim 0$.
△ Less
Submitted 7 August, 2023; v1 submitted 24 April, 2023;
originally announced April 2023.
-
Predicting Survival of Tongue Cancer Patients by Machine Learning Models
Authors:
Angelos Vasilopoulos,
Nan Miles Xi
Abstract:
Tongue cancer is a common oral cavity malignancy that originates in the mouth and throat. Much effort has been invested in improving its diagnosis, treatment, and management. Surgical removal, chemotherapy, and radiation therapy remain the major treatment for tongue cancer. The survival of patients determines the treatment effect. Previous studies have identified certain survival and risk factors…
▽ More
Tongue cancer is a common oral cavity malignancy that originates in the mouth and throat. Much effort has been invested in improving its diagnosis, treatment, and management. Surgical removal, chemotherapy, and radiation therapy remain the major treatment for tongue cancer. The survival of patients determines the treatment effect. Previous studies have identified certain survival and risk factors based on descriptive statistics, ignoring the complex, nonlinear relationship among clinical and demographic variables. In this study, we utilize five cutting-edge machine learning models and clinical data to predict the survival of tongue cancer patients after treatment. Five-fold cross-validation, bootstrap analysis, and permutation feature importance are applied to estimate and interpret model performance. The prognostic factors identified by our method are consistent with previous clinical studies. Our method is accurate, interpretable, and thus useable as additional evidence in tongue cancer treatment and management.
△ Less
Submitted 22 December, 2022;
originally announced December 2022.
-
A 64-core mixed-signal in-memory compute chip based on phase-change memory for deep neural network inference
Authors:
Manuel Le Gallo,
Riduan Khaddam-Aljameh,
Milos Stanisavljevic,
Athanasios Vasilopoulos,
Benedikt Kersting,
Martino Dazzi,
Geethan Karunaratne,
Matthias Braendli,
Abhairaj Singh,
Silvia M. Mueller,
Julian Buechel,
Xavier Timoneda,
Vinay Joshi,
Urs Egger,
Angelo Garofalo,
Anastasios Petropoulos,
Theodore Antonakopoulos,
Kevin Brew,
Samuel Choi,
Injo Ok,
Timothy Philip,
Victor Chan,
Claire Silvestre,
Ishtiaq Ahsan,
Nicole Saulnier
, et al. (4 additional authors not shown)
Abstract:
The need to repeatedly shuttle around synaptic weight values from memory to processing units has been a key source of energy inefficiency associated with hardware implementation of artificial neural networks. Analog in-memory computing (AIMC) with spatially instantiated synaptic weights holds high promise to overcome this challenge, by performing matrix-vector multiplications (MVMs) directly withi…
▽ More
The need to repeatedly shuttle around synaptic weight values from memory to processing units has been a key source of energy inefficiency associated with hardware implementation of artificial neural networks. Analog in-memory computing (AIMC) with spatially instantiated synaptic weights holds high promise to overcome this challenge, by performing matrix-vector multiplications (MVMs) directly within the network weights stored on a chip to execute an inference workload. However, to achieve end-to-end improvements in latency and energy consumption, AIMC must be combined with on-chip digital operations and communication to move towards configurations in which a full inference workload is realized entirely on-chip. Moreover, it is highly desirable to achieve high MVM and inference accuracy without application-wise re-tuning of the chip. Here, we present a multi-core AIMC chip designed and fabricated in 14-nm complementary metal-oxide-semiconductor (CMOS) technology with backend-integrated phase-change memory (PCM). The fully-integrated chip features 64 256x256 AIMC cores interconnected via an on-chip communication network. It also implements the digital activation functions and processing involved in ResNet convolutional neural networks and long short-term memory (LSTM) networks. We demonstrate near software-equivalent inference accuracy with ResNet and LSTM networks while implementing all the computations associated with the weight layers and the activation functions on-chip. The chip can achieve a maximal throughput of 63.1 TOPS at an energy efficiency of 9.76 TOPS/W for 8-bit input/output matrix-vector multiplications.
△ Less
Submitted 6 December, 2022;
originally announced December 2022.
-
Tuning hyperparameters of doublet-detection methods for single-cell RNA sequencing data
Authors:
Nan Miles Xi,
Angelos Vasilopoulos
Abstract:
The existence of doublets in single-cell RNA sequencing (scRNA-seq) data poses a great challenge in downstream data analysis. Computational doublet-detection methods have been developed to remove doublets from scRNA-seq data. Yet, the default hyperparameter settings of those methods may not provide optimal performance. Here, we propose a strategy to tune hyperparameters for a cutting-edge doublet-…
▽ More
The existence of doublets in single-cell RNA sequencing (scRNA-seq) data poses a great challenge in downstream data analysis. Computational doublet-detection methods have been developed to remove doublets from scRNA-seq data. Yet, the default hyperparameter settings of those methods may not provide optimal performance. Here, we propose a strategy to tune hyperparameters for a cutting-edge doublet-detection method. We utilize a full factorial design to explore the relationship between hyperparameters and detection accuracy on 16 real scRNA-seq datasets. The optimal hyperparameters are obtained by a response surface model and convex optimization. We show that the optimal hyperparameters provide top performance across scRNA-seq datasets under various biological conditions. Our tuning strategy can be applied to other computational doublet-detection methods. It also offers insights into hyperparameter tuning for broader computational methods in scRNA-seq data analysis.
△ Less
Submitted 5 February, 2023; v1 submitted 1 November, 2022;
originally announced November 2022.
-
Universality in the two-dimensional dilute Baxter-Wu model
Authors:
Alexandros Vasilopoulos,
Nikolaos G. Fytas,
Erol Vatansever,
Anastasios Malakis,
Martin Weigel
Abstract:
We study the question of universality in the two-dimensional spin-$1$ Baxter-Wu model in the presence of a crystal field $Δ$. We employ extensive numerical simulations of two types, providing us with complementary results: Wang-Landau sampling at fixed values of $Δ$ and a parallelized variant of the multicanonical approach performed at constant temperature $T$. A detailed finite-size scaling analy…
▽ More
We study the question of universality in the two-dimensional spin-$1$ Baxter-Wu model in the presence of a crystal field $Δ$. We employ extensive numerical simulations of two types, providing us with complementary results: Wang-Landau sampling at fixed values of $Δ$ and a parallelized variant of the multicanonical approach performed at constant temperature $T$. A detailed finite-size scaling analysis in the regime of second-order phase transitions in the $(Δ, T)$ phase diagram indicates that the transition belongs to the universality class of the $4$-state Potts model. Previous controversies with respect to the nature of the transition are discussed and possibly attributed to the presence of strong finite-size effects, especially as one approaches the pentacritical point of the model.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Multicanonical simulations of the 2D spin-$1$ Baxter-Wu model in a crystal field
Authors:
Nikolaos G. Fytas,
Alexandros Vasilopoulos,
Erol Vatansever,
Anastasios Malakis,
Martin Weigel
Abstract:
We investigate aspects of universality in the two-dimensional (2D) spin-$1$ Baxter-Wu model in a crystal field $Δ$ using a parallel version of the multicanonical algorithm employed at constant temperature $T$. A detailed finite-size scaling analysis in the continuous regime of the $Δ-T$ phase diagram of the model indicates that the transition belongs to the universality class of the $4$-state Pott…
▽ More
We investigate aspects of universality in the two-dimensional (2D) spin-$1$ Baxter-Wu model in a crystal field $Δ$ using a parallel version of the multicanonical algorithm employed at constant temperature $T$. A detailed finite-size scaling analysis in the continuous regime of the $Δ-T$ phase diagram of the model indicates that the transition belongs to the universality class of the $4$-state Potts model. The presence of first-order-like finite-size effects that become more pronounced as one approaches the pentacritical point of the model is highlighted and discussed.
△ Less
Submitted 31 March, 2022;
originally announced April 2022.
-
Monte Carlo study of the two-dimensional kinetic Blume-Capel model in a quenched random crystal field
Authors:
Alexandros Vasilopoulos,
Zeynep Demir Vatansever,
Erol Vatansever,
Nikolaos G. Fytas
Abstract:
We investigate by means of Monte Carlo simulations the dynamic phase transition of the two-dimensional kinetic Blume-Capel model under a periodically oscillating magnetic field in the presence of a quenched random crystal-field coupling. We analyze the universality principles of this dynamic transition for various values of the crystal-field coupling at the originally second-order regime of the co…
▽ More
We investigate by means of Monte Carlo simulations the dynamic phase transition of the two-dimensional kinetic Blume-Capel model under a periodically oscillating magnetic field in the presence of a quenched random crystal-field coupling. We analyze the universality principles of this dynamic transition for various values of the crystal-field coupling at the originally second-order regime of the corresponding equilibrium phase diagram of the model. A detailed finite-size scaling analysis indicates that the observed nonequilibrium phase transition belongs to the universality class of the equilibrium Ising ferromagnet with additional logarithmic corrections in the scaling behavior of the heat capacity. Our results are in agreement with earlier works on kinetic Ising models.
△ Less
Submitted 19 July, 2021;
originally announced July 2021.