Search | arXiv e-print repository

Universal energy and magnetisation distributions in the Blume-Capel and Baxter-Wu models

Authors: A. R. S. Macedo, J. A. Plascak, A. Vasilopoulos, N. G. Fytas, M. Akritidis, M. Weigel

Abstract: We analyse the probability distribution functions of the energy and magnetisation of the two-dimensional Blume-Capel and Baxter-Wu models with spin values $S \in \{1/2,1, 3/2\}$ in the presence of a crystal field $Δ$. By employing extensive single-spin flip Monte Carlo simulations and a recently developed method of studying the zeros of the energy probability distribution we are able to probe, wit… ▽ More We analyse the probability distribution functions of the energy and magnetisation of the two-dimensional Blume-Capel and Baxter-Wu models with spin values $S \in \{1/2,1, 3/2\}$ in the presence of a crystal field $Δ$. By employing extensive single-spin flip Monte Carlo simulations and a recently developed method of studying the zeros of the energy probability distribution we are able to probe, with a good numerical accuracy, several critical characteristics of the transitions. Additionally, the universal aspects of these transitions are scrutinised by computing the corresponding probability distribution functions. The energy distribution has been underutilised in the literature when compared to that of the magnetisation. Somewhat surprisingly, however, the former appears to be more robust in characterising the universality class for both models upon varying the crystal field $Δ$ than the latter. Finally, our analysis suggests that in contrast to the Blume-Capel ferromagnet, the Baxter-Wu model appears to suffer from strong finite-size effects, especially upon increasing $Δ$ and $S$, that obscure the application of traditional finite-size scaling approaches. △ Less

Submitted 7 July, 2024; originally announced July 2024.

Comments: 24 pages, 9 figures, 1 table, submitted to JSTAT

arXiv:2405.05141 [pdf, other]

Learning-to-learn enables rapid learning with phase-change memory-based in-memory computing

Authors: Thomas Ortner, Horst Petschenig, Athanasios Vasilopoulos, Roland Renner, Špela Brglez, Thomas Limbacher, Enrique Piñero, Alejandro Linares Barranco, Angeliki Pantazi, Robert Legenstein

Abstract: There is a growing demand for low-power, autonomously learning artificial intelligence (AI) systems that can be applied at the edge and rapidly adapt to the specific situation at deployment site. However, current AI models struggle in such scenarios, often requiring extensive fine-tuning, computational resources, and data. In contrast, humans can effortlessly adjust to new tasks by transferring kn… ▽ More There is a growing demand for low-power, autonomously learning artificial intelligence (AI) systems that can be applied at the edge and rapidly adapt to the specific situation at deployment site. However, current AI models struggle in such scenarios, often requiring extensive fine-tuning, computational resources, and data. In contrast, humans can effortlessly adjust to new tasks by transferring knowledge from related ones. The concept of learning-to-learn (L2L) mimics this process and enables AI models to rapidly adapt with only little computational effort and data. In-memory computing neuromorphic hardware (NMHW) is inspired by the brain's operating principles and mimics its physical co-location of memory and compute. In this work, we pair L2L with in-memory computing NMHW based on phase-change memory devices to build efficient AI models that can rapidly adapt to new tasks. We demonstrate the versatility of our approach in two scenarios: a convolutional neural network performing image classification and a biologically-inspired spiking neural network generating motor commands for a real robotic arm. Both models rapidly learn with few parameter updates. Deployed on the NMHW, they perform on-par with their software equivalents. Moreover, meta-training of these models can be performed in software with high-precision, alleviating the need for accurate hardware models. △ Less

Submitted 22 April, 2024; originally announced May 2024.

Comments: 16 pages and 3 appendix pages; Preprint currently under review

arXiv:2402.07549 [pdf, other]

doi 10.1109/ISCAS58744.2024.10558286

A Precision-Optimized Fixed-Point Near-Memory Digital Processing Unit for Analog In-Memory Computing

Authors: Elena Ferro, Athanasios Vasilopoulos, Corey Lammie, Manuel Le Gallo, Luca Benini, Irem Boybat, Abu Sebastian

Abstract: Analog In-Memory Computing (AIMC) is an emerging technology for fast and energy-efficient Deep Learning (DL) inference. However, a certain amount of digital post-processing is required to deal with circuit mismatches and non-idealities associated with the memory devices. Efficient near-memory digital logic is critical to retain the high area/energy efficiency and low latency of AIMC. Existing syst… ▽ More Analog In-Memory Computing (AIMC) is an emerging technology for fast and energy-efficient Deep Learning (DL) inference. However, a certain amount of digital post-processing is required to deal with circuit mismatches and non-idealities associated with the memory devices. Efficient near-memory digital logic is critical to retain the high area/energy efficiency and low latency of AIMC. Existing systems adopt Floating Point 16 (FP16) arithmetic with limited parallelization capability and high latency. To overcome these limitations, we propose a Near-Memory digital Processing Unit (NMPU) based on fixed-point arithmetic. It achieves competitive accuracy and higher computing throughput than previous approaches while minimizing the area overhead. Moreover, the NMPU supports standard DL activation steps, such as ReLU and Batch Normalization. We perform a physical implementation of the NMPU design in a 14 nm CMOS technology and provide detailed performance, power, and area assessments. We validate the efficacy of the NMPU by using data from an AIMC chip and demonstrate that a simulated AIMC system with the proposed NMPU outperforms existing FP16-based implementations, providing 139$\times$ speed-up, 7.8$\times$ smaller area, and a competitive power consumption. Additionally, our approach achieves an inference accuracy of 86.65 %/65.06 %, with an accuracy drop of just 0.12 %/0.4 % compared to the FP16 baseline when benchmarked with ResNet9/ResNet32 networks trained on the CIFAR10/CIFAR100 datasets, respectively. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: Accepted at ISCAS2024

arXiv:2401.09859 [pdf, other]

doi 10.1109/ISCAS58744.2024.10558540

Improving the Accuracy of Analog-Based In-Memory Computing Accelerators Post-Training

Authors: Corey Lammie, Athanasios Vasilopoulos, Julian Büchel, Giacomo Camposampiero, Manuel Le Gallo, Malte Rasch, Abu Sebastian

Abstract: Analog-Based In-Memory Computing (AIMC) inference accelerators can be used to efficiently execute Deep Neural Network (DNN) inference workloads. However, to mitigate accuracy losses, due to circuit and device non-idealities, Hardware-Aware (HWA) training methodologies must be employed. These typically require significant information about the underlying hardware. In this paper, we propose two Post… ▽ More Analog-Based In-Memory Computing (AIMC) inference accelerators can be used to efficiently execute Deep Neural Network (DNN) inference workloads. However, to mitigate accuracy losses, due to circuit and device non-idealities, Hardware-Aware (HWA) training methodologies must be employed. These typically require significant information about the underlying hardware. In this paper, we propose two Post-Training (PT) optimization methods to improve accuracy after training is performed. For each crossbar, the first optimizes the conductance range of each column, and the second optimizes the input, i.e, Digital-to-Analog Converter (DAC), range. It is demonstrated that, when these methods are employed, the complexity during training, and the amount of information about the underlying hardware can be reduced, with no notable change in accuracy ($\leq$0.1%) when finetuning the pretrained RoBERTa transformer model for all General Language Understanding Evaluation (GLUE) benchmark tasks. Additionally, it is demonstrated that further optimizing learned parameters PT improves accuracy. △ Less

Submitted 18 January, 2024; originally announced January 2024.

Comments: Accepted at 2024 IEEE International Symposium on Circuits and Systems (ISCAS)

arXiv:2305.16647 [pdf]

doi 10.1109/IEDM45625.2022.10019486

Gradient descent-based programming of analog in-memory computing cores

Authors: Julian Büchel, Athanasios Vasilopoulos, Benedikt Kersting, Frederic Odermatt, Kevin Brew, Injo Ok, Sam Choi, Iqbal Saraf, Victor Chan, Timothy Philip, Nicole Saulnier, Vijay Narayanan, Manuel Le Gallo, Abu Sebastian

Abstract: The precise programming of crossbar arrays of unit-cells is crucial for obtaining high matrix-vector-multiplication (MVM) accuracy in analog in-memory computing (AIMC) cores. We propose a radically different approach based on directly minimizing the MVM error using gradient descent with synthetic random input data. Our method significantly reduces the MVM error compared with conventional unit-cell… ▽ More The precise programming of crossbar arrays of unit-cells is crucial for obtaining high matrix-vector-multiplication (MVM) accuracy in analog in-memory computing (AIMC) cores. We propose a radically different approach based on directly minimizing the MVM error using gradient descent with synthetic random input data. Our method significantly reduces the MVM error compared with conventional unit-cell by unit-cell iterative programming. It also eliminates the need for high-resolution analog-to-digital converters (ADCs) to read the small unit-cell conductance during programming. Our method improves the experimental inference accuracy of ResNet-9 implemented on two phase-change memory (PCM)-based AIMC cores by 1.26%. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Journal ref: 2022 International Electron Devices Meeting (IEDM), San Francisco, CA, USA, 2022, pp. 33.1.1-33.1.4

arXiv:2304.12379 [pdf, other]

doi 10.1103/PhysRevE.108.024140

Two-dimensional dilute Baxter-Wu model: Transition order and universality

Authors: A. R. S. Macedo, A. Vasilopoulos, M. Akritidis, J. A. Plascak, N. G. Fytas, M. Weigel

Abstract: We investigate the critical behavior of the two-dimensional spin-$1$ Baxter-Wu model in the presence of a crystal-field coupling $Δ$ with the goal of determining the universality class of transitions along the second-order part of the transition line as one approaches the putative location of the multicritical point. We employ extensive Monte Carlo simulations using two different methodologies: (i… ▽ More We investigate the critical behavior of the two-dimensional spin-$1$ Baxter-Wu model in the presence of a crystal-field coupling $Δ$ with the goal of determining the universality class of transitions along the second-order part of the transition line as one approaches the putative location of the multicritical point. We employ extensive Monte Carlo simulations using two different methodologies: (i) a study of the zeros of the energy probability distribution, closely related to the Fisher zeros of the partition function, and (ii) the well-established multicanonical approach employed to study the probability distribution of the crystal-field energy. A detailed finite-size scaling analysis in the regime of second-order phase transitions in the $(Δ, T)$ phase diagram supports previous claims that the transition belongs to the universality class of the $4$-state Potts model. For positive values of $Δ$, we observe the presence of strong finite-size effects, indicative of crossover effects due to the proximity of the first-order part of the transition line. Finally, we demonstrate how a combination of cluster and heat-bath updates allows one to equilibrate larger systems, and we demonstrate the potential of this approach for resolving the ambiguities observed in the regime of $Δ\gtrsim 0$. △ Less

Submitted 7 August, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

Comments: 11 pages, 11 figures, 2 tables, new updated titled, version to be published in Phys Rev E

Journal ref: Phys. Rev. E 108, 024140 (2023)

arXiv:2212.12114 [pdf]

Predicting Survival of Tongue Cancer Patients by Machine Learning Models

Authors: Angelos Vasilopoulos, Nan Miles Xi

Abstract: Tongue cancer is a common oral cavity malignancy that originates in the mouth and throat. Much effort has been invested in improving its diagnosis, treatment, and management. Surgical removal, chemotherapy, and radiation therapy remain the major treatment for tongue cancer. The survival of patients determines the treatment effect. Previous studies have identified certain survival and risk factors… ▽ More Tongue cancer is a common oral cavity malignancy that originates in the mouth and throat. Much effort has been invested in improving its diagnosis, treatment, and management. Surgical removal, chemotherapy, and radiation therapy remain the major treatment for tongue cancer. The survival of patients determines the treatment effect. Previous studies have identified certain survival and risk factors based on descriptive statistics, ignoring the complex, nonlinear relationship among clinical and demographic variables. In this study, we utilize five cutting-edge machine learning models and clinical data to predict the survival of tongue cancer patients after treatment. Five-fold cross-validation, bootstrap analysis, and permutation feature importance are applied to estimate and interpret model performance. The prognostic factors identified by our method are consistent with previous clinical studies. Our method is accurate, interpretable, and thus useable as additional evidence in tongue cancer treatment and management. △ Less

Submitted 22 December, 2022; originally announced December 2022.

arXiv:2212.02872 [pdf, other]

doi 10.1038/s41928-023-01010-1

A 64-core mixed-signal in-memory compute chip based on phase-change memory for deep neural network inference

Authors: Manuel Le Gallo, Riduan Khaddam-Aljameh, Milos Stanisavljevic, Athanasios Vasilopoulos, Benedikt Kersting, Martino Dazzi, Geethan Karunaratne, Matthias Braendli, Abhairaj Singh, Silvia M. Mueller, Julian Buechel, Xavier Timoneda, Vinay Joshi, Urs Egger, Angelo Garofalo, Anastasios Petropoulos, Theodore Antonakopoulos, Kevin Brew, Samuel Choi, Injo Ok, Timothy Philip, Victor Chan, Claire Silvestre, Ishtiaq Ahsan, Nicole Saulnier , et al. (4 additional authors not shown)

Abstract: The need to repeatedly shuttle around synaptic weight values from memory to processing units has been a key source of energy inefficiency associated with hardware implementation of artificial neural networks. Analog in-memory computing (AIMC) with spatially instantiated synaptic weights holds high promise to overcome this challenge, by performing matrix-vector multiplications (MVMs) directly withi… ▽ More The need to repeatedly shuttle around synaptic weight values from memory to processing units has been a key source of energy inefficiency associated with hardware implementation of artificial neural networks. Analog in-memory computing (AIMC) with spatially instantiated synaptic weights holds high promise to overcome this challenge, by performing matrix-vector multiplications (MVMs) directly within the network weights stored on a chip to execute an inference workload. However, to achieve end-to-end improvements in latency and energy consumption, AIMC must be combined with on-chip digital operations and communication to move towards configurations in which a full inference workload is realized entirely on-chip. Moreover, it is highly desirable to achieve high MVM and inference accuracy without application-wise re-tuning of the chip. Here, we present a multi-core AIMC chip designed and fabricated in 14-nm complementary metal-oxide-semiconductor (CMOS) technology with backend-integrated phase-change memory (PCM). The fully-integrated chip features 64 256x256 AIMC cores interconnected via an on-chip communication network. It also implements the digital activation functions and processing involved in ResNet convolutional neural networks and long short-term memory (LSTM) networks. We demonstrate near software-equivalent inference accuracy with ResNet and LSTM networks while implementing all the computations associated with the weight layers and the activation functions on-chip. The chip can achieve a maximal throughput of 63.1 TOPS at an energy efficiency of 9.76 TOPS/W for 8-bit input/output matrix-vector multiplications. △ Less

Submitted 6 December, 2022; originally announced December 2022.

Journal ref: Nature Electronics 6, 680-693 (2023)

arXiv:2211.00772 [pdf]

Tuning hyperparameters of doublet-detection methods for single-cell RNA sequencing data

Authors: Nan Miles Xi, Angelos Vasilopoulos

Abstract: The existence of doublets in single-cell RNA sequencing (scRNA-seq) data poses a great challenge in downstream data analysis. Computational doublet-detection methods have been developed to remove doublets from scRNA-seq data. Yet, the default hyperparameter settings of those methods may not provide optimal performance. Here, we propose a strategy to tune hyperparameters for a cutting-edge doublet-… ▽ More The existence of doublets in single-cell RNA sequencing (scRNA-seq) data poses a great challenge in downstream data analysis. Computational doublet-detection methods have been developed to remove doublets from scRNA-seq data. Yet, the default hyperparameter settings of those methods may not provide optimal performance. Here, we propose a strategy to tune hyperparameters for a cutting-edge doublet-detection method. We utilize a full factorial design to explore the relationship between hyperparameters and detection accuracy on 16 real scRNA-seq datasets. The optimal hyperparameters are obtained by a response surface model and convex optimization. We show that the optimal hyperparameters provide top performance across scRNA-seq datasets under various biological conditions. Our tuning strategy can be applied to other computational doublet-detection methods. It also offers insights into hyperparameter tuning for broader computational methods in scRNA-seq data analysis. △ Less

Submitted 5 February, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

arXiv:2205.01494 [pdf, other]

doi 10.1103/PhysRevE.105.054143

Universality in the two-dimensional dilute Baxter-Wu model

Authors: Alexandros Vasilopoulos, Nikolaos G. Fytas, Erol Vatansever, Anastasios Malakis, Martin Weigel

Abstract: We study the question of universality in the two-dimensional spin-$1$ Baxter-Wu model in the presence of a crystal field $Δ$. We employ extensive numerical simulations of two types, providing us with complementary results: Wang-Landau sampling at fixed values of $Δ$ and a parallelized variant of the multicanonical approach performed at constant temperature $T$. A detailed finite-size scaling analy… ▽ More We study the question of universality in the two-dimensional spin-$1$ Baxter-Wu model in the presence of a crystal field $Δ$. We employ extensive numerical simulations of two types, providing us with complementary results: Wang-Landau sampling at fixed values of $Δ$ and a parallelized variant of the multicanonical approach performed at constant temperature $T$. A detailed finite-size scaling analysis in the regime of second-order phase transitions in the $(Δ, T)$ phase diagram indicates that the transition belongs to the universality class of the $4$-state Potts model. Previous controversies with respect to the nature of the transition are discussed and possibly attributed to the presence of strong finite-size effects, especially as one approaches the pentacritical point of the model. △ Less

Submitted 3 May, 2022; originally announced May 2022.

Comments: 10 pages, 12 figures, 1 table, version to be published in Phys. Rev. E

Journal ref: Phys. Rev. E 105, 054143 (2022)

arXiv:2204.00047 [pdf, other]

doi 10.1088/1742-6596/2207/1/012008

Multicanonical simulations of the 2D spin-$1$ Baxter-Wu model in a crystal field

Authors: Nikolaos G. Fytas, Alexandros Vasilopoulos, Erol Vatansever, Anastasios Malakis, Martin Weigel

Abstract: We investigate aspects of universality in the two-dimensional (2D) spin-$1$ Baxter-Wu model in a crystal field $Δ$ using a parallel version of the multicanonical algorithm employed at constant temperature $T$. A detailed finite-size scaling analysis in the continuous regime of the $Δ-T$ phase diagram of the model indicates that the transition belongs to the universality class of the $4$-state Pott… ▽ More We investigate aspects of universality in the two-dimensional (2D) spin-$1$ Baxter-Wu model in a crystal field $Δ$ using a parallel version of the multicanonical algorithm employed at constant temperature $T$. A detailed finite-size scaling analysis in the continuous regime of the $Δ-T$ phase diagram of the model indicates that the transition belongs to the universality class of the $4$-state Potts model. The presence of first-order-like finite-size effects that become more pronounced as one approaches the pentacritical point of the model is highlighted and discussed. △ Less

Submitted 31 March, 2022; originally announced April 2022.

Comments: 6 pages, 6 figures, XXXII IUPAP Conference on Computational Physics

Journal ref: J. Phys.: Conf. Ser. 2207, 012008 (2022)

arXiv:2107.08632 [pdf, other]

doi 10.1103/PhysRevE.104.024108

Monte Carlo study of the two-dimensional kinetic Blume-Capel model in a quenched random crystal field

Authors: Alexandros Vasilopoulos, Zeynep Demir Vatansever, Erol Vatansever, Nikolaos G. Fytas

Abstract: We investigate by means of Monte Carlo simulations the dynamic phase transition of the two-dimensional kinetic Blume-Capel model under a periodically oscillating magnetic field in the presence of a quenched random crystal-field coupling. We analyze the universality principles of this dynamic transition for various values of the crystal-field coupling at the originally second-order regime of the co… ▽ More We investigate by means of Monte Carlo simulations the dynamic phase transition of the two-dimensional kinetic Blume-Capel model under a periodically oscillating magnetic field in the presence of a quenched random crystal-field coupling. We analyze the universality principles of this dynamic transition for various values of the crystal-field coupling at the originally second-order regime of the corresponding equilibrium phase diagram of the model. A detailed finite-size scaling analysis indicates that the observed nonequilibrium phase transition belongs to the universality class of the equilibrium Ising ferromagnet with additional logarithmic corrections in the scaling behavior of the heat capacity. Our results are in agreement with earlier works on kinetic Ising models. △ Less

Submitted 19 July, 2021; originally announced July 2021.

Comments: 25 pages (APS preprint style), 13 figures, 1 table

Journal ref: Phys. Rev. E 104, 024108 (2021)

Showing 1–12 of 12 results for author: Vasilopoulos, A