-
Nuclear mass predictions using machine learning models
Authors:
Esra Yüksel,
Derya Soydaner,
Hüseyin Bahtiyar
Abstract:
The exploration of nuclear mass or binding energy, a fundamental property of atomic nuclei, remains at the forefront of nuclear physics research due to limitations in experimental studies and uncertainties in model calculations, particularly when moving away from the stability line. In this work, we employ two machine learning (ML) models, Support Vector Regression (SVR) and Gaussian Process Regre…
▽ More
The exploration of nuclear mass or binding energy, a fundamental property of atomic nuclei, remains at the forefront of nuclear physics research due to limitations in experimental studies and uncertainties in model calculations, particularly when moving away from the stability line. In this work, we employ two machine learning (ML) models, Support Vector Regression (SVR) and Gaussian Process Regression (GPR), to assess their performance in predicting nuclear mass excesses using available experimental data and a physics-based feature space. We also examine the extrapolation capabilities of these models using newly measured nuclei from AME2020 and by extending our calculations beyond the training and test set regions. Our results indicate that both SVR and GPR models perform quite well within the training and test regions when informed with a physics-based feature space. Furthermore, these ML models demonstrate the ability to make reasonable predictions away from the available experimental data, offering results comparable to the model calculations. Through further refinement, these models can be used as reliable and efficient ML tools for studying nuclear properties in the future.
△ Less
Submitted 25 June, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
Unveiling The Factors of Aesthetic Preferences with Explainable AI
Authors:
Derya Soydaner,
Johan Wagemans
Abstract:
The allure of aesthetic appeal in images captivates our senses, yet the underlying intricacies of aesthetic preferences remain elusive. In this study, we pioneer a novel perspective by utilizing several different machine learning (ML) models that focus on aesthetic attributes known to influence preferences. Our models process these attributes as inputs to predict the aesthetic scores of images. Mo…
▽ More
The allure of aesthetic appeal in images captivates our senses, yet the underlying intricacies of aesthetic preferences remain elusive. In this study, we pioneer a novel perspective by utilizing several different machine learning (ML) models that focus on aesthetic attributes known to influence preferences. Our models process these attributes as inputs to predict the aesthetic scores of images. Moreover, to delve deeper and obtain interpretable explanations regarding the factors driving aesthetic preferences, we utilize the popular Explainable AI (XAI) technique known as SHapley Additive exPlanations (SHAP). Our methodology compares the performance of various ML models, including Random Forest, XGBoost, Support Vector Regression, and Multilayer Perceptron, in accurately predicting aesthetic scores, and consistently observing results in conjunction with SHAP. We conduct experiments on three image aesthetic benchmarks, namely Aesthetics with Attributes Database (AADB), Explainable Visual Aesthetics (EVA), and Personalized image Aesthetics database with Rich Attributes (PARA), providing insights into the roles of attributes and their interactions. Finally, our study presents ML models for aesthetics research, alongside the introduction of XAI. Our aim is to shed light on the complex nature of aesthetic preferences in images through ML and to provide a deeper understanding of the attributes that influence aesthetic judgements.
△ Less
Submitted 28 May, 2024; v1 submitted 24 November, 2023;
originally announced November 2023.
-
Multi-task convolutional neural network for image aesthetic assessment
Authors:
Derya Soydaner,
Johan Wagemans
Abstract:
As people's aesthetic preferences for images are far from understood, image aesthetic assessment is a challenging artificial intelligence task. The range of factors underlying this task is almost unlimited, but we know that some aesthetic attributes affect those preferences. In this study, we present a multi-task convolutional neural network that takes into account these attributes. The proposed n…
▽ More
As people's aesthetic preferences for images are far from understood, image aesthetic assessment is a challenging artificial intelligence task. The range of factors underlying this task is almost unlimited, but we know that some aesthetic attributes affect those preferences. In this study, we present a multi-task convolutional neural network that takes into account these attributes. The proposed neural network jointly learns the attributes along with the overall aesthetic scores of images. This multi-task learning framework allows for effective generalization through the utilization of shared representations. Our experiments demonstrate that the proposed method outperforms the state-of-the-art approaches in predicting overall aesthetic scores for images in one benchmark of image aesthetics. We achieve near-human performance in terms of overall aesthetic scores when considering the Spearman's rank correlations. Moreover, our model pioneers the application of multi-tasking in another benchmark, serving as a new baseline for future research. Notably, our approach achieves this performance while using fewer parameters compared to existing multi-task neural networks in the literature, and consequently makes our method more efficient in terms of computational complexity.
△ Less
Submitted 15 January, 2024; v1 submitted 16 May, 2023;
originally announced May 2023.
-
From paintbrush to pixel: A review of deep neural networks in AI-generated art
Authors:
Anne-Sofie Maerten,
Derya Soydaner
Abstract:
This paper delves into the fascinating field of AI-generated art and explores the various deep neural network architectures and models that have been utilized to create it. From the classic convolutional networks to the cutting-edge diffusion models, we examine the key players in the field. We explain the general structures and working principles of these neural networks. Then, we showcase example…
▽ More
This paper delves into the fascinating field of AI-generated art and explores the various deep neural network architectures and models that have been utilized to create it. From the classic convolutional networks to the cutting-edge diffusion models, we examine the key players in the field. We explain the general structures and working principles of these neural networks. Then, we showcase examples of milestones, starting with the dreamy landscapes of DeepDream and moving on to the most recent developments, including Stable Diffusion and DALL-E 2, which produce mesmerizing images. A detailed comparison of these models is provided, highlighting their strengths and limitations. Thus, we examine the remarkable progress that deep neural networks have made so far in a short period of time. With a unique blend of technical explanations and insights into the current state of AI-generated art, this paper exemplifies how art and computer science interact.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.
-
Application of multilayer perceptron with data augmentation in nuclear physics
Authors:
Hüseyin Bahtiyar,
Derya Soydaner,
Esra Yüksel
Abstract:
Neural networks have become popular in many fields of science since they serve as promising, reliable and powerful tools. In this work, we study the effect of data augmentation on the predictive power of neural network models for nuclear physics data. We present two different data augmentation techniques, and we conduct a detailed analysis in terms of different depths, optimizers, activation funct…
▽ More
Neural networks have become popular in many fields of science since they serve as promising, reliable and powerful tools. In this work, we study the effect of data augmentation on the predictive power of neural network models for nuclear physics data. We present two different data augmentation techniques, and we conduct a detailed analysis in terms of different depths, optimizers, activation functions and random seed values to show the success and robustness of the model. Using the experimental uncertainties for data augmentation for the first time, the size of the training data set is artificially boosted and the changes in the root-mean-square error between the model predictions on the test set and the experimental data are investigated. Our results show that the data augmentation decreases the prediction errors, stabilizes the model and prevents overfitting. The extrapolation capabilities of the MLP models are also tested for newly measured nuclei in AME2020 mass table, and it is shown that the predictions are significantly improved by using data augmentation.
△ Less
Submitted 5 July, 2022; v1 submitted 16 May, 2022;
originally announced May 2022.
-
Attention Mechanism in Neural Networks: Where it Comes and Where it Goes
Authors:
Derya Soydaner
Abstract:
A long time ago in the machine learning literature, the idea of incorporating a mechanism inspired by the human visual system into neural networks was introduced. This idea is named the attention mechanism, and it has gone through a long development period. Today, many works have been devoted to this idea in a variety of tasks. Remarkable performance has recently been demonstrated. The goal of thi…
▽ More
A long time ago in the machine learning literature, the idea of incorporating a mechanism inspired by the human visual system into neural networks was introduced. This idea is named the attention mechanism, and it has gone through a long development period. Today, many works have been devoted to this idea in a variety of tasks. Remarkable performance has recently been demonstrated. The goal of this paper is to provide an overview from the early work on searching for ways to implement attention idea with neural networks until the recent trends. This review emphasizes the important milestones during this progress regarding different tasks. By this way, this study aims to provide a road map for researchers to explore the current development and get inspired for novel approaches beyond the attention.
△ Less
Submitted 27 April, 2022;
originally announced April 2022.
-
Nuclear binding energy predictions using neural networks: Application of the multilayer perceptron
Authors:
Esra Yüksel,
Derya Soydaner,
Hüseyin Bahtiyar
Abstract:
In recent years, artificial neural networks and their applications for large data sets have became a crucial part of scientific research. In this work, we implement the Multilayer Perceptron (MLP), which is a class of feedforward artificial neural network (ANN), to predict ground-state binding energies of atomic nuclei. Two different MLP architectures with three and four hidden layers are used to…
▽ More
In recent years, artificial neural networks and their applications for large data sets have became a crucial part of scientific research. In this work, we implement the Multilayer Perceptron (MLP), which is a class of feedforward artificial neural network (ANN), to predict ground-state binding energies of atomic nuclei. Two different MLP architectures with three and four hidden layers are used to study their effects on the predictions. To train the MLP architectures, two different inputs are used along with the latest atomic mass table and changes in binding energy predictions are also analyzed in terms of the changes in the input channel. It is seen that using appropriate MLP architectures and putting more physical information in the input channels, MLP can make fast and reliable predictions for binding energies of atomic nuclei, which is also comparable to the microscopic energy density functionals.
△ Less
Submitted 7 May, 2021; v1 submitted 28 January, 2021;
originally announced January 2021.
-
A Comparison of Optimization Algorithms for Deep Learning
Authors:
Derya Soydaner
Abstract:
In recent years, we have witnessed the rise of deep learning. Deep neural networks have proved their success in many areas. However, the optimization of these networks has become more difficult as neural networks going deeper and datasets becoming bigger. Therefore, more advanced optimization algorithms have been proposed over the past years. In this study, widely used optimization algorithms for…
▽ More
In recent years, we have witnessed the rise of deep learning. Deep neural networks have proved their success in many areas. However, the optimization of these networks has become more difficult as neural networks going deeper and datasets becoming bigger. Therefore, more advanced optimization algorithms have been proposed over the past years. In this study, widely used optimization algorithms for deep learning are examined in detail. To this end, these algorithms called adaptive gradient methods are implemented for both supervised and unsupervised tasks. The behaviour of the algorithms during training and results on four image datasets, namely, MNIST, CIFAR-10, Kaggle Flowers and Labeled Faces in the Wild are compared by pointing out their differences against basic optimization algorithms.
△ Less
Submitted 28 July, 2020;
originally announced July 2020.