-
On the use of quantality in nuclei and many-body systems
Authors:
J. -P. Ebran,
L. Heitz,
E. Khan
Abstract:
The use of quantality is discussed in the case of nuclei and other many-body systems such as atomic electrons. This dimensionless quantity is known to indicate when a many-body system behaves like a crystal or a quantum liquid. Its role is further analyzed, showing the emergence of a fundamental lengthscale, the limit radius, which corresponds to the hard-core of the nucleon-nucleon interaction in…
▽ More
The use of quantality is discussed in the case of nuclei and other many-body systems such as atomic electrons. This dimensionless quantity is known to indicate when a many-body system behaves like a crystal or a quantum liquid. Its role is further analyzed, showing the emergence of a fundamental lengthscale, the limit radius, which corresponds to the hard-core of the nucleon-nucleon interaction in the case of nucleons, and to a value close to the Bohr radius in the case of atomic electrons. The occurrence of a cluster phase in nuclei is analyzed using the quantality through its relation to the localization parameter, allowing for the identification of both the number of nucleons and the density as control parameters for the occurrence of this phase. The relation of the quantality to the magnitude of the interaction also exhibits a third dimensionless parameter, monitoring the magnitude of the spin-orbit effect in finite systems, through the realization of the pseudo-spin symmetry. The impact of quantality on the spin-orbit effect is compared in various many-body systems. The role of quantality in the relative effect of the binding energy and the shell one is also analyzed in nuclei. Finally, additional dimensionless quantities are proposed from the generalization of the quantality. Nuclei are found to be exceptional systems because all their dimensionless quantities are close to the order of unity, at variance with other many-body systems.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Conformal Prediction via Regression-as-Classification
Authors:
Etash Guha,
Shlok Natarajan,
Thomas Möllenhoff,
Mohammad Emtiyaz Khan,
Eugene Ndiaye
Abstract:
Conformal prediction (CP) for regression can be challenging, especially when the output distribution is heteroscedastic, multimodal, or skewed. Some of the issues can be addressed by estimating a distribution over the output, but in reality, such approaches can be sensitive to estimation error and yield unstable intervals.~Here, we circumvent the challenges by converting regression to a classifica…
▽ More
Conformal prediction (CP) for regression can be challenging, especially when the output distribution is heteroscedastic, multimodal, or skewed. Some of the issues can be addressed by estimating a distribution over the output, but in reality, such approaches can be sensitive to estimation error and yield unstable intervals.~Here, we circumvent the challenges by converting regression to a classification problem and then use CP for classification to obtain CP sets for regression.~To preserve the ordering of the continuous-output space, we design a new loss function and make necessary modifications to the CP classification techniques.~Empirical results on many benchmarks shows that this simple approach gives surprisingly good results on many practical problems.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Space Physiology and Technology: Musculoskeletal Adaptations, Countermeasures, and the Opportunity for Wearable Robotics
Authors:
Shamas Ul Ebad Khan,
Re** John Varghese,
Panagiotis Kassanos,
Dario Farina,
Etienne Burdet
Abstract:
Space poses significant challenges for human physiology, leading to physiological adaptations in response to an environment vastly different from Earth. While these adaptations can be beneficial, they may not fully counteract the adverse impact of space-related stressors. A comprehensive understanding of these physiological adaptations is needed to devise effective countermeasures to support human…
▽ More
Space poses significant challenges for human physiology, leading to physiological adaptations in response to an environment vastly different from Earth. While these adaptations can be beneficial, they may not fully counteract the adverse impact of space-related stressors. A comprehensive understanding of these physiological adaptations is needed to devise effective countermeasures to support human life in space. This review focuses on the impact of the environment in space on the musculoskeletal system. It highlights the complex interplay between bone and muscle adaptation, the underlying physiological mechanisms, and their implications on astronaut health. Furthermore, the review delves into the deployed and current advances in countermeasures and proposes, as a perspective for future developments, wearable sensing and robotic technologies, such as exoskeletons, as a fitting alternative.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Effects of Finite Temperature and Pairing Correlations in Multi-$Λ$ Hypernuclei
Authors:
Bahruz Suleymanli,
Kutsal Bozkurt,
Elias Khan,
Haşim Güven,
Jérôme Margueron
Abstract:
The influence of finite temperatures and pairing correlations on the ground state properties of multi $Λ$- Ca, Sn and Pb hypernuclei is explored using finite temperature Hartree Fock Bogoliubov approach and contact pairing interaction. A critical temperature is predicted and is in agreement with the Bardeen Cooper Schrieffer relationship $k_B T_C^Λ\approx 0.5 Δ^{T=0}_Λ$, beyond which pairing corre…
▽ More
The influence of finite temperatures and pairing correlations on the ground state properties of multi $Λ$- Ca, Sn and Pb hypernuclei is explored using finite temperature Hartree Fock Bogoliubov approach and contact pairing interaction. A critical temperature is predicted and is in agreement with the Bardeen Cooper Schrieffer relationship $k_B T_C^Λ\approx 0.5 Δ^{T=0}_Λ$, beyond which pairing correlations drop to zero. Particle densities, $Λ$ single particle energies, and nuclear radii are weakly impacted by pairing as well as by finite temperatures. However, other nuclear properties which are more sensitive to pairing correlations, such as $Λ$ pairing gaps, condensation energies, and abnormal densities are also more impacted by finite temperature, especially around the critical temperature. Furthermore, calculations show the occurrence of the pairing re-entrance effect in the $^{280}_{70Λ}$Pb hyperon drip line hypernucleus. Our study provides insight into the thermal evolution of $Λ$ pairing, i.e. the emergence and vanishing of pairing correlations in multi $Λ$ hypernuclei as a function of temperature.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
SDXL Finetuned with LoRA for Coloring Therapy: Generating Graphic Templates Inspired by United Arab Emirates Culture
Authors:
Abdulla Alfalasi,
Esrat Khan,
Mohamed Alhashmi,
Raed Aldweik,
Davor Svetinovic
Abstract:
A transformative approach to mental health therapy lies at the crossroads of cultural heritage and advanced technology. This paper introduces an innovative method that fuses machine learning techniques with traditional Emirati motifs, focusing on the United Arab Emirates (UAE). We utilize the Stable Diffusion XL (SDXL) model, enhanced with Low-Rank Adaptation (LoRA), to create culturally significa…
▽ More
A transformative approach to mental health therapy lies at the crossroads of cultural heritage and advanced technology. This paper introduces an innovative method that fuses machine learning techniques with traditional Emirati motifs, focusing on the United Arab Emirates (UAE). We utilize the Stable Diffusion XL (SDXL) model, enhanced with Low-Rank Adaptation (LoRA), to create culturally significant coloring templates featuring Al-Sadu weaving patterns. This novel approach leverages coloring therapy for its recognized stress-relieving benefits and embeds deep cultural resonance, making it a potent tool for therapeutic intervention and cultural preservation. Specifically targeting Generalized Anxiety Disorder (GAD), our method demonstrates significant potential in reducing associated symptoms. Additionally, the paper delves into the broader implications of color and music therapy, emphasizing the importance of culturally tailored content. The technical aspects of the SDXL model and its LoRA fine-tuning showcase its capability to generate high-quality, culturally specific images. This research stands at the forefront of integrating mental wellness practices with cultural heritage, providing a groundbreaking perspective on the synergy between technology, culture, and healthcare. In future work, we aim to employ biosignals to assess the level of engagement and effectiveness of color therapy. A key focus will be to examine the impact of the Emirati heritage Al Sadu art on Emirati individuals and compare their responses with those of other nationalities. This will provide deeper insights into the cultural specificity of therapeutic interventions and further the understanding of the unique interplay between cultural identity and mental health therapy.
△ Less
Submitted 20 February, 2024;
originally announced March 2024.
-
Variational Learning is Effective for Large Deep Networks
Authors:
Yuesong Shen,
Nico Daheim,
Bai Cong,
Peter Nickl,
Gian Maria Marconi,
Clement Bazan,
Rio Yokota,
Iryna Gurevych,
Daniel Cremers,
Mohammad Emtiyaz Khan,
Thomas Möllenhoff
Abstract:
We give extensive empirical evidence against the common belief that variational learning is ineffective for large neural networks. We show that an optimizer called Improved Variational Online Newton (IVON) consistently matches or outperforms Adam for training large networks such as GPT-2 and ResNets from scratch. IVON's computational costs are nearly identical to Adam but its predictive uncertaint…
▽ More
We give extensive empirical evidence against the common belief that variational learning is ineffective for large neural networks. We show that an optimizer called Improved Variational Online Newton (IVON) consistently matches or outperforms Adam for training large networks such as GPT-2 and ResNets from scratch. IVON's computational costs are nearly identical to Adam but its predictive uncertainty is better. We show several new use cases of IVON where we improve finetuning and model merging in Large Language Models, accurately predict generalization error, and faithfully estimate sensitivity to data. We find overwhelming evidence that variational learning is effective.
△ Less
Submitted 6 June, 2024; v1 submitted 27 February, 2024;
originally announced February 2024.
-
Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI
Authors:
Theodore Papamarkou,
Maria Skoularidou,
Konstantina Palla,
Laurence Aitchison,
Julyan Arbel,
David Dunson,
Maurizio Filippone,
Vincent Fortuin,
Philipp Hennig,
José Miguel Hernández-Lobato,
Aliaksandr Hubin,
Alexander Immer,
Theofanis Karaletsos,
Mohammad Emtiyaz Khan,
Agustinus Kristiadi,
Yingzhen Li,
Stephan Mandt,
Christopher Nemeth,
Michael A. Osborne,
Tim G. J. Rudner,
David Rügamer,
Yee Whye Teh,
Max Welling,
Andrew Gordon Wilson,
Ruqi Zhang
Abstract:
In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni…
▽ More
In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learning (BDL) constitutes a promising avenue, offering advantages across these diverse settings. This paper posits that BDL can elevate the capabilities of deep learning. It revisits the strengths of BDL, acknowledges existing challenges, and highlights some exciting research avenues aimed at addressing these obstacles. Looking ahead, the discussion focuses on possible ways to combine large-scale foundation models with BDL to unlock their full potential.
△ Less
Submitted 2 June, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
Shadow: A Novel Loss Function for Efficient Training in Siamese Networks
Authors:
Alif Elham Khan,
Mohammad Junayed Hasan,
Humayra Anjum,
Nabeel Mohammed
Abstract:
Despite significant recent advances in similarity detection tasks, existing approaches pose substantial challenges under memory constraints. One of the primary reasons for this is the use of computationally expensive metric learning loss functions such as Triplet Loss in Siamese networks. In this paper, we present a novel loss function called Shadow Loss that compresses the dimensions of an embedd…
▽ More
Despite significant recent advances in similarity detection tasks, existing approaches pose substantial challenges under memory constraints. One of the primary reasons for this is the use of computationally expensive metric learning loss functions such as Triplet Loss in Siamese networks. In this paper, we present a novel loss function called Shadow Loss that compresses the dimensions of an embedding space during loss calculation without loss of performance. The distance between the projections of the embeddings is learned from inputs on a compact projection space where distances directly correspond to a measure of class similarity. Projecting on a lower-dimension projection space, our loss function converges faster, and the resulting classified image clusters have higher inter-class and smaller intra-class distances. Shadow Loss not only reduces embedding dimensions favoring memory constraint devices but also consistently performs better than the state-of-the-art Triplet Margin Loss by an accuracy of 5\%-10\% across diverse datasets. The proposed loss function is also model agnostic, upholding its performance across several tested models. Its effectiveness and robustness across balanced, imbalanced, medical, and non-medical image datasets suggests that it is not specific to a particular model or dataset but demonstrates superior performance consistently while using less memory and computation.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
The Memory Perturbation Equation: Understanding Model's Sensitivity to Data
Authors:
Peter Nickl,
Lu Xu,
Dharmesh Tailor,
Thomas Möllenhoff,
Mohammad Emtiyaz Khan
Abstract:
Understanding model's sensitivity to its training data is crucial but can also be challenging and costly, especially during training. To simplify such issues, we present the Memory-Perturbation Equation (MPE) which relates model's sensitivity to perturbation in its training data. Derived using Bayesian principles, the MPE unifies existing sensitivity measures, generalizes them to a wide-variety of…
▽ More
Understanding model's sensitivity to its training data is crucial but can also be challenging and costly, especially during training. To simplify such issues, we present the Memory-Perturbation Equation (MPE) which relates model's sensitivity to perturbation in its training data. Derived using Bayesian principles, the MPE unifies existing sensitivity measures, generalizes them to a wide-variety of models and algorithms, and unravels useful properties regarding sensitivities. Our empirical results show that sensitivity estimates obtained during training can be used to faithfully predict generalization on unseen test data. The proposed equation is expected to be useful for future research on robust and adaptive learning.
△ Less
Submitted 16 January, 2024; v1 submitted 30 October, 2023;
originally announced October 2023.
-
Model Merging by Uncertainty-Based Gradient Matching
Authors:
Nico Daheim,
Thomas Möllenhoff,
Edoardo Maria Ponti,
Iryna Gurevych,
Mohammad Emtiyaz Khan
Abstract:
Models trained on different datasets can be merged by a weighted-averaging of their parameters, but why does it work and when can it fail? Here, we connect the inaccuracy of weighted-averaging to mismatches in the gradients and propose a new uncertainty-based scheme to improve the performance by reducing the mismatch. The connection also reveals implicit assumptions in other schemes such as averag…
▽ More
Models trained on different datasets can be merged by a weighted-averaging of their parameters, but why does it work and when can it fail? Here, we connect the inaccuracy of weighted-averaging to mismatches in the gradients and propose a new uncertainty-based scheme to improve the performance by reducing the mismatch. The connection also reveals implicit assumptions in other schemes such as averaging, task arithmetic, and Fisher-weighted averaging. Our new method gives consistent improvements for large language models and vision transformers, both in terms of performance and robustness to hyperparameters.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Exploiting Inferential Structure in Neural Processes
Authors:
Dharmesh Tailor,
Mohammad Emtiyaz Khan,
Eric Nalisnick
Abstract:
Neural Processes (NPs) are appealing due to their ability to perform fast adaptation based on a context set. This set is encoded by a latent variable, which is often assumed to follow a simple distribution. However, in real-word settings, the context set may be drawn from richer distributions having multiple modes, heavy tails, etc. In this work, we provide a framework that allows NPs' latent vari…
▽ More
Neural Processes (NPs) are appealing due to their ability to perform fast adaptation based on a context set. This set is encoded by a latent variable, which is often assumed to follow a simple distribution. However, in real-word settings, the context set may be drawn from richer distributions having multiple modes, heavy tails, etc. In this work, we provide a framework that allows NPs' latent variable to be given a rich prior defined by a graphical model. These distributional assumptions directly translate into an appropriate aggregation strategy for the context set. Moreover, we describe a message-passing procedure that still allows for end-to-end optimization with stochastic gradients. We demonstrate the generality of our framework by using mixture and Student-t assumptions that yield improvements in function modelling and test-time robustness.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Memory-Based Dual Gaussian Processes for Sequential Learning
Authors:
Paul E. Chang,
Prakhar Verma,
S. T. John,
Arno Solin,
Mohammad Emtiyaz Khan
Abstract:
Sequential learning with Gaussian processes (GPs) is challenging when access to past data is limited, for example, in continual and active learning. In such cases, errors can accumulate over time due to inaccuracies in the posterior, hyperparameters, and inducing points, making accurate learning challenging. Here, we present a method to keep all such errors in check using the recently proposed dua…
▽ More
Sequential learning with Gaussian processes (GPs) is challenging when access to past data is limited, for example, in continual and active learning. In such cases, errors can accumulate over time due to inaccuracies in the posterior, hyperparameters, and inducing points, making accurate learning challenging. Here, we present a method to keep all such errors in check using the recently proposed dual sparse variational GP. Our method enables accurate inference for generic likelihoods and improves learning by actively building and updating a memory of past data. We demonstrate its effectiveness in several applications involving Bayesian optimization, active learning, and continual learning.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Behavioral Forensics in Social Networks: Identifying Misinformation, Disinformation and Refutation Spreaders Using Machine Learning
Authors:
Euna Mehnaz Khan,
Ayush Ram,
Bhavtosh Rath,
Emily Vraga,
Jaideep Srivastava
Abstract:
With the ever-increasing spread of misinformation on online social networks, it has become very important to identify the spreaders of misinformation (unintentional), disinformation (intentional), and misinformation refutation. It can help in educating the first, stop** the second, and soliciting the help of the third category, respectively, in the overall effort to counter misinformation spread…
▽ More
With the ever-increasing spread of misinformation on online social networks, it has become very important to identify the spreaders of misinformation (unintentional), disinformation (intentional), and misinformation refutation. It can help in educating the first, stop** the second, and soliciting the help of the third category, respectively, in the overall effort to counter misinformation spread. Existing research to identify spreaders is limited to binary classification (true vs false information spreaders). However, people's intention (whether naive or malicious) behind sharing misinformation can only be understood after observing their behavior after exposure to both the misinformation and its refutation which the existing literature lacks to consider. In this paper, we propose a labeling mechanism to label people as one of the five defined categories based on the behavioral actions they exhibit when exposed to misinformation and its refutation. However, everyone does not show behavioral actions but is part of a network. Therefore, we use their network features, extracted through deep learning-based graph embedding models, to train a machine learning model for the prediction of the classes. We name our approach behavioral forensics since it is an evidence-based investigation of suspicious behavior which is spreading misinformation and disinformation in our case. After evaluating our proposed model on a real-world Twitter dataset, we achieved 77.45% precision and 75.80% recall in detecting the malicious actors, who shared the misinformation even after receiving its refutation. Such behavior shows intention, and hence these actors can rightfully be called agents of disinformation spread.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
Variational Bayes Made Easy
Authors:
Mohammad Emtiyaz Khan
Abstract:
Variational Bayes is a popular method for approximate inference but its derivation can be cumbersome. To simplify the process, we give a 3-step recipe to identify the posterior form by explicitly looking for linearity with respect to expectations of well-known distributions. We can then directly write the update by simply ``reading-off'' the terms in front of those expectations. The recipe makes t…
▽ More
Variational Bayes is a popular method for approximate inference but its derivation can be cumbersome. To simplify the process, we give a 3-step recipe to identify the posterior form by explicitly looking for linearity with respect to expectations of well-known distributions. We can then directly write the update by simply ``reading-off'' the terms in front of those expectations. The recipe makes the derivation easier, faster, shorter, and more general.
△ Less
Submitted 10 July, 2023; v1 submitted 27 April, 2023;
originally announced April 2023.
-
On the nature of compact stars determined by gravitational waves, radio-astronomy, x-ray emission and nuclear physics
Authors:
H. Güven,
J. Margueron,
K. Bozkurt,
E. Khan
Abstract:
We investigate the question of the nature of compact stars, considering they may be neutron stars or hybrid stars containing a quark core, within the present constraints given by gravitational waves, radio-astronomy, X-ray emissions from millisecond pulsars and nuclear physics. A Bayesian framework is used to combine together all these constraints and to predict tidal deformabilities and radii for…
▽ More
We investigate the question of the nature of compact stars, considering they may be neutron stars or hybrid stars containing a quark core, within the present constraints given by gravitational waves, radio-astronomy, X-ray emissions from millisecond pulsars and nuclear physics. A Bayesian framework is used to combine together all these constraints and to predict tidal deformabilities and radii for a 1.4~M$_\odot$ compact star. We find that present gravitation wave and radio-astronomy data favors stiff nucleonic EoS compatible with nuclear physics and that GW170817 waveform is best described for binary hybrid stars. Binary neutron stars with soft EoS could however not be totally excluded. In all cases, these %In addition, this data favor stiff quark matter, independently of the nuclear EoS, with a low value for the transition density ($n_\mathrm{tr}\in[0.18,0.35]~\mathrm{fm}^{-3}$). Combining these results with constraints from X-ray observation supports the existence $1.4$~M$_\odot$ mass hybrid star, with a radius predicted to be about $R_{1.4}=12.22(45)$~km.
△ Less
Submitted 11 June, 2024; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Interpretable histopathology-based prediction of disease relevant features in Inflammatory Bowel Disease biopsies using weakly-supervised deep learning
Authors:
Ricardo Mokhtari,
Azam Hamidinekoo,
Daniel Sutton,
Arthur Lewis,
Bastian Angermann,
Ulf Gehrmann,
Pal Lundin,
Hibret Adissu,
Junmei Cairns,
Jessica Neisen,
Emon Khan,
Daniel Marks,
Nia Khachapuridze,
Talha Qaiser,
Nikolay Burlutskiy
Abstract:
Crohn's Disease (CD) and Ulcerative Colitis (UC) are the two main Inflammatory Bowel Disease (IBD) types. We developed deep learning models to identify histological disease features for both CD and UC using only endoscopic labels. We explored fine-tuning and end-to-end training of two state-of-the-art self-supervised models for predicting three different endoscopic categories (i) CD vs UC (AUC=0.8…
▽ More
Crohn's Disease (CD) and Ulcerative Colitis (UC) are the two main Inflammatory Bowel Disease (IBD) types. We developed deep learning models to identify histological disease features for both CD and UC using only endoscopic labels. We explored fine-tuning and end-to-end training of two state-of-the-art self-supervised models for predicting three different endoscopic categories (i) CD vs UC (AUC=0.87), (ii) normal vs lesional (AUC=0.81), (iii) low vs high disease severity score (AUC=0.80). We produced visual attention maps to interpret what the models learned and validated them with the support of a pathologist, where we observed a strong association between the models' predictions and histopathological inflammatory features of the disease. Additionally, we identified several cases where the model incorrectly predicted normal samples as lesional but were correct on the microscopic level when reviewed by the pathologist. This tendency of histological presentation to be more severe than endoscopic presentation was previously published in the literature. In parallel, we utilised a model trained on the Colon Nuclei Identification and Counting (CoNIC) dataset to predict and explore 6 cell populations. We observed correlation between areas enriched with the predicted immune cells in biopsies and the pathologist's feedback on the attention maps. Finally, we identified several cell level features indicative of disease severity in CD and UC. These models can enhance our understanding about the pathology behind IBD and can shape our strategies for patient stratification in clinical trials.
△ Less
Submitted 16 May, 2023; v1 submitted 20 March, 2023;
originally announced March 2023.
-
The Lie-Group Bayesian Learning Rule
Authors:
Eren Mehmet Kıral,
Thomas Möllenhoff,
Mohammad Emtiyaz Khan
Abstract:
The Bayesian Learning Rule provides a framework for generic algorithm design but can be difficult to use for three reasons. First, it requires a specific parameterization of exponential family. Second, it uses gradients which can be difficult to compute. Third, its update may not always stay on the manifold. We address these difficulties by proposing an extension based on Lie-groups where posterio…
▽ More
The Bayesian Learning Rule provides a framework for generic algorithm design but can be difficult to use for three reasons. First, it requires a specific parameterization of exponential family. Second, it uses gradients which can be difficult to compute. Third, its update may not always stay on the manifold. We address these difficulties by proposing an extension based on Lie-groups where posteriors are parametrized through transformations of an arbitrary base distribution and updated via the group's exponential map. This simplifies all three difficulties for many cases, providing flexible parametrizations through group's action, simple gradient computation through reparameterization, and updates that always stay on the manifold. We use the new learning rule to derive a new algorithm for deep learning with desirable biologically-plausible attributes to learn sparse features. Our work opens a new frontier for the design of new algorithms by exploiting Lie-group structures.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
Frequency-domain Blind Quality Assessment of Blurred and Blocking-artefact Images using Gaussian Process Regression model
Authors:
Maryam Viqar,
Athar A. Moinuddin,
Ekram Khan,
M. Ghanbari
Abstract:
Most of the standard image and video codecs are block-based and depending upon the compression ratio the compressed images/videos suffer from different distortions. At low ratios, blurriness is observed and as compression increases blocking artifacts occur. Generally, in order to reduce blockiness, images are low-pass filtered which leads to more blurriness. Also, in bokeh mode images they are com…
▽ More
Most of the standard image and video codecs are block-based and depending upon the compression ratio the compressed images/videos suffer from different distortions. At low ratios, blurriness is observed and as compression increases blocking artifacts occur. Generally, in order to reduce blockiness, images are low-pass filtered which leads to more blurriness. Also, in bokeh mode images they are commonly seen: blurriness as a result of intentional blurred background while blocking artifact and global blurriness arising due to compression. Therefore, such visual media suffer from both blockiness and blurriness distortions. Along with this, noise is also commonly encountered distortion. Most of the existing works on quality assessment quantify these distortions individually. This paper proposes a methodology to blindly measure overall quality of an image suffering from these distortions, individually as well as jointly. This is achieved by considering the sum of absolute values of low and high-frequency Discrete Frequency Transform (DFT) coefficients defined as sum magnitudes. The number of blocks lying in specific ranges of sum magnitudes including zero-valued AC coefficients and mean of 100 maximum and 100 minimum values of these sum magnitudes are used as feature vectors. These features are then fed to the Machine Learning (ML) based Gaussian Process Regression (GPR) model, which quantifies the image quality. The simulation results show that the proposed method can estimate the quality of images distorted with the blockiness, blurriness, noise and their combinations. It is relatively fast compared to many state-of-art methods, and therefore is suitable for real-time quality monitoring applications.
△ Less
Submitted 5 March, 2023;
originally announced March 2023.
-
Simplifying Momentum-based Positive-definite Submanifold Optimization with Applications to Deep Learning
Authors:
Wu Lin,
Valentin Duruisseaux,
Melvin Leok,
Frank Nielsen,
Mohammad Emtiyaz Khan,
Mark Schmidt
Abstract:
Riemannian submanifold optimization with momentum is computationally challenging because, to ensure that the iterates remain on the submanifold, we often need to solve difficult differential equations. Here, we simplify such difficulties for a class of sparse or structured symmetric positive-definite matrices with the affine-invariant metric. We do so by proposing a generalized version of the Riem…
▽ More
Riemannian submanifold optimization with momentum is computationally challenging because, to ensure that the iterates remain on the submanifold, we often need to solve difficult differential equations. Here, we simplify such difficulties for a class of sparse or structured symmetric positive-definite matrices with the affine-invariant metric. We do so by proposing a generalized version of the Riemannian normal coordinates that dynamically orthonormalizes the metric and locally converts the problem into an unconstrained problem in the Euclidean space. We use our approach to simplify existing approaches for structured covariances and develop matrix-inverse-free $2^\text{nd}$-order optimizers for deep learning with low precision by using only matrix multiplications. Code: https://github.com/yorkerlin/StructuredNGD-DL
△ Less
Submitted 16 March, 2024; v1 submitted 19 February, 2023;
originally announced February 2023.
-
H-LPS: a hybrid approach for user's location privacy in location-based services
Authors:
Sonia Sabir,
Inayat Ali,
Eraj Khan
Abstract:
Applications providing location-based services (LBS) have gained much attention and importance with the notion of the internet of things (IoT). Users are utilizing LBS by providing their location information to third-party service providers. However, location data is very sensitive that can reveal user's private life to adversaries. The passive and pervasive data collection in IoT upsurges serious…
▽ More
Applications providing location-based services (LBS) have gained much attention and importance with the notion of the internet of things (IoT). Users are utilizing LBS by providing their location information to third-party service providers. However, location data is very sensitive that can reveal user's private life to adversaries. The passive and pervasive data collection in IoT upsurges serious issues of location privacy. Privacy-preserving location-based services are a hot research topic. Many anonymization and obfuscation techniques have been proposed to overcome location privacy issues. In this paper, we have proposed a hybrid location privacy scheme (H-LPS), a hybrid scheme mainly based on obfuscation and collaboration for protecting users' location privacy while using location-based services. Obfuscation naturally degrades the quality of service but provides more privacy as compared to anonymization. Our proposed scheme, H-LPS, provides a very high-level of privacy yet provides good accuracy for most of the users. The privacy level and service accuracy of H-LPS are compared with state-of-the-art location privacy schemes and it is shown that H-LPS could be a candidate solution for preserving user location privacy in location-based services.
△ Less
Submitted 15 December, 2022;
originally announced December 2022.
-
Microscopic description of $α$, $2α$, and cluster decays of $^{216-220}$Rn and $^{220-224}$Ra
Authors:
J. Zhao,
J. -P. Ebran,
L. Heitz,
E. Khan,
F. Mercier,
T. Niksic,
D. Vretenar
Abstract:
Alpha and cluster decays are analyzed for heavy nuclei located above $^{208}$Pb on the chart of nuclides: $^{216-220}$Rn and $^{220-224}$Ra, that are also candidates for observing the $2 α$ decay mode. A microscopic theoretical approach based on relativistic Energy Density Functionals (EDF), is used to compute axially-symmetric deformation energy surfaces as functions of quadrupole, octupole and h…
▽ More
Alpha and cluster decays are analyzed for heavy nuclei located above $^{208}$Pb on the chart of nuclides: $^{216-220}$Rn and $^{220-224}$Ra, that are also candidates for observing the $2 α$ decay mode. A microscopic theoretical approach based on relativistic Energy Density Functionals (EDF), is used to compute axially-symmetric deformation energy surfaces as functions of quadrupole, octupole and hexadecupole collective coordinates. Dynamical least-action paths for specific decay modes are calculated on the corresponding potential energy surfaces. The effective collective inertia is determined using the perturbative cranking approximation, and zero-point and rotational energy corrections are included in the model. The predicted half-lives for $α$-decay are within one order of magnitude of the experimental values. In the case of single $α$ emission, the nuclei considered in the present study exhibit least-action paths that differ significantly up to the scission point. The differences in alpha-decay lifetimes are not only driven by Q values, but also by variances of the least-action paths prior to scission. In contrast, the $2 α$ decay mode presents very similar paths from equilibrium to scission, and the differences in lifetimes are mainly driven by the corresponding Q values. The predicted $^{14}$C cluster decay half-lives are within three orders of magnitudes of the empirical values, and point to a much more complex pattern compared to the alpha-decay mode.
△ Less
Submitted 6 March, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
PANDORA project: photo-nuclear reactions below $A=60$
Authors:
A. Tamii,
L. Pellegri,
P. -A. Söderström,
D. Allard,
S. Goriely,
T. Inakura,
E. Khan,
E. Kido,
M. Kimura,
E. Litvinova,
S. Nagataki,
P. von Neumann-Cosel,
N. Pietralla,
N. Shimizu,
N. Tsoneva,
Y. Utsuno,
S. Adachi,
P. Adsley,
A. Bahini,
D. Balabanski,
B. Baret,
J. A. C. Bekker,
S. D. Binda,
E. Boicu,
A. Bracco
, et al. (56 additional authors not shown)
Abstract:
Photo-nuclear reactions of light nuclei below a mass of $A=60$ are studied experimentally and theoretically by the PANDORA (Photo-Absorption of Nuclei and Decay Observation for Reactions in Astrophysics) project. Two experimental methods, virtual-photon excitation by proton scattering and real-photo absorption by a high-brilliance gamma-ray beam produced by laser Compton scattering, will be applie…
▽ More
Photo-nuclear reactions of light nuclei below a mass of $A=60$ are studied experimentally and theoretically by the PANDORA (Photo-Absorption of Nuclei and Decay Observation for Reactions in Astrophysics) project. Two experimental methods, virtual-photon excitation by proton scattering and real-photo absorption by a high-brilliance gamma-ray beam produced by laser Compton scattering, will be applied to measure the photo-absorption cross sections and the decay branching ratio of each decay channel as a function of the photon energy. Several nuclear models, e.g. anti-symmetrized molecular dynamics, mean-field type models, a large-scale shell model, and ab initio models, will be employed to predict the photo-nuclear reactions. The uncertainty in the model predictions will be evaluated from the discrepancies between the model predictions and the experimental data. The data and the predictions will be implemented in a general reaction calculation code TALYS . The results will be applied to the simulation of the photo-disintegration process of ultra-high-energy cosmic rays in inter-galactic propagation.
△ Less
Submitted 18 November, 2022; v1 submitted 7 November, 2022;
originally announced November 2022.
-
Bridging the Gap Between Target Networks and Functional Regularization
Authors:
Alexandre Piche,
Valentin Thomas,
Joseph Marino,
Rafael Pardinas,
Gian Maria Marconi,
Christopher Pal,
Mohammad Emtiyaz Khan
Abstract:
Bootstrap** is behind much of the successes of Deep Reinforcement Learning. However, learning the value function via bootstrap** often leads to unstable training due to fast-changing target values. Target Networks are employed to stabilize training by using an additional set of lagging parameters to estimate the target values. Despite the popularity of Target Networks, their effect on the opti…
▽ More
Bootstrap** is behind much of the successes of Deep Reinforcement Learning. However, learning the value function via bootstrap** often leads to unstable training due to fast-changing target values. Target Networks are employed to stabilize training by using an additional set of lagging parameters to estimate the target values. Despite the popularity of Target Networks, their effect on the optimization is still misunderstood. In this work, we show that they act as an implicit regularizer. This regularizer has disadvantages such as being inflexible and non convex. To overcome these issues, we propose an explicit Functional Regularization that is a convex regularizer in function space and can easily be tuned. We analyze the convergence of our method theoretically and empirically demonstrate that replacing Target Networks with the more theoretically grounded Functional Regularization approach leads to better sample efficiency and performance improvements.
△ Less
Submitted 3 January, 2024; v1 submitted 21 October, 2022;
originally announced October 2022.
-
Covariant energy density functionals with and without tensor couplings at the Hartree-Bogoliubov level
Authors:
F. Mercier,
J. -P. Ebran,
E. Khan
Abstract:
Background: The study of additional terms in functionals is relevant to better describe nuclear structure phenomenology. Among these terms, the tensor one is known to impact nuclear structure properties, especially in neutron-rich nuclei. However, its effect has not been studied on the whole nuclear chart yet.
Purpose: The impact of terms corresponding to the tensor at the Hartree level, is stud…
▽ More
Background: The study of additional terms in functionals is relevant to better describe nuclear structure phenomenology. Among these terms, the tensor one is known to impact nuclear structure properties, especially in neutron-rich nuclei. However, its effect has not been studied on the whole nuclear chart yet.
Purpose: The impact of terms corresponding to the tensor at the Hartree level, is studied for infinite nuclear matter as well as deformed nuclei, by develo** new density-dependent functionals including these terms. In particular, we study in details the improvement such a term can bring to the description of specific nuclear observables.
Methods: The framework of covariant energy density functional is used at the Hartree-Bogoliubov level. The free parameters of covariant functionals are optimized by combining Markov-Chain-Monte-Carlo and simplex algorithms.
Results: An improvement of the RMS binding energies, spin-orbit splittings and gaps is obtained over the nuclear chart, including axially deformed ones, when including tensors terms. Small modifications of the potential energy surface and densities are also found. In infinite matter, the Dirac mass is shifted to a larger value, in better agreement with experiments.
Conclusions: Taking into account additional terms corresponding to the tensor terms in the vector-isoscalar channel at the Hartree level, improves the description of nuclear properties, both in nuclei and in nuclear matter.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
Can Calibration Improve Sample Prioritization?
Authors:
Ganesh Tata,
Gautham Krishna Gudur,
Gopinath Chennupati,
Mohammad Emtiyaz Khan
Abstract:
Calibration can reduce overconfident predictions of deep neural networks, but can calibration also accelerate training? In this paper, we show that it can when used to prioritize some examples for performing subset selection. We study the effect of popular calibration techniques in selecting better subsets of samples during training (also called sample prioritization) and observe that calibration…
▽ More
Calibration can reduce overconfident predictions of deep neural networks, but can calibration also accelerate training? In this paper, we show that it can when used to prioritize some examples for performing subset selection. We study the effect of popular calibration techniques in selecting better subsets of samples during training (also called sample prioritization) and observe that calibration can improve the quality of subsets, reduce the number of examples per epoch (by at least 70%), and can thereby speed up the overall training process. We further study the effect of using calibrated pre-trained models coupled with calibration during training to guide sample prioritization, which again seems to improve the quality of samples selected.
△ Less
Submitted 15 November, 2022; v1 submitted 12 October, 2022;
originally announced October 2022.
-
SAM as an Optimal Relaxation of Bayes
Authors:
Thomas Möllenhoff,
Mohammad Emtiyaz Khan
Abstract:
Sharpness-aware minimization (SAM) and related adversarial deep-learning methods can drastically improve generalization, but their underlying mechanisms are not yet fully understood. Here, we establish SAM as a relaxation of the Bayes objective where the expected negative-loss is replaced by the optimal convex lower bound, obtained by using the so-called Fenchel biconjugate. The connection enables…
▽ More
Sharpness-aware minimization (SAM) and related adversarial deep-learning methods can drastically improve generalization, but their underlying mechanisms are not yet fully understood. Here, we establish SAM as a relaxation of the Bayes objective where the expected negative-loss is replaced by the optimal convex lower bound, obtained by using the so-called Fenchel biconjugate. The connection enables a new Adam-like extension of SAM to automatically obtain reasonable uncertainty estimates, while sometimes also improving its accuracy. By connecting adversarial and Bayesian methods, our work opens a new path to robustness.
△ Less
Submitted 10 December, 2023; v1 submitted 4 October, 2022;
originally announced October 2022.
-
Alpha-particle formation and clustering in nuclei
Authors:
E. Khan,
L. Heitz,
F. Mercier,
J. -P. Ebran
Abstract:
The nucleonic localization function has been used for a decade to study the formation of alpha-particles in nuclei, by providing a measure of having nucleons of a given spin in a single place. However, differences in interpretation remain, compared to the nucleonic density of the nucleus. In order to better understand the respective role of the nucleonic localization function and the densities in…
▽ More
The nucleonic localization function has been used for a decade to study the formation of alpha-particles in nuclei, by providing a measure of having nucleons of a given spin in a single place. However, differences in interpretation remain, compared to the nucleonic density of the nucleus. In order to better understand the respective role of the nucleonic localization function and the densities in the alpha-particle formation in cluster states or in alpha-decay mechanism, both an analytic approximation and microscopic calculations, using energy density functionals, are undertaken. The nucleonic localization function is shown to measure the anti-centrifugal effect, and is not sensitive to the level of compactness of the alpha-particle itself. It probes the purity of the spatial overlap of four nucleons in the four possible (spin, isospin) states. The density provides, in addition, information on the compactness of an alpha-particle cluster. The respective roles of the nucleonic localization function and the density are also analyzed in the case of alpha-particle emission. More generally, criteria to assess the prediction of alpha-cluster in nuclear states are provided.
△ Less
Submitted 24 November, 2022; v1 submitted 19 September, 2022;
originally announced September 2022.
-
Clustering in nuclei at finite temperature
Authors:
Esra Yüksel,
Florian Mercier,
Jean-Paul Ebran,
Elias Khan
Abstract:
We investigate the localization and clustering features in $^{20}$Ne ($N=Z$) and neutron-rich $^{32}$Ne nuclei at zero and finite temperatures. The finite temperature Hartree-Bogoliubov theory is used with the relativistic density-dependent meson-nucleon coupling functional DD-ME2. It is shown that clustering features gradually weaken with increasing temperature and disappear when the shape phase…
▽ More
We investigate the localization and clustering features in $^{20}$Ne ($N=Z$) and neutron-rich $^{32}$Ne nuclei at zero and finite temperatures. The finite temperature Hartree-Bogoliubov theory is used with the relativistic density-dependent meson-nucleon coupling functional DD-ME2. It is shown that clustering features gradually weaken with increasing temperature and disappear when the shape phase transition occurs. Considering thermal fluctuations in the density profiles, the clustering features vanish at lower temperatures, compared to the case without thermal fluctuations. The effect of the pairing correlations on the nucleon localization and the formation of cluster structures are also studied at finite temperatures. Due to the inclusion of pairing in the calculations, cluster structures are preserved until the critical temperatures for the shape phase transition are reached. Above the critical temperature of the shape phase transition, the clustering features suddenly disappear, which differs from the results without pairing.
△ Less
Submitted 8 November, 2022; v1 submitted 27 July, 2022;
originally announced July 2022.
-
Nuclear incompressibility and sound speed in uniform matter and finite nuclei
Authors:
Guilherme Grams,
Rahul Somasundaram,
Jerome Margueron,
Elias Khan
Abstract:
We have extended the compressible liquid-drop model (CLDM) with a density-dependent surface term (eCLDM), which allows for a unified description of both the nuclear ground state energies and the incompressibility modulus in finite nuclei $K_A$. We analyse the role of the nuclear empirical parameters, e.g., $K_{sat}$, $Q_{sat}$, $L_{sym}$ and $K_{sym}$, which contribute to the bulk properties, as w…
▽ More
We have extended the compressible liquid-drop model (CLDM) with a density-dependent surface term (eCLDM), which allows for a unified description of both the nuclear ground state energies and the incompressibility modulus in finite nuclei $K_A$. We analyse the role of the nuclear empirical parameters, e.g., $K_{sat}$, $Q_{sat}$, $L_{sym}$ and $K_{sym}$, which contribute to the bulk properties, as well as the role of the finite size contributions. For the bulk properties, the density and isospin dependencies of the nuclear incompressibility in infinite matter are characterized by introducing new empirical parameters, and two new constraints for the value of $K_{sym}$ are suggested. For finite nuclei, we employ a Bayesian approach coupled to a Markov-Chain Monte-Carlo (MCMC) exploration of the parameter space to confront the model predictions of $K_A$ in Zr, Sn and Pb isotopes to the experimental data. We show that $Q_{sat}\approx -950\pm 200$~MeV describes the experimental measurements of $K_A$ in these isotopes. This value is different from the ones deduced from phenomenological nuclear energy density functionals, suggesting a possible explanation of their difficulty to accurately describe Zr, Sn and Pb data all together. In addition we explore the impact of a fictitious measurement of the Giant Monopole Resonance energy in $^{132}$Sn. We show that this measurement, provided it is accurate enough, will allow to better determine $K_{sym}$ and $K_τ$. Finally we explore the properties of the sound speed around saturation density and show the important role of finite size terms in finite nuclei since they reduce the sound speed to approximately half compared to nuclear matter.
△ Less
Submitted 5 July, 2022;
originally announced July 2022.
-
Quantum hybridization negative differential resistance from non-toxic halide perovskite nanowire heterojunctions and its strain control
Authors:
Juho Lee,
Muhammad Ejaz Khan,
Yong-Hoon Kim
Abstract:
While low-dimensional organometal halide perovskites are expected to open up new opportunities for a diverse range of device applications, like in their bulk counterparts, the toxicity of Pb-based halide perovskite materials is a significant concern that hinders their practical use. We recently predicted that lead triiodide (PbI$_3$) columns de-rived from trimethylsulfonium (TMS) lead triiodide (C…
▽ More
While low-dimensional organometal halide perovskites are expected to open up new opportunities for a diverse range of device applications, like in their bulk counterparts, the toxicity of Pb-based halide perovskite materials is a significant concern that hinders their practical use. We recently predicted that lead triiodide (PbI$_3$) columns de-rived from trimethylsulfonium (TMS) lead triiodide (CH$_3$)$_3$SPbI$_3$ (TMSPbI$_3$) by strip** off TMS ligands should be semimetallic, and additionally ultrahigh negative differential resistance (NDR) can arise from the heterojunction composed of a TMSPbI$_3$ channel sandwiched by PbI$_3$ electrodes. Herein, we computationally explore whether similar material and device characteristics can be obtained from other one-dimensional halide perovskites based on non-Pb metal elements, and in doing so deepen the understanding of their mechanistic origins. First, scanning through several candidate metal halide inorganic frameworks as well as their parental form halide perovskites, we find that the germanium triiodide (GeI$_3$) column also assumes a semimetallic character by avoiding the Peierls distortion. Next, adopting the bundled nanowire GeI$_3$-TMSGeI$_3$-GeI$_3$ junction configuration, we obtain a drastically high peak current density and ultrahigh NDR at room temperature. Furthermore, the robustness and controllability of NDR signals under strain are revealed, establishing its potential for flexible electronics applications. It will be emphasized that, despite the performance metrics notably enhanced over those from the PbI$_3$-TMSPbI$_3$-PbI$_3$ case, these device characteristics still arise from the identical quantum hybridization NDR mechanism.
△ Less
Submitted 23 May, 2022;
originally announced May 2022.
-
Ghost sensing: the rise and role of exceptional points in planar geometry
Authors:
Emroz Khan,
Evgenii E. Narimanov
Abstract:
We show the recently discovered ghost waves - a special class of non-uniform electromagnetic waves in biaxial anisotropic media - can be used for optical sensing based on exceptional points. In addition to showing high sensitivity and precision, the proposed sensor employs simple planar geometry and is robust against noise.
We show the recently discovered ghost waves - a special class of non-uniform electromagnetic waves in biaxial anisotropic media - can be used for optical sensing based on exceptional points. In addition to showing high sensitivity and precision, the proposed sensor employs simple planar geometry and is robust against noise.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
Dual Parameterization of Sparse Variational Gaussian Processes
Authors:
Vincent Adam,
Paul E. Chang,
Mohammad Emtiyaz Khan,
Arno Solin
Abstract:
Sparse variational Gaussian process (SVGP) methods are a common choice for non-conjugate Gaussian process inference because of their computational benefits. In this paper, we improve their computational efficiency by using a dual parameterization where each data example is assigned dual parameters, similarly to site parameters used in expectation propagation. Our dual parameterization speeds-up in…
▽ More
Sparse variational Gaussian process (SVGP) methods are a common choice for non-conjugate Gaussian process inference because of their computational benefits. In this paper, we improve their computational efficiency by using a dual parameterization where each data example is assigned dual parameters, similarly to site parameters used in expectation propagation. Our dual parameterization speeds-up inference using natural gradient descent, and provides a tighter evidence lower bound for hyperparameter learning. The approach has the same memory cost as the current SVGP methods, but it is faster and more accurate.
△ Less
Submitted 19 January, 2022; v1 submitted 5 November, 2021;
originally announced November 2021.
-
Derivation of the M$_n$/M$_p$ ratio in exotic nuclei
Authors:
E. Khan
Abstract:
A generalized formula is provided, to calculate the M$_n$/M$_p$ ratio of the multipole transition matrix elements, in the framework of the so-called phenomenological analysis. It takes into account the possible difference between the neutron and proton radii and diffuseness, which can occur, especially in exotic nuclei. The validity domain of the original Bernstein formula is discussed, in the cas…
▽ More
A generalized formula is provided, to calculate the M$_n$/M$_p$ ratio of the multipole transition matrix elements, in the framework of the so-called phenomenological analysis. It takes into account the possible difference between the neutron and proton radii and diffuseness, which can occur, especially in exotic nuclei. The validity domain of the original Bernstein formula is discussed, in the case of the proton scattering probe at few tens of MeV. The largest discrepancies are obtained for very neutron-rich nuclei (N/Z$\gtrsim$1.6) or when the electromagnetic deformation parameter is larger than the proton scattering one. The reduction of the statistical error bars, and the study of very neutron-rich nuclei at facilities of exotic beams of new generation, should favor the use of the generalized Bernstein formula.
△ Less
Submitted 4 October, 2021;
originally announced October 2021.
-
Systematical studies of the E1 photon strength functions combining Skyrme-HFB+QRPA model and experimental giant dipole resonance properties
Authors:
Y. Xu,
S. Goriely,
E. Khan
Abstract:
Valuable theoretical predictions of nuclear dipole excitations in the whole nuclear chart are of great interest for different applications, including in particular nuclear astrophysics. We present here the systematic study of the electric dipole (E1) photon strength functions (PSFs) combining the microscopic Hartree-Fock-Bogoliubov plus Quasiparticle Random Phase Approximation (HFB+QRPA) model and…
▽ More
Valuable theoretical predictions of nuclear dipole excitations in the whole nuclear chart are of great interest for different applications, including in particular nuclear astrophysics. We present here the systematic study of the electric dipole (E1) photon strength functions (PSFs) combining the microscopic Hartree-Fock-Bogoliubov plus Quasiparticle Random Phase Approximation (HFB+QRPA) model and the parametrizations constrained by the available experimental giant dipole resonance (GDR) data. For about 10000 nuclei with 8<Z<124 lying between the proton and the neutron drip-lines on nuclear chart, the particle-hole strength distributions are computed using the HFB+QRPA model under the assumption of spherical symmetry and making use of the BSk27 Skyrme effective interaction derived from the most accurate HFB mass model (HFB-27) so far achieved. Large-scale calculations of the BSk27+QRPA E1 PSFs are performed in the framework of a specific folding procedure, in which three phenomenological improvements are considered. First, two interference factors are introduced and adjusted to reproduce at best the available experimental GDR data. Second, an empirical expression accounting for the deformation effect is applied to describe the peak splitting of the strength function. Third, the width of the strength function is corrected by a temperature-dependent term, which effectively increases the de-excitation photon strength function at low-energy. The E1 PSFs as well as the extracted GDR peaks and widths are compared with available experimental data. A relatively good agreement with data indicates the reliability of the calculations. Eventually, the astrophysical (n,g) rates for all the 10000 nuclei with 8<Z<124 are estimated using the present E1 PSFs. The resulting reaction rates are compared with previous BSk7+QRPA results and Gogny-HFB+QRPA predictions based on the D1M interaction.
△ Less
Submitted 28 September, 2021;
originally announced September 2021.
-
Low-energy monopole strength in spherical and deformed nuclei : cluster and soft modes
Authors:
F. Mercier,
J. -P. Ebran,
E. Khan
Abstract:
Background : Several recent experiments report significant low-energy isoscalar monopole strength, below the giant resonance, in various nuclei. In light $α$-conjugate nuclei, these low-energy resonances were recently interpreted as cluster vibration modes. However, the nature of these excitations in neutron-rich nuclei remain elusive.
Purpose : The present work provides a systematic analysis of…
▽ More
Background : Several recent experiments report significant low-energy isoscalar monopole strength, below the giant resonance, in various nuclei. In light $α$-conjugate nuclei, these low-energy resonances were recently interpreted as cluster vibration modes. However, the nature of these excitations in neutron-rich nuclei remain elusive.
Purpose : The present work provides a systematic analysis of the low-energy monopole strength in isotopic chains, from Neon to Germanium, in order to monitor and understand its nature and conditions of emergence.
Methods : We perform covariant quasiparticle random phase approximation (QRPA) calculations, formulated within the finite amplitude method (FAM), on top of constrained relativistic Hartree-Bogoliubov (RHB) reference states.
Results : Neutron excess leads to the appearance of low-energy excitations according to a systematic pattern reflecting the single-particle features of the underlying RHB reference state. With the onset of deformation, these low-energy resonances get split and give rise to more complex patterns, with possible mixing with the giant resonance. At lower energy, cluster-like excitations found in $N=Z$ systems survive in neutron-rich nuclei, with valence neutrons arranging in molecular-like orbitals. Finally, at very low energy, pair excitations are also found in superfluid nuclei, but remain negligible in most of the cases.
Conclusions : The low-energy part of the monopole strength exhibits various modes, from cluster vibrations ($\sim$ 5-10 MeV) to components of the giant resonance downshifted by the onset of deformation, including soft modes ($\sim$ 10-15 MeV) as well as pair excitation ($<$ 5 MeV), with possible mixing, depending on neutron-excess, deformation, and pairing energy.
△ Less
Submitted 6 September, 2021;
originally announced September 2021.
-
Structured second-order methods via natural gradient descent
Authors:
Wu Lin,
Frank Nielsen,
Mohammad Emtiyaz Khan,
Mark Schmidt
Abstract:
In this paper, we propose new structured second-order methods and structured adaptive-gradient methods obtained by performing natural-gradient descent on structured parameter spaces. Natural-gradient descent is an attractive approach to design new algorithms in many settings such as gradient-free, adaptive-gradient, and second-order methods. Our structured methods not only enjoy a structural invar…
▽ More
In this paper, we propose new structured second-order methods and structured adaptive-gradient methods obtained by performing natural-gradient descent on structured parameter spaces. Natural-gradient descent is an attractive approach to design new algorithms in many settings such as gradient-free, adaptive-gradient, and second-order methods. Our structured methods not only enjoy a structural invariance but also admit a simple expression. Finally, we test the efficiency of our proposed methods on both deterministic non-convex problems and deep learning problems.
△ Less
Submitted 19 February, 2022; v1 submitted 22 July, 2021;
originally announced July 2021.
-
Subset-of-Data Variational Inference for Deep Gaussian-Processes Regression
Authors:
Ayush Jain,
P. K. Srijith,
Mohammad Emtiyaz Khan
Abstract:
Deep Gaussian Processes (DGPs) are multi-layer, flexible extensions of Gaussian processes but their training remains challenging. Sparse approximations simplify the training but often require optimization over a large number of inducing inputs and their locations across layers. In this paper, we simplify the training by setting the locations to a fixed subset of data and sampling the inducing inpu…
▽ More
Deep Gaussian Processes (DGPs) are multi-layer, flexible extensions of Gaussian processes but their training remains challenging. Sparse approximations simplify the training but often require optimization over a large number of inducing inputs and their locations across layers. In this paper, we simplify the training by setting the locations to a fixed subset of data and sampling the inducing inputs from a variational distribution. This reduces the trainable parameters and computation cost without significant performance degradations, as demonstrated by our empirical results on regression problems. Our modifications simplify and stabilize DGP training while making it amenable to sampling schemes for setting the inducing inputs.
△ Less
Submitted 17 July, 2021;
originally announced July 2021.
-
The Bayesian Learning Rule
Authors:
Mohammad Emtiyaz Khan,
Håvard Rue
Abstract:
We show that many machine-learning algorithms are specific instances of a single algorithm called the \emph{Bayesian learning rule}. The rule, derived from Bayesian principles, yields a wide-range of algorithms from fields such as optimization, deep learning, and graphical models. This includes classical algorithms such as ridge regression, Newton's method, and Kalman filter, as well as modern dee…
▽ More
We show that many machine-learning algorithms are specific instances of a single algorithm called the \emph{Bayesian learning rule}. The rule, derived from Bayesian principles, yields a wide-range of algorithms from fields such as optimization, deep learning, and graphical models. This includes classical algorithms such as ridge regression, Newton's method, and Kalman filter, as well as modern deep-learning algorithms such as stochastic-gradient descent, RMSprop, and Dropout. The key idea in deriving such algorithms is to approximate the posterior using candidate distributions estimated by using natural gradients. Different candidate distributions result in different algorithms and further approximations to natural gradients give rise to variants of those algorithms. Our work not only unifies, generalizes, and improves existing algorithms, but also helps us design new ones.
△ Less
Submitted 8 June, 2024; v1 submitted 9 July, 2021;
originally announced July 2021.
-
Knowledge-Adaptation Priors
Authors:
Mohammad Emtiyaz Khan,
Siddharth Swaroop
Abstract:
Humans and animals have a natural ability to quickly adapt to their surroundings, but machine-learning models, when subjected to changes, often require a complete retraining from scratch. We present Knowledge-adaptation priors (K-priors) to reduce the cost of retraining by enabling quick and accurate adaptation for a wide-variety of tasks and models. This is made possible by a combination of weigh…
▽ More
Humans and animals have a natural ability to quickly adapt to their surroundings, but machine-learning models, when subjected to changes, often require a complete retraining from scratch. We present Knowledge-adaptation priors (K-priors) to reduce the cost of retraining by enabling quick and accurate adaptation for a wide-variety of tasks and models. This is made possible by a combination of weight and function-space priors to reconstruct the gradients of the past, which recovers and generalizes many existing, but seemingly-unrelated, adaptation strategies. Training with simple first-order gradient methods can often recover the exact retrained model to an arbitrary accuracy by choosing a sufficiently large memory of the past data. Empirical results show that adaptation with K-priors achieves performance similar to full retraining, but only requires training on a handful of past examples.
△ Less
Submitted 27 October, 2021; v1 submitted 16 June, 2021;
originally announced June 2021.
-
Ground State Properties of Charmed Hypernuclei with Mean Field Approach
Authors:
H. Güven,
K. Bozkurt,
E. Khan,
J. Margueron
Abstract:
Closed shell charmed hypernuclei $^5_{Λ_c}$Li, $^{17}_{Λ_c}$F, $^{41}_{Λ_c}$Sc, $^{57}_{Λ_c}$Cu, $^{133}_{Λ_c}$Sb and $^{209}_{Λ_c}$Bi are calculated within Hartree-Fock approach by using three different force sets derived from microscopic Brueckner-Hartree-Fock calculations of $Λ$ hypernuclei. Ground state properties (binding energies, $Λ_c$ separation energies, $Λ_c$ single particle energies and…
▽ More
Closed shell charmed hypernuclei $^5_{Λ_c}$Li, $^{17}_{Λ_c}$F, $^{41}_{Λ_c}$Sc, $^{57}_{Λ_c}$Cu, $^{133}_{Λ_c}$Sb and $^{209}_{Λ_c}$Bi are calculated within Hartree-Fock approach by using three different force sets derived from microscopic Brueckner-Hartree-Fock calculations of $Λ$ hypernuclei. Ground state properties (binding energies, $Λ_c$ separation energies, $Λ_c$ single particle energies and $Λ_c$ densities) of charmed nuclei are examined. Due to the Coulomb repulsion between protons and the $Λ_c$ baryon, charmed hypernuclei are most bound for $16\leq$A$\leq 41$, where $^{17}_{Λ_c}$F can be considered as an excellent candidate to measure charmed hypernuclei. The competition between the attractive nucleon-$Λ_c$ interaction and the Coulomb repulsion is discussed, and we compare $Λ$ and $Λ_c$ hypernuclei properties.
△ Less
Submitted 22 June, 2021; v1 submitted 8 June, 2021;
originally announced June 2021.
-
Bridging the Gap Between Target Networks and Functional Regularization
Authors:
Alexandre Piché,
Valentin Thomas,
Rafael Pardinas,
Joseph Marino,
Gian Maria Marconi,
Christopher Pal,
Mohammad Emtiyaz Khan
Abstract:
Bootstrap** is behind much of the successes of deep Reinforcement Learning. However, learning the value function via bootstrap** often leads to unstable training due to fast-changing target values. Target Networks are employed to stabilize training by using an additional set of lagging parameters to estimate the target values. Despite the popularity of Target Networks, their effect on the opti…
▽ More
Bootstrap** is behind much of the successes of deep Reinforcement Learning. However, learning the value function via bootstrap** often leads to unstable training due to fast-changing target values. Target Networks are employed to stabilize training by using an additional set of lagging parameters to estimate the target values. Despite the popularity of Target Networks, their effect on the optimization is still misunderstood. In this work, we show that they act as an implicit regularizer which can be beneficial in some cases, but also have disadvantages such as being inflexible and can result in instabilities, even when vanilla TD(0) converges. To overcome these issues, we propose an explicit Functional Regularization alternative that is flexible and a convex regularizer in function space and we theoretically study its convergence. We conduct an experimental study across a range of environments, discount factors, and off-policiness data collections to investigate the effectiveness of the regularization induced by Target Networks and Functional Regularization in terms of performance, accuracy, and stability. Our findings emphasize that Functional Regularization can be used as a drop-in replacement for Target Networks and result in performance improvement. Furthermore, adjusting both the regularization weight and the network update period in Functional Regularization can result in further performance improvements compared to solely adjusting the network update period as typically done with Target Networks. Our approach also enhances the ability to networks to recover accurate $Q$-values.
△ Less
Submitted 7 September, 2023; v1 submitted 4 June, 2021;
originally announced June 2021.
-
A Novel Falling-Ball Algorithm for Image Segmentation
Authors:
Asra Aslam,
Ekram Khan,
Mohammad Samar Ansari,
M. M. Sufyan Beg
Abstract:
Image segmentation refers to the separation of objects from the background, and has been one of the most challenging aspects of digital image processing. Practically it is impossible to design a segmentation algorithm which has 100% accuracy, and therefore numerous segmentation techniques have been proposed in the literature, each with certain limitations. In this paper, a novel Falling-Ball algor…
▽ More
Image segmentation refers to the separation of objects from the background, and has been one of the most challenging aspects of digital image processing. Practically it is impossible to design a segmentation algorithm which has 100% accuracy, and therefore numerous segmentation techniques have been proposed in the literature, each with certain limitations. In this paper, a novel Falling-Ball algorithm is presented, which is a region-based segmentation algorithm, and an alternative to watershed transform (based on waterfall model). The proposed algorithm detects the catchment basins by assuming that a ball falling from hilly terrains will stop in a catchment basin. Once catchment basins are identified, the association of each pixel with one of the catchment basin is obtained using multi-criterion fuzzy logic. Edges are constructed by dividing image into different catchment basins with the help of a membership function. Finally closed contour algorithm is applied to find closed regions and objects within closed regions are segmented using intensity information. The performance of the proposed algorithm is evaluated both objectively as well as subjectively. Simulation results show that the proposed algorithms gives superior performance over conventional Sobel edge detection methods and the watershed segmentation algorithm. For comparative analysis, various comparison methods are used for demonstrating the superiority of proposed methods over existing segmentation methods.
△ Less
Submitted 12 May, 2021; v1 submitted 6 May, 2021;
originally announced May 2021.
-
Scalable Marginal Likelihood Estimation for Model Selection in Deep Learning
Authors:
Alexander Immer,
Matthias Bauer,
Vincent Fortuin,
Gunnar Rätsch,
Mohammad Emtiyaz Khan
Abstract:
Marginal-likelihood based model-selection, even though promising, is rarely used in deep learning due to estimation difficulties. Instead, most approaches rely on validation data, which may not be readily available. In this work, we present a scalable marginal-likelihood estimation method to select both hyperparameters and network architectures, based on the training data alone. Some hyperparamete…
▽ More
Marginal-likelihood based model-selection, even though promising, is rarely used in deep learning due to estimation difficulties. Instead, most approaches rely on validation data, which may not be readily available. In this work, we present a scalable marginal-likelihood estimation method to select both hyperparameters and network architectures, based on the training data alone. Some hyperparameters can be estimated online during training, simplifying the procedure. Our marginal-likelihood estimate is based on Laplace's method and Gauss-Newton approximations to the Hessian, and it outperforms cross-validation and manual-tuning on standard regression and image classification datasets, especially in terms of calibration and out-of-distribution detection. Our work shows that marginal likelihoods can improve generalization and be useful when validation data is unavailable (e.g., in nonstationary settings).
△ Less
Submitted 15 June, 2021; v1 submitted 11 April, 2021;
originally announced April 2021.
-
Tractable structured natural gradient descent using local parameterizations
Authors:
Wu Lin,
Frank Nielsen,
Mohammad Emtiyaz Khan,
Mark Schmidt
Abstract:
Natural-gradient descent (NGD) on structured parameter spaces (e.g., low-rank covariances) is computationally challenging due to difficult Fisher-matrix computations. We address this issue by using \emph{local-parameter coordinates} to obtain a flexible and efficient NGD method that works well for a wide-variety of structured parameterizations. We show four applications where our method (1) genera…
▽ More
Natural-gradient descent (NGD) on structured parameter spaces (e.g., low-rank covariances) is computationally challenging due to difficult Fisher-matrix computations. We address this issue by using \emph{local-parameter coordinates} to obtain a flexible and efficient NGD method that works well for a wide-variety of structured parameterizations. We show four applications where our method (1) generalizes the exponential natural evolutionary strategy, (2) recovers existing Newton-like algorithms, (3) yields new structured second-order algorithms via matrix groups, and (4) gives new algorithms to learn covariances of Gaussian and Wishart-based distributions. We show results on a range of problems from deep learning, variational inference, and evolution strategies. Our work opens a new direction for scalable structured geometric methods.
△ Less
Submitted 17 January, 2022; v1 submitted 15 February, 2021;
originally announced February 2021.
-
Strain-induced metallization and defect suppression at zipper-like interdigitated atomically thin interfaces enabling high-efficiency halide perovskite solar cells
Authors:
Nikolai Tsvetkov,
Byeong Cheul Moon,
Jeung Ku Kang,
Muhammad Ejaz Khan,
Yong-Hoon Kim
Abstract:
Halide perovskite light absorbers have great advantages for photovoltaics such as efficient solar energy absorption, but charge accumulation and recombination at the interface with an electron transport layer (ETL) remains a major challenge in realizing their full potential. Here we report the experimental realization of a zipper-like interdigitated interface between a Pb-based halide perovskite l…
▽ More
Halide perovskite light absorbers have great advantages for photovoltaics such as efficient solar energy absorption, but charge accumulation and recombination at the interface with an electron transport layer (ETL) remains a major challenge in realizing their full potential. Here we report the experimental realization of a zipper-like interdigitated interface between a Pb-based halide perovskite light absorber and an oxide ETL by the PbO cap** of the ETL surface, which produces an atomically thin two-dimensional metallic layer that can significantly enhance the perovskite/ETL charge extraction process. As the atomistic origin of the emergent two-dimensional interfacial metallicity, first-principles calculations performed on the representative MAPbI$_3$/TiO$_2$ interface identify the interfacial strain induced by the simultaneous formation of stretched I-substitutional Pb bonds (and thus Pb-I-Pb bonds bridging MAPbI$_3$ and TiO$_2$) and contracted substitutional Pb-O bonds. Direct and indirect experimental evidences for the presence of interfacial metallic states are provided, and a non-conventional defect-passivating nature of the strained interdigitated perovskite/ETL interface is emphasized. It is experimentally demonstrated that the PbO cap** method is generally applicable to other ETL materials including ZnO and SrTiO$_3$, and that the zipper-like interdigitated metallic interface leads to about two-fold increase in charge extraction rate. Finally, in terms of the photovoltaic efficiency, we observe a volcano-type behavior with the highest performance achieved at the monolayer-level PbO cap**. The method established here might prove to be a general interface engineering approach to realize high-performance perovskite solar cells.
△ Less
Submitted 17 December, 2020;
originally announced December 2020.
-
Signature of a possible $α$-cluster state in $N=Z$ doubly-magic $^{56}$Ni
Authors:
S. Bagchi,
H. Akimune,
J. Gibelin,
M. N. Harakeh,
N. Kalantar-Nayestanaki,
N. L. Achouri,
B. Bastin,
K. Boretzky,
H. Bouzomita,
M. Caamaño,
L. Càceres,
S. Damoy,
F. Delaunay,
B. Fernández-Domínguez,
M. Fujiwara,
U. Garg,
G. F. Grinyer,
O. Kamalou,
E. Khan,
A. Krasznahorkay,
G. Lhoutellier,
J. F. Libin,
S. Lukyanov,
K. Mazurek,
M. A. Najafi
, et al. (14 additional authors not shown)
Abstract:
An inelastic $α$-scattering experiment on the unstable $N=Z$, doubly-magic $^{56}$Ni nucleus was performed in inverse kinematics at an incident energy of 50 A.MeV at GANIL. High multiplicity for $α$-particle emission was observed within the limited phase-space of the experimental setup. This observation cannot be explained by means of the statistical-decay model. The ideal classical gas model at…
▽ More
An inelastic $α$-scattering experiment on the unstable $N=Z$, doubly-magic $^{56}$Ni nucleus was performed in inverse kinematics at an incident energy of 50 A.MeV at GANIL. High multiplicity for $α$-particle emission was observed within the limited phase-space of the experimental setup. This observation cannot be explained by means of the statistical-decay model. The ideal classical gas model at $kT$ = 0.4 MeV reproduces fairly well the experimental momentum distribution and the observed multiplicity of $α$ particles corresponds to an excitation energy around 96 MeV. The method of distributed $mα$-decay ensembles is in agreement with the experimental results if we assume that the $α$-gas state in $^{56}$Ni exists at around $113^{+15}_{-17}$ MeV. These results suggest that there may exist an exotic state consisting of many $α$ particles at the excitation energy of $113^{+15}_{-17}$ MeV.
△ Less
Submitted 29 October, 2020;
originally announced October 2020.
-
The Structure of $^{33}$Si and the magicity of the N=20 gap at Z=14
Authors:
S. Jongile,
A. Lemasson,
O. Sorlin,
M. Wiedeking,
P. Papka,
D. Bazin,
C. Borcea,
R. Borcea,
A. Gade,
H. Iwasaki,
E. Khan,
A. Lepailleur,
A. Mutschler,
F. Nowacki,
F. Recchia,
T. Roger,
F. Rotaru,
M. Stanoiu,
S. R. Stroberg,
J. A. Tostevin,
M. Vandebrouck,
D. Weisshaar,
K. Wimmer
Abstract:
The structure of $^{33}$Si was studied by a one-neutron knockout reaction from a $^{34}$Si beam at 98.5 MeV/u incident on a $^{9}$Be target. The prompt $γ$-rays following the de-excitation of $^{33}$Si were detected using the GRETINA $γ$-ray tracking array while the reaction residues were identified on an event-by-event basis in the focal plane of the S800 spectrometer at NSCL (National Supercondu…
▽ More
The structure of $^{33}$Si was studied by a one-neutron knockout reaction from a $^{34}$Si beam at 98.5 MeV/u incident on a $^{9}$Be target. The prompt $γ$-rays following the de-excitation of $^{33}$Si were detected using the GRETINA $γ$-ray tracking array while the reaction residues were identified on an event-by-event basis in the focal plane of the S800 spectrometer at NSCL (National Superconducting Cyclotron Laboratory). The presently derived spectroscopic factor values, $C^2S$, for the 3/2$^+$ and 1/2$^+$ states, corresponding to a neutron removal from the $0d_{3/2}$ and $1s_{1/2}$ orbitals, agree with shell model calculations and point to a strong $N=20$ shell closure. Three states arising from the more bound $0d_{5/2}$ orbital are proposed, one of which is unbound by about 930 keV. The sensitivity of this experiment has also confirmed a weak population of 9/2$^-$ and 11/2$_{1,2}^-$ final states, which originate from a higher-order process. This mechanism may also have populated, to some fraction, the 3/2$^-$ and 7/2$^-$ negative-parity states, which hinders a determination of the $C^2S$ values for knockout from the normally unoccupied $1p_{3/2}$ and $0f_{7/2}$ orbits.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
Impact of 150keV and 590keV proton irradiation on monolayer MoS2
Authors:
Burcu Ozden,
Ethan Khan,
Sunil Uprety,
Tianyi Zhang,
Joseph Razon,
Ke Wang,
Tamara Isaacs-Smith,
Minseo Park,
Mauricio Terrones
Abstract:
We present a comprehensive study on the effects of proton irradiation at different energies (150 and 590 keV) with the fluence of 1x 1012 proton/cm2 on monolayer MoS2. This study not only improves our understanding of the influence of high-energy proton beams on MoS2 but also has implications for radiation-induced changes in device processing and engineering of devices from multilayer MoS2 startin…
▽ More
We present a comprehensive study on the effects of proton irradiation at different energies (150 and 590 keV) with the fluence of 1x 1012 proton/cm2 on monolayer MoS2. This study not only improves our understanding of the influence of high-energy proton beams on MoS2 but also has implications for radiation-induced changes in device processing and engineering of devices from multilayer MoS2 starting material. Increasing defect density with decreasing proton irradiation energy was observed from photoluminescence spectroscopy study. These defects are attributed to sulfur vacancies observed through x-ray photoelectron spectroscopy analysis and confirmed by transmission electron microscope imaging. Scanning electron microscopy images showed the creation of grain boundaries after proton irradiation. A higher degree of surface deformation was detected with lower irradiation energies through atomic force microscopy. Inter-defect distance is increased with the increase in proton energy irradiation as estimated by transmission electron microscopy imaging. Raman spectroscopy reveals negligible structural changes in the crystal quality after the irradiation. These deformation damages due to proton irradiation are insignificant at the MoS2 layer. Based on the overall influence of low energy proton irradiation on the material characteristics, ML-MoS2 materials can be considered robust and reliable building blocks for 2D material based devices for space applications.
△ Less
Submitted 11 August, 2020;
originally announced August 2020.
-
Low-energy cluster vibrations in N = Z nuclei
Authors:
F. Mercier,
A. Bjelčić,
T. Nikšić,
J. -P. Ebran,
E. Khan,
D. Vretenar
Abstract:
Significant transition strength in light $α$-conjugate nuclei at low energy, typically below 10 MeV, has been observed in many experiments. In this work the isoscalar low-energy response of N=Z nuclei is explored using the Finite Amplitude Method (FAM) based on the microscopic framework of nuclear energy density functionals. Depending on the multipolarity of the excitation and the equilibrium defo…
▽ More
Significant transition strength in light $α$-conjugate nuclei at low energy, typically below 10 MeV, has been observed in many experiments. In this work the isoscalar low-energy response of N=Z nuclei is explored using the Finite Amplitude Method (FAM) based on the microscopic framework of nuclear energy density functionals. Depending on the multipolarity of the excitation and the equilibrium deformation of a particular isotope, the low-energy strength functions display prominent peaks that can be attributed to vibration of cluster structures: $α$+$^{12}$C+$α$ and $α$+$^{16}$O in $^{20}$Ne, $^{12}$C+$^{12}$C in $^{24}$Mg, 4$α$+$^{12}$C in $^{28}$Si, etc. Such cluster excitations are favored in light nuclei with large deformation.
△ Less
Submitted 27 July, 2020;
originally announced July 2020.
-
Fast Variational Learning in State-Space Gaussian Process Models
Authors:
Paul E. Chang,
William J. Wilkinson,
Mohammad Emtiyaz Khan,
Arno Solin
Abstract:
Gaussian process (GP) regression with 1D inputs can often be performed in linear time via a stochastic differential equation formulation. However, for non-Gaussian likelihoods, this requires application of approximate inference methods which can make the implementation difficult, e.g., expectation propagation can be numerically unstable and variational inference can be computationally inefficient.…
▽ More
Gaussian process (GP) regression with 1D inputs can often be performed in linear time via a stochastic differential equation formulation. However, for non-Gaussian likelihoods, this requires application of approximate inference methods which can make the implementation difficult, e.g., expectation propagation can be numerically unstable and variational inference can be computationally inefficient. In this paper, we propose a new method that removes such difficulties. Building upon an existing method called conjugate-computation variational inference, our approach enables linear-time inference via Kalman recursions while avoiding numerical instabilities and convergence issues. We provide an efficient JAX implementation which exploits just-in-time compilation and allows for fast automatic differentiation through large for-loops. Overall, our approach leads to fast and stable variational inference in state-space GP models that can be scaled to time series with millions of data points.
△ Less
Submitted 17 July, 2020; v1 submitted 9 July, 2020;
originally announced July 2020.