-
Analyzing the Correlation Between Thermal and Kinematic Parameters in Various Multiplicity Classes within 7 and 13 TeV pp Collisions
Authors:
Muhammad Waqas,
Wolfgang Bietenholz,
Mohamed Bouzidi,
Muhammad Ajaz,
Abd Al Karim Haj Ismail,
Taoufik Saidani
Abstract:
We investigate the transverse momentum spectra of identified particles at 7 TeV and 13 TeV in pp collisions in the framework of the blast wave model with Tsallis statistics (TBW). Based on experimental data by ALICE Collaboration, we observe that the model describes the $p_T$ spectra well with the common Tsallis temperature (T) and flow velocity (β_T) but separate non-extensive parameters (q) for…
▽ More
We investigate the transverse momentum spectra of identified particles at 7 TeV and 13 TeV in pp collisions in the framework of the blast wave model with Tsallis statistics (TBW). Based on experimental data by ALICE Collaboration, we observe that the model describes the $p_T$ spectra well with the common Tsallis temperature (T) and flow velocity (β_T) but separate non-extensive parameters (q) for baryons and mesons. The parameter dependence on multiplicity as well as on collision energy is investigated, and a strong dependence on the former while a weak dependence on the latter is reported. The extracted parameters in this work consist of the initial temperature (T_i), the average transverse momentum (<p_T>), the T, β_T, and the q. These parameters are found to increase a little with increasing energy, however, they (except the parameter q) decrease significantly with decreasing multiplicity. We observe that $β_T$ drops to zero after the multiplicity class VII, while, $T$ and $q$ do not change their behavior. Furthermore, our analysis explore the correlations among different parameters, including associations with the charged particle multiplicity per unit pseudorapidity (dN_{ch}/dη). The correlation between T and beta_T, T and dN_{ch}/dη, β_T and dN_{ch}/dη, T_i and <p_T> and T_i and dN_{ch}/dηdemonstrates a positive relationship, while, the correlation between T and q-1, and q-1 and dN_{ch}/dηis negative. Finally, we implement an extra flow correction on the T parameter. Our findings reveal that the Doppler-corrected temperature parameter aligns closely with the T in scenarios with lower multiplicities. However, as the multiplicity increases, a noticeable divergence emerges between these parameters, indicating a widening separation between them.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Investigation of the freeze-out parameters in B-B, O-O, Ca-Ca and Au-Au collisions at 39 GeV
Authors:
Muhammad Waqas,
Guang Xiong Peng,
Fu-Hu Liu,
Muhammad Ajaz,
Abd Al Karim Haj Ismail
Abstract:
We analyzed the transverse momentum spectra of proton, deuteron and triton in Boron-Boron (B-B), Oxygen-Oxygen (O-O), and Calcium-Calcium (Ca-Ca) central collisions, as well as in several centrality bins in Gold-Gold (Au-Au) collisions at 39 GeV by using the blast wave model with Tsallis statistics. The bulk properties in terms of kinetic freeze-out temperature, transverse flow velocity and kineti…
▽ More
We analyzed the transverse momentum spectra of proton, deuteron and triton in Boron-Boron (B-B), Oxygen-Oxygen (O-O), and Calcium-Calcium (Ca-Ca) central collisions, as well as in several centrality bins in Gold-Gold (Au-Au) collisions at 39 GeV by using the blast wave model with Tsallis statistics. The bulk properties in terms of kinetic freeze-out temperature, transverse flow velocity and kinetic freeze-out volume are extracted from the model by the least square method. We observed that with increasing the rest mass of the particle, the kinetic freeze-out temperature becomes larger, while transverse flow velocity and the kinetic freeze-out volume reduces. These parameters are also found to depend on the size of the system. Larger the size of the system, the larger they are. Furthermore, the kinetic freeze-out temperature in peripheral Au-Au collisions is close to the central O-O collisions. We also observed that the above parameters depend on the centrality, and they decrease from central to peripheral collisions. Besides, we also extracted the entropy-index parameter $q$, and the parameter $N_0$ which shows the multiplicity. Both of them depend on the size of interacting the system, rest mass of the particle and centrality. Both $q$ and $N_0$ are larger for lighter particles, and the former is smaller for large systems while the latter is larger, and the former decrease with increasing centrality while the latter increase.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
Particle species and energy dependencies of freeze-out parameters in high-energy proton-proton collisions
Authors:
Muhammad Waqas,
Guang Xiong Peng,
Fu-Hu Liu,
Muhammad Ajaz,
Abd Al Karim Haj Ismail,
Khusniddin K. Olimov,
Abdel Nasser Tawfik
Abstract:
We used blast wave model with Tsallis statistics to analyze the experimental data measured by ALICE Collaboration in proton-proton collisions at Large Hadron Collider and extracted the related parameters (kinetic freeze-out temperature, transverse flow velocity and kinetic freeze-out volume of emission source) from transverse momentum spectra of the particles. We found that the kinetic freeze-out…
▽ More
We used blast wave model with Tsallis statistics to analyze the experimental data measured by ALICE Collaboration in proton-proton collisions at Large Hadron Collider and extracted the related parameters (kinetic freeze-out temperature, transverse flow velocity and kinetic freeze-out volume of emission source) from transverse momentum spectra of the particles. We found that the kinetic freeze-out temperature and kinetic freeze-out volume are mass dependent. The former increase while the latter decrease with the particle mass which is the evidence of a mass as well as volume differential kinetic freeze-out scenario. Furthermore we extracted the mean transverse momentum and initial temperature by an indirect method and observed that they increase with mass of the particles. All the above discussed parameters are observed to increase with energy. Triton ($t$), hyper-triton (${^3_{\barΛ} H}$) and helion (${^3 He}$) and their anti-matter are observed to freeze-out at the same time due to isospin symmetry.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
Interpretable Mixture of Experts
Authors:
Aya Abdelsalam Ismail,
Sercan Ö. Arik,
**sung Yoon,
Ankur Taly,
Soheil Feizi,
Tomas Pfister
Abstract:
The need for reliable model explanations is prominent for many machine learning applications, particularly for tabular and time-series data as their use cases often involve high-stakes decision making. Towards this goal, we introduce a novel interpretable modeling framework, Interpretable Mixture of Experts (IME), that yields high accuracy, comparable to `black-box' Deep Neural Networks (DNNs) in…
▽ More
The need for reliable model explanations is prominent for many machine learning applications, particularly for tabular and time-series data as their use cases often involve high-stakes decision making. Towards this goal, we introduce a novel interpretable modeling framework, Interpretable Mixture of Experts (IME), that yields high accuracy, comparable to `black-box' Deep Neural Networks (DNNs) in many cases, along with useful interpretability capabilities. IME consists of an assignment module and a mixture of experts, with each sample being assigned to a single expert for prediction. We introduce multiple options for IME based on the assignment and experts being interpretable. When the experts are chosen to be interpretable such as linear models, IME yields an inherently-interpretable architecture where the explanations produced by IME are the exact descriptions of how the prediction is computed. In addition to constituting a standalone inherently-interpretable architecture, IME has the premise of being integrated with existing DNNs to offer interpretability to a subset of samples while maintaining the accuracy of the DNNs. Through extensive experiments on 15 tabular and time-series datasets, IME is demonstrated to be more accurate than single interpretable models and perform comparably with existing state-of-the-art DNNs in accuracy. On most datasets, IME even outperforms DNNs, while providing faithful explanations. Lastly, IME's explanations are compared to commonly-used post-hoc explanations methods through a user study -- participants are able to better predict the model behavior when given IME explanations, while finding IME's explanations more faithful and trustworthy.
△ Less
Submitted 25 May, 2023; v1 submitted 5 June, 2022;
originally announced June 2022.
-
Extraction of freezeout parameters and their dependence on collision energy and collision cross-section
Authors:
Muhammad Waqas,
Guang-Xiong Peng,
Muhammad Ajaz,
Abd Al Karim Haj Ismail,
Pei-Pin Yang,
Zafar Wazir
Abstract:
We used the Blast wave model with Boltzmann Gibbs statistics and analyzed the experimental data of transverse momentum spectra ($p_T$) measured by NA61/SHINE and NA 49 Collaborations in inelastic (INEL) proton-proton, and the most central Beryllium-Beryllium (Be-Be), Argon-Scandium (Ar-Sc) and Lead-Lead (Pb-Pb) collisions. The model results fit the experimental data of NA61/SHINE and NA 49 Collabo…
▽ More
We used the Blast wave model with Boltzmann Gibbs statistics and analyzed the experimental data of transverse momentum spectra ($p_T$) measured by NA61/SHINE and NA 49 Collaborations in inelastic (INEL) proton-proton, and the most central Beryllium-Beryllium (Be-Be), Argon-Scandium (Ar-Sc) and Lead-Lead (Pb-Pb) collisions. The model results fit the experimental data of NA61/SHINE and NA 49 Collaborations very well. We extracted kinetic freezeout temperature, transverse flow velocity and kinetic freezeout volume directly from the spectra. We also calculated mean transverse momentum and initial temperature from the fit function. It is observed that the kinetic freezeout temperature increases with increasing the collision energy as well as collision cross-section (size of the colliding system). Furthermore, the transverse flow remains unchanged with increasing the collision energy, while it changes randomly with the collision cross-section. Similarly, with the increase in collision energy or the collision cross-section, the freeze-out volume and the average $p_T$ increase. The initial temperature is also observed to be an increasing function of the collision cross-section.
△ Less
Submitted 1 December, 2021;
originally announced December 2021.
-
Improving Deep Learning Interpretability by Saliency Guided Training
Authors:
Aya Abdelsalam Ismail,
Héctor Corrada Bravo,
Soheil Feizi
Abstract:
Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in unfaithful feature attributions. In this paper, we tackle this issue and introduce a {\it saliency guided training}procedure for neural networks to reduce noisy gra…
▽ More
Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in unfaithful feature attributions. In this paper, we tackle this issue and introduce a {\it saliency guided training}procedure for neural networks to reduce noisy gradients used in predictions while retaining the predictive performance of the model. Our saliency guided training procedure iteratively masks features with small and potentially noisy gradients while maximizing the similarity of model outputs for both masked and unmasked inputs. We apply the saliency guided training procedure to various synthetic and real data sets from computer vision, natural language processing, and time series across diverse neural architectures, including Recurrent Neural Networks, Convolutional Networks, and Transformers. Through qualitative and quantitative evaluations, we show that saliency guided training procedure significantly improves model interpretability across various domains while preserving its predictive performance.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
Freezeout properties of different light nuclei at the RHIC Beam Energy Scan
Authors:
M. Waqas,
G. X. Peng,
Rui-Qin Wang,
Muhammad Ajaz,
Abd Al Karim Haj Ismail
Abstract:
We study the transverse momentum spectra of light nuclei (deuteron, anti-deuteron and triton) produced in Gold-Gold (Au-Au) collisions in different centrality bins by the blast wave model with Tsallis statistics. The model results are in agreement with the experimental data measured by STAR Collaboration in special transverse momentum ranges. We extracted the kinetic freezeout temperature, transve…
▽ More
We study the transverse momentum spectra of light nuclei (deuteron, anti-deuteron and triton) produced in Gold-Gold (Au-Au) collisions in different centrality bins by the blast wave model with Tsallis statistics. The model results are in agreement with the experimental data measured by STAR Collaboration in special transverse momentum ranges. We extracted the kinetic freezeout temperature, transverse flow velocity and kinetic freezeout volume. It is observed that kinetic freezeout temperature and transverse flow velocity increases initially, and then saturates from 14.5-39 GeV, while the kinetic freezeout volume increase initially up to 19.6 GeV but saturates from 19.6-39 GeV. This may indicate that the phase transition starts in part volume that ends in the whole volume at 39 GeV and the critical point may exists somewhere in the energy range of 14.5-39 GeV. The present work observed that the kinetic freezeout temperature, transverse flow velocity and kinetic freezeout volume has a decreasing trend from central to peripheral collisions. We found the freezeout volume of triton is smaller than those of deuteron and anti-deuteron, which shows that triton freezeout earlier than that of deuteron and anti-deuteron.
△ Less
Submitted 18 October, 2021;
originally announced October 2021.
-
Study of kinetic freeze-out parameters as function of rapidity in pp collisions at CERN SPS energies
Authors:
Muhammad Waqas,
H. M. Chen,
Guang Xiong Pen,
Abd Al Karim Haj Ismail,
Muhammad Ajaz,
Zafar Wazir,
Ramoona Shehzadi,
Sabiha Jamal,
Atef AbdelKader
Abstract:
We used the blast wave model with Boltzmann Gibbs statistics and analyzed the experimental data measured by NA61/SHINE Collaboration in inelastic (INEL) proton-proton collisions at different rapidity slices at different center-of-mass energies. The particles used in this study are pion, kaon, proton and anti-proton. We extracted kinetic freeze-out temperature, transverse flow velocity and kinetic…
▽ More
We used the blast wave model with Boltzmann Gibbs statistics and analyzed the experimental data measured by NA61/SHINE Collaboration in inelastic (INEL) proton-proton collisions at different rapidity slices at different center-of-mass energies. The particles used in this study are pion, kaon, proton and anti-proton. We extracted kinetic freeze-out temperature, transverse flow velocity and kinetic freeze-out volume from the transverse momentum spectra of the particles. We observed that the kinetic freeze-out temperature is rapidity and energy dependent, while transverse flow velocity does not depend on them. Furthermore, we observed that the kinetic freeze-out volume is energy dependent but it remains constant with changing the rapidity. We also observed that all these three parameters are mass dependent. In addition, with the increase of mass, the kinetic freeze-out temperature increases, and the transverse flow velocity as well as kinetic freeze-out volume decreases.
△ Less
Submitted 13 September, 2021;
originally announced September 2021.
-
Improving Multimodal Accuracy Through Modality Pre-training and Attention
Authors:
Aya Abdelsalam Ismail,
Mahmudul Hasan,
Faisal Ishtiaq
Abstract:
Training a multimodal network is challenging and it requires complex architectures to achieve reasonable performance. We show that one reason for this phenomena is the difference between the convergence rate of various modalities. We address this by pre-training modality-specific sub-networks in multimodal architectures independently before end-to-end training of the entire network. Furthermore, w…
▽ More
Training a multimodal network is challenging and it requires complex architectures to achieve reasonable performance. We show that one reason for this phenomena is the difference between the convergence rate of various modalities. We address this by pre-training modality-specific sub-networks in multimodal architectures independently before end-to-end training of the entire network. Furthermore, we show that the addition of an attention mechanism between sub-networks after pre-training helps identify the most important modality during ambiguous scenarios boosting the performance. We demonstrate that by performing these two tricks a simple network can achieve similar performance to a complicated architecture that is significantly more expensive to train on multiple tasks including sentiment analysis, emotion recognition, and speaker trait recognition.
△ Less
Submitted 11 November, 2020;
originally announced November 2020.
-
Benchmarking Deep Learning Interpretability in Time Series Predictions
Authors:
Aya Abdelsalam Ismail,
Mohamed Gunady,
Héctor Corrada Bravo,
Soheil Feizi
Abstract:
Saliency methods are used extensively to highlight the importance of input features in model predictions. These methods are mostly used in vision and language tasks, and their applications to time series data is relatively unexplored. In this paper, we set out to extensively compare the performance of various saliency-based interpretability methods across diverse neural architectures, including Re…
▽ More
Saliency methods are used extensively to highlight the importance of input features in model predictions. These methods are mostly used in vision and language tasks, and their applications to time series data is relatively unexplored. In this paper, we set out to extensively compare the performance of various saliency-based interpretability methods across diverse neural architectures, including Recurrent Neural Network, Temporal Convolutional Networks, and Transformers in a new benchmark of synthetic time series data. We propose and report multiple metrics to empirically evaluate the performance of saliency methods for detecting feature importance over time using both precision (i.e., whether identified features contain meaningful signals) and recall (i.e., the number of features with signal identified as important). Through several experiments, we show that (i) in general, network architectures and saliency methods fail to reliably and accurately identify feature importance over time in time series data, (ii) this failure is mainly due to the conflation of time and feature domains, and (iii) the quality of saliency maps can be improved substantially by using our proposed two-step temporal saliency rescaling (TSR) approach that first calculates the importance of each time step before calculating the importance of each feature at a time step.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
QC-Automator: Deep Learning-based Automated Quality Control for Diffusion MR Images
Authors:
Zahra Riahi Samani,
Jacob Antony Alappatt,
Drew Parker,
Abdol Aziz Ould Ismail,
Ragini Verma
Abstract:
Quality assessment of diffusion MRI (dMRI) data is essential prior to any analysis, so that appropriate pre-processing can be used to improve data quality and ensure that the presence of MRI artifacts do not affect the results of subsequent image analysis. Manual quality assessment of the data is subjective, possibly error-prone, and infeasible, especially considering the growing number of consort…
▽ More
Quality assessment of diffusion MRI (dMRI) data is essential prior to any analysis, so that appropriate pre-processing can be used to improve data quality and ensure that the presence of MRI artifacts do not affect the results of subsequent image analysis. Manual quality assessment of the data is subjective, possibly error-prone, and infeasible, especially considering the growing number of consortium-like studies, underlining the need for automation of the process. In this paper, we have developed a deep-learning-based automated quality control (QC) tool, QC-Automator, for dMRI data, that can handle a variety of artifacts such as motion, multiband interleaving, ghosting, susceptibility, herringbone and chemical shifts. QC-Automator uses convolutional neural networks along with transfer learning to train the automated artifact detection on a labeled dataset of ~332000 slices of dMRI data, from 155 unique subjects and 5 scanners with different dMRI acquisitions, achieving a 98% accuracy in detecting artifacts. The method is fast and paves the way for efficient and effective artifact detection in large datasets. It is also demonstrated to be replicable on other datasets with different acquisition parameters.
△ Less
Submitted 15 November, 2019;
originally announced November 2019.
-
Input-Cell Attention Reduces Vanishing Saliency of Recurrent Neural Networks
Authors:
Aya Abdelsalam Ismail,
Mohamed Gunady,
Luiz Pessoa,
Héctor Corrada Bravo,
Soheil Feizi
Abstract:
Recent efforts to improve the interpretability of deep neural networks use saliency to characterize the importance of input features to predictions made by models. Work on interpretability using saliency-based methods on Recurrent Neural Networks (RNNs) has mostly targeted language tasks, and their applicability to time series data is less understood. In this work we analyze saliency-based methods…
▽ More
Recent efforts to improve the interpretability of deep neural networks use saliency to characterize the importance of input features to predictions made by models. Work on interpretability using saliency-based methods on Recurrent Neural Networks (RNNs) has mostly targeted language tasks, and their applicability to time series data is less understood. In this work we analyze saliency-based methods for RNNs, both classical and gated cell architectures. We show that RNN saliency vanishes over time, biasing detection of salient features only to later time steps and are, therefore, incapable of reliably detecting important features at arbitrary time intervals. To address this vanishing saliency problem, we propose a novel RNN cell structure (input-cell attention), which can extend any RNN cell architecture. At each time step, instead of only looking at the current input vector, input-cell attention uses a fixed-size matrix embedding, each row of the matrix attending to different inputs from current or previous time steps. Using synthetic data, we show that the saliency map produced by the input-cell attention RNN is able to faithfully detect important features regardless of their occurrence in time. We also apply the input-cell attention RNN on a neuroscience task analyzing functional Magnetic Resonance Imaging (fMRI) data for human subjects performing a variety of tasks. In this case, we use saliency to characterize brain regions (input features) for which activity is important to distinguish between tasks. We show that standard RNN architectures are only capable of detecting important brain regions in the last few time steps of the fMRI data, while the input-cell attention model is able to detect important brain region activity across time without latter time step biases.
△ Less
Submitted 27 October, 2019;
originally announced October 2019.
-
Improving Long-Horizon Forecasts with Expectation-Biased LSTM Networks
Authors:
Aya Abdelsalam Ismail,
Timothy Wood,
Héctor Corrada Bravo
Abstract:
State-of-the-art forecasting methods using Recurrent Neural Net- works (RNN) based on Long-Short Term Memory (LSTM) cells have shown exceptional performance targeting short-horizon forecasts, e.g given a set of predictor features, forecast a target value for the next few time steps in the future. However, in many applica- tions, the performance of these methods decays as the forecasting horizon ex…
▽ More
State-of-the-art forecasting methods using Recurrent Neural Net- works (RNN) based on Long-Short Term Memory (LSTM) cells have shown exceptional performance targeting short-horizon forecasts, e.g given a set of predictor features, forecast a target value for the next few time steps in the future. However, in many applica- tions, the performance of these methods decays as the forecasting horizon extends beyond these few time steps. This paper aims to explore the challenges of long-horizon forecasting using LSTM networks. Here, we illustrate the long-horizon forecasting problem in datasets from neuroscience and energy supply management. We then propose expectation-biasing, an approach motivated by the literature of Dynamic Belief Networks, as a solution to improve long-horizon forecasting using LSTMs. We propose two LSTM ar- chitectures along with two methods for expectation biasing that significantly outperforms standard practice.
△ Less
Submitted 18 April, 2018;
originally announced April 2018.
-
Enhancement of local electric field in core-shell orientation of ellipsoidal metal/dielectric nanoparticles
Authors:
A. A. Ismail,
A. V. Gholap,
Y. A. Abbo
Abstract:
In this paper it is shown that the enhancement factor of the local electric field in metal covered ellipsoidal nanoparticles embedded in a dielectric host matrix has two maxima at two different frequencies. The second maximum for the metal covered inclusions with large dielectric core (small metal fraction $p$) is comparatively large. This maximum strongly depends on the depolarization factor of t…
▽ More
In this paper it is shown that the enhancement factor of the local electric field in metal covered ellipsoidal nanoparticles embedded in a dielectric host matrix has two maxima at two different frequencies. The second maximum for the metal covered inclusions with large dielectric core (small metal fraction $p$) is comparatively large. This maximum strongly depends on the depolarization factor of the core $L_{z}^{(1)}$, kee** that of the shell $L_{z}^{(2)}$ constant and is less than $L_{z}^{(1)}$. If the frequency of the external radiation approaches the frequency of surface plasmons of a metal, the local field in the particle considerably increases. The importance of maximum value of enhancement factor $|A|^{2}$ of the ellipsoidal inclusion is emphasized in the case where the dielectric core exceeds metal fraction of the inclusion. The results of numerical computations for typical small silver particles are presented graphically.
△ Less
Submitted 22 June, 2017;
originally announced June 2017.
-
Local field enhancement at the core of cylindrical nanoinclusions embedded in a linear dielectric host matrix
Authors:
Y. A. Abbo,
V. N. Mal'nev,
A. A. Ismail
Abstract:
In this paper we have discussed theoretical concepts and presented numerical results of local field enhancement at the core of different assemblages of metal/dielectric cylindrical nanoinclusions embedded in a linear dielectric host matrix. The obtained results show that for a composite with metal coated inclusions there exist two peak values of the enhancement factor at two different resonant fre…
▽ More
In this paper we have discussed theoretical concepts and presented numerical results of local field enhancement at the core of different assemblages of metal/dielectric cylindrical nanoinclusions embedded in a linear dielectric host matrix. The obtained results show that for a composite with metal coated inclusions there exist two peak values of the enhancement factor at two different resonant frequencies. The existence of the second maxima becomes more important for a larger volume fraction of the metal part of the inclusion. For dielectric coated metal core inclusions and pure metal inclusions there is only one resonant frequency and one peak value of the enhancement factor. The enhancement of an electromagnetic wave is promising for the existence of nonlinear optical phenomena such as optical bistability which is important in optical communication and in optical computing such as optical switch and memory elements.
△ Less
Submitted 15 September, 2016;
originally announced September 2016.