-
Towards Optimizing and Evaluating a Retrieval Augmented QA Chatbot using LLMs with Human in the Loop
Authors:
Anum Afzal,
Alexander Kowsik,
Rajna Fani,
Florian Matthes
Abstract:
Large Language Models have found application in various mundane and repetitive tasks including Human Resource (HR) support. We worked with the domain experts of SAP SE to develop an HR support chatbot as an efficient and effective tool for addressing employee inquiries. We inserted a human-in-the-loop in various parts of the development cycles such as dataset collection, prompt optimization, and e…
▽ More
Large Language Models have found application in various mundane and repetitive tasks including Human Resource (HR) support. We worked with the domain experts of SAP SE to develop an HR support chatbot as an efficient and effective tool for addressing employee inquiries. We inserted a human-in-the-loop in various parts of the development cycles such as dataset collection, prompt optimization, and evaluation of generated output. By enhancing the LLM-driven chatbot's response quality and exploring alternative retrieval methods, we have created an efficient, scalable, and flexible tool for HR professionals to address employee inquiries effectively. Our experiments and evaluation conclude that GPT-4 outperforms other models and can overcome inconsistencies in data through internal reasoning capabilities. Additionally, through expert analysis, we infer that reference-free evaluation metrics such as G-Eval and Prometheus demonstrate reliability closely aligned with that of human evaluation.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates
Authors:
Arshia Afzal,
Grigorios Chrysos,
Volkan Cevher,
Mahsa Shoaran
Abstract:
EEG-based seizure detection models face challenges in terms of inference speed and memory efficiency, limiting their real-time implementation in clinical devices. This paper introduces a novel graph-based residual state update mechanism (REST) for real-time EEG signal analysis in applications such as epileptic seizure detection. By leveraging a combination of graph neural networks and recurrent st…
▽ More
EEG-based seizure detection models face challenges in terms of inference speed and memory efficiency, limiting their real-time implementation in clinical devices. This paper introduces a novel graph-based residual state update mechanism (REST) for real-time EEG signal analysis in applications such as epileptic seizure detection. By leveraging a combination of graph neural networks and recurrent structures, REST efficiently captures both non-Euclidean geometry and temporal dependencies within EEG data. Our model demonstrates high accuracy in both seizure detection and classification tasks. Notably, REST achieves a remarkable 9-fold acceleration in inference speed compared to state-of-the-art models, while simultaneously demanding substantially less memory than the smallest model employed for this task. These attributes position REST as a promising candidate for real-time implementation in clinical devices, such as Responsive Neurostimulation or seizure alert systems.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Primordial Black Holes and Scalar-induced Gravitational Waves in Radiative Hybrid Inflation
Authors:
Adeela Afzal,
Anish Ghoshal
Abstract:
We study the possibility that primordial black holes (PBHs) can be formed from large curvature perturbations generated during the waterfall phase transition due to the effects of one-loop radiative corrections of Yukawa couplings between the inflaton and a dark fermion in a non-supersymmetric hybrid inflationary model. We obtain a spectral index, $n_s$ and a tensor-to-scalar ratio, $r$ consistent…
▽ More
We study the possibility that primordial black holes (PBHs) can be formed from large curvature perturbations generated during the waterfall phase transition due to the effects of one-loop radiative corrections of Yukawa couplings between the inflaton and a dark fermion in a non-supersymmetric hybrid inflationary model. We obtain a spectral index, $n_s$ and a tensor-to-scalar ratio, $r$ consistent with the current Planck data. Our findings show that the abundance of PBHs are correlated to the dark fermion mass $m_N$ and peak in the GW spectrum. We identify parameter space where PBHs can be the entire dark matter (DM) candidate of the universe or a fraction of it. Our predictions are consistent with any existing constraints of PBH from microlensing, BBN, and CMB, etc. Moreover, the scenario is also testable via induced gravitational waves (GWs) from first-order scalar perturbations detectable in future observatories such as LISA and ET. For instance, with inflaton mass $m \sim 2\times 10^{12}$ GeV, $m_N \sim 5.4\times 10^{15}$ GeV, we obtain PBHs of around $10^{-13}\, M_\odot$ mass that can explain the entire abundance of DM and predict GWs with amplitude $Ω_{\rm GW}h^2$ $\sim 10^{-9}$ with peak frequency $f$ $\sim$ $0.1$ Hz in LISA. By explicitly estimating fine-tuning we show the model has very mild tuning. We discuss successful reheating at the end of the inflationary phase via the conversion of the waterfall field into standard model (SM) particles. We also briefly speculate a scenario where the dark fermion can be the possible heavy right-handed neutrino (RHN) which is responsible for generating the SM neutrino masses via the seesaw mechanism. The RHN can be produced due to waterfall field decay and its subsequent decay may also explain the observed baryon asymmetry in the universe via leptogenesis.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Gravitational wave emission from metastable current-carrying strings in $E_6$
Authors:
Adeela Afzal,
Qaisar Shafi,
Amit Tiwari
Abstract:
We discuss $E_6$ based extensions of the Standard Model (SM) containing two varieties of superheavy metastable cosmic strings (CSs) that respectively have neutral and electrically charged current carriers. We employ an extended version of the velocity-dependent one-scale (VOS) model, recently discussed by some authors, to estimate the gravitational wave (GW) spectrum emitted by metastable strings…
▽ More
We discuss $E_6$ based extensions of the Standard Model (SM) containing two varieties of superheavy metastable cosmic strings (CSs) that respectively have neutral and electrically charged current carriers. We employ an extended version of the velocity-dependent one-scale (VOS) model, recently discussed by some authors, to estimate the gravitational wave (GW) spectrum emitted by metastable strings with a dimensionless string tension $G μ\approx 10^{-6}$ that carry a right-handed neutrino (RHN) current. We find that with a low to moderate amount of current, the spectrum is compatible with the LIGO O3 run and also consistent at the 1$σ$ level with the recent PTA signals.
△ Less
Submitted 9 February, 2024; v1 submitted 9 November, 2023;
originally announced November 2023.
-
Study of linear energy transfer effect on rib fracture in breast patients receiving pencil-beam-scanning proton therapy
Authors:
Yunze Yang,
Kimberly R. Gergelis,
Jiajian Shen,
Arslan Afzal,
Trey C. Mullikin,
Robert W. Gao,
Khaled Aziz,
Dean A. Shumway,
Kimberly S. Corbin,
Wei Liu,
Robert W. Mutter
Abstract:
Purpose: To study the effect of proton linear energy transfer (LET) on rib fracture in breast cancer patients treated with pencil-beam scanning proton therapy (PBS) using a novel tool of dose-LET volume histogram (DLVH).
Methods: From a prospective registry of patients treated with post-mastectomy proton therapy to the chest wall and regional lymph nodes for breast cancer between 2015 and 2020,…
▽ More
Purpose: To study the effect of proton linear energy transfer (LET) on rib fracture in breast cancer patients treated with pencil-beam scanning proton therapy (PBS) using a novel tool of dose-LET volume histogram (DLVH).
Methods: From a prospective registry of patients treated with post-mastectomy proton therapy to the chest wall and regional lymph nodes for breast cancer between 2015 and 2020, we retrospectively identified rib fracture cases detected after completing treatment. Contemporaneously treated control patients that did not develop rib fracture were matched to patients 2:1 considering prescription dose, boost location, reconstruction status, laterality, chest wall thickness, and treatment year. The DLVH index, V(d, l), defined as volume(V) of the structure with at least dose(d) and LET(l), was calculated. DLVH plots between the fracture and control group were compared. Conditional logistic regression (CLR) model was used to establish the relation of V(d, l) and the observed fracture at each combination of d and l. The p-value derived from CLR model shows the statistical difference between fracture patients and the matched control group. Using the 2D p-value map, the DLVH features associated with the patient outcomes were extracted.
Results: Seven rib fracture patients were identified, and fourteen matched patients were selected for the control group. The median time from the completion of proton therapy to rib fracture diagnosis was 12 months (range 5 to 14 months). Two patients had grade 2 symptomatic rib fracture while the remaining 5 were grade 1 incidentally detected on imaging. The derived p-value map demonstrated larger V(0-36 Gy[RBE], 4.0-5.0 keV/um) in patients experiencing fracture (p<0.1).
Conclusions: In breast cancer patients receiving PBS, a larger volume of chest wall receiving moderate dose and high LET may result in increased risk of rib fracture.
△ Less
Submitted 31 October, 2023;
originally announced October 2023.
-
Physical Oscillator Model for Supercomputing
Authors:
Ayesha Afzal,
Georg Hager,
Gerhard Wellein
Abstract:
A parallel program together with the parallel hardware it is running on is not only a vehicle to solve numerical problems, it is also a complex system with interesting dynamical behavior: resynchronization and desynchronization of parallel processes, propagating phases of idleness, and the peculiar effects of noise and system topology are just a few examples. We propose a physical oscillator model…
▽ More
A parallel program together with the parallel hardware it is running on is not only a vehicle to solve numerical problems, it is also a complex system with interesting dynamical behavior: resynchronization and desynchronization of parallel processes, propagating phases of idleness, and the peculiar effects of noise and system topology are just a few examples. We propose a physical oscillator model (POM) to describe aspects of the dynamics of interacting parallel processes. Motivated by the well-known Kuramoto Model, a process with its regular compute-communicate cycles is modeled as an oscillator which is coupled to other oscillators (processes) via an interaction potential. Instead of a simple all-to-all connectivity, we employ a sparse topology matrix map** the communication structure and thus the inter-process dependencies of the program onto the oscillator model and propose two interaction potentials that are suitable for different scenarios in parallel computing: resource-scalable and resource-bottlenecked applications. The former are not limited by a resource bottleneck such as memory bandwidth or network contention, while the latter are. Unlike the original Kuramoto model, which has a periodic sinusoidal potential that is attractive for small angles, our characteristic potentials are always attractive for large angles and only differ in the short-distance behavior. We show that the model with appropriate potentials can mimic the propagation of delays and the synchronizing and desynchronizing behavior of scalable and bottlenecked parallel programs, respectively.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
SPEChpc 2021 Benchmarks on Ice Lake and Sapphire Rapids Infiniband Clusters: A Performance and Energy Case Study
Authors:
Ayesha Afzal,
Georg Hager,
Gerhard Wellein
Abstract:
In this work, fundamental performance, power, and energy characteristics of the full SPEChpc 2021 benchmark suite are assessed on two different clusters based on Intel Ice Lake and Sapphire Rapids CPUs using the MPI-only codes' variants. We use memory bandwidth, data volume, and scalability metrics in order to categorize the benchmarks and pinpoint relevant performance and scalability bottlenecks…
▽ More
In this work, fundamental performance, power, and energy characteristics of the full SPEChpc 2021 benchmark suite are assessed on two different clusters based on Intel Ice Lake and Sapphire Rapids CPUs using the MPI-only codes' variants. We use memory bandwidth, data volume, and scalability metrics in order to categorize the benchmarks and pinpoint relevant performance and scalability bottlenecks on the node and cluster levels. Common patterns such as memory bandwidth limitation, dominating communication and synchronization overhead, MPI serialization, superlinear scaling, and alignment issues could be identified, in isolation or in combination, showing that SPEChpc 2021 is representative of many HPC workloads. Power dissipation and energy measurements indicate that the modern Intel server CPUs have such a high idle power level that race-to-idle is the paramount strategy for energy to solution and energy-delay product minimization. On the chip level, only memory-bound code shows a clear advantage of Sapphire Rapids compared to Ice Lake in terms of energy to solution.
△ Less
Submitted 14 September, 2023; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Supersymmetric hybrid inflation and metastable cosmic strings in $SU(4)_c \times SU(2)_L \times U(1)_R$
Authors:
Adeela Afzal,
Maria Mehmood,
Mansoor Ur Rehman,
Qaiser Shafi
Abstract:
We construct a realistic supersymmetric model for superheavy metastable cosmic strings (CSs) that can be investigated in the current pulsar timing array (PTA) experiments. We consider shifted $μ$ hybrid inflation in which the symmetry breaking $SU(4)_c \times SU(2)_L \times U(1)_R\rightarrow SU(3)_c\times SU(2)_L \times U(1)_{B-L}\times U(1)_R$ proceeds along an inflationary trajectory such that t…
▽ More
We construct a realistic supersymmetric model for superheavy metastable cosmic strings (CSs) that can be investigated in the current pulsar timing array (PTA) experiments. We consider shifted $μ$ hybrid inflation in which the symmetry breaking $SU(4)_c \times SU(2)_L \times U(1)_R\rightarrow SU(3)_c\times SU(2)_L \times U(1)_{B-L}\times U(1)_R$ proceeds along an inflationary trajectory such that the topologically unstable primordial monopoles are inflated away. The breaking of $U(1)_{B-L} \times U(1)_R \rightarrow U(1)_Y$ after inflation ends yields the metastable CSs that generate the stochastic gravitational wave background (SGWB) which is consistent with the current PTA data set. The scalar spectral index $n_s$ and the tensor to scalar ratio $r$ are also compatible with Planck 2018. We briefly discuss both reheating and leptogenesis in this model.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
Challenges in Domain-Specific Abstractive Summarization and How to Overcome them
Authors:
Anum Afzal,
Juraj Vladika,
Daniel Braun,
Florian Matthes
Abstract:
Large Language Models work quite well with general-purpose data and many tasks in Natural Language Processing. However, they show several limitations when used for a task such as domain-specific abstractive text summarization. This paper identifies three of those limitations as research problems in the context of abstractive text summarization: 1) Quadratic complexity of transformer-based models w…
▽ More
Large Language Models work quite well with general-purpose data and many tasks in Natural Language Processing. However, they show several limitations when used for a task such as domain-specific abstractive text summarization. This paper identifies three of those limitations as research problems in the context of abstractive text summarization: 1) Quadratic complexity of transformer-based models with respect to the input text length; 2) Model Hallucination, which is a model's ability to generate factually incorrect text; and 3) Domain Shift, which happens when the distribution of the model's training and test corpus is not the same. Along with a discussion of the open research questions, this paper also provides an assessment of existing state-of-the-art techniques relevant to domain-specific text summarization to address the research gaps.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
The NANOGrav 15-year Data Set: Search for Signals from New Physics
Authors:
Adeela Afzal,
Gabriella Agazie,
Akash Anumarlapudi,
Anne M. Archibald,
Zaven Arzoumanian,
Paul T. Baker,
Bence Bécsy,
Jose Juan Blanco-Pillado,
Laura Blecha,
Kimberly K. Boddy,
Adam Brazier,
Paul R. Brook,
Sarah Burke-Spolaor,
Rand Burnette,
Robin Case,
Maria Charisi,
Shami Chatterjee,
Katerina Chatziioannou,
Belinda D. Cheeseboro,
Siyuan Chen,
Tyler Cohen,
James M. Cordes,
Neil J. Cornish,
Fronefield Crawford,
H. Thankful Cromartie
, et al. (98 additional authors not shown)
Abstract:
The 15-year pulsar timing data set collected by the North American Nanohertz Observatory for Gravitational Waves (NANOGrav) shows positive evidence for the presence of a low-frequency gravitational-wave (GW) background. In this paper, we investigate potential cosmological interpretations of this signal, specifically cosmic inflation, scalar-induced GWs, first-order phase transitions, cosmic string…
▽ More
The 15-year pulsar timing data set collected by the North American Nanohertz Observatory for Gravitational Waves (NANOGrav) shows positive evidence for the presence of a low-frequency gravitational-wave (GW) background. In this paper, we investigate potential cosmological interpretations of this signal, specifically cosmic inflation, scalar-induced GWs, first-order phase transitions, cosmic strings, and domain walls. We find that, with the exception of stable cosmic strings of field theory origin, all these models can reproduce the observed signal. When compared to the standard interpretation in terms of inspiraling supermassive black hole binaries (SMBHBs), many cosmological models seem to provide a better fit resulting in Bayes factors in the range from 10 to 100. However, these results strongly depend on modeling assumptions about the cosmic SMBHB population and, at this stage, should not be regarded as evidence for new physics. Furthermore, we identify excluded parameter regions where the predicted GW signal from cosmological sources significantly exceeds the NANOGrav signal. These parameter constraints are independent of the origin of the NANOGrav signal and illustrate how pulsar timing data provide a new way to constrain the parameter space of these models. Finally, we search for deterministic signals produced by models of ultralight dark matter (ULDM) and dark matter substructures in the Milky Way. We find no evidence for either of these signals and thus report updated constraints on these models. In the case of ULDM, these constraints outperform torsion balance and atomic clock constraints for ULDM coupled to electrons, muons, or gluons.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Making Applications Faster by Asynchronous Execution: Slowing Down Processes or Relaxing MPI Collectives
Authors:
Ayesha Afzal,
Georg Hager,
Stefano Markidis,
Gerhard Wellein
Abstract:
Comprehending the performance bottlenecks at the core of the intricate hardware-software interactions exhibited by highly parallel programs on HPC clusters is crucial. This paper sheds light on the issue of automatically asynchronous MPI communication in memory-bound parallel programs on multicore clusters and how it can be facilitated. For instance, slowing down MPI processes by deliberate inject…
▽ More
Comprehending the performance bottlenecks at the core of the intricate hardware-software interactions exhibited by highly parallel programs on HPC clusters is crucial. This paper sheds light on the issue of automatically asynchronous MPI communication in memory-bound parallel programs on multicore clusters and how it can be facilitated. For instance, slowing down MPI processes by deliberate injection of delays can improve performance if certain conditions are met. This leads to the counter-intuitive conclusion that noise, independent of its source, is not always detrimental but can be leveraged for performance improvements. We employ phase-space graphs as a new tool to visualize parallel program dynamics. They are useful in spotting certain patterns in parallel execution that will easily go unnoticed with traditional tracing tools. We investigate five different microbenchmarks and applications on different supercomputer platforms: an MPI-augmented STREAM Triad, two implementations of Lattice-Boltzmann fluid solvers, and the LULESH and HPCG proxy applications.
△ Less
Submitted 24 February, 2023; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Investigating Conversational Search Behavior For Domain Exploration
Authors:
Phillip Schneider,
Anum Afzal,
Juraj Vladika,
Daniel Braun,
Florian Matthes
Abstract:
Conversational search has evolved as a new information retrieval paradigm, marking a shift from traditional search systems towards interactive dialogues with intelligent search agents. This change especially affects exploratory information-seeking contexts, where conversational search systems can guide the discovery of unfamiliar domains. In these scenarios, users find it often difficult to expres…
▽ More
Conversational search has evolved as a new information retrieval paradigm, marking a shift from traditional search systems towards interactive dialogues with intelligent search agents. This change especially affects exploratory information-seeking contexts, where conversational search systems can guide the discovery of unfamiliar domains. In these scenarios, users find it often difficult to express their information goals due to insufficient background knowledge. Conversational interfaces can provide assistance by eliciting information needs and narrowing down the search space. However, due to the complexity of information-seeking behavior, the design of conversational interfaces for retrieving information remains a great challenge. Although prior work has employed user studies to empirically ground the system design, most existing studies are limited to well-defined search tasks or known domains, thus being less exploratory in nature. Therefore, we conducted a laboratory study to investigate open-ended search behavior for navigation through unknown information landscapes. The study comprised of 26 participants who were restricted in their search to a text chat interface. Based on the collected dialogue transcripts, we applied statistical analyses and process mining techniques to uncover general information-seeking patterns across five different domains. We not only identify core dialogue acts and their interrelations that enable users to discover domain knowledge, but also derive design suggestions for conversational search systems.
△ Less
Submitted 27 February, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
Pressure-dependent semiconductor-metal transition and elastic, electronic, optical, and thermophysical properties of SnS binary chalcogenide
Authors:
Ayesha Tasnim,
Md. Mahamudujjaman,
Md. Asif Afzal,
R. S. Islam,
S. H. Naqib
Abstract:
Density functional theory based study of the pressure dependent physical properties of binary SnS compound has been carried out. The computed elastic constants reveal that SnS is mechanically stable and brittle under ambient conditions. With increasing pressure, the compound becomes ductile. The Poisson's ratio also indicates brittle-ductile transition with increasing pressure. The hardness of SnS…
▽ More
Density functional theory based study of the pressure dependent physical properties of binary SnS compound has been carried out. The computed elastic constants reveal that SnS is mechanically stable and brittle under ambient conditions. With increasing pressure, the compound becomes ductile. The Poisson's ratio also indicates brittle-ductile transition with increasing pressure. The hardness of SnS increases significantly with pressure. The compound possesses elastic anisotropy. The ground state electronic band structure is semiconducting with a small band gap which becomes metallic under pressure. The band becomes more and more dispersive with the increase in pressure while the electronic correlations decrease as pressure is raised. Both the Debye temperature and the phonon thermal conductivity of SnS increase sharply with pressure. The Melting temperature of the compound is low. Mixed bonding characteristics are found with ionic and covalent contributions. SnS is a good absorber of ultraviolet light. The reflectivity of the material increases with the increase in pressure. The reflectivity is nonselective over a wide spectral range. The low energy refractive index is high. All these optical characteristics are useful for prospective optoelectronic device applications. The optical anisotropy is low.
△ Less
Submitted 28 October, 2022;
originally announced October 2022.
-
Comparative analysis of physical properties of some binary transition metal carbides XC (X = Nb, Ta, Ti): Insights from a comprehensive ab-initio study
Authors:
Razu Ahmed,
Md. Mahamudujjaman,
Md. Asif Afzal,
Md. Sajidul Islam,
R. S. Islam,
S. H. Naqib
Abstract:
Binary metallic carbides belong to a technologically prominent class of materials. We have explored the structural, mechanical, electronic, optical, and some thermophysical properties of XC (X = Nb, Ta, Ti) binary metallic carbides in details employing first-principles method. Some of the results obtained are novel. A comparative analysis has been made.
Binary metallic carbides belong to a technologically prominent class of materials. We have explored the structural, mechanical, electronic, optical, and some thermophysical properties of XC (X = Nb, Ta, Ti) binary metallic carbides in details employing first-principles method. Some of the results obtained are novel. A comparative analysis has been made.
△ Less
Submitted 23 September, 2022;
originally announced September 2022.
-
Exploring Techniques for the Analysis of Spontaneous Asynchronicity in MPI-Parallel Applications
Authors:
Ayesha Afzal,
Georg Hager,
Gerhard Wellein,
Stefano Markidis
Abstract:
This paper studies the utility of using data analytics and machine learning techniques for identifying, classifying, and characterizing the dynamics of large-scale parallel (MPI) programs. To this end, we run microbenchmarks and realistic proxy applications with the regular compute-communicate structure on two different supercomputing platforms and choose the per-process performance and MPI time p…
▽ More
This paper studies the utility of using data analytics and machine learning techniques for identifying, classifying, and characterizing the dynamics of large-scale parallel (MPI) programs. To this end, we run microbenchmarks and realistic proxy applications with the regular compute-communicate structure on two different supercomputing platforms and choose the per-process performance and MPI time per time step as relevant observables. Using principal component analysis, clustering techniques, correlation functions, and a new "phase space plot," we show how desynchronization patterns (or lack thereof) can be readily identified from a data set that is much smaller than a full MPI trace. Our methods also lead the way towards a more general classification of parallel program dynamics.
△ Less
Submitted 27 May, 2022;
originally announced May 2022.
-
The Role of Idle Waves, Desynchronization, and Bottleneck Evasion in the Performance of Parallel Programs
Authors:
Ayesha Afzal,
Georg Hager,
Gerhard Wellein
Abstract:
The performance of highly parallel applications on distributed-memory systems is influenced by many factors. Analytic performance modeling techniques aim to provide insight into performance limitations and are often the starting point of optimization efforts. However, coupling analytic models across the system hierarchy (socket, node, network) fails to encompass the intricate interplay between the…
▽ More
The performance of highly parallel applications on distributed-memory systems is influenced by many factors. Analytic performance modeling techniques aim to provide insight into performance limitations and are often the starting point of optimization efforts. However, coupling analytic models across the system hierarchy (socket, node, network) fails to encompass the intricate interplay between the program code and the hardware, especially when execution and communication bottlenecks are involved. In this paper we investigate the effect of "bottleneck evasion" and how it can lead to automatic overlap of communication overhead with computation. Bottleneck evasion leads to a gradual loss of the initial bulk-synchronous behavior of a parallel code so that its processes become desynchronized. This occurs most prominently in memory-bound programs, which is why we choose memory-bound benchmark and application codes, specifically an MPI-augmented STREAM Triad, sparse matrix-vector multiplication, and a collective-avoiding Chebyshev filter diagonalization code to demonstrate the consequences of desynchronization on two different supercomputing platforms. We investigate the role of idle waves as possible triggers for desynchronization and show the impact of automatic asynchronous communication for a spectrum of code properties and parameters, such as saturation point, matrix structures, domain decomposition, and communication concurrency. Our findings reveal how eliminating synchronization points (such as collective communication or barriers) precipitates performance improvements that go beyond what can be expected by simply subtracting the overhead of the collective from the overall runtime.
△ Less
Submitted 9 May, 2022;
originally announced May 2022.
-
Challenges and Opportunities of Edge AI for Next-Generation Implantable BMIs
Authors:
MohammadAli Shaeri,
Arshia Afzal,
Mahsa Shoaran
Abstract:
Neuroscience and neurotechnology are currently being revolutionized by artificial intelligence (AI) and machine learning. AI is widely used to study and interpret neural signals (analytical applications), assist people with disabilities (prosthetic applications), and treat underlying neurological symptoms (therapeutic applications). In this brief, we will review the emerging opportunities of on-ch…
▽ More
Neuroscience and neurotechnology are currently being revolutionized by artificial intelligence (AI) and machine learning. AI is widely used to study and interpret neural signals (analytical applications), assist people with disabilities (prosthetic applications), and treat underlying neurological symptoms (therapeutic applications). In this brief, we will review the emerging opportunities of on-chip AI for the next-generation implantable brain-machine interfaces (BMIs), with a focus on state-of-the-art prosthetic BMIs. Major technological challenges for the effectiveness of AI models will be discussed. Finally, we will present algorithmic and IC design solutions to enable a new generation of AI-enhanced and high-channel-count BMIs.
△ Less
Submitted 13 April, 2022; v1 submitted 4 April, 2022;
originally announced April 2022.
-
$μ$-hybrid Inflation, Gravitino Dark Matter and Stochastic Gravitational Wave Background from Cosmic Strings
Authors:
Adeela Afzal,
Waqas Ahmed,
Mansoor Ur Rehman,
Qaisar Shafi
Abstract:
We present a successful realization of supersymmetric $μ$-hybrid inflation model based on a gauged $U(1)_{B-L}$ extension of the minimal supersymmetric standard model, with the soft supersymmetry breaking terms are playing an important role. Successful non-thermal leptogenesis with gravitino dark matter yields a reheat temperature in the range…
▽ More
We present a successful realization of supersymmetric $μ$-hybrid inflation model based on a gauged $U(1)_{B-L}$ extension of the minimal supersymmetric standard model, with the soft supersymmetry breaking terms are playing an important role. Successful non-thermal leptogenesis with gravitino dark matter yields a reheat temperature in the range $2 \times 10^{7} \lesssim T_R \lesssim 5 \times 10^{9}$ GeV. This corresponds to the predictions $2 \times 10^{-18} \lesssim r\lesssim 4 \times 10^{-13}$ for the tensor to scalar ratio, and $-2 \times 10^{-6} \lesssim dn_s/d\ln k \lesssim -5 \times 10^{-11}$ for the running of the scalar spectral index. The $B-L$ breaking scale is estimated as $ 6 \times 10^{14}\lesssim M/ \text{GeV}\lesssim 10^{16}$, calculated at the central value of the scalar spectral index, $n_s =0.9655$, reported by Planck 2018. Finally, in a grand unified theory setup the dimensionless string tension parameter associated with the metastable strings is in the range $ 10^{-9} \lesssim Gμ_\text{cs} \lesssim 10^{-6}$ corresponding to a stochastic gravitational wave background lying within the 2$σ$ bound of the recent NANOGrav 12.5-yr data.
△ Less
Submitted 1 June, 2022; v1 submitted 15 February, 2022;
originally announced February 2022.
-
A comprehensive DFT based insights into the physical properties of tetragonal Mo5PB2
Authors:
M. I. Naher,
M. A. Afzal,
S. H. Naqib
Abstract:
Tetragonal Mo5PB2 compound, a recently discovered superconductor, belongs to technologically important class of materials. It is quite surprising to note that a large number of physical properties of Mo5PB2, including elastic properties and their anisotropy, acoustic behavior, electronic (charge density distribution, electron density difference), thermo-physical, bonding characteristics, and optic…
▽ More
Tetragonal Mo5PB2 compound, a recently discovered superconductor, belongs to technologically important class of materials. It is quite surprising to note that a large number of physical properties of Mo5PB2, including elastic properties and their anisotropy, acoustic behavior, electronic (charge density distribution, electron density difference), thermo-physical, bonding characteristics, and optical properties have not been carried out at all. In the present work we have explored all these properties in details for the first time with density functional theory based first-principles method. Mo5PB2 is found to be a mechanically stable, elastically anisotropic compound with ductile character. Moreover, the chemical bonding is interpreted by calculating the electronic energy density of states, electron density distribution, elastic properties and Mulliken bond population analysis. Mo5PB2 has a combination of mainly ionic, metallic, and some covalent bonding characteristics. The compound possesses high level of machinability. The band structure along with a large electronic density of states at the Fermi level reveals metallic character. Calculated values of different thermal parameters of Mo5PB2 are closely related to the elastic properties. The energy dependent optical parameters show close assent to the underlying electronic band structure. The optical absorption and reflectivity spectra and the low energy index of refraction of Mo5PB2 show that the compound holds promise to be used in optoelectronic device sector. Unlike the notable anisotropy found in elastic, mechanical properties and minimum thermal conductivity, the optical parameters are found to be almost isotropic with respect to the polarization direction of the incident electric field.
△ Less
Submitted 11 May, 2021;
originally announced May 2021.
-
Current Status and Performance Analysis of Table Recognition in Document Images with Deep Neural Networks
Authors:
Khurram Azeem Hashmi,
Marcus Liwicki,
Didier Stricker,
Muhammad Adnan Afzal,
Muhammad Ahtsham Afzal,
Muhammad Zeshan Afzal
Abstract:
The first phase of table recognition is to detect the tabular area in a document. Subsequently, the tabular structures are recognized in the second phase in order to extract information from the respective cells. Table detection and structural recognition are pivotal problems in the domain of table understanding. However, table analysis is a perplexing task due to the colossal amount of diversity…
▽ More
The first phase of table recognition is to detect the tabular area in a document. Subsequently, the tabular structures are recognized in the second phase in order to extract information from the respective cells. Table detection and structural recognition are pivotal problems in the domain of table understanding. However, table analysis is a perplexing task due to the colossal amount of diversity and asymmetry in tables. Therefore, it is an active area of research in document image analysis. Recent advances in the computing capabilities of graphical processing units have enabled deep neural networks to outperform traditional state-of-the-art machine learning methods. Table understanding has substantially benefited from the recent breakthroughs in deep neural networks. However, there has not been a consolidated description of the deep learning methods for table detection and table structure recognition. This review paper provides a thorough analysis of the modern methodologies that utilize deep neural networks. This work provided a thorough understanding of the current state-of-the-art and related challenges of table understanding in document images. Furthermore, the leading datasets and their intricacies have been elaborated along with the quantitative results. Moreover, a brief overview is given regarding the promising directions that can serve as a guide to further improve table analysis in document images.
△ Less
Submitted 8 May, 2021; v1 submitted 29 April, 2021;
originally announced April 2021.
-
GzScenic: Automatic Scene Generation for Gazebo Simulator
Authors:
Afsoon Afzal,
Claire Le Goues,
Christopher S. Timperley
Abstract:
Testing robotic and cyberphysical systems in simulation require specifications of the simulated environments (i.e., scenes). The Scenic domain-specific language provides a high-level probabilistic programming language that allows users to specify scenarios for simulation. Scenic automatically generates concrete scenes that can be rendered by simulators. However, Scenic is mainly designed for auton…
▽ More
Testing robotic and cyberphysical systems in simulation require specifications of the simulated environments (i.e., scenes). The Scenic domain-specific language provides a high-level probabilistic programming language that allows users to specify scenarios for simulation. Scenic automatically generates concrete scenes that can be rendered by simulators. However, Scenic is mainly designed for autonomous vehicle simulation and does not support the most popular general-purpose simulator: Gazebo. In this work, we present GzScenic; a tool that automatically generates scenes for simulation in Gazebo. GzScenic automatically generates both the models required for running Scenic on the scenarios, and the models that Gazebo requires for running the simulation.
△ Less
Submitted 17 April, 2021;
originally announced April 2021.
-
Analytic Modeling of Idle Waves in Parallel Programs: Communication, Cluster Topology, and Noise Impact
Authors:
Ayesha Afzal,
Georg Hager,
Gerhard Wellein
Abstract:
Most distributed-memory bulk-synchronous parallel programs in HPC assume that compute resources are available continuously and homogeneously across the allocated set of compute nodes. However, long one-off delays on individual processes can cause global disturbances, so-called idle waves, by rippling through the system. This process is mainly governed by the communication topology of the underlyin…
▽ More
Most distributed-memory bulk-synchronous parallel programs in HPC assume that compute resources are available continuously and homogeneously across the allocated set of compute nodes. However, long one-off delays on individual processes can cause global disturbances, so-called idle waves, by rippling through the system. This process is mainly governed by the communication topology of the underlying parallel code. This paper makes significant contributions to the understanding of idle wave dynamics. We study the propagation mechanisms of idle waves across the ranks of MPI-parallel programs. We present a validated analytic model for their propagation velocity with respect to communication parameters and topology, with a special emphasis on sparse communication patterns. We study the interaction of idle waves with MPI collectives and show that, depending on the implementation, a collective may be transparent to the wave. Finally we analyze two mechanisms of idle wave decay: topological decay, which is rooted in differences in communication characteristics among parts of the system, and noise-induced decay, which is caused by system or application noise. We show that noise-induced decay is largely independent of noise characteristics but depends only on the overall noise power. An analytic expression for idle wave decay rate with respect to noise power is derived. For model validation we use microbenchmarks and stencil algorithms on three different supercomputing platforms.
△ Less
Submitted 4 March, 2021;
originally announced March 2021.
-
Structural, elastic, bonding, optoelectronic, and some thermo-physical properties of transition metal dichalcogenides ZrX2 (X = S, Se, Te): Insights from ab-initio calculations
Authors:
Md. Mahamudujjaman,
Md. Asif Afzal,
R. S. Islam,
S. H. Naqib
Abstract:
Transition metal dichalcogenides (TMDCs) belong to technologically important compounds. We have explored the structural, elastic, bonding, optoelectronic and some thermo-physical properties of ZrX2 (X = S, Se, Te) TMDCs in details via ab-initio technique in this study. Elastic anisotropy indices, atomic bonding character, optoelectronic properties and thermo-physical parameters including melting t…
▽ More
Transition metal dichalcogenides (TMDCs) belong to technologically important compounds. We have explored the structural, elastic, bonding, optoelectronic and some thermo-physical properties of ZrX2 (X = S, Se, Te) TMDCs in details via ab-initio technique in this study. Elastic anisotropy indices, atomic bonding character, optoelectronic properties and thermo-physical parameters including melting temperature and minimum phonon thermal conductivity are investigated for the first time. All the TMDCs under investigation possess significant elastic anisotropy and layered structural features. ZrX2 (X = S, Se, Te) compounds are fairly machinable, and ZrS2 and ZrSe2 are moderately hard. ZrTe2, on the other hand, is significantly softer. Both covalent and ionic bondings contribute in the crystals. Electronic band structure calculations display semiconducting behavior for ZrS2 and ZrSe2 and metallic behavior for ZrTe2. Energy dependent optoelectronic parameters exhibit good correspondence with the underlying electronic energy density of states features. ZrX2 (X = S, Se, Te) compounds absorb ultraviolet radiation effectively. The reflectivity spectrum, R(w), remains over 50% in the energy range from 0 eV to 20 eV for ZrTe2. Therefore, this TMDC has wide band and nonselective high reflectivity and can be used as an efficient reflector to reduce solar heating. Debye temperature, melting point and minimum phonon thermal conductivity of the compounds under study are low and show excellent correspondence with each other and also with the elastic and bonding characteristics.
△ Less
Submitted 24 February, 2021;
originally announced February 2021.
-
An analytic performance model for overlap** execution of memory-bound loop kernels on multicore CPUs
Authors:
Ayesha Afzal,
Georg Hager,
Gerhard Wellein
Abstract:
Complex applications running on multicore processors show a rich performance phenomenology. The growing number of cores per ccNUMA domain complicates performance analysis of memory-bound code since system noise, load imbalance, or task-based programming models can lead to thread desynchronization. Hence, the simplifying assumption that all cores execute the same loop can not be upheld. Motivated b…
▽ More
Complex applications running on multicore processors show a rich performance phenomenology. The growing number of cores per ccNUMA domain complicates performance analysis of memory-bound code since system noise, load imbalance, or task-based programming models can lead to thread desynchronization. Hence, the simplifying assumption that all cores execute the same loop can not be upheld. Motivated by observations on plain and modified versions of the HPCG benchmark, we construct a performance model of execution of memory-bound loop kernels. It can predict the memory bandwidth share per kernel on a memory contention domain depending on the number of active cores and which other workload the kernel is paired with. The only code features required are the single-thread cache line access frequency per kernel, which is directly related to the single-thread memory bandwidth, and its saturated bandwidth. It can either be measured directly or predicted using the Execution-Cache-Memory (ECM) performance model. The computational intensity of the kernels and the detailed structure of the code is of no significance. We validate our model on Intel Broadwell, Intel Cascade Lake, and AMD Rome processors pairing various streaming and stencil kernels. The error in predicting the bandwidth share per kernel is less than 8%.
△ Less
Submitted 31 October, 2020;
originally announced November 2020.
-
A DFT based first-principles investigation of the physical properties of Bi2Te2Se topological insulator
Authors:
Md. Asif Afzal,
S. H. Naqib
Abstract:
A topological insulator possesses a bulk energy gap splitting the lowest empty band from the highest occupied electronic band. The electronic states at the surface (or edge in two dimensions), on the other hand, of a topological insulator are gapless and are protected by the time reversal symmetry. Such systems are promising for variety of optoelectronic, superconducting, thermoelectric and quantu…
▽ More
A topological insulator possesses a bulk energy gap splitting the lowest empty band from the highest occupied electronic band. The electronic states at the surface (or edge in two dimensions), on the other hand, of a topological insulator are gapless and are protected by the time reversal symmetry. Such systems are promising for variety of optoelectronic, superconducting, thermoelectric and quantum computation related applications. We have studied elastic, mechanical, electronic, optical properties, bonding character and the electronic charge density distribution of ternary Bi2Te2Se topological insulator. The compound under study is mechanically stable and elastically anisotropic. The electronic band structure calculations reveal high degree of anisotropy in the energy dispersion. Electronic effective mass is high in the c-direction compared to that in the ab-plane. The optical constants show moderate level of variation with respect to the polarization of the electric field of the incident radiation. The optical spectra are consistent with the electronic band structure and electronic density of states features. Both electronic band structure and optical constants show clear indications of a direct band gap of 0.610 eV for Bi2Te2Se. It is also found that Bi2Te2Se possesses high refractive index at low photon energies in the infrared and visible region. It has low reflectivity in the ultraviolet region. Bi2Te2Se absorbs photons strongly in the ultraviolet energies. All these features make Bi2Te2Se suitable for diverse class of optoelectronic device applications.
△ Less
Submitted 27 May, 2020;
originally announced May 2020.
-
A Study on the Challenges of Using Robotics Simulators for Testing
Authors:
Afsoon Afzal,
Deborah S. Katz,
Claire Le Goues,
Christopher S. Timperley
Abstract:
Robotics simulation plays an important role in the design, development, and verification and validation of robotic systems. Recent studies have shown that simulation may be used as a cheaper, safer, and more reliable alternative to manual, and widely used, process of field testing. This is particularly important in the context of continuous integration pipelines, where integrated automated testing…
▽ More
Robotics simulation plays an important role in the design, development, and verification and validation of robotic systems. Recent studies have shown that simulation may be used as a cheaper, safer, and more reliable alternative to manual, and widely used, process of field testing. This is particularly important in the context of continuous integration pipelines, where integrated automated testing is key to reducing costs while maintaining system safety. However, simulation and automated testing are not seeing the degree of widespread adoption in practice that their potential would motivate. Our goal in this paper is to develop a principled understanding of the ways developers use simulation in their process, and the challenges they face in doing so. This type of understanding can guide the development of more effective simulators and testing techniques for modern robotics development.
To that end, we conduct a survey of 82 robotics developers from a diversity of backgrounds that addresses the current capabilities and limits of simulation technology in practice. We find that simulation is used by 85% of our participants for testing, and that many participants desire to use simulation as part of their test automation. We identify 10 high-level challenges that impede developers from using simulation for manual and automated testing, and general purposes. These challenges include the gap between simulation and reality, a lack of reproducibility, and considerable resource costs associated with using simulators. Finally, we outline avenues for improvement in the development of new simulators that can help simulation reach its potential as a means of verification and validation.
△ Less
Submitted 15 April, 2020;
originally announced April 2020.
-
Desynchronization and Wave Pattern Formation in MPI-Parallel and Hybrid Memory-Bound Programs
Authors:
Ayesha Afzal,
Georg Hager,
Gerhard Wellein
Abstract:
Analytic, first-principles performance modeling of distributed-memory parallel codes is notoriously imprecise. Even for applications with extremely regular and homogeneous compute-communicate phases, simply adding communication time to computation time does often not yield a satisfactory prediction of parallel runtime due to deviations from the expected simple lockstep pattern caused by system noi…
▽ More
Analytic, first-principles performance modeling of distributed-memory parallel codes is notoriously imprecise. Even for applications with extremely regular and homogeneous compute-communicate phases, simply adding communication time to computation time does often not yield a satisfactory prediction of parallel runtime due to deviations from the expected simple lockstep pattern caused by system noise, variations in communication time, and inherent load imbalance. In this paper, we highlight the specific cases of provoked and spontaneous desynchronization of memory-bound, bulk-synchronous pure MPI and hybrid MPI+OpenMP programs. Using simple microbenchmarks we observe that although desynchronization can introduce increased waiting time per process, it does not necessarily cause lower resource utilization but can lead to an increase in available bandwidth per core. In case of significant communication overhead, even natural noise can shove the system into a state of automatic overlap of communication and computation, improving the overall time to solution. The saturation point, i.e., the number of processes per memory domain required to achieve full memory bandwidth, is pivotal in the dynamics of this process and the emerging stable wave pattern. We also demonstrate how hybrid MPI-OpenMP programming can prevent desirable desynchronization by eliminating the bandwidth bottleneck among processes. A Chebyshev filter diagonalization application is used to demonstrate some of the observed effects in a realistic setting.
△ Less
Submitted 7 February, 2020;
originally announced February 2020.
-
Propagation and Decay of Injected One-Off Delays on Clusters: A Case Study
Authors:
Ayesha Afzal,
Georg Hager,
Gerhard Wellein
Abstract:
Analytic, first-principles performance modeling of distributed-memory applications is difficult due to a wide spectrum of random disturbances caused by the application and the system. These disturbances (commonly called "noise") destroy the assumptions of regularity that one usually employs when constructing simple analytic models. Despite numerous efforts to quantify, categorize, and reduce such…
▽ More
Analytic, first-principles performance modeling of distributed-memory applications is difficult due to a wide spectrum of random disturbances caused by the application and the system. These disturbances (commonly called "noise") destroy the assumptions of regularity that one usually employs when constructing simple analytic models. Despite numerous efforts to quantify, categorize, and reduce such effects, a comprehensive quantitative understanding of their performance impact is not available, especially for long delays that have global consequences for the parallel application. In this work, we investigate various traces collected from synthetic benchmarks that mimic real applications on simulated and real message-passing systems in order to pinpoint the mechanisms behind delay propagation. We analyze the dependence of the propagation speed of idle waves emanating from injected delays with respect to the execution and communication properties of the application, study how such delays decay under increased noise levels, and how they interact with each other. We also show how fine-grained noise can make a system immune against the adverse effects of propagating idle waves. Our results contribute to a better understanding of the collective phenomena that manifest themselves in distributed-memory parallel applications.
△ Less
Submitted 28 August, 2019; v1 submitted 25 May, 2019;
originally announced May 2019.
-
High-Throughput Computational Studies in Catalysis and Materials Research, and their Impact on Rational Design
Authors:
Mohammad Atif Faiz Afzal,
Johannes Hachmann
Abstract:
In the 21st century, many technology fields have become reliant on advancements in process automation. We have seen dramatic growth in areas and industries that have successfully implemented a high level of automation. In drug discovery, for example, it has alleviated an otherwise extremely complex and tedious process and has resulted in the development of several new drugs. Over the last decade,…
▽ More
In the 21st century, many technology fields have become reliant on advancements in process automation. We have seen dramatic growth in areas and industries that have successfully implemented a high level of automation. In drug discovery, for example, it has alleviated an otherwise extremely complex and tedious process and has resulted in the development of several new drugs. Over the last decade, these automation techniques have begun being adapted in the chemical and materials community as well with the goal of exploring chemical space and pursuing the discovery and design of novel compounds for various applications. The impact of new materials on industrial and economic development has been stimulating tremendous research efforts by the materials community, and embracing automation as well as tools from computational and data science have led to acceleration and streamlining of the discovery process. In particular, virtual high-throughput screening (HTPS) is now becoming a mainstream technique to search for materials with properties that are tailored for specific applications. Its efficiency combined with the increasing availability of open-source codes and large computational resources makes it a powerful and attractive tool in materials research. Herein, we will review a selection of recent, high-profile HTPS projects for new materials and catalysts. In the case of catalysts, we focus on the HTPS studies for oxygen reduction reaction, oxygen evolution reaction, hydrogen evolution reaction, and carbon dioxide reduction reaction. Whereas, for other materials applications, we emphasize on the HTPS studies for photovoltaics, gas separation, high-refractive-index materials, and OLEDs.
△ Less
Submitted 10 February, 2019;
originally announced February 2019.
-
Area Product and Mass Formula for Kerr-Newman Black Hole in Quintessence
Authors:
Ayesha Zakria,
Asma Afzal
Abstract:
In this research work, predominantly we acquire area, angular velocity, entropy, surface gravity and Hawking temperature of inner and outer horizons for Kerr-Newman black hole in presence of quintessence. Additionally, we determine area sum, area product, entropy sum and entropy product. We examine that the area product and entropy product are free from mass $M$ but they surly rely upon the angula…
▽ More
In this research work, predominantly we acquire area, angular velocity, entropy, surface gravity and Hawking temperature of inner and outer horizons for Kerr-Newman black hole in presence of quintessence. Additionally, we determine area sum, area product, entropy sum and entropy product. We examine that the area product and entropy product are free from mass $M$ but they surly rely upon the angular momentum $J$, charge $q$, spin parameter $a$ and the normalization factor $c$. We monitor that these thermodynamic products are universal. We investigate that the area sum and entropy sum rely upon the mass $M$, charge $q$, spin parameter $a$ and the normalization factor $c$, so these sums are not universal. The black hole mass and Christodoulou-Ruffini mass for Kerr-Newman black hole in quintessence are also found. We extract the entropy bound from the area bound. We derive the Penrose inequality and discuss the microscopic nature of the entropy.
△ Less
Submitted 19 September, 2018; v1 submitted 11 August, 2018;
originally announced August 2018.
-
Estimating Probability Distributions using "Dirac" Kernels (via Rademacher-Walsh Polynomial Basis Functions)
Authors:
Hamse Y. Mussa,
Avid M. Afzal
Abstract:
In many applications (in particular information systems, such as pattern recognition, machine learning, cheminformatics, bioinformatics to name but a few) the assessment of uncertainty is essential - i.e., the estimation of the underlying probability distribution function. More often than not, the form of this function is unknown and it becomes necessary to non-parametrically construct/estimate it…
▽ More
In many applications (in particular information systems, such as pattern recognition, machine learning, cheminformatics, bioinformatics to name but a few) the assessment of uncertainty is essential - i.e., the estimation of the underlying probability distribution function. More often than not, the form of this function is unknown and it becomes necessary to non-parametrically construct/estimate it from a given sample. One of the methods of choice to non-parametrically estimate the unknown probability distribution function for a given random variable (defined on binary space) has been the expansion of the estimation function in Rademacher-Walsh Polynomial basis functions. In this paper we demonstrate that the expansion of the probability distribution function estimation in Rademacher-Walsh Polynomial basis functions is equivalent to the expansion of the function estimation in a set of "Dirac kernel" functions. The latter approach can ameliorate the computational bottleneck and notational awkwardness often associated with the Rademacher-Walsh Polynomial basis functions approach, in particular when the binary input space is large.
△ Less
Submitted 23 September, 2016;
originally announced September 2016.
-
Information-Centric Offloading in Cellular Networks with Coordinated Device-to-Device Communication
Authors:
Asma Afzal,
Syed Ali Raza Zaidi,
Des McLernon,
Mounir Ghogho
Abstract:
In this paper, we develop a comprehensive analytical framework for cache enabled cellular networks overlaid with coordinated device-to-device (D2D) communication. We follow an approach similar to LTE Direct, where the base station (BS) is responsible for establishing D2D links. We consider that an arbitrary requesting user is offloaded to D2D mode to communicate with one of its 'k' closest D2D hel…
▽ More
In this paper, we develop a comprehensive analytical framework for cache enabled cellular networks overlaid with coordinated device-to-device (D2D) communication. We follow an approach similar to LTE Direct, where the base station (BS) is responsible for establishing D2D links. We consider that an arbitrary requesting user is offloaded to D2D mode to communicate with one of its 'k' closest D2D helpers within the macrocell subject to content availability and helper selection scheme. We consider two different D2D helper selection schemes: 1) uniform selection (US), where the D2D helper is selected uniformly and 2) nearest selection (NS), where the nearest helper possessing the content is selected. Employing tools from stochastic geometry, we model the locations of BSs and D2D helpers using independent homogeneous Poisson point processes (HPPPs). We characterize the D2D mode probability of an arbitrary user for both the NS and US schemes. The distribution of the distance between an arbitrary user and its ith neighboring D2D helper within the macrocell is derived using disk approximation for the Voronoi cell, which is shown to be reasonably accurate. We fully characterize the overall coverage probability and the average ergodic rate of an arbitrary user requesting a particular content. We show that significant performance gains can be achieved compared to conventional cellular communication under both the NS and US schemes when popular contents are requested and NS scheme always outperforms the US scheme. Our analysis reveals an interesting trade off between the performance metrics and the number of candidate D2D helpers 'k'. We conclude that enhancing D2D opportunities for the users does not always result in better performance and the network parameters have to be carefully tuned to harness maximum gains.
△ Less
Submitted 20 December, 2017; v1 submitted 11 August, 2016;
originally announced August 2016.
-
Target Fishing: A Single-Label or Multi-Label Problem?
Authors:
Avid M. Afzal,
Hamse Y. Mussa,
Richard E. Turner,
Andreas Bender,
Robert C. Glen
Abstract:
According to Cobanoglu et al and Murphy, it is now widely acknowledged that the single target paradigm (one protein or target, one disease, one drug) that has been the dominant premise in drug development in the recent past is untenable. More often than not, a drug-like compound (ligand) can be promiscuous - that is, it can interact with more than one target protein. In recent years, in in silico…
▽ More
According to Cobanoglu et al and Murphy, it is now widely acknowledged that the single target paradigm (one protein or target, one disease, one drug) that has been the dominant premise in drug development in the recent past is untenable. More often than not, a drug-like compound (ligand) can be promiscuous - that is, it can interact with more than one target protein. In recent years, in in silico target prediction methods the promiscuity issue has been approached computationally in different ways. In this study we confine attention to the so-called ligand-based target prediction machine learning approaches, commonly referred to as target-fishing. With a few exceptions, the target-fishing approaches that are currently ubiquitous in cheminformatics literature can be essentially viewed as single-label multi-classification schemes; these approaches inherently bank on the single target paradigm assumption that a ligand can home in on one specific target. In order to address the ligand promiscuity issue, one might be able to cast target-fishing as a multi-label multi-class classification problem. For illustrative and comparison purposes, single-label and multi-label Naive Bayes classification models (denoted here by SMM and MMM, respectively) for target-fishing were implemented. The models were constructed and tested on 65,587 compounds and 308 targets retrieved from the ChEMBL17 database. SMM and MMM performed differently: for 16,344 test compounds, the MMM model returned recall and precision values of 0.8058 and 0.6622, respectively; the corresponding recall and precision values yielded by the SMM model were 0.7805 and 0.7596, respectively. However, at a significance level of 0.05 and one degree of freedom McNemar test performed on the target prediction results returned by SMM and MMM for the 16,344 test ligands gave a chi-squared value of 15.656, in favour of the MMM approach.
△ Less
Submitted 23 November, 2014;
originally announced November 2014.