Search | arXiv e-print repository

Cooperative Advisory Residual Policies for Congestion Mitigation

Authors: Aamir Hasan, Neeloy Chakraborty, Haonan Chen, Jung-Hoon Cho, Cathy Wu, Katherine Driggs-Campbell

Abstract: Fleets of autonomous vehicles can mitigate traffic congestion through simple actions, thus improving many socioeconomic factors such as commute time and gas costs. However, these approaches are limited in practice as they assume precise control over autonomous vehicle fleets, incur extensive installation costs for a centralized sensor ecosystem, and also fail to account for uncertainty in driver b… ▽ More Fleets of autonomous vehicles can mitigate traffic congestion through simple actions, thus improving many socioeconomic factors such as commute time and gas costs. However, these approaches are limited in practice as they assume precise control over autonomous vehicle fleets, incur extensive installation costs for a centralized sensor ecosystem, and also fail to account for uncertainty in driver behavior. To this end, we develop a class of learned residual policies that can be used in cooperative advisory systems and only require the use of a single vehicle with a human driver. Our policies advise drivers to behave in ways that mitigate traffic congestion while accounting for diverse driver behaviors, particularly drivers' reactions to instructions, to provide an improved user experience. To realize such policies, we introduce an improved reward function that explicitly addresses congestion mitigation and driver attitudes to advice. We show that our residual policies can be personalized by conditioning them on an inferred driver trait that is learned in an unsupervised manner with a variational autoencoder. Our policies are trained in simulation with our novel instruction adherence driver model, and evaluated in simulation and through a user study (N=16) to capture the sentiments of human drivers. Our results show that our approaches successfully mitigate congestion while adapting to different driver behaviors, with up to 20% and 40% improvement as measured by a combination metric of speed and deviations in speed across time over baselines in our simulation tests and user study, respectively. Our user study further shows that our policies are human-compatible and personalize to drivers. △ Less

Submitted 29 June, 2024; originally announced July 2024.

arXiv:2406.15638 [pdf, other]

Root Cause Analysis of Anomalies in 5G RAN Using Graph Neural Network and Transformer

Authors: Antor Hasan, Conrado Boeira, Khaleda Papry, Yue Ju, Zhongwen Zhu, Israat Haque

Abstract: The emergence of 5G technology marks a significant milestone in develo** telecommunication networks, enabling exciting new applications such as augmented reality and self-driving vehicles. However, these improvements bring an increased management complexity and a special concern in dealing with failures, as the applications 5G intends to support heavily rely on high network performance and low l… ▽ More The emergence of 5G technology marks a significant milestone in develo** telecommunication networks, enabling exciting new applications such as augmented reality and self-driving vehicles. However, these improvements bring an increased management complexity and a special concern in dealing with failures, as the applications 5G intends to support heavily rely on high network performance and low latency. Thus, automatic self-healing solutions have become effective in dealing with this requirement, allowing a learning-based system to automatically detect anomalies and perform Root Cause Analysis (RCA). However, there are inherent challenges to the implementation of such intelligent systems. First, there is a lack of suitable data for anomaly detection and RCA, as labelled data for failure scenarios is uncommon. Secondly, current intelligent solutions are tailored to LTE networks and do not fully capture the spatio-temporal characteristics present in the data. Considering this, we utilize a calibrated simulator, Simu5G, and generate open-source data for normal and failure scenarios. Using this data, we propose Simba, a state-of-the-art approach for anomaly detection and root cause analysis in 5G Radio Access Networks (RANs). We leverage Graph Neural Networks to capture spatial relationships while a Transformer model is used to learn the temporal dependencies of the data. We implement a prototype of Simba and evaluate it over multiple failures. The outcomes are compared against existing solutions to confirm the superiority of Simba. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.15045 [pdf, other]

Harnessing Knowledge Retrieval with Large Language Models for Clinical Report Error Correction

Authors: **ge Wu, Zhaolong Wu, Abul Hasan, Yunsoo Kim, Jason P. Y. Cheung, Teng Zhang, Honghan Wu

Abstract: This study proposes an approach for error correction in clinical radiology reports, leveraging large language models (LLMs) and retrieval-augmented generation (RAG) techniques. The proposed framework employs internal and external retrieval mechanisms to extract relevant medical entities and relations from the report and external knowledge sources. A three-stage inference process is introduced, dec… ▽ More This study proposes an approach for error correction in clinical radiology reports, leveraging large language models (LLMs) and retrieval-augmented generation (RAG) techniques. The proposed framework employs internal and external retrieval mechanisms to extract relevant medical entities and relations from the report and external knowledge sources. A three-stage inference process is introduced, decomposing the task into error detection, localization, and correction subtasks, which enhances the explainability and performance of the system. The effectiveness of the approach is evaluated using a benchmark dataset created by corrupting real-world radiology reports with realistic errors, guided by domain experts. Experimental results demonstrate the benefits of the proposed methods, with the combination of internal and external retrieval significantly improving the accuracy of error detection, localization, and correction across various state-of-the-art LLMs. The findings contribute to the development of more robust and reliable error correction systems for clinical documentation. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.14312 [pdf, other]

Infusing clinical knowledge into tokenisers for language models

Authors: Abul Hasan, **ge Wu, Quang Ngoc Nguyen, Salomé Andres, Imane Guellil, Huayu Zhang, Arlene Casey, Beatrice Alex, Bruce Guthrie, Honghan Wu

Abstract: This study introduces a novel knowledge enhanced tokenisation mechanism, K-Tokeniser, for clinical text processing. Technically, at initialisation stage, K-Tokeniser populates global representations of tokens based on semantic types of domain concepts (such as drugs or diseases) from either a domain ontology like Unified Medical Language System or the training data of the task related corpus. At t… ▽ More This study introduces a novel knowledge enhanced tokenisation mechanism, K-Tokeniser, for clinical text processing. Technically, at initialisation stage, K-Tokeniser populates global representations of tokens based on semantic types of domain concepts (such as drugs or diseases) from either a domain ontology like Unified Medical Language System or the training data of the task related corpus. At training or inference stage, sentence level localised context will be utilised for choosing the optimal global token representation to realise the semantic-based tokenisation. To avoid pretraining using the new tokeniser, an embedding initialisation approach is proposed to generate representations for new tokens. Using three transformer-based language models, a comprehensive set of experiments are conducted on four real-world datasets for evaluating K-Tokeniser in a wide range of clinical text analytics tasks including clinical concept and relation extraction, automated clinical coding, clinical phenotype identification, and clinical research article classification. Overall, our models demonstrate consistent improvements over their counterparts in all tasks. In particular, substantial improvements are observed in the automated clinical coding task with 13\% increase on Micro $F_1$ score. Furthermore, K-Tokeniser also shows significant capacities in facilitating quicker converge of language models. Specifically, using K-Tokeniser, the language models would only require 50\% of the training data to achieve the best performance of the baseline tokeniser using all training data in the concept extraction task and less than 20\% of the data for the automated coding task. It is worth mentioning that all these improvements require no pre-training process, making the approach generalisable. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: 18 pages, 6 figures

arXiv:2406.09103 [pdf, other]

Chain-of-Though (CoT) prompting strategies for medical error detection and correction

Authors: Zhaolong Wu, Abul Hasan, **ge Wu, Yunsoo Kim, Jason P. Y. Cheung, Teng Zhang, Honghan Wu

Abstract: This paper describes our submission to the MEDIQA-CORR 2024 shared task for automatically detecting and correcting medical errors in clinical notes. We report results for three methods of few-shot In-Context Learning (ICL) augmented with Chain-of-Thought (CoT) and reason prompts using a large language model (LLM). In the first method, we manually analyse a subset of train and validation dataset to… ▽ More This paper describes our submission to the MEDIQA-CORR 2024 shared task for automatically detecting and correcting medical errors in clinical notes. We report results for three methods of few-shot In-Context Learning (ICL) augmented with Chain-of-Thought (CoT) and reason prompts using a large language model (LLM). In the first method, we manually analyse a subset of train and validation dataset to infer three CoT prompts by examining error types in the clinical notes. In the second method, we utilise the training dataset to prompt the LLM to deduce reasons about their correctness or incorrectness. The constructed CoTs and reasons are then augmented with ICL examples to solve the tasks of error detection, span identification, and error correction. Finally, we combine the two methods using a rule-based ensemble method. Across the three sub-tasks, our ensemble method achieves a ranking of 3rd for both sub-task 1 and 2, while securing 7th place in sub-task 3 among all submissions. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: accepted as NAACL workshop

arXiv:2406.07649 [pdf, other]

Intrinsic compressibility effects in near-wall turbulence

Authors: Asif Manzoor Hasan, Pedro Costa, Johan Larsson, Sergio Pirozzoli, Rene Pecnik

Abstract: The impact of intrinsic compressibility effects -- changes in fluid volume due to pressure variations -- on high-speed wall-bounded turbulence has often been overlooked or incorrectly attributed to mean property variations. To unambiguously quantify these intrinsic compressibility effects, we perform direct numerical simulations of compressible turbulent channel flows with nearly uniform mean prop… ▽ More The impact of intrinsic compressibility effects -- changes in fluid volume due to pressure variations -- on high-speed wall-bounded turbulence has often been overlooked or incorrectly attributed to mean property variations. To unambiguously quantify these intrinsic compressibility effects, we perform direct numerical simulations of compressible turbulent channel flows with nearly uniform mean properties. Our simulations reveal that intrinsic compressibility effects yield a significant upward shift in the logarithmic mean velocity profile that can be attributed to the reduction in the turbulent shear stress. This reduction stems from the weakening of the near-wall quasi-streamwise vortices. We in turn attribute this weakening to the spontaneous opposition of sweeps and ejections from the near-wall expansions and contractions of the fluid, and provide a theoretical explanation for this mechanism. Our results also demonstrate that intrinsic compressibility effects are responsible for the increase in the inner-scaled streamwise turbulence intensity in compressible flows compared to incompressible flows, previously regarded to be an effect of mean property variations. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 29 pages, 17 figures

arXiv:2406.03916 [pdf, other]

ArMeme: Propagandistic Content in Arabic Memes

Authors: Firoj Alam, Abul Hasnat, Fatema Ahmed, Md Arid Hasan, Maram Hasanain

Abstract: With the rise of digital communication, memes have become a significant medium for cultural and political expression that is often used to mislead audiences. Identification of such misleading and persuasive multimodal content has become more important among various stakeholders, including social media platforms, policymakers, and the broader society as they often cause harm to individuals, organiz… ▽ More With the rise of digital communication, memes have become a significant medium for cultural and political expression that is often used to mislead audiences. Identification of such misleading and persuasive multimodal content has become more important among various stakeholders, including social media platforms, policymakers, and the broader society as they often cause harm to individuals, organizations, and/or society. While there has been effort to develop AI-based automatic systems for resource-rich languages (e.g., English), it is relatively little to none for medium to low resource languages. In this study, we focused on develo** an Arabic memes dataset with manual annotations of propagandistic content. We annotated ~6K Arabic memes collected from various social media platforms, which is a first resource for Arabic multimodal research. We provide a comprehensive analysis aiming to develop computational tools for their detection. We will make them publicly available for the community. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: disinformation, misinformation, factuality, harmfulness, fake news, propaganda, multimodality, text, images

MSC Class: 68T50 ACM Class: I.2.7

arXiv:2406.03062 [pdf, other]

RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization

Authors: **ge Wu, Abul Hasan, Honghan Wu

Abstract: Radiology report summarization is a crucial task that can help doctors quickly identify clinically significant findings without the need to review detailed sections of reports. This study proposes RadBARTsum, a domain-specific and ontology facilitated adaptation of the BART model for abstractive radiology report summarization. The approach involves two main steps: 1) re-training the BART model on… ▽ More Radiology report summarization is a crucial task that can help doctors quickly identify clinically significant findings without the need to review detailed sections of reports. This study proposes RadBARTsum, a domain-specific and ontology facilitated adaptation of the BART model for abstractive radiology report summarization. The approach involves two main steps: 1) re-training the BART model on a large corpus of radiology reports using a novel entity masking strategy to improving biomedical domain knowledge learning, and 2) fine-tuning the model for the summarization task using the Findings and Background sections to predict the Impression section. Experiments are conducted using different masking strategies. Results show that the re-training process with domain knowledge facilitated masking improves performances consistently across various settings. This work contributes a domain-specific generative language model for radiology report summarization and a method for utilising medical knowledge to realise entity masking language model. The proposed approach demonstrates a promising direction of enhancing the efficiency of language models by deepening its understanding of clinical knowledge in radiology reports. △ Less

Submitted 5 June, 2024; originally announced June 2024.

arXiv:2405.19218 [pdf, other]

The $SL_2(\mathbb{R})$ duality and the non-invertible $U(1)$ symmetry of Maxwell theory

Authors: Azeem Hasan, Shani Meynet, Daniele Migliorati

Abstract: Recent proposals for the Symmetry Topological Field Theory (SymTFT) of Maxwell theory admit a 0-form symmetry compatible with the classical $SL_2(\mathbb{R})$ duality of electromagnetism. We describe how to realize these automorphisms of the SymTFT in terms of its operators and we detail their effects on the dynamical theory and its global variants. In the process, we show that the classical… ▽ More Recent proposals for the Symmetry Topological Field Theory (SymTFT) of Maxwell theory admit a 0-form symmetry compatible with the classical $SL_2(\mathbb{R})$ duality of electromagnetism. We describe how to realize these automorphisms of the SymTFT in terms of its operators and we detail their effects on the dynamical theory and its global variants. In the process, we show that the classical $U(1)$ symmetry, corresponding to the stabilizer of $SL_2(\mathbb{R})$, can be restored as a non-invertible one, by means of an infinite series of discrete gauging. This provides an example of the reemergence of a classical symmetry in the quantum regime, which was not broken by anomalies, but rather by the quantization of electromagnetic fluxes. However, this procedure comes at the price of introducing "continuous" condensates that trivialize all line operators. △ Less

Submitted 11 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

Comments: v2: 19 pages, 1 figure, references added

arXiv:2405.17455 [pdf, other]

WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets

Authors: Adib Hasan, Mardavij Roozbehani, Munther Dahleh

Abstract: This paper introduces WeatherFormer, a transformer encoder-based model designed to learn robust weather features from minimal observations. It addresses the challenge of modeling complex weather dynamics from small datasets, a bottleneck for many prediction tasks in agriculture, epidemiology, and climate science. WeatherFormer was pretrained on a large pretraining dataset comprised of 39 years of… ▽ More This paper introduces WeatherFormer, a transformer encoder-based model designed to learn robust weather features from minimal observations. It addresses the challenge of modeling complex weather dynamics from small datasets, a bottleneck for many prediction tasks in agriculture, epidemiology, and climate science. WeatherFormer was pretrained on a large pretraining dataset comprised of 39 years of satellite measurements across the Americas. With a novel pretraining task and fine-tuning, WeatherFormer achieves state-of-the-art performance in county-level soybean yield prediction and influenza forecasting. Technical innovations include a unique spatiotemporal encoding that captures geographical, annual, and seasonal variations, adapting the transformer architecture to continuous weather data, and a pretraining strategy to learn representations that are robust to missing weather features. This paper for the first time demonstrates the effectiveness of pretraining large transformer encoder models for weather-dependent applications across multiple domains. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.11188 [pdf, other]

Wind Power Prediction across Different Locations using Deep Domain Adaptive Learning

Authors: Md Saiful Islam Sajol, Md Shazid Islam, A S M Jahid Hasan, Md Saydur Rahman, Jubair Yusuf

Abstract: Accurate prediction of wind power is essential for the grid integration of this intermittent renewable source and aiding grid planners in forecasting available wind capacity. Spatial differences lead to discrepancies in climatological data distributions between two geographically dispersed regions, consequently making the prediction task more difficult. Thus, a prediction model that learns from th… ▽ More Accurate prediction of wind power is essential for the grid integration of this intermittent renewable source and aiding grid planners in forecasting available wind capacity. Spatial differences lead to discrepancies in climatological data distributions between two geographically dispersed regions, consequently making the prediction task more difficult. Thus, a prediction model that learns from the data of a particular climatic region can suffer from being less robust. A deep neural network (DNN) based domain adaptive approach is proposed to counter this drawback. Effective weather features from a large set of weather parameters are selected using a random forest approach. A pre-trained model from the source domain is utilized to perform the prediction task, assuming no source data is available during target domain prediction. The weights of only the last few layers of the DNN model are updated throughout the task, kee** the rest of the network unchanged, making the model faster compared to the traditional approaches. The proposed approach demonstrates higher accuracy ranging from 6.14% to even 28.44% compared to the traditional non-adaptive method. △ Less

Submitted 18 May, 2024; originally announced May 2024.

arXiv:2405.11029 [pdf, other]

Generative Artificial Intelligence: A Systematic Review and Applications

Authors: Sandeep Singh Sengar, Affan Bin Hasan, Sanjay Kumar, Fiona Carroll

Abstract: In recent years, the study of artificial intelligence (AI) has undergone a paradigm shift. This has been propelled by the groundbreaking capabilities of generative models both in supervised and unsupervised learning scenarios. Generative AI has shown state-of-the-art performance in solving perplexing real-world conundrums in fields such as image translation, medical diagnostics, textual imagery fu… ▽ More In recent years, the study of artificial intelligence (AI) has undergone a paradigm shift. This has been propelled by the groundbreaking capabilities of generative models both in supervised and unsupervised learning scenarios. Generative AI has shown state-of-the-art performance in solving perplexing real-world conundrums in fields such as image translation, medical diagnostics, textual imagery fusion, natural language processing, and beyond. This paper documents the systematic review and analysis of recent advancements and techniques in Generative AI with a detailed discussion of their applications including application-specific models. Indeed, the major impact that generative AI has made to date, has been in language generation with the development of large language models, in the field of image translation and several other interdisciplinary applications of generative AI. Moreover, the primary contribution of this paper lies in its coherent synthesis of the latest advancements in these areas, seamlessly weaving together contemporary breakthroughs in the field. Particularly, how it shares an exploration of the future trajectory for generative AI. In conclusion, the paper ends with a discussion of Responsible AI principles, and the necessary ethical considerations for the sustainability and growth of these generative models. △ Less

Submitted 17 May, 2024; originally announced May 2024.

arXiv:2404.12958 [pdf, other]

Improving Pediatric Pneumonia Diagnosis with Adult Chest X-ray Images Utilizing Contrastive Learning and Embedding Similarity

Authors: Mohammad Zunaed, Anwarul Hasan, Taufiq Hasan

Abstract: Despite the advancement of deep learning-based computer-aided diagnosis (CAD) methods for pneumonia from adult chest x-ray (CXR) images, the performance of CAD methods applied to pediatric images remains suboptimal, mainly due to the lack of large-scale annotated pediatric imaging datasets. Establishing a proper framework to leverage existing adult large-scale CXR datasets can thus enhance pediatr… ▽ More Despite the advancement of deep learning-based computer-aided diagnosis (CAD) methods for pneumonia from adult chest x-ray (CXR) images, the performance of CAD methods applied to pediatric images remains suboptimal, mainly due to the lack of large-scale annotated pediatric imaging datasets. Establishing a proper framework to leverage existing adult large-scale CXR datasets can thus enhance pediatric pneumonia detection performance. In this paper, we propose a three-branch parallel path learning-based framework that utilizes both adult and pediatric datasets to improve the performance of deep learning models on pediatric test datasets. The paths are trained with pediatric only, adult only, and both types of CXRs, respectively. Our proposed framework utilizes the multi-positive contrastive loss to cluster the classwise embeddings and the embedding similarity loss among these three parallel paths to make the classwise embeddings as close as possible to reduce the effect of domain shift. Experimental evaluations on open-access adult and pediatric CXR datasets show that the proposed method achieves a superior AUROC score of 0.8464 compared to 0.8348 obtained using the conventional approach of join training on both datasets. The proposed approach thus paves the way for generalized CAD models that are effective for both adult and pediatric age groups. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: Accepted to International Conference of IEEE Engineering in Medicine and Biology Society (EMBC), 2024

arXiv:2404.12132 [pdf, other]

Enhancing Suicide Risk Assessment: A Speech-Based Automated Approach in Emergency Medicine

Authors: Shahin Amiriparian, Maurice Gerczuk, Justina Lutz, Wolfgang Strube, Irina Papazova, Alkomiet Hasan, Alexander Kathan, Björn W. Schuller

Abstract: The delayed access to specialized psychiatric assessments and care for patients at risk of suicidal tendencies in emergency departments creates a notable gap in timely intervention, hindering the provision of adequate mental health support during critical situations. To address this, we present a non-invasive, speech-based approach for automatic suicide risk assessment. For our study, we have coll… ▽ More The delayed access to specialized psychiatric assessments and care for patients at risk of suicidal tendencies in emergency departments creates a notable gap in timely intervention, hindering the provision of adequate mental health support during critical situations. To address this, we present a non-invasive, speech-based approach for automatic suicide risk assessment. For our study, we have collected a novel dataset of speech recordings from $20$ patients from which we extract three sets of features, including wav2vec, interpretable speech and acoustic features, and deep learning-based spectral representations. We proceed by conducting a binary classification to assess suicide risk in a leave-one-subject-out fashion. Our most effective speech model achieves a balanced accuracy of $66.2\,\%$. Moreover, we show that integrating our speech model with a series of patients' metadata, such as the history of suicide attempts or access to firearms, improves the overall result. The metadata integration yields a balanced accuracy of $94.4\,\%$, marking an absolute improvement of $28.2\,\%$, demonstrating the efficacy of our proposed approaches for automatic suicide risk assessment in emergency medicine. △ Less

Submitted 18 April, 2024; originally announced April 2024.

ACM Class: I.2

arXiv:2404.10924 [pdf, other]

Binder: Hierarchical Concept Representation through Order Embedding of Binary Vectors

Authors: Croix Gyurek, Niloy Talukder, Mohammad Al Hasan

Abstract: For natural language understanding and generation, embedding concepts using an order-based representation is an essential task. Unlike traditional point vector based representation, an order-based representation imposes geometric constraints on the representation vectors for explicitly capturing various semantic relationships that may exist between a pair of concepts. In existing literature, sever… ▽ More For natural language understanding and generation, embedding concepts using an order-based representation is an essential task. Unlike traditional point vector based representation, an order-based representation imposes geometric constraints on the representation vectors for explicitly capturing various semantic relationships that may exist between a pair of concepts. In existing literature, several approaches on order-based embedding have been proposed, mostly focusing on capturing hierarchical relationships; examples include vectors in Euclidean space, complex, Hyperbolic, order, and Box Embedding. Box embedding creates region-based rich representation of concepts, but along the process it sacrifices simplicity, requiring a custom-made optimization scheme for learning the representation. Hyperbolic embedding improves embedding quality by exploiting the ever-expanding property of Hyperbolic space, but it also suffers from the same fate as box embedding as gradient descent like optimization is not simple in the Hyperbolic space. In this work, we propose Binder, a novel approach for order-based representation. Binder uses binary vectors for embedding, so the embedding vectors are compact with an order of magnitude smaller footprint than other methods. Binder uses a simple and efficient optimization scheme for learning representation vectors with a linear time complexity. Our comprehensive experimental results show that Binder is very accurate, yielding competitive results on the representation task. But Binder stands out from its competitors on the transitive closure link prediction task as it can learn concept embeddings just from the direct edges, whereas all existing order-based approaches rely on the indirect edges. △ Less

Submitted 16 April, 2024; originally announced April 2024.

arXiv:2404.10643 [pdf, other]

A Calibrated and Automated Simulator for Innovations in 5G

Authors: Conrado Boeira, Antor Hasan, Khaleda Papry, Yue Ju, Zhongwen Zhu, Israat Haque

Abstract: The rise of 5G deployments has created the environment for many emerging technologies to flourish. Self-driving vehicles, Augmented and Virtual Reality, and remote operations are examples of applications that leverage 5G networks' support for extremely low latency, high bandwidth, and increased throughput. However, the complex architecture of 5G hinders innovation due to the lack of accessibility… ▽ More The rise of 5G deployments has created the environment for many emerging technologies to flourish. Self-driving vehicles, Augmented and Virtual Reality, and remote operations are examples of applications that leverage 5G networks' support for extremely low latency, high bandwidth, and increased throughput. However, the complex architecture of 5G hinders innovation due to the lack of accessibility to testbeds or realistic simulators with adequate 5G functionalities. Also, configuring and managing simulators are complex and time consuming. Finally, the lack of adequate representative data hinders the data-driven designs in 5G campaigns. Thus, we calibrated a system-level open-source simulator, Simu5G, following 3GPP guidelines to enable faster innovation in the 5G domain. Furthermore, we developed an API for automatic simulator configuration without knowing the underlying architectural details. Finally, we demonstrate the usage of the calibrated and automated simulator by develo** an ML-based anomaly detection in a 5G Radio Access Network (RAN). △ Less

Submitted 16 April, 2024; originally announced April 2024.

arXiv:2404.09402 [pdf, other]

Neural McKean-Vlasov Processes: Distributional Dependence in Diffusion Processes

Authors: Haoming Yang, Ali Hasan, Yuting Ng, Vahid Tarokh

Abstract: McKean-Vlasov stochastic differential equations (MV-SDEs) provide a mathematical description of the behavior of an infinite number of interacting particles by imposing a dependence on the particle density. As such, we study the influence of explicitly including distributional information in the parameterization of the SDE. We propose a series of semi-parametric methods for representing MV-SDEs, an… ▽ More McKean-Vlasov stochastic differential equations (MV-SDEs) provide a mathematical description of the behavior of an infinite number of interacting particles by imposing a dependence on the particle density. As such, we study the influence of explicitly including distributional information in the parameterization of the SDE. We propose a series of semi-parametric methods for representing MV-SDEs, and corresponding estimators for inferring parameters from data based on the properties of the MV-SDE. We analyze the characteristics of the different architectures and estimators, and consider their applicability in relevant machine learning problems. We empirically compare the performance of the different architectures and estimators on real and synthetic datasets for time series and probabilistic modeling. The results suggest that explicitly including distributional dependence in the parameterization of the SDE is effective in modeling temporal data with interaction under an exchangeability assumption while maintaining strong performance for standard Itô-SDEs due to the richer class of probability flows associated with MV-SDEs. △ Less

Submitted 14 April, 2024; originally announced April 2024.

Comments: Appears in AISTATS 2024

arXiv:2404.00470 [pdf]

Classification of Short Segment Pediatric Heart Sounds Based on a Transformer-Based Convolutional Neural Network

Authors: Md Hassanuzzaman, Nurul Akhtar Hasan, Mohammad Abdullah Al Mamun, Khawza I Ahmed, Ahsan H Khandoker, Raqibul Mostafa

Abstract: Congenital anomalies arising as a result of a defect in the structure of the heart and great vessels are known as congenital heart diseases or CHDs. A PCG can provide essential details about the mechanical conduction system of the heart and point out specific patterns linked to different kinds of CHD. This study aims to investigate the minimum signal duration required for the automatic classificat… ▽ More Congenital anomalies arising as a result of a defect in the structure of the heart and great vessels are known as congenital heart diseases or CHDs. A PCG can provide essential details about the mechanical conduction system of the heart and point out specific patterns linked to different kinds of CHD. This study aims to investigate the minimum signal duration required for the automatic classification of heart sounds. This study also investigated the optimum signal quality assessment indicator (Root Mean Square of Successive Differences) RMSSD and (Zero Crossings Rate) ZCR value. Mel-frequency cepstral coefficients (MFCCs) based feature is used as an input to build a Transformer-Based residual one-dimensional convolutional neural network, which is then used for classifying the heart sound. The study showed that 0.4 is the ideal threshold for getting suitable signals for the RMSSD and ZCR indicators. Moreover, a minimum signal length of 5s is required for effective heart sound classification. It also shows that a shorter signal (3 s heart sound) does not have enough information to categorize heart sounds accurately, and the longer signal (15 s heart sound) may contain more noise. The best accuracy, 93.69%, is obtained for the 5s signal to distinguish the heart sound. △ Less

Submitted 30 March, 2024; originally announced April 2024.

Comments: 16 pages,11 Figures

arXiv:2403.15087 [pdf]

doi 10.4236/ijaa.2024.141002

Does the Redshift Distribution of Swift Long GRBs Trace the Star-Formation Rate?

Authors: Ali M. Hasan, Walid J. Azzam

Abstract: Gamma-ray bursts (GRBs) are extremely powerful explosions that have been traditionally classified into two categories: long bursts (LGRBs) with an observed duration T90 > 2 s, and short bursts (SGRBs) with an observed duration T90 < 2 s, where T90 is the time interval during which 90% of the fluence is detected. LGRBs are believed to emanate from the core-collapse of massive stars, while SGRBs are… ▽ More Gamma-ray bursts (GRBs) are extremely powerful explosions that have been traditionally classified into two categories: long bursts (LGRBs) with an observed duration T90 > 2 s, and short bursts (SGRBs) with an observed duration T90 < 2 s, where T90 is the time interval during which 90% of the fluence is detected. LGRBs are believed to emanate from the core-collapse of massive stars, while SGRBs are believed to result from the merging of two compact objects, like two neutron stars. Because LGRBs are produced by the violent death of massive stars, we expect that their redshift distribution should trace the star-formation rate (SFR). The purpose of our study is to investigate the extent to which the redshift distribution of LGRBs follows and reflects the SFR. We use a sample of 370 LGRBs taken from the Swift catalog, and we investigate different models for the LGRB redshift distribution. We also carry out Monte Carlo simulations to check the consistency of our results. Our results indicate that the SFR can describe the LGRB redshift distribution well for high redshift bursts, but it needs an evolution term to fit the distribution well at low redshift. △ Less

Submitted 22 March, 2024; originally announced March 2024.

Comments: 37 pages, 7 figures, 3 tables, 45 references

Journal ref: International Journal of Astronomy and Astrophysics, Volume 14, pp. 20-44, 2024

arXiv:2403.10468 [pdf, other]

An Empirical Study on Developers Shared Conversations with ChatGPT in GitHub Pull Requests and Issues

Authors: Huizi Hao, Kazi Amit Hasan, Hong Qin, Marcos Macedo, Yuan Tian, Steven H. H. Ding, Ahmed E. Hassan

Abstract: ChatGPT has significantly impacted software development practices, providing substantial assistance to developers in a variety of tasks, including coding, testing, and debugging. Despite its widespread adoption, the impact of ChatGPT as an assistant in collaborative coding remains largely unexplored. In this paper, we analyze a dataset of 210 and 370 developers shared conversations with ChatGPT in… ▽ More ChatGPT has significantly impacted software development practices, providing substantial assistance to developers in a variety of tasks, including coding, testing, and debugging. Despite its widespread adoption, the impact of ChatGPT as an assistant in collaborative coding remains largely unexplored. In this paper, we analyze a dataset of 210 and 370 developers shared conversations with ChatGPT in GitHub pull requests (PRs) and issues. We manually examined the content of the conversations and characterized the dynamics of the sharing behavior, i.e., understanding the rationale behind the sharing, identifying the locations where the conversations were shared, and determining the roles of the developers who shared them. Our main observations are: (1) Developers seek ChatGPT assistance across 16 types of software engineering inquiries. In both conversations shared in PRs and issues, the most frequently encountered inquiry categories include code generation, conceptual questions, how-to guides, issue resolution, and code review. (2) Developers frequently engage with ChatGPT via multi-turn conversations where each prompt can fulfill various roles, such as unveiling initial or new tasks, iterative follow-up, and prompt refinement. Multi-turn conversations account for 33.2% of the conversations shared in PRs and 36.9% in issues. (3) In collaborative coding, developers leverage shared conversations with ChatGPT to facilitate their role-specific contributions, whether as authors of PRs or issues, code reviewers, or collaborators on issues. Our work serves as the first step towards understanding the dynamics between developers and ChatGPT in collaborative software development and opens up new directions for future research on the topic. △ Less

Submitted 15 March, 2024; originally announced March 2024.

arXiv:2403.06436 [pdf]

Designing a K-state P-bit Engine

Authors: Mohammad Khairul Bashar, Abir Hasan, Nikhil Shukla

Abstract: Probabilistic bit (p-bit)-based compute engines utilize the unique capability of a p-bit to probabilistically switch between two states to solve computationally challenging problems. However, when solving problems that require more than two states (e.g., problems such as Max-3-Cut, verifying if a graph is K-partite (K>2) etc.), additional pre-processing steps such as graph reduction are required t… ▽ More Probabilistic bit (p-bit)-based compute engines utilize the unique capability of a p-bit to probabilistically switch between two states to solve computationally challenging problems. However, when solving problems that require more than two states (e.g., problems such as Max-3-Cut, verifying if a graph is K-partite (K>2) etc.), additional pre-processing steps such as graph reduction are required to make the problem compatible with a two-state p-bit platform. Moreover, this not only increases the problem size by entailing the use of auxiliary variables but can also degrade the solution quality. In this work, we develop a unique framework for implementing a K-state (K>2) p-bit engine. Furthermore, from an implementation standpoint, we show that such a K-state p-bit engine can be implemented using N traditional (2-state) p-bits, and one multi-state p-bit -- a novel concept proposed here. Augmenting traditional p-bit platforms, our approach enables us to solve an archetypal combinatoric problem class requiring multiple states, namely Max-K-Cut (K=3, 4 shown here), without using any additional auxiliary variables. Thus, our work fundamentally advances the functional capability of p-bit engines, enabling them to solve a broader class of computationally challenging problems more efficiently. △ Less

Submitted 27 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

arXiv:2403.06060 [pdf, other]

Ensemble Language Models for Multilingual Sentiment Analysis

Authors: Md Arid Hasan

Abstract: The rapid advancement of social media enables us to analyze user opinions. In recent times, sentiment analysis has shown a prominent research gap in understanding human sentiment based on the content shared on social media. Although sentiment analysis for commonly spoken languages has advanced significantly, low-resource languages like Arabic continue to get little research due to resource limitat… ▽ More The rapid advancement of social media enables us to analyze user opinions. In recent times, sentiment analysis has shown a prominent research gap in understanding human sentiment based on the content shared on social media. Although sentiment analysis for commonly spoken languages has advanced significantly, low-resource languages like Arabic continue to get little research due to resource limitations. In this study, we explore sentiment analysis on tweet texts from SemEval-17 and the Arabic Sentiment Tweet dataset. Moreover, We investigated four pretrained language models and proposed two ensemble language models. Our findings include monolingual models exhibiting superior performance and ensemble models outperforming the baseline while the majority voting ensemble outperforms the English language. △ Less

Submitted 9 March, 2024; originally announced March 2024.

Comments: This is one of my graduate course project reports and currently, I'm not planning to submit to any conferences

ACM Class: I.2.7

arXiv:2403.03312 [pdf, other]

Beyond the Dashboard: Investigating Distracted Driver Communication Preferences for ADAS

Authors: Aamir Hasan, D. Livingston McPherson, Melissa Miles, Katherine Driggs-Campbell

Abstract: Distracted driving is a major cause of road fatalities. With improvements in driver (in)attention detection, these distracted situations can be caught early to alert drivers and improve road safety and comfort. However, drivers may have differing preferences for the modes of such communication based on the driving scenario and their current distraction state. To this end, we present an (N=147) whe… ▽ More Distracted driving is a major cause of road fatalities. With improvements in driver (in)attention detection, these distracted situations can be caught early to alert drivers and improve road safety and comfort. However, drivers may have differing preferences for the modes of such communication based on the driving scenario and their current distraction state. To this end, we present an (N=147) where videos of simulated driving scenarios were utilized to learn drivers preferences for modes of communication and their evolution with the drivers changing attention. The survey queried participants preferred modes of communication for scenarios such as collisions or stagnation at a green light. that inform the future of communication between drivers and their vehicles. We showcase the different driver preferences based on the nature of the driving scenario and also show that they evolve as the drivers distraction state changes △ Less

Submitted 23 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

Comments: 10 pages, 6 figures. All materials associated with the study can be found at https://sites.google.com/illinois.edu/driver-preference-for-modes

arXiv:2402.11544 [pdf, ps, other]

On efficient normal bases over binary fields

Authors: Mohamadou Sall, M. Anwar Hasan

Abstract: Binary field extensions are fundamental to many applications, such as multivariate public key cryptography, code-based cryptography, and error-correcting codes. Their implementation requires a foundation in number theory and algebraic geometry and necessitates the utilization of efficient bases. The continuous increase in the power of computation, and the design of new (quantum) computers increase… ▽ More Binary field extensions are fundamental to many applications, such as multivariate public key cryptography, code-based cryptography, and error-correcting codes. Their implementation requires a foundation in number theory and algebraic geometry and necessitates the utilization of efficient bases. The continuous increase in the power of computation, and the design of new (quantum) computers increase the threat to the security of systems and impose increasingly demanding encryption standards with huge polynomial or extension degrees. For cryptographic purposes or other common implementations of finite fields arithmetic, it is essential to explore a wide range of implementations with diverse bases. Unlike some bases, polynomial and Gaussian normal bases are well-documented and widely employed. In this paper, we explore other forms of bases of $\mathbb{F}_{2^n}$ over $\mathbb{F}_2$ to demonstrate efficient implementation of operations within different ranges. To achieve this, we leverage results on fast computations and elliptic periods introduced by Couveignes and Lercier, and subsequently expanded upon by Ezome and Sall. This leads to the establishment of new tables for efficient computation over binary fields. △ Less

Submitted 18 February, 2024; originally announced February 2024.

arXiv:2402.05736 [pdf, ps, other]

Numerical solution of the Newtonian plane Couette flow with linear dynamic wall slip

Authors: Muner M. A. Hasan, Ethar A. A. Ahmed, Ahmed F. Ghaleb, Moustafa S. Abou-Dina, Georgios C. Georgiou

Abstract: An efficient numerical approach based on weighted average finite differences is used to solve the Newtonian plane Couette flow with wall slip, obeying a dynamic slip law that generalizes the Navier slip law with the inclusion of a relaxation term. Slip is exhibited only along the fixed plate, and the motion is triggered by the motion of the other plate. Three different cases are considered for the… ▽ More An efficient numerical approach based on weighted average finite differences is used to solve the Newtonian plane Couette flow with wall slip, obeying a dynamic slip law that generalizes the Navier slip law with the inclusion of a relaxation term. Slip is exhibited only along the fixed plate, and the motion is triggered by the motion of the other plate. Three different cases are considered for the motion of the moving plate, i.e., constant speed, oscillating speed, and a single-period sinusoidal speed. The velocity and the volumetric flow rate are calculated in all cases and comparisons are made with the results of other methods and available results in the literature. The numerical outcomes confirm the dam** with time and the lagging effects arising from the Navier and dynamic wall slip conditions and demonstrate the hysteretic behavior of the slip velocity in following the harmonic boundary motion. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 21 pages, 15 figures

arXiv:2401.14908 [pdf, ps, other]

doi 10.1145/3630106.3658957

A Framework for Assurance Audits of Algorithmic Systems

Authors: Khoa Lam, Benjamin Lange, Borhane Blili-Hamelin, Jovana Davidovic, Shea Brown, Ali Hasan

Abstract: An increasing number of regulations propose AI audits as a mechanism for achieving transparency and accountability for artificial intelligence (AI) systems. Despite some converging norms around various forms of AI auditing, auditing for the purpose of compliance and assurance currently lacks agreed-upon practices, procedures, taxonomies, and standards. We propose the criterion audit as an operatio… ▽ More An increasing number of regulations propose AI audits as a mechanism for achieving transparency and accountability for artificial intelligence (AI) systems. Despite some converging norms around various forms of AI auditing, auditing for the purpose of compliance and assurance currently lacks agreed-upon practices, procedures, taxonomies, and standards. We propose the criterion audit as an operationalizable compliance and assurance external audit framework. We model elements of this approach after financial auditing practices, and argue that AI audits should similarly provide assurance to their stakeholders about AI organizations' ability to govern their algorithms in ways that mitigate harms and uphold human values. We discuss the necessary conditions for the criterion audit and provide a procedural blueprint for performing an audit engagement in practice. We illustrate how this framework can be adapted to current regulations by deriving the criteria on which bias audits can be performed for in-scope hiring algorithms, as required by the recently effective New York City Local Law 144 of 2021. We conclude by offering a critical discussion on the benefits, inherent limitations, and implementation challenges of applying practices of the more mature financial auditing industry to AI auditing where robust guardrails against quality assurance issues are only starting to emerge. Our discussion -- informed by experiences in performing these audits in practice -- highlights the critical role that an audit ecosystem plays in ensuring the effectiveness of audits. △ Less

Submitted 28 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

Journal ref: The 2024 ACM Conference on Fairness, Accountability, and Transparency

arXiv:2401.14422 [pdf, other]

Location Agnostic Source-Free Domain Adaptive Learning to Predict Solar Power Generation

Authors: Md Shazid Islam, A S M Jahid Hasan, Md Saydur Rahman, Jubair Yusuf, Md Saiful Islam Sajol, Farhana Akter Tumpa

Abstract: The prediction of solar power generation is a challenging task due to its dependence on climatic characteristics that exhibit spatial and temporal variability. The performance of a prediction model may vary across different places due to changes in data distribution, resulting in a model that works well in one region but not in others. Furthermore, as a consequence of global warming, there is a no… ▽ More The prediction of solar power generation is a challenging task due to its dependence on climatic characteristics that exhibit spatial and temporal variability. The performance of a prediction model may vary across different places due to changes in data distribution, resulting in a model that works well in one region but not in others. Furthermore, as a consequence of global warming, there is a notable acceleration in the alteration of weather patterns on an annual basis. This phenomenon introduces the potential for diminished efficacy of existing models, even within the same geographical region, as time progresses. In this paper, a domain adaptive deep learning-based framework is proposed to estimate solar power generation using weather features that can solve the aforementioned challenges. A feed-forward deep convolutional network model is trained for a known location dataset in a supervised manner and utilized to predict the solar power of an unknown location later. This adaptive data-driven approach exhibits notable advantages in terms of computing speed, storage efficiency, and its ability to improve outcomes in scenarios where state-of-the-art non-adaptive methods fail. Our method has shown an improvement of $10.47 \%$, $7.44 \%$, $5.11\%$ in solar power prediction accuracy compared to best performing non-adaptive method for California (CA), Florida (FL) and New York (NY), respectively. △ Less

Submitted 6 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

arXiv:2401.10862 [pdf, other]

Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning

Authors: Adib Hasan, Ileana Rugina, Alex Wang

Abstract: Large Language Models (LLMs) are susceptible to `jailbreaking' prompts, which can induce the generation of harmful content. This paper demonstrates that moderate WANDA pruning (Sun et al., 2023) can increase their resistance to such attacks without the need for fine-tuning, while maintaining performance on standard benchmarks. Our findings suggest that the benefits of pruning correlate with the in… ▽ More Large Language Models (LLMs) are susceptible to `jailbreaking' prompts, which can induce the generation of harmful content. This paper demonstrates that moderate WANDA pruning (Sun et al., 2023) can increase their resistance to such attacks without the need for fine-tuning, while maintaining performance on standard benchmarks. Our findings suggest that the benefits of pruning correlate with the initial safety levels of the model, indicating a regularizing effect of WANDA pruning. We introduce a dataset of 225 harmful tasks across five categories to systematically evaluate this safety enhancement. We argue that safety improvements can be understood through a regularization perspective. First, we show that pruning helps LLMs focus more effectively on task-relevant tokens within jailbreaking prompts. Then, we analyze the effects of pruning on the perplexity of malicious prompts before and after their integration into jailbreak templates. Finally, we demonstrate statistically significant performance improvements under domain shifts when applying WANDA to linear models. △ Less

Submitted 28 April, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

arXiv:2312.10903 [pdf, other]

Robust Node Representation Learning via Graph Variational Diffusion Networks

Authors: Jun Zhuang, Mohammad Al Hasan

Abstract: Node representation learning by using Graph Neural Networks (GNNs) has been widely explored. However, in recent years, compelling evidence has revealed that GNN-based node representation learning can be substantially deteriorated by delicately-crafted perturbations in a graph structure. To learn robust node representation in the presence of perturbations, various works have been proposed to safegu… ▽ More Node representation learning by using Graph Neural Networks (GNNs) has been widely explored. However, in recent years, compelling evidence has revealed that GNN-based node representation learning can be substantially deteriorated by delicately-crafted perturbations in a graph structure. To learn robust node representation in the presence of perturbations, various works have been proposed to safeguard GNNs. Within these existing works, Bayesian label transition has been proven to be more effective, but this method is extensively reliant on a well-built prior distribution. The variational inference could address this limitation by sampling the latent node embedding from a Gaussian prior distribution. Besides, leveraging the Gaussian distribution (noise) in hidden layers is an appealing strategy to strengthen the robustness of GNNs. However, our experiments indicate that such a strategy can cause over-smoothing issues during node aggregation. In this work, we propose the Graph Variational Diffusion Network (GVDN), a new node encoder that effectively manipulates Gaussian noise to safeguard robustness on perturbed graphs while alleviating over-smoothing issues through two mechanisms: Gaussian diffusion and node embedding propagation. Thanks to these two mechanisms, our model can generate robust node embeddings for recovery. Specifically, we design a retraining mechanism using the generated node embedding to recover the performance of node classifications in the presence of perturbations. The experiments verify the effectiveness of our proposed model across six public datasets. △ Less

Submitted 17 December, 2023; originally announced December 2023.

Comments: preprint, under review

arXiv:2312.08656 [pdf, other]

MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training

Authors: Hongwu Peng, Xi Xie, Kaustubh Shivdikar, MD Amit Hasan, Jiahui Zhao, Shaoyi Huang, Omer Khan, David Kaeli, Caiwen Ding

Abstract: In the acceleration of deep neural network training, the GPU has become the mainstream platform. GPUs face substantial challenges on GNNs, such as workload imbalance and memory access irregularities, leading to underutilized hardware. Existing solutions such as PyG, DGL with cuSPARSE, and GNNAdvisor frameworks partially address these challenges but memory traffic is still significant. We argue t… ▽ More In the acceleration of deep neural network training, the GPU has become the mainstream platform. GPUs face substantial challenges on GNNs, such as workload imbalance and memory access irregularities, leading to underutilized hardware. Existing solutions such as PyG, DGL with cuSPARSE, and GNNAdvisor frameworks partially address these challenges but memory traffic is still significant. We argue that drastic performance improvements can only be achieved by the vertical optimization of algorithm and system innovations, rather than treating the speedup optimization as an "after-thought" (i.e., (i) given a GNN algorithm, designing an accelerator, or (ii) given hardware, mainly optimizing the GNN algorithm). In this paper, we present MaxK-GNN, an advanced high-performance GPU training system integrating algorithm and system innovation. (i) We introduce the MaxK nonlinearity and provide a theoretical analysis of MaxK nonlinearity as a universal approximator, and present the Compressed Balanced Sparse Row (CBSR) format, designed to store the data and index of the feature matrix after nonlinearity; (ii) We design a coalescing enhanced forward computation with row-wise product-based SpGEMM Kernel using CBSR for input feature matrix fetching and strategic placement of a sparse output accumulation buffer in shared memory; (iii) We develop an optimized backward computation with outer product-based and SSpMM Kernel. We conduct extensive evaluations of MaxK-GNN and report the end-to-end system run-time. Experiments show that MaxK-GNN system could approach the theoretical speedup limit according to Amdahl's law. We achieve comparable accuracy to SOTA GNNs, but at a significantly increased speed: 3.22/4.24 times speedup (vs. theoretical limits, 5.52/7.27 times) on Reddit compared to DGL and GNNAdvisor implementations. △ Less

Submitted 18 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

Comments: ASPLOS 2024 accepted publication

ACM Class: I.2; C.5

arXiv:2311.05197 [pdf]

doi 10.1016/j.eswa.2023.123029

Deep learning in computed tomography pulmonary angiography imaging: a dual-pronged approach for pulmonary embolism detection

Authors: Fabiha Bushra, Muhammad E. H. Chowdhury, Rusab Sarmun, Saidul Kabir, Menatalla Said, Sohaib Bassam Zoghoul, Adam Mushtak, Israa Al-Hashimi, Abdulrahman Alqahtani, Anwarul Hasan

Abstract: The increasing reliance on Computed Tomography Pulmonary Angiography (CTPA) for Pulmonary Embolism (PE) diagnosis presents challenges and a pressing need for improved diagnostic solutions. The primary objective of this study is to leverage deep learning techniques to enhance the Computer Assisted Diagnosis (CAD) of PE. With this aim, we propose a classifier-guided detection approach that effective… ▽ More The increasing reliance on Computed Tomography Pulmonary Angiography (CTPA) for Pulmonary Embolism (PE) diagnosis presents challenges and a pressing need for improved diagnostic solutions. The primary objective of this study is to leverage deep learning techniques to enhance the Computer Assisted Diagnosis (CAD) of PE. With this aim, we propose a classifier-guided detection approach that effectively leverages the classifier's probabilistic inference to direct the detection predictions, marking a novel contribution in the domain of automated PE diagnosis. Our classification system includes an Attention-Guided Convolutional Neural Network (AG-CNN) that uses local context by employing an attention mechanism. This approach emulates a human expert's attention by looking at both global appearances and local lesion regions before making a decision. The classifier demonstrates robust performance on the FUMPE dataset, achieving an AUROC of 0.927, sensitivity of 0.862, specificity of 0.879, and an F1-score of 0.805 with the Inception-v3 backbone architecture. Moreover, AG-CNN outperforms the baseline DenseNet-121 model, achieving an 8.1% AUROC gain. While previous research has mostly focused on finding PE in the main arteries, our use of cutting-edge object detection models and ensembling techniques greatly improves the accuracy of detecting small embolisms in the peripheral arteries. Finally, our proposed classifier-guided detection approach further refines the detection metrics, contributing new state-of-the-art to the community: mAP$_{50}$, sensitivity, and F1-score of 0.846, 0.901, and 0.779, respectively, outperforming the former benchmark with a significant 3.7% improvement in mAP$_{50}$. Our research aims to elevate PE patient care by integrating AI solutions into clinical workflows, highlighting the potential of human-AI collaboration in medical diagnostics. △ Less

Submitted 5 January, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

Comments: Published in Expert Systems With Applications

Journal ref: Expert Systems With Applications, Volume 245, 1 July 2024, 123029

arXiv:2310.16183 [pdf, other]

doi 10.18653/v1/2023.banglalp-1.48

BLP-2023 Task 2: Sentiment Analysis

Authors: Md. Arid Hasan, Firoj Alam, Anika Anjum, Shudipta Das, Afiyat Anjum

Abstract: We present an overview of the BLP Sentiment Shared Task, organized as part of the inaugural BLP 2023 workshop, co-located with EMNLP 2023. The task is defined as the detection of sentiment in a given piece of social media text. This task attracted interest from 71 participants, among whom 29 and 30 teams submitted systems during the development and evaluation phases, respectively. In total, partic… ▽ More We present an overview of the BLP Sentiment Shared Task, organized as part of the inaugural BLP 2023 workshop, co-located with EMNLP 2023. The task is defined as the detection of sentiment in a given piece of social media text. This task attracted interest from 71 participants, among whom 29 and 30 teams submitted systems during the development and evaluation phases, respectively. In total, participants submitted 597 runs. However, a total of 15 teams submitted system description papers. The range of approaches in the submitted systems spans from classical machine learning models, fine-tuning pre-trained models, to leveraging Large Language Model (LLMs) in zero- and few-shot settings. In this paper, we provide a detailed account of the task setup, including dataset development and evaluation setup. Additionally, we provide a brief overview of the systems submitted by the participants. All datasets and evaluation scripts from the shared task have been made publicly available for the research community, to foster further research in this domain. △ Less

Submitted 21 February, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

Comments: Accepted in BLP Workshop at EMNLP-23

ACM Class: I.2.7

arXiv:2310.15685 [pdf, other]

doi 10.1117/12.2677192

Atmospheric dispersion corrector for a multi-object spectroscopic mode of HROS-TMT

Authors: Manjunath Bestha, Amirul Hasan, Devika Divakar, Arun Surya, S. Sriram, T. Sivarani, A** Prakash, Parvathy M, Sudharsan Yadav

Abstract: Highly multiplexed spectroscopic surveys have changed the astronomy landscape in recent years. However, these surveys are limited to low and medium spectral resolution. High spectral resolution spectroscopy is often photon starved and will benefit from a large telescope aperture. Multiplexed high-resolution surveys require a wide field of view and a large aperture for a suitable large number of br… ▽ More Highly multiplexed spectroscopic surveys have changed the astronomy landscape in recent years. However, these surveys are limited to low and medium spectral resolution. High spectral resolution spectroscopy is often photon starved and will benefit from a large telescope aperture. Multiplexed high-resolution surveys require a wide field of view and a large aperture for a suitable large number of bright targets. This requirement introduces several practical difficulties, especially for large telescopes, such as the future ELTs. Some of the challenges are the need for a wide field atmospheric dispersion corrector and to deal with the curved non-telecentric focal plane. Here, we present a concept of Multi-Object Spectroscopy (MOS) mode for TMT High-Resolution Optical Spectrograph (HROS), we have designed an atmospheric dispersion corrector for individual objects that fit inside a fiber positioner. We present the ZEMAX design and the performance of the atmospheric dispersion corrector for all elevations accessible by TMT. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Journal ref: Proc. SPIE 12677, Astronomical Optics: Design, Manufacture, and Test of Space and Ground Systems IV, 126770Q (4 October 2023)

arXiv:2310.03931 [pdf]

Optical Properties and Behavior of Whispering Gallery Mode Resonators in Complex Microsphere Configurations: Insights for Sensing and Information Processing Applications

Authors: Yaser M. Banad, Syed Mohammad Abid Hasan, Sarah S. Sharif, Georgios Veronis, Manas Ranjan Gartia

Abstract: Whispering gallery mode (WGM) resonators are garnering significant attention due to their unique characteristics and remarkable properties. When integrated with optical sensing and processing technology, WGM resonators offer numerous advantages, including compact size, high sensitivity, rapid response, and tunability. This paper comprehensively investigates the optical properties and behavior of W… ▽ More Whispering gallery mode (WGM) resonators are garnering significant attention due to their unique characteristics and remarkable properties. When integrated with optical sensing and processing technology, WGM resonators offer numerous advantages, including compact size, high sensitivity, rapid response, and tunability. This paper comprehensively investigates the optical properties and behavior of WGMs in complex microsphere resonator configurations. The findings underscore the potential of WGMs in sensing applications and their role in advancing future optical information processing. The study explores the impact of configuration, size, excitation, polarization, and coupling effects on the WGMs properties. The paper provides crucial insights and valuable guidance for designing and optimizing microsphere resonator systems, enabling their realization for practical applications. △ Less

Submitted 5 October, 2023; originally announced October 2023.

Comments: 11 pages, 13 figures

arXiv:2310.01720 [pdf, other]

Perceiver-based CDF Modeling for Time Series Forecasting

Authors: Cat P. Le, Chris Cannella, Ali Hasan, Yuting Ng, Vahid Tarokh

Abstract: Transformers have demonstrated remarkable efficacy in forecasting time series data. However, their extensive dependence on self-attention mechanisms demands significant computational resources, thereby limiting their practical applicability across diverse tasks, especially in multimodal problems. In this work, we propose a new architecture, called perceiver-CDF, for modeling cumulative distributio… ▽ More Transformers have demonstrated remarkable efficacy in forecasting time series data. However, their extensive dependence on self-attention mechanisms demands significant computational resources, thereby limiting their practical applicability across diverse tasks, especially in multimodal problems. In this work, we propose a new architecture, called perceiver-CDF, for modeling cumulative distribution functions (CDF) of time series data. Our approach combines the perceiver architecture with a copula-based attention mechanism tailored for multimodal time series prediction. By leveraging the perceiver, our model efficiently transforms high-dimensional and multimodal data into a compact latent space, thereby significantly reducing computational demands. Subsequently, we implement a copula-based attention mechanism to construct the joint distribution of missing data for prediction. Further, we propose an output variance testing mechanism to effectively mitigate error propagation during prediction. To enhance efficiency and reduce complexity, we introduce midpoint inference for the local attention mechanism. This enables the model to efficiently capture dependencies within nearby imputed samples without considering all previous samples. The experiments on the unimodal and multimodal benchmarks consistently demonstrate a 20% improvement over state-of-the-art methods while utilizing less than half of the computational resources. △ Less

Submitted 24 June, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

Comments: Accepted in Winter Simulation Conference 2024

arXiv:2309.05865 [pdf, other]

Force-directed graph embedding with hops distance

Authors: Hamidreza Lotfalizadeh, Mohammad Al Hasan

Abstract: Graph embedding has become an increasingly important technique for analyzing graph-structured data. By representing nodes in a graph as vectors in a low-dimensional space, graph embedding enables efficient graph processing and analysis tasks like node classification, link prediction, and visualization. In this paper, we propose a novel force-directed graph embedding method that utilizes the steady… ▽ More Graph embedding has become an increasingly important technique for analyzing graph-structured data. By representing nodes in a graph as vectors in a low-dimensional space, graph embedding enables efficient graph processing and analysis tasks like node classification, link prediction, and visualization. In this paper, we propose a novel force-directed graph embedding method that utilizes the steady acceleration kinetic formula to embed nodes in a way that preserves graph topology and structural features. Our method simulates a set of customized attractive and repulsive forces between all node pairs with respect to their hop distance. These forces are then used in Newton's second law to obtain the acceleration of each node. The method is intuitive, parallelizable, and highly scalable. We evaluate our method on several graph analysis tasks and show that it achieves competitive performance compared to state-of-the-art unsupervised embedding techniques. △ Less

Submitted 11 September, 2023; originally announced September 2023.

arXiv:2309.02188 [pdf, other]

Incorporating Dictionaries into a Neural Network Architecture to Extract COVID-19 Medical Concepts From Social Media

Authors: Abul Hasan, Mark Levene, David Weston

Abstract: We investigate the potential benefit of incorporating dictionary information into a neural network architecture for natural language processing. In particular, we make use of this architecture to extract several concepts related to COVID-19 from an on-line medical forum. We use a sample from the forum to manually curate one dictionary for each concept. In addition, we use MetaMap, which is a tool… ▽ More We investigate the potential benefit of incorporating dictionary information into a neural network architecture for natural language processing. In particular, we make use of this architecture to extract several concepts related to COVID-19 from an on-line medical forum. We use a sample from the forum to manually curate one dictionary for each concept. In addition, we use MetaMap, which is a tool for extracting biomedical concepts, to identify a small number of semantic concepts. For a supervised concept extraction task on the forum data, our best model achieved a macro $F_1$ score of 90\%. A major difficulty in medical concept extraction is obtaining labelled data from which to build supervised models. We investigate the utility of our models to transfer to data derived from a different source in two ways. First for producing labels via weak learning and second to perform concept extraction. The dataset we use in this case comprises COVID-19 related tweets and we achieve an $F_1$ score 81\% for symptom concept extraction trained on weakly labelled data. The utility of our dictionaries is compared with a COVID-19 symptom dictionary that was constructed directly from Twitter. Further experiments that incorporate BERT and a COVID-19 version of BERTweet demonstrate that the dictionaries provide a commensurate result. Our results show that incorporating small domain dictionaries to deep learning models can improve concept extraction tasks. Moreover, models built using dictionaries generalize well and are transferable to different datasets on a similar task. △ Less

Submitted 5 September, 2023; originally announced September 2023.

arXiv:2308.11825 [pdf, other]

Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks

Authors: Xi Xie, Hongwu Peng, Amit Hasan, Shaoyi Huang, Jiahui Zhao, Haowen Fang, Wei Zhang, Tong Geng, Omer Khan, Caiwen Ding

Abstract: Graph Convolutional Networks (GCNs) are pivotal in extracting latent information from graph data across various domains, yet their acceleration on mainstream GPUs is challenged by workload imbalance and memory access irregularity. To address these challenges, we present Accel-GCN, a GPU accelerator architecture for GCNs. The design of Accel-GCN encompasses: (i) a lightweight degree sorting stage t… ▽ More Graph Convolutional Networks (GCNs) are pivotal in extracting latent information from graph data across various domains, yet their acceleration on mainstream GPUs is challenged by workload imbalance and memory access irregularity. To address these challenges, we present Accel-GCN, a GPU accelerator architecture for GCNs. The design of Accel-GCN encompasses: (i) a lightweight degree sorting stage to group nodes with similar degree; (ii) a block-level partition strategy that dynamically adjusts warp workload sizes, enhancing shared memory locality and workload balance, and reducing metadata overhead compared to designs like GNNAdvisor; (iii) a combined warp strategy that improves memory coalescing and computational parallelism in the column dimension of dense matrices. Utilizing these principles, we formulated a kernel for sparse matrix multiplication (SpMM) in GCNs that employs block-level partitioning and combined warp strategy. This approach augments performance and multi-level memory efficiency and optimizes memory bandwidth by exploiting memory coalescing and alignment. Evaluation of Accel-GCN across 18 benchmark graphs reveals that it outperforms cuSPARSE, GNNAdvisor, and graph-BLAST by factors of 1.17 times, 1.86 times, and 2.94 times respectively. The results underscore Accel-GCN as an effective solution for enhancing GCN computational efficiency. △ Less

Submitted 22 August, 2023; originally announced August 2023.

Comments: ICCAD 2023 accepted publication

ACM Class: I.2; B.6; C.3

arXiv:2308.10783 [pdf, other]

Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis

Authors: Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker, Sheak Rashed Haider Noori

Abstract: The rapid expansion of the digital world has propelled sentiment analysis into a critical tool across diverse sectors such as marketing, politics, customer service, and healthcare. While there have been significant advancements in sentiment analysis for widely spoken languages, low-resource languages, such as Bangla, remain largely under-researched due to resource constraints. Furthermore, the rec… ▽ More The rapid expansion of the digital world has propelled sentiment analysis into a critical tool across diverse sectors such as marketing, politics, customer service, and healthcare. While there have been significant advancements in sentiment analysis for widely spoken languages, low-resource languages, such as Bangla, remain largely under-researched due to resource constraints. Furthermore, the recent unprecedented performance of Large Language Models (LLMs) in various applications highlights the need to evaluate them in the context of low-resource languages. In this study, we present a sizeable manually annotated dataset encompassing 33,606 Bangla news tweets and Facebook comments. We also investigate zero- and few-shot in-context learning with several language models, including Flan-T5, GPT-4, and Bloomz, offering a comparative analysis against fine-tuned models. Our findings suggest that monolingual transformer-based models consistently outperform other models, even in zero and few-shot scenarios. To foster continued exploration, we intend to make this dataset and our research tools publicly available to the broader research community. △ Less

Submitted 4 April, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

Comments: Accepted at LREC-COLING 2024. Zero-Shot Prompting, Few-Shot Prompting, LLMs, Comparative Study, Fine-tuned Models, Bangla, Sentiment Analysis

MSC Class: 68T50 ACM Class: I.2.7

arXiv:2308.00864 [pdf, other]

PeRP: Personalized Residual Policies For Congestion Mitigation Through Co-operative Advisory Systems

Authors: Aamir Hasan, Neeloy Chakraborty, Haonan Chen, Jung-Hoon Cho, Cathy Wu, Katherine Driggs-Campbell

Abstract: Intelligent driving systems can be used to mitigate congestion through simple actions, thus improving many socioeconomic factors such as commute time and gas costs. However, these systems assume precise control over autonomous vehicle fleets, and are hence limited in practice as they fail to account for uncertainty in human behavior. Piecewise Constant (PC) Policies address these issues by structu… ▽ More Intelligent driving systems can be used to mitigate congestion through simple actions, thus improving many socioeconomic factors such as commute time and gas costs. However, these systems assume precise control over autonomous vehicle fleets, and are hence limited in practice as they fail to account for uncertainty in human behavior. Piecewise Constant (PC) Policies address these issues by structurally modeling the likeness of human driving to reduce traffic congestion in dense scenarios to provide action advice to be followed by human drivers. However, PC policies assume that all drivers behave similarly. To this end, we develop a co-operative advisory system based on PC policies with a novel driver trait conditioned Personalized Residual Policy, PeRP. PeRP advises drivers to behave in ways that mitigate traffic congestion. We first infer the driver's intrinsic traits on how they follow instructions in an unsupervised manner with a variational autoencoder. Then, a policy conditioned on the inferred trait adapts the action of the PC policy to provide the driver with a personalized recommendation. Our system is trained in simulation with novel driver modeling of instruction adherence. We show that our approach successfully mitigates congestion while adapting to different driver behaviors, with 4 to 22% improvement in average speed over baselines. △ Less

Submitted 15 August, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: Accepted to ITSC 2023. Additional material and code is available at the project webpage: https://sites.google.com/illinois.edu/perp

arXiv:2307.06924 [pdf, other]

doi 10.1109/LRA.2024.3362591

DRAGON: A Dialogue-Based Robot for Assistive Navigation with Visual Language Grounding

Authors: Shui**g Liu, Aamir Hasan, Kaiwen Hong, Runxuan Wang, Peixin Chang, Zachary Mizrachi, Justin Lin, D. Livingston McPherson, Wendy A. Rogers, Katherine Driggs-Campbell

Abstract: Persons with visual impairments (PwVI) have difficulties understanding and navigating spaces around them. Current wayfinding technologies either focus solely on navigation or provide limited communication about the environment. Motivated by recent advances in visual-language grounding and semantic navigation, we propose DRAGON, a guiding robot powered by a dialogue system and the ability to associ… ▽ More Persons with visual impairments (PwVI) have difficulties understanding and navigating spaces around them. Current wayfinding technologies either focus solely on navigation or provide limited communication about the environment. Motivated by recent advances in visual-language grounding and semantic navigation, we propose DRAGON, a guiding robot powered by a dialogue system and the ability to associate the environment with natural language. By understanding the commands from the user, DRAGON is able to guide the user to the desired landmarks on the map, describe the environment, and answer questions from visual observations. Through effective utilization of dialogue, the robot can ground the user's free-form descriptions to landmarks in the environment, and give the user semantic information through spoken language. We conduct a user study with blindfolded participants in an everyday indoor environment. Our results demonstrate that DRAGON is able to communicate with the user smoothly, provide a good guiding experience, and connect users with their surrounding environment in an intuitive manner. Videos and code are available at https://sites.google.com/view/dragon-wayfinding/home. △ Less

Submitted 5 March, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

Comments: Published in IEEE Robotics and Automation Letters (RA-L)

arXiv:2307.06782 [pdf]

doi 10.15407/ujpe68.9.577

Proton and Neutron Pairing Properties within a mixed volume-surface pairing force using SKI3-HFB Theory

Authors: Malik A. Hasan, Ali H. Taqi

Abstract: This work aims at a systematic investigations of the pairing properties and Fermi properties from the proton drip-line to the neutron drip-line. In order to provide more accurate mass formula with skyrme SKI3 force, the global descriptive power of the SKI3-HFB model for pairing properties are made in this study. Systematic Skyrme SKI3-Hartree-Fock-Bogoliubov calculations with a mixed volume-surfac… ▽ More This work aims at a systematic investigations of the pairing properties and Fermi properties from the proton drip-line to the neutron drip-line. In order to provide more accurate mass formula with skyrme SKI3 force, the global descriptive power of the SKI3-HFB model for pairing properties are made in this study. Systematic Skyrme SKI3-Hartree-Fock-Bogoliubov calculations with a mixed volume-surface pairing force are carried out to study the ground-state neutron and proton pairing gap, neutron and proton pairing energy and neutron and proton Fermi energy for about 2095 even-even nuclei ranging from $2\le{Z}\le110 $ to $2\le{N}\le236$. The calculated results of neutron and proton pairing gap are compared with experimental data using the difference-point formulas $Δ^{(3)}, Δ^{(4)}$ and $Δ^{(5)}$, and also compared with the neutron and proton pairing gap of Lipkin-Nogami model. It is shown that the Skyrme-SKI3 functional with the mixed volume-surface pairing force can be successfully used for describing the ground-state pairing and Fermi properties of the investigated nuclei, in particularly the neutron-rich nuclei and the exotic nuclei near the neutron drip-line. On the other hand, the calculated neutron and proton pairing gap are in good agreement with the available experimental values of the neutron and proton pairing gap of the difference-point formulas $Δ^{(3)}, Δ^{(4)}$ and $Δ^{(5)}$ and with the data of Lipkin-Nogami model over the whole nuclear chart. △ Less

Submitted 13 July, 2023; originally announced July 2023.

Comments: 18 pages, 9 figures

arXiv:2307.05203 [pdf, other]

Best practices for quantum error mitigation with digital zero-noise extrapolation

Authors: Ritajit Majumdar, Pedro Rivero, Friederike Metz, Areeq Hasan, Derek S Wang

Abstract: Digital zero-noise extrapolation (dZNE) has emerged as a common approach for quantum error mitigation (QEM) due to its conceptual simplicity, accessibility, and resource efficiency. In practice, however, properly applying dZNE to extend the computational reach of noisy quantum processors is rife with subtleties. Here, based on literature review and original experiments on noisy simulators and real… ▽ More Digital zero-noise extrapolation (dZNE) has emerged as a common approach for quantum error mitigation (QEM) due to its conceptual simplicity, accessibility, and resource efficiency. In practice, however, properly applying dZNE to extend the computational reach of noisy quantum processors is rife with subtleties. Here, based on literature review and original experiments on noisy simulators and real quantum hardware, we define best practices for QEM with dZNE for each step of the workflow, including noise amplification, execution on the quantum device, extrapolation to the zero-noise limit, and composition with other QEM methods. We anticipate that this effort to establish best practices for dZNE will be extended to other QEM methods, leading to more reproducible and rigorous calculations on noisy quantum hardware. △ Less

Submitted 20 July, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

Comments: 10 pages, 11 figures, submitted to IEEE Quantum Week 2023

arXiv:2307.02199 [pdf, other]

Estimating mean profiles and fluxes in high-speed turbulent boundary layers using inner/outer-layer transformations

Authors: Asif Manzoor Hasan, Johan Larsson, Sergio Pirozzoli, Rene Pecnik

Abstract: Accurately predicting drag and heat transfer for compressible high-speed flows is of utmost importance for a range of engineering applications. This requires the precise knowledge of the entire velocity and temperature profiles. A common approach is to use compressible velocity scaling laws (transformation), that inverse transform the velocity profile of an incompressible flow, together with a tem… ▽ More Accurately predicting drag and heat transfer for compressible high-speed flows is of utmost importance for a range of engineering applications. This requires the precise knowledge of the entire velocity and temperature profiles. A common approach is to use compressible velocity scaling laws (transformation), that inverse transform the velocity profile of an incompressible flow, together with a temperature-velocity relation. In this Note, we use distinct velocity transformations for the inner and outer layers. In the inner layer, we utilize a recently proposed scaling law that appropriately incorporates variable property and intrinsic compressibility effects, while the outer layer profile is inverse-transformed with the well-known Van Driest transformation. The result is an analytical expression for the mean shear valid in the entire boundary layer, which combined with a temperature-velocity relationship, provides predictions of mean velocity and temperature profiles at unprecedented accuracy. Using these profiles, drag and heat transfer is evaluated with an accuracy of +/-4% and +/-8%, respectively, for a wide range of compressible turbulent boundary layers up to Mach numbers of 14. △ Less

Submitted 5 July, 2023; originally announced July 2023.

Comments: 9 pages, 3 figures

arXiv:2306.11697 [pdf, other]

Treatment Effects in Extreme Regimes

Authors: Ahmed Aloui, Ali Hasan, Yuting Ng, Miroslav Pajic, Vahid Tarokh

Abstract: Understanding treatment effects in extreme regimes is important for characterizing risks associated with different interventions. This is hindered by the unavailability of counterfactual outcomes and the rarity and difficulty of collecting extreme data in practice. To address this issue, we propose a new framework based on extreme value theory for estimating treatment effects in extreme regimes. W… ▽ More Understanding treatment effects in extreme regimes is important for characterizing risks associated with different interventions. This is hindered by the unavailability of counterfactual outcomes and the rarity and difficulty of collecting extreme data in practice. To address this issue, we propose a new framework based on extreme value theory for estimating treatment effects in extreme regimes. We quantify these effects using variations in tail decay rates of potential outcomes in the presence and absence of treatments. We establish algorithms for calculating these quantities and develop related theoretical results. We demonstrate the efficacy of our approach on various standard synthetic and semi-synthetic datasets. △ Less

Submitted 22 May, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

arXiv:2306.00762 [pdf, other]

Inference and Sampling of Point Processes from Diffusion Excursions

Authors: Ali Hasan, Yu Chen, Yuting Ng, Mohamed Abdelghani, Anderson Schneider, Vahid Tarokh

Abstract: Point processes often have a natural interpretation with respect to a continuous process. We propose a point process construction that describes arrival time observations in terms of the state of a latent diffusion process. In this framework, we relate the return times of a diffusion in a continuous path space to new arrivals of the point process. This leads to a continuous sample path that is use… ▽ More Point processes often have a natural interpretation with respect to a continuous process. We propose a point process construction that describes arrival time observations in terms of the state of a latent diffusion process. In this framework, we relate the return times of a diffusion in a continuous path space to new arrivals of the point process. This leads to a continuous sample path that is used to describe the underlying mechanism generating the arrival distribution. These models arise in many disciplines, such as financial settings where actions in a market are determined by a hidden continuous price or in neuroscience where a latent stimulus generates spike trains. Based on the developments in Itô's excursion theory, we propose methods for inferring and sampling from the point process derived from the latent diffusion process. We illustrate the approach with numerical examples using both simulated and real data. The proposed methods and framework provide a basis for interpreting point processes through the lens of diffusions. △ Less

Submitted 1 June, 2023; originally announced June 2023.

Comments: In UAI 2023

arXiv:2305.10422 [pdf, other]

Four-manifolds and Symmetry Categories of 2d CFTs

Authors: Vladimir Bashmakov, Michele Del Zotto, Azeem Hasan

Abstract: In this paper we study the geometric origin of non-invertible symmetries of 2d theories arising from the reduction of 6d $(2,0)$ theories on four-manifolds. This generalizes and extends our previous results in the context of class $\mathcal S$ theories to a wider realm of models. In particular, we find that relative 2d field theories, such as the chiral boson, have a higher dimensional origin in f… ▽ More In this paper we study the geometric origin of non-invertible symmetries of 2d theories arising from the reduction of 6d $(2,0)$ theories on four-manifolds. This generalizes and extends our previous results in the context of class $\mathcal S$ theories to a wider realm of models. In particular, we find that relative 2d field theories, such as the chiral boson, have a higher dimensional origin in four-manifolds that are not null cobordant. Moreover, we see that for the 2d theories with a 6d origin, the non-invertible symmetries have a geometric origin as a sum over topologies from the perspective of the 7d symmetry TFT. In particular, we show that the Tambara-Yamagami non-invertible symmetries $TY(\mathbb Z_N)$ can be given a geometric origin of this kind. We focus on examples that do not depend on spin structures, but we analyse the simplest of such cases, finding an interesting parallel between the extra choices arising in that context and symmetry fractionalization in Maxwell theories. △ Less

Submitted 11 January, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

Comments: Comments are very much welcome

arXiv:2305.06712 [pdf, ps, other]

Incorporating intrinsic compressibility effects in velocity transformations for wall-bounded turbulent flows

Authors: Asif Manzoor Hasan, Johan Larsson, Sergio Pirozzoli, Rene Pecnik

Abstract: A transformation that relates a compressible wall-bounded turbulent flow with non-uniform fluid properties to an equivalent incompressible flow with uniform fluid properties is derived and validated. The transformation accounts for both variable-property and intrinsic compressibility effects, the latter being the key improvement over the current state-of-the-art. The importance of intrinsic compre… ▽ More A transformation that relates a compressible wall-bounded turbulent flow with non-uniform fluid properties to an equivalent incompressible flow with uniform fluid properties is derived and validated. The transformation accounts for both variable-property and intrinsic compressibility effects, the latter being the key improvement over the current state-of-the-art. The importance of intrinsic compressibility effects contradicts the renowned Morkovin's hypothesis. △ Less

Submitted 2 October, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

Comments: 7 pages, 2 figures

arXiv:2304.13081 [pdf, other]

Organizational Governance of Emerging Technologies: AI Adoption in Healthcare

Authors: Jee Young Kim, William Boag, Freya Gulamali, Alifia Hasan, Henry David Jeffry Hogg, Mark Lifson, Deirdre Mulligan, Manesh Patel, Inioluwa Deborah Raji, Ajai Sehgal, Keo Shaw, Danny Tobey, Alexandra Valladares, David Vidal, Suresh Balu, Mark Sendak

Abstract: Private and public sector structures and norms refine how emerging technology is used in practice. In healthcare, despite a proliferation of AI adoption, the organizational governance surrounding its use and integration is often poorly understood. What the Health AI Partnership (HAIP) aims to do in this research is to better define the requirements for adequate organizational governance of AI syst… ▽ More Private and public sector structures and norms refine how emerging technology is used in practice. In healthcare, despite a proliferation of AI adoption, the organizational governance surrounding its use and integration is often poorly understood. What the Health AI Partnership (HAIP) aims to do in this research is to better define the requirements for adequate organizational governance of AI systems in healthcare settings and support health system leaders to make more informed decisions around AI adoption. To work towards this understanding, we first identify how the standards for the AI adoption in healthcare may be designed to be used easily and efficiently. Then, we map out the precise decision points involved in the practical institutional adoption of AI technology within specific health systems. Practically, we achieve this through a multi-organizational collaboration with leaders from major health systems across the United States and key informants from related fields. Working with the consultancy IDEO [dot] org, we were able to conduct usability-testing sessions with healthcare and AI ethics professionals. Usability analysis revealed a prototype structured around mock key decision points that align with how organizational leaders approach technology adoption. Concurrently, we conducted semi-structured interviews with 89 professionals in healthcare and other relevant fields. Using a modified grounded theory approach, we were able to identify 8 key decision points and comprehensive procedures throughout the AI adoption lifecycle. This is one of the most detailed qualitative analyses to date of the current governance structures and processes involved in AI adoption by health systems in the United States. We hope these findings can inform future efforts to build capabilities to promote the safe, effective, and responsible adoption of emerging technologies in healthcare. △ Less

Submitted 10 May, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

arXiv:2304.12820 [pdf]

Direct Simulation Monte Carlo Analysis on Thrust Vectoring of a Supersonic Micro Nozzle using Bypass Mass Injection

Authors: Maruf Md. Ikram, Abu Taqui Md. Tahsin, A. B. M. Toufique Hasan

Abstract: Converging diverging micro nozzle is fundamentally intended for flow acceleration through the generation of kinetic energy for the advanced micro-propulsion systems. Such supersonic micro nozzles have significant applications in the launching, propulsion and rapid directional control of the micro-satellites for a better maneuver. Micro scale analysis of such flow devices is extended to the rarefie… ▽ More Converging diverging micro nozzle is fundamentally intended for flow acceleration through the generation of kinetic energy for the advanced micro-propulsion systems. Such supersonic micro nozzles have significant applications in the launching, propulsion and rapid directional control of the micro-satellites for a better maneuver. Micro scale analysis of such flow devices is extended to the rarefied flow regime. Present study has addressed thrust vectoring in a planar converging-diverging supersonic micro nozzle by the bypass mass injection technique. Direct Simulation Monte Carlo (DSMC) method has been used for numerical modelling. Primary focus is given on the thrust vectoring control of the micro nozzle with a throat height of 20 micro meter and an expansion ratio of 1.7. For the secondary injection, a rectangular channel of 5 different bypass widths (2 - 12 micro meter) is considered for two different outlet pressures (Pout = 10 kPa and 40 kPa) while kee** the inlet pressure (Pin) and temperature (Tin) fixed at 1 atm and 300 k respectively. The physical behavior of the micro nozzle is acknowledged through the analysis of Mach, pressure, temperature and density contours. Numerical results reveal that the secondary flow injection is adapted into the primary flow through the formation of a pressure bump in the diverging section. Moreover, the total mass flow rate, secondary flow percentage, thrust force, the thrust coefficient and specific impulse increase with the bypass channel width. A change in thrust direction is obtained which in turn produces a considerable vectoring effect in the supersonic micro nozzle. The vectoring angle for Pout of 40 kPa peaks at 6 micro meter bypass channel whereas a gradual increase of the vectoring angle with the bypass channel width is observed for Pout = 10 kPa. △ Less

Submitted 7 July, 2023; v1 submitted 13 December, 2022; originally announced April 2023.

Comments: Journal paper and it's under review

Showing 1–50 of 188 results for author: Hasan, A