-
Lung-CADex: Fully automatic Zero-Shot Detection and Classification of Lung Nodules in Thoracic CT Images
Authors:
Furqan Shaukat,
Syed Muhammad Anwar,
Abhijeet Parida,
Van Khanh Lam,
Marius George Linguraru,
Mubarak Shah
Abstract:
Lung cancer has been one of the major threats to human life for decades. Computer-aided diagnosis can help with early lung nodul detection and facilitate subsequent nodule characterization. Large Visual Language models (VLMs) have been found effective for multiple downstream medical tasks that rely on both imaging and text data. However, lesion level detection and subsequent diagnosis using VLMs h…
▽ More
Lung cancer has been one of the major threats to human life for decades. Computer-aided diagnosis can help with early lung nodul detection and facilitate subsequent nodule characterization. Large Visual Language models (VLMs) have been found effective for multiple downstream medical tasks that rely on both imaging and text data. However, lesion level detection and subsequent diagnosis using VLMs have not been explored yet. We propose CADe, for segmenting lung nodules in a zero-shot manner using a variant of the Segment Anything Model called MedSAM. CADe trains on a prompt suite on input computed tomography (CT) scans by using the CLIP text encoder through prefix tuning. We also propose, CADx, a method for the nodule characterization as benign/malignant by making a gallery of radiomic features and aligning image-feature pairs through contrastive learning. Training and validation of CADe and CADx have been done using one of the largest publicly available datasets, called LIDC. To check the generalization ability of the model, it is also evaluated on a challenging dataset, LUNGx. Our experimental results show that the proposed methods achieve a sensitivity of 0.86 compared to 0.76 that of other fully supervised methods.The source code, datasets and pre-processed data can be accessed using the link:
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
D-Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions
Authors:
Hareem Nisar,
Syed Muhammad Anwar,
Zhifan Jiang,
Abhijeet Parida,
Vishwesh Nath,
Holger R. Roth,
Marius George Linguraru
Abstract:
Large vision language models (VLMs) have progressed incredibly from research to applicability for general-purpose use cases. LLaVA-Med, a pioneering large language and vision assistant for biomedicine, can perform multi-modal biomedical image and data analysis to provide a natural language interface for radiologists. While it is highly generalizable and works with multi-modal data, it is currently…
▽ More
Large vision language models (VLMs) have progressed incredibly from research to applicability for general-purpose use cases. LLaVA-Med, a pioneering large language and vision assistant for biomedicine, can perform multi-modal biomedical image and data analysis to provide a natural language interface for radiologists. While it is highly generalizable and works with multi-modal data, it is currently limited by well-known challenges that exist in the large language model space. Hallucinations and imprecision in responses can lead to misdiagnosis which currently hinder the clinical adaptability of VLMs. To create precise, user-friendly models in healthcare, we propose D-Rax -- a domain-specific, conversational, radiologic assistance tool that can be used to gain insights about a particular radiologic image. In this study, we enhance the conversational analysis of chest X-ray (CXR) images to support radiological reporting, offering comprehensive insights from medical imaging and aiding in the formulation of accurate diagnosis. D-Rax is achieved by fine-tuning the LLaVA-Med architecture on our curated enhanced instruction-following data, comprising of images, instructions, as well as disease diagnosis and demographic predictions derived from MIMIC-CXR imaging data, CXR-related visual question answer (VQA) pairs, and predictive outcomes from multiple expert AI models. We observe statistically significant improvement in responses when evaluated for both open and close-ended conversations. Leveraging the power of state-of-the-art diagnostic models combined with VLMs, D-Rax empowers clinicians to interact with medical images using natural language, which could potentially streamline their decision-making process, enhance diagnostic accuracy, and conserve their time.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Quantum interrogation using weak value measurement
Authors:
Muhammad Abdullah Ijaz,
Syed Bilal Hyder Shah,
Muhammad Sabieh Anwar
Abstract:
We propose a scheme for quantum interrogation measurements using constructive interference and post-selection to achieve single-pass high-efficiency detection for imperfect or semi-transparent absorbers. We illustrate that our method works for heralded single-photon as well as weak attenuated sources. We also study the influence of error from our equipment and show that post-selection renders robu…
▽ More
We propose a scheme for quantum interrogation measurements using constructive interference and post-selection to achieve single-pass high-efficiency detection for imperfect or semi-transparent absorbers. We illustrate that our method works for heralded single-photon as well as weak attenuated sources. We also study the influence of error from our equipment and show that post-selection renders robustness to our scheme against noise. We further demonstrate that with a small extension, we can quantify the transmittance of the imperfect absorber by using the process of weak value amplification (WVA)
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Geometric and Harmonic Aging Intensity function and a Reliability Perspective
Authors:
Subarna Bhattacharjee,
Ananda Sen,
Sabana Anwar,
Aninda Kumar Nanda
Abstract:
In this paper, we introduce some new notions of aging based on geometric, harmonic means of failure rate and aging intensity function. We define a generalized version of aging functions called specific interval-average geometric hazard rate, specific interval-average harmonic hazard rate. We focus on some characterization results and their inter-relationships among the resulting non-parametric cla…
▽ More
In this paper, we introduce some new notions of aging based on geometric, harmonic means of failure rate and aging intensity function. We define a generalized version of aging functions called specific interval-average geometric hazard rate, specific interval-average harmonic hazard rate. We focus on some characterization results and their inter-relationships among the resulting non-parametric classes of distributions. Monotonic nature of so defined aging classes are exhibited by some well known probability distributions. Probabilistic orders based on these functions are taken up for further study. The work is illustrated through case studies and a simulated data having applications in reliability/survival analysis.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation
Authors:
A B M Ashikur Rahman,
Saeed Anwar,
Muhammad Usman,
Ajmal Mian
Abstract:
Large Language Models (LLMs) have demonstrated remarkable capabilities, revolutionizing the integration of AI in daily life applications. However, they are prone to hallucinations, generating claims that contradict established facts, deviating from prompts, and producing inconsistent responses when the same prompt is presented multiple times. Addressing these issues is challenging due to the lack…
▽ More
Large Language Models (LLMs) have demonstrated remarkable capabilities, revolutionizing the integration of AI in daily life applications. However, they are prone to hallucinations, generating claims that contradict established facts, deviating from prompts, and producing inconsistent responses when the same prompt is presented multiple times. Addressing these issues is challenging due to the lack of comprehensive and easily assessable benchmark datasets. Most existing datasets are small and rely on multiple-choice questions, which are inadequate for evaluating the generative prowess of LLMs. To measure hallucination in LLMs, this paper introduces a comprehensive benchmark dataset comprising over 75,000 prompts across eight domains. These prompts are designed to elicit definitive, concise, and informative answers. The dataset is divided into two segments: one publicly available for testing and assessing LLM performance and a hidden segment for benchmarking various LLMs. In our experiments, we tested six LLMs-GPT-3.5, LLama 2, LLama 3, Gemini, Mixtral, and Zephyr-revealing that overall factual hallucination ranges from 59% to 82% on the public dataset and 57% to 76% in the hidden benchmark. Prompt misalignment hallucination ranges from 6% to 95% in the public dataset and 17% to 94% in the hidden counterpart. Average consistency ranges from 21% to 61% and 22% to 63%, respectively. Domain-wise analysis shows that LLM performance significantly deteriorates when asked for specific numeric information while performing moderately with person, location, and date queries. Our dataset demonstrates its efficacy and serves as a comprehensive benchmark for LLM performance evaluation. Our dataset and LLMs responses are available at \href{https://github.com/ashikiut/DefAn}{https://github.com/ashikiut/DefAn}.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
FSBI: Deepfakes Detection with Frequency Enhanced Self-Blended Images
Authors:
Ahmed Abul Hasanaath,
Hamzah Luqman,
Raed Katib,
Saeed Anwar
Abstract:
Advances in deepfake research have led to the creation of almost perfect manipulations undetectable by human eyes and some deepfakes detection tools. Recently, several techniques have been proposed to differentiate deepfakes from realistic images and videos. This paper introduces a Frequency Enhanced Self-Blended Images (FSBI) approach for deepfakes detection. This proposed approach utilizes Discr…
▽ More
Advances in deepfake research have led to the creation of almost perfect manipulations undetectable by human eyes and some deepfakes detection tools. Recently, several techniques have been proposed to differentiate deepfakes from realistic images and videos. This paper introduces a Frequency Enhanced Self-Blended Images (FSBI) approach for deepfakes detection. This proposed approach utilizes Discrete Wavelet Transforms (DWT) to extract discriminative features from the self-blended images (SBI) to be used for training a convolutional network architecture model. The SBIs blend the image with itself by introducing several forgery artifacts in a copy of the image before blending it. This prevents the classifier from overfitting specific artifacts by learning more generic representations. These blended images are then fed into the frequency features extractor to detect artifacts that can not be detected easily in the time domain. The proposed approach has been evaluated on FF++ and Celeb-DF datasets and the obtained results outperformed the state-of-the-art techniques with the cross-dataset evaluation protocol.
△ Less
Submitted 25 June, 2024; v1 submitted 12 June, 2024;
originally announced June 2024.
-
Global synchronization in generalized multilayer higher-order networks
Authors:
Palash Kumar Pal,
Md Sayeed Anwar,
Matjaz Perc,
Dibakar Ghosh
Abstract:
Networks incorporating higher-order interactions are increasingly recognized for their ability to introduce novel dynamics into various processes, including synchronization. Previous studies on synchronization within multilayer networks have often been limited to specific models, such as the Kuramoto model, or have focused solely on higher-order interactions within individual layers. Here, we pres…
▽ More
Networks incorporating higher-order interactions are increasingly recognized for their ability to introduce novel dynamics into various processes, including synchronization. Previous studies on synchronization within multilayer networks have often been limited to specific models, such as the Kuramoto model, or have focused solely on higher-order interactions within individual layers. Here, we present a comprehensive framework for investigating synchronization, particularly global synchronization, in multilayer networks with higher-order interactions. Our framework considers interactions beyond pairwise connections, both within and across layers. We demonstrate the existence of a stable global synchronous state, with a condition resembling the master stability function, contingent on the choice of coupling functions. Our theoretical findings are supported by simulations using Hindmarsh-Rose neuronal and Rössler oscillators. These simulations illustrate how synchronization is facilitated by higher-order interactions, both within and across layers, highlighting the advantages over scenarios involving interactions within single layers.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Brain Tumor Segmentation (BraTS) Challenge 2024: Meningioma Radiotherapy Planning Automated Segmentation
Authors:
Dominic LaBella,
Katherine Schumacher,
Michael Mix,
Kevin Leu,
Shan McBurney-Lin,
Pierre Nedelec,
Javier Villanueva-Meyer,
Jonathan Shapey,
Tom Vercauteren,
Kazumi Chia,
Omar Al-Salihi,
Justin Leu,
Lia Halasz,
Yury Velichko,
Chunhao Wang,
John Kirkpatrick,
Scott Floyd,
Zachary J. Reitman,
Trey Mullikin,
Ulas Bagci,
Sean Sachdev,
Jona A. Hattangadi-Gluth,
Tyler Seibert,
Nikdokht Farid,
Connor Puett
, et al. (45 additional authors not shown)
Abstract:
The 2024 Brain Tumor Segmentation Meningioma Radiotherapy (BraTS-MEN-RT) challenge aims to advance automated segmentation algorithms using the largest known multi-institutional dataset of radiotherapy planning brain MRIs with expert-annotated target labels for patients with intact or post-operative meningioma that underwent either conventional external beam radiotherapy or stereotactic radiosurger…
▽ More
The 2024 Brain Tumor Segmentation Meningioma Radiotherapy (BraTS-MEN-RT) challenge aims to advance automated segmentation algorithms using the largest known multi-institutional dataset of radiotherapy planning brain MRIs with expert-annotated target labels for patients with intact or post-operative meningioma that underwent either conventional external beam radiotherapy or stereotactic radiosurgery. Each case includes a defaced 3D post-contrast T1-weighted radiotherapy planning MRI in its native acquisition space, accompanied by a single-label "target volume" representing the gross tumor volume (GTV) and any at-risk post-operative site. Target volume annotations adhere to established radiotherapy planning protocols, ensuring consistency across cases and institutions. For pre-operative meningiomas, the target volume encompasses the entire GTV and associated nodular dural tail, while for post-operative cases, it includes at-risk resection cavity margins as determined by the treating institution. Case annotations were reviewed and approved by expert neuroradiologists and radiation oncologists. Participating teams will develop, containerize, and evaluate automated segmentation models using this comprehensive dataset. Model performance will be assessed using the lesion-wise Dice Similarity Coefficient and the 95% Hausdorff distance. The top-performing teams will be recognized at the Medical Image Computing and Computer Assisted Intervention Conference in October 2024. BraTS-MEN-RT is expected to significantly advance automated radiotherapy planning by enabling precise tumor segmentation and facilitating tailored treatment, ultimately improving patient outcomes.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Analysis of the BraTS 2023 Intracranial Meningioma Segmentation Challenge
Authors:
Dominic LaBella,
Ujjwal Baid,
Omaditya Khanna,
Shan McBurney-Lin,
Ryan McLean,
Pierre Nedelec,
Arif Rashid,
Nourel Hoda Tahon,
Talissa Altes,
Radhika Bhalerao,
Yaseen Dhemesh,
Devon Godfrey,
Fathi Hilal,
Scott Floyd,
Anastasia Janas,
Anahita Fathi Kazerooni,
John Kirkpatrick,
Collin Kent,
Florian Kofler,
Kevin Leu,
Nazanin Maleki,
Bjoern Menze,
Maxence Pajot,
Zachary J. Reitman,
Jeffrey D. Rudie
, et al. (96 additional authors not shown)
Abstract:
We describe the design and results from the BraTS 2023 Intracranial Meningioma Segmentation Challenge. The BraTS Meningioma Challenge differed from prior BraTS Glioma challenges in that it focused on meningiomas, which are typically benign extra-axial tumors with diverse radiologic and anatomical presentation and a propensity for multiplicity. Nine participating teams each developed deep-learning…
▽ More
We describe the design and results from the BraTS 2023 Intracranial Meningioma Segmentation Challenge. The BraTS Meningioma Challenge differed from prior BraTS Glioma challenges in that it focused on meningiomas, which are typically benign extra-axial tumors with diverse radiologic and anatomical presentation and a propensity for multiplicity. Nine participating teams each developed deep-learning automated segmentation models using image data from the largest multi-institutional systematically expert annotated multilabel multi-sequence meningioma MRI dataset to date, which included 1000 training set cases, 141 validation set cases, and 283 hidden test set cases. Each case included T2, T2/FLAIR, T1, and T1Gd brain MRI sequences with associated tumor compartment labels delineating enhancing tumor, non-enhancing tumor, and surrounding non-enhancing T2/FLAIR hyperintensity. Participant automated segmentation models were evaluated and ranked based on a scoring system evaluating lesion-wise metrics including dice similarity coefficient (DSC) and 95% Hausdorff Distance. The top ranked team had a lesion-wise median dice similarity coefficient (DSC) of 0.976, 0.976, and 0.964 for enhancing tumor, tumor core, and whole tumor, respectively and a corresponding average DSC of 0.899, 0.904, and 0.871, respectively. These results serve as state-of-the-art benchmarks for future pre-operative meningioma automated segmentation algorithms. Additionally, we found that 1286 of 1424 cases (90.3%) had at least 1 compartment voxel abutting the edge of the skull-stripped image edge, which requires further investigation into optimal pre-processing face anonymization steps.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Bird's-Eye View to Street-View: A Survey
Authors:
Khawlah Bajbaa,
Muhammad Usman,
Saeed Anwar,
Ibrahim Radwan,
Abdul Bais
Abstract:
In recent years, street view imagery has grown to become one of the most important sources of geospatial data collection and urban analytics, which facilitates generating meaningful insights and assisting in decision-making. Synthesizing a street-view image from its corresponding satellite image is a challenging task due to the significant differences in appearance and viewpoint between the two do…
▽ More
In recent years, street view imagery has grown to become one of the most important sources of geospatial data collection and urban analytics, which facilitates generating meaningful insights and assisting in decision-making. Synthesizing a street-view image from its corresponding satellite image is a challenging task due to the significant differences in appearance and viewpoint between the two domains. In this study, we screened 20 recent research papers to provide a thorough review of the state-of-the-art of how street-view images are synthesized from their corresponding satellite counterparts. The main findings are: (i) novel deep learning techniques are required for synthesizing more realistic and accurate street-view images; (ii) more datasets need to be collected for public usage; and (iii) more specific evaluation metrics need to be investigated for evaluating the generated images appropriately. We conclude that, due to applying outdated deep learning techniques, the recent literature failed to generate detailed and diverse street-view images.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Dynamic FMR and magneto-optical response of hydrogenated FCC phase Fe25Pd75 thin films and micro patterned devices
Authors:
Shahbaz Khan,
Satyajit Sarkar,
Nicolas B. Lawler,
Ali Akbar,
Muhammad Sabieh Anwar,
Mariusz Martyniuk,
K. Swaminathan Iyer,
Mikhail Kostylev
Abstract:
In this work, we investigate the effects of H2 on the physical properties of Fe25Pd75. Broadband ferromagnetic resonance (FMR) spectroscopy revealed a significant FMR peak shift induced by H2 absorption for the FCC phased Fe25Pd75. The peak shifted towards higher applied fields, which is contrary to what was previously observed for CoPd alloys. Additionally, we conducted structural and magneto-opt…
▽ More
In this work, we investigate the effects of H2 on the physical properties of Fe25Pd75. Broadband ferromagnetic resonance (FMR) spectroscopy revealed a significant FMR peak shift induced by H2 absorption for the FCC phased Fe25Pd75. The peak shifted towards higher applied fields, which is contrary to what was previously observed for CoPd alloys. Additionally, we conducted structural and magneto-optical Kerr ellipsometric studies on the Fe25Pd75 film and performed density functional theory calculations to explore the electronic and magnetic properties in both hydrogenated and dehydrogenated states. In the final part of this study, we deposited a Fe25Pd75 layer on top of a microscopic coplanar transmission line and investigated the FMR response of the layer while driven by a microwave current in the coplanar line. We observed a large amplitude FMR response upon hydrogen absorption, as well as desorption rates when cycling between pure N2 and a mixture of 3% H2 + 97% N2.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Deep Models for Multi-View 3D Object Recognition: A Review
Authors:
Mona Alzahrani,
Muhammad Usman,
Salma Kammoun,
Saeed Anwar,
Tarek Helmy
Abstract:
Human decision-making often relies on visual information from multiple perspectives or views. In contrast, machine learning-based object recognition utilizes information from a single image of the object. However, the information conveyed by a single image may not be sufficient for accurate decision-making, particularly in complex recognition problems. The utilization of multi-view 3D representati…
▽ More
Human decision-making often relies on visual information from multiple perspectives or views. In contrast, machine learning-based object recognition utilizes information from a single image of the object. However, the information conveyed by a single image may not be sufficient for accurate decision-making, particularly in complex recognition problems. The utilization of multi-view 3D representations for object recognition has thus far demonstrated the most promising results for achieving state-of-the-art performance. This review paper comprehensively covers recent progress in multi-view 3D object recognition methods for 3D classification and retrieval tasks. Specifically, we focus on deep learning-based and transformer-based techniques, as they are widely utilized and have achieved state-of-the-art performance. We provide detailed information about existing deep learning-based and transformer-based multi-view 3D object recognition models, including the most commonly used 3D datasets, camera configurations and number of views, view selection strategies, pre-trained CNN architectures, fusion strategies, and recognition performance on 3D classification and 3D retrieval tasks. Additionally, we examine various computer vision applications that use multi-view classification. Finally, we highlight key findings and future directions for develo** multi-view 3D object recognition methods to provide readers with a comprehensive understanding of the field.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
The Brain Tumor Segmentation in Pediatrics (BraTS-PEDs) Challenge: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)
Authors:
Anahita Fathi Kazerooni,
Nastaran Khalili,
Deep Gandhi,
Xinyang Liu,
Zhifan Jiang,
Syed Muhammed Anwar,
Jake Albrecht,
Maruf Adewole,
Udunna Anazodo,
Hannah Anderson,
Sina Bagheri,
Ujjwal Baid,
Timothy Bergquist,
Austin J. Borja,
Evan Calabrese,
Verena Chung,
Gian-Marco Conte,
Farouk Dako,
James Eddy,
Ivan Ezhov,
Ariana Familiar,
Keyvan Farahani,
Anurag Gottipati,
Debanjan Haldar,
Shuvanjan Haldar
, et al. (51 additional authors not shown)
Abstract:
Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. Here we pr…
▽ More
Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. Here we present the CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs challenge, focused on pediatric brain tumors with data acquired across multiple international consortia dedicated to pediatric neuro-oncology and clinical trials. The CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs challenge brings together clinicians and AI/imaging scientists to lead to faster development of automated segmentation techniques that could benefit clinical trials, and ultimately the care of children with brain tumors.
△ Less
Submitted 29 April, 2024; v1 submitted 23 April, 2024;
originally announced April 2024.
-
On weighted failure rate, its means and associated quantile version
Authors:
Subarna Bhattacharjee,
S. M. Sunoj,
Sabana Anwar
Abstract:
In this paper, we define weighted failure rate and their different means from the stand point of an application. We begin by emphasizing that the formation of n independent component series system having weighted failure rates with sum of weight functions being unity is same as a mixture of n distributions. We derive some parametric and non-parametric characterization results. We discuss on the fo…
▽ More
In this paper, we define weighted failure rate and their different means from the stand point of an application. We begin by emphasizing that the formation of n independent component series system having weighted failure rates with sum of weight functions being unity is same as a mixture of n distributions. We derive some parametric and non-parametric characterization results. We discuss on the form invariance property of baseline failure rate for a specific choice of weight function. Some bounds on means of aging functions are obtained. Here, we establish that weighted IFRA class is not closed under formation of coherent systems unlike the IFRA class. An interesting application of the present work is credited to the fact that the quantile version of means of failure rate is obtained as a special case of weighted means of failure rate.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Cluster formation due to repulsive spanning trees in attractively coupled networks
Authors:
Sayantan Nag Chowdhury,
Md Sayeed Anwar,
Dibakar Ghosh
Abstract:
Ensembles of coupled nonlinear oscillators are a popular paradigm and an ideal benchmark for analyzing complex collective behaviors. The onset of cluster synchronization is found to be at the core of various technological and biological processes. The current literature has investigated cluster synchronization by focusing mostly on the case of attractive coupling among the oscillators. However, th…
▽ More
Ensembles of coupled nonlinear oscillators are a popular paradigm and an ideal benchmark for analyzing complex collective behaviors. The onset of cluster synchronization is found to be at the core of various technological and biological processes. The current literature has investigated cluster synchronization by focusing mostly on the case of attractive coupling among the oscillators. However, the case of two coexisting competing interactions is of practical interest due to their relevance in diverse natural settings, including neuronal networks consisting of excitatory and inhibitory neurons, the coevolving social model with voters of opposite opinions, ecological plant communities with both facilitation and competition, to name a few. In the present article, we investigate the impact of repulsive spanning trees on cluster formation within a connected network of attractively coupled limit cycle oscillators. We successfully predict which nodes belong to each cluster and the emergent frustration of the connected networks independent of the particular local dynamics at the network nodes. We also determine local asymptotic stability of the cluster states using an approach based on the formulation of a master stability function. We additionally validate the emergence of solitary states and antisynchronization for some specific choices of spanning trees and networks.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
DiCoM -- Diverse Concept Modeling towards Enhancing Generalizability in Chest X-Ray Studies
Authors:
Abhieet Parida,
Daniel Capellan-Martin,
Sara Atito,
Muhammad Awais,
Maria J. Ledesma-Carbayo,
Marius G. Linguraru,
Syed Muhammad Anwar
Abstract:
Chest X-Ray (CXR) is a widely used clinical imaging modality and has a pivotal role in the diagnosis and prognosis of various lung and heart related conditions. Conventional automated clinical diagnostic tool design strategies relying on radiology reads and supervised learning, entail the cumbersome requirement of high quality annotated training data. To address this challenge, self-supervised pre…
▽ More
Chest X-Ray (CXR) is a widely used clinical imaging modality and has a pivotal role in the diagnosis and prognosis of various lung and heart related conditions. Conventional automated clinical diagnostic tool design strategies relying on radiology reads and supervised learning, entail the cumbersome requirement of high quality annotated training data. To address this challenge, self-supervised pre-training has proven to outperform supervised pre-training in numerous downstream vision tasks, representing a significant breakthrough in the field. However, medical imaging pre-training significantly differs from pre-training with natural images (e.g., ImageNet) due to unique attributes of clinical images. In this context, we introduce Diverse Concept Modeling (DiCoM), a novel self-supervised training paradigm that leverages a student teacher framework for learning diverse concepts and hence effective representation of the CXR data. Hence, expanding beyond merely modeling a single primary label within an image, instead, effectively harnessing the information from all the concepts inherent in the CXR. The pre-trained model is subsequently fine-tuned to address diverse domain-specific tasks. Our proposed paradigm consistently demonstrates robust performance across multiple downstream tasks on multiple datasets, highlighting the success and generalizability of the pre-training strategy. To establish the efficacy of our methods we analyze both the power of learned representations and the speed of convergence (SoC) of our models. For diverse data and tasks, DiCoM is able to achieve in most cases better results compared to other state-of-the-art pre-training strategies. This when combined with the higher SoC and generalization capabilities positions DiCoM to be established as a foundation model for CXRs, a widely used imaging modality.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
Filter Bubble or Homogenization? Disentangling the Long-Term Effects of Recommendations on User Consumption Patterns
Authors:
Md Sanzeed Anwar,
Grant Schoenebeck,
Paramveer S. Dhillon
Abstract:
Recommendation algorithms play a pivotal role in sha** our media choices, which makes it crucial to comprehend their long-term impact on user behavior. These algorithms are often linked to two critical outcomes: homogenization, wherein users consume similar content despite disparate underlying preferences, and the filter bubble effect, wherein individuals with differing preferences only consume…
▽ More
Recommendation algorithms play a pivotal role in sha** our media choices, which makes it crucial to comprehend their long-term impact on user behavior. These algorithms are often linked to two critical outcomes: homogenization, wherein users consume similar content despite disparate underlying preferences, and the filter bubble effect, wherein individuals with differing preferences only consume content aligned with their preferences (without much overlap with other users). Prior research assumes a trade-off between homogenization and filter bubble effects and then shows that personalized recommendations mitigate filter bubbles by fostering homogenization. However, because of this assumption of a tradeoff between these two effects, prior work cannot develop a more nuanced view of how recommendation systems may independently impact homogenization and filter bubble effects. We develop a more refined definition of homogenization and the filter bubble effect by decomposing them into two key metrics: how different the average consumption is between users (inter-user diversity) and how varied an individual's consumption is (intra-user diversity). We then use a novel agent-based simulation framework that enables a holistic view of the impact of recommendation systems on homogenization and filter bubble effects. Our simulations show that traditional recommendation algorithms (based on past behavior) mainly reduce filter bubbles by affecting inter-user diversity without significantly impacting intra-user diversity. Building on these findings, we introduce two new recommendation algorithms that take a more nuanced approach by accounting for both types of diversity.
△ Less
Submitted 7 March, 2024; v1 submitted 22 February, 2024;
originally announced February 2024.
-
Zero-Shot Pediatric Tuberculosis Detection in Chest X-Rays using Self-Supervised Learning
Authors:
Daniel Capellán-Martín,
Abhijeet Parida,
Juan J. Gómez-Valverde,
Ramon Sanchez-Jacob,
Pooneh Roshanitabrizi,
Marius G. Linguraru,
María J. Ledesma-Carbayo,
Syed M. Anwar
Abstract:
Tuberculosis (TB) remains a significant global health challenge, with pediatric cases posing a major concern. The World Health Organization (WHO) advocates for chest X-rays (CXRs) for TB screening. However, visual interpretation by radiologists can be subjective, time-consuming and prone to error, especially in pediatric TB. Artificial intelligence (AI)-driven computer-aided detection (CAD) tools,…
▽ More
Tuberculosis (TB) remains a significant global health challenge, with pediatric cases posing a major concern. The World Health Organization (WHO) advocates for chest X-rays (CXRs) for TB screening. However, visual interpretation by radiologists can be subjective, time-consuming and prone to error, especially in pediatric TB. Artificial intelligence (AI)-driven computer-aided detection (CAD) tools, especially those utilizing deep learning, show promise in enhancing lung disease detection. However, challenges include data scarcity and lack of generalizability. In this context, we propose a novel self-supervised paradigm leveraging Vision Transformers (ViT) for improved TB detection in CXR, enabling zero-shot pediatric TB detection. We demonstrate improvements in TB detection performance ($\sim$12.7% and $\sim$13.4% top AUC/AUPR gains in adults and children, respectively) when conducting self-supervised pre-training when compared to fully-supervised (i.e., non pre-trained) ViT models, achieving top performances of 0.959 AUC and 0.962 AUPR in adult TB detection, and 0.697 AUC and 0.607 AUPR in zero-shot pediatric TB detection. As a result, this work demonstrates that self-supervised learning on adult CXRs effectively extends to challenging downstream tasks such as pediatric TB detection, where data are scarce.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
Quantitative Metrics for Benchmarking Medical Image Harmonization
Authors:
Abhijeet Parida,
Zhifan Jiang,
Roger J. Packer,
Robert A. Avery,
Syed M. Anwar,
Marius G. Linguraru
Abstract:
Image harmonization is an important preprocessing strategy to address domain shifts arising from data acquired using different machines and scanning protocols in medical imaging. However, benchmarking the effectiveness of harmonization techniques has been a challenge due to the lack of widely available standardized datasets with ground truths. In this context, we propose three metrics: two intensi…
▽ More
Image harmonization is an important preprocessing strategy to address domain shifts arising from data acquired using different machines and scanning protocols in medical imaging. However, benchmarking the effectiveness of harmonization techniques has been a challenge due to the lack of widely available standardized datasets with ground truths. In this context, we propose three metrics: two intensity harmonization metrics and one anatomy preservation metric for medical images during harmonization, where no ground truths are required. Through extensive studies on a dataset with available harmonization ground truth, we demonstrate that our metrics are correlated with established image quality assessment metrics. We show how these novel metrics may be applied to real-world scenarios where no harmonization ground truth exists. Additionally, we provide insights into different interpretations of the metric values, shedding light on their significance in the context of the harmonization process. As a result of our findings, we advocate for the adoption of these quantitative harmonization metrics as a standard for benchmarking the performance of image harmonization techniques.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Personality Trait Recognition using ECG Spectrograms and Deep Learning
Authors:
Muhammad Mohsin Altaf,
Saadat Ullah Khan,
Muhammad Majd,
Syed Muhammad Anwar
Abstract:
This paper presents an innovative approach to recognizing personality traits using deep learning (DL) methods applied to electrocardiogram (ECG) signals. Within the framework of detecting the big five personality traits model encompassing extra-version, neuroticism, agreeableness, conscientiousness, and openness, the research explores the potential of ECG-derived spectrograms as informative featur…
▽ More
This paper presents an innovative approach to recognizing personality traits using deep learning (DL) methods applied to electrocardiogram (ECG) signals. Within the framework of detecting the big five personality traits model encompassing extra-version, neuroticism, agreeableness, conscientiousness, and openness, the research explores the potential of ECG-derived spectrograms as informative features. Optimal window sizes for spectrogram generation are determined, and a convolutional neural network (CNN), specifically Resnet-18, and visual transformer (ViT) are employed for feature extraction and personality trait classification. The study utilizes the publicly available ASCERTAIN dataset, which comprises various physiological signals, including ECG recordings, collected from 58 participants during the presentation of video stimuli categorized by valence and arousal levels. The outcomes of this study demonstrate noteworthy performance in personality trait classification, consistently achieving F1-scores exceeding 0.9 across different window sizes and personality traits. These results emphasize the viability of ECG signal spectrograms as a valuable modality for personality trait recognition, with Resnet-18 exhibiting effectiveness in discerning distinct personality traits.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Human Emotions Analysis and Recognition Using EEG Signals in Response to 360$^\circ$ Videos
Authors:
Haseeb ur Rahman Abbasi,
Zeeshan Rashid,
Muhammad Majid,
Syed Muhammad Anwar
Abstract:
Emotion recognition (ER) technology is an integral part for develo** innovative applications such as drowsiness detection and health monitoring that plays a pivotal role in contemporary society. This study delves into ER using electroencephalography (EEG), within immersive virtual reality (VR) environments. There are four main stages in our proposed methodology including data acquisition, pre-pr…
▽ More
Emotion recognition (ER) technology is an integral part for develo** innovative applications such as drowsiness detection and health monitoring that plays a pivotal role in contemporary society. This study delves into ER using electroencephalography (EEG), within immersive virtual reality (VR) environments. There are four main stages in our proposed methodology including data acquisition, pre-processing, feature extraction, and emotion classification. Acknowledging the limitations of existing 2D datasets, we introduce a groundbreaking 3D VR dataset to elevate the precision of emotion elicitation. Leveraging the Interaxon Muse headband for EEG recording and Oculus Quest 2 for VR stimuli, we meticulously recorded data from 40 participants, prioritizing subjects without reported mental illnesses. Pre-processing entails rigorous cleaning, uniform truncation, and the application of a Savitzky-Golay filter to the EEG data. Feature extraction encompasses a comprehensive analysis of metrics such as power spectral density, correlation, rational and divisional asymmetry, and power spectrum. To ensure the robustness of our model, we employed a 10-fold cross-validation, revealing an average validation accuracy of 85.54\%, with a noteworthy maximum accuracy of 90.20\% in the best fold. Subsequently, the trained model demonstrated a commendable test accuracy of 82.03\%, promising favorable outcomes.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Self-organized bistability on globally coupled higher-order networks
Authors:
Md Sayeed Anwar,
Nikita Frolov,
Alexander E. Hramov,
Dibakar Ghosh
Abstract:
Self-organized bistability (SOB) stands as a critical behavior for the systems delicately adjusting themselves to the brink of bistability, characterized by a first-order transition. Its essence lies in the inherent ability of the system to undergo enduring shifts between the coexisting states, achieved through the self-regulation of a controlling parameter. Recently, SOB has been established in a…
▽ More
Self-organized bistability (SOB) stands as a critical behavior for the systems delicately adjusting themselves to the brink of bistability, characterized by a first-order transition. Its essence lies in the inherent ability of the system to undergo enduring shifts between the coexisting states, achieved through the self-regulation of a controlling parameter. Recently, SOB has been established in a scale-free network as a recurrent transition to a short-living state of global synchronization. Here, we embark on a theoretical exploration that extends the boundaries of the SOB concept on a higher-order network (implicitly embedded microscopically within a simplicial complex) while considering the limitations imposed by coupling constraints. By applying Ott-Antonsen dimensionality reduction in the thermodynamic limit to the higher-order network, we derive SOB requirements under coupling limits that are in good agreement with numerical simulations on systems of finite size. We use continuous synchronization diagrams and statistical data from spontaneous synchronized events to demonstrate the crucial role SOB plays in initiating and terminating temporary synchronized events. We show that under weak coupling consumption, these spontaneous occurrences closely resemble the statistical traits of the epileptic brain functioning.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
A solvable two-dimensional swarmalator model
Authors:
Kevin O'Keeffe,
Gourab Kumar Sar,
Md Sayeed Anwar,
Joao U. F. Lizárraga,
Marcus A. M. de Aguiar,
Dibakar Ghosh
Abstract:
Swarmalators are oscillators that swarm through space as they synchronize in time. Introduced a few years ago to model many systems which mix synchrony with self-assembly, they remain poorly understood theoretically. Here we obtain the first analytic results on swarmalators moving in two-dimensional (2D) plane by enforcing periodic boundary conditions; this simpler topology allows expressions for…
▽ More
Swarmalators are oscillators that swarm through space as they synchronize in time. Introduced a few years ago to model many systems which mix synchrony with self-assembly, they remain poorly understood theoretically. Here we obtain the first analytic results on swarmalators moving in two-dimensional (2D) plane by enforcing periodic boundary conditions; this simpler topology allows expressions for order parameters, stabilities, and bifurcations to be derived exactly. We suggest some future directions for swarmalator research and point out some connections to the Kuramoto model and the Vicsek model from active matter; these are intended as a call-to-arms for the sync community and other researchers looking for new problems and puzzles to work on.
△ Less
Submitted 22 December, 2023; v1 submitted 15 December, 2023;
originally announced December 2023.
-
Photonic spin Hall effect in Haldane materials
Authors:
Muzamil Shah,
Muhammad Sabieh Anwar,
Reza Asgari,
Gao Xianlong
Abstract:
The photonic spin Hall effect of light beams reflected from the surfaces of various two-dimensional hexagonal crystalline structures, considering their associated time-reversal $\mathcal{T}$ and inversion $\mathcal{I}$ symmetries, is investigated. Employing the Haldane model with tunable parameters as a generic model, we examine the longitudinal and transverse spin-separations of the reflected bea…
▽ More
The photonic spin Hall effect of light beams reflected from the surfaces of various two-dimensional hexagonal crystalline structures, considering their associated time-reversal $\mathcal{T}$ and inversion $\mathcal{I}$ symmetries, is investigated. Employing the Haldane model with tunable parameters as a generic model, we examine the longitudinal and transverse spin-separations of the reflected beam in both topological non-trivial and trivial systems. The study reveals that the sign switching of the PSHE in these materials is attributed to the non-trivial and trivial topology. By manipulating the interplay between spin-orbit coupling and external electric fields, we demonstrate topological phase transitions in buckled Xene monolayer materials through the photonic spin Hall effect. Different behaviors of the photonic spin Hall effect are observed in various topological phases within these materials. Additionally, we explore the reflected spin and valley-polarized spatial shifts in monolayer transition metal dichalcogenides. The photonic spin Hall effect in buckled Xene monolayer materials and transition metal dichalcogenides is highly influenced by the spin and valley degrees of freedom of charge carriers, offering a promising avenue to explore spintronics and valleytronics in these hexagonal materials. We propose that the photonic spin Hall effect in Haldane materials can serve as a metrological tool for optical parameter characterization and as a promising method for determining Chern numbers and topological phase transitions through direct optical weak measurement techniques.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Desynchrony induced by higher-order interactions in triplex metapopulations
Authors:
Palash Kumar Pal,
Md Sayeed Anwar,
Dibakar Ghosh
Abstract:
In a predator-prey metapopulation, the two traits are adversely related: synchronization and persistence. A decrease in synchrony apparently leads to an increase in persistence and, therefore, necessitates the study of desynchrony in a metapopulation. In this article, we study predator-prey patches that communicate with one another while being interconnected through distinct dispersal structures i…
▽ More
In a predator-prey metapopulation, the two traits are adversely related: synchronization and persistence. A decrease in synchrony apparently leads to an increase in persistence and, therefore, necessitates the study of desynchrony in a metapopulation. In this article, we study predator-prey patches that communicate with one another while being interconnected through distinct dispersal structures in the layers of a three-layer multiplex network. We investigate the synchronization phenomenon among the patches of the outer layers by introducing higher-order interactions (specifically three-body interactions) in the middle layer. We observe a decrease in the synchronous behavior or, alternatively, an increase in desynchrony due to the inclusion of group interactions among the patches of the middle layer. The advancement of desynchrony becomes more prominent with increasing strength and numbers of three-way interactions in the middle layer. We analytically validated our numerical results by performing the stability analysis of the referred synchronous solution using the master stability function approach. Additionally, we verify our findings by taking into account two distinct predator-prey models and dispersal topologies, which ultimately assert that the findings are generalizable across various models and dispersal structures.
△ Less
Submitted 18 October, 2023;
originally announced October 2023.
-
Improved Crop and Weed Detection with Diverse Data Ensemble Learning
Authors:
Muhammad Hamza Asad,
Saeed Anwar,
Abdul Bais
Abstract:
Modern agriculture heavily relies on Site-Specific Farm Management practices, necessitating accurate detection, localization, and quantification of crops and weeds in the field, which can be achieved using deep learning techniques. In this regard, crop and weed-specific binary segmentation models have shown promise. However, uncontrolled field conditions limit their performance from one field to t…
▽ More
Modern agriculture heavily relies on Site-Specific Farm Management practices, necessitating accurate detection, localization, and quantification of crops and weeds in the field, which can be achieved using deep learning techniques. In this regard, crop and weed-specific binary segmentation models have shown promise. However, uncontrolled field conditions limit their performance from one field to the other. To improve semantic model generalization, existing methods augment and synthesize agricultural data to account for uncontrolled field conditions. However, given highly varied field conditions, these methods have limitations. To overcome the challenges of model deterioration in such conditions, we propose utilizing data specific to other crops and weeds for our specific target problem. To achieve this, we propose a novel ensemble framework. Our approach involves utilizing different crop and weed models trained on diverse datasets and employing a teacher-student configuration. By using homogeneous stacking of base models and a trainable meta-architecture to combine their outputs, we achieve significant improvements for Canola crops and Kochia weeds on unseen test data, surpassing the performance of single semantic segmentation models. We identify the UNET meta-architecture as the most effective in this context. Finally, through ablation studies, we demonstrate and validate the effectiveness of our proposed model. We observe that including base models trained on other target crops and weeds can help generalize the model to capture varied field conditions. Lastly, we propose two novel datasets with varied conditions for comparisons.
△ Less
Submitted 14 June, 2024; v1 submitted 2 October, 2023;
originally announced October 2023.
-
ChatGPT Performance on Standardized Testing Exam -- A Proposed Strategy for Learners
Authors:
Umer Farooq,
Saira Anwar
Abstract:
This study explores the problem solving capabilities of ChatGPT and its prospective applications in standardized test preparation, focusing on the GRE quantitative exam. Prior research has shown great potential for the utilization of ChatGPT for academic purposes in revolutionizing the approach to studying across various disciplines. We investigate how ChatGPT performs across various question type…
▽ More
This study explores the problem solving capabilities of ChatGPT and its prospective applications in standardized test preparation, focusing on the GRE quantitative exam. Prior research has shown great potential for the utilization of ChatGPT for academic purposes in revolutionizing the approach to studying across various disciplines. We investigate how ChatGPT performs across various question types in the GRE quantitative domain, and how modifying question prompts impacts its accuracy. More specifically this study addressed two research questions: 1. How does ChatGPT perform in answering GRE-based quantitative questions across various content areas? 2. How does the accuracy of ChatGPT vary with modifying the question prompts? The dataset consisting of 100 randomly selected GRE quantitative questions was collected from the ETS official guide to GRE test preparation. We used quantitative evaluation to answer our first research question, and t-test to examine the statistical association between prompt modification and ChatGPT's accuracy. Results show a statistical improvement in the ChatGPT's accuracy after applying instruction priming and contextual prompts to the original questions. ChatGPT showed 84% accuracy with the modified prompts compared to 69% with the original data. The study discusses the areas where ChatGPT struggled with certain questions and how modifications can be helpful for preparing for standardized tests like GRE and provides future directions for prompt modifications.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Collective dynamics of swarmalators with higher-order interactions
Authors:
Md Sayeed Anwar,
Gourab Kumar Sar,
Matjaz Perc,
Dibakar Ghosh
Abstract:
Higher-order interactions shape collective dynamics, but how they affect transitions between different states in swarmalator systems is yet to be determined. To that effect, we here study an analytically tractable swarmalator model that incorporates both pairwise and higher-order interactions, resulting in four distinct collective states: async, phase wave, mixed, and sync states. We show that eve…
▽ More
Higher-order interactions shape collective dynamics, but how they affect transitions between different states in swarmalator systems is yet to be determined. To that effect, we here study an analytically tractable swarmalator model that incorporates both pairwise and higher-order interactions, resulting in four distinct collective states: async, phase wave, mixed, and sync states. We show that even a minute fraction of higher-order interactions induces abrupt transitions from the async state to the phase wave and the sync state. We also show that higher-order interactions facilitate an abrupt transition from the phase wave to the sync state by bypassing the intermediate mixed state. Moreover, elevated levels of higher-order interactions can sustain the presence of phase wave and sync state, even when pairwise interactions lean towards repulsion. The insights gained from these findings unveil self-organizing processes that hold the potential to explain sudden transitions between various collective states in numerous real-world systems.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Automatic assessment of text-based responses in post-secondary education: A systematic review
Authors:
Rujun Gao,
Hillary E. Merzdorf,
Saira Anwar,
M. Cynthia Hipwell,
Arun Srinivasa
Abstract:
Text-based open-ended questions in academic formative and summative assessments help students become deep learners and prepare them to understand concepts for a subsequent conceptual assessment. However, grading text-based questions, especially in large courses, is tedious and time-consuming for instructors. Text processing models continue progressing with the rapid development of Artificial Intel…
▽ More
Text-based open-ended questions in academic formative and summative assessments help students become deep learners and prepare them to understand concepts for a subsequent conceptual assessment. However, grading text-based questions, especially in large courses, is tedious and time-consuming for instructors. Text processing models continue progressing with the rapid development of Artificial Intelligence (AI) tools and Natural Language Processing (NLP) algorithms. Especially after breakthroughs in Large Language Models (LLM), there is immense potential to automate rapid assessment and feedback of text-based responses in education. This systematic review adopts a scientific and reproducible literature search strategy based on the PRISMA process using explicit inclusion and exclusion criteria to study text-based automatic assessment systems in post-secondary education, screening 838 papers and synthesizing 93 studies. To understand how text-based automatic assessment systems have been developed and applied in education in recent years, three research questions are considered. All included studies are summarized and categorized according to a proposed comprehensive framework, including the input and output of the system, research motivation, and research outcomes, aiming to answer the research questions accordingly. Additionally, the typical studies of automated assessment systems, research methods, and application domains in these studies are investigated and summarized. This systematic review provides an overview of recent educational applications of text-based assessment systems for understanding the latest AI/NLP developments assisting in text-based assessments in higher education. Findings will particularly benefit researchers and educators incorporating LLMs such as ChatGPT into their educational activities.
△ Less
Submitted 13 January, 2024; v1 submitted 30 August, 2023;
originally announced August 2023.
-
Harmonization Across Imaging Locations(HAIL): One-Shot Learning for Brain MRI
Authors:
Abhijeet Parida,
Zhifan Jiang,
Syed Muhammad Anwar,
Nicholas Foreman,
Nicholas Stence,
Michael J. Fisher,
Roger J. Packer,
Robert A. Avery,
Marius George Linguraru
Abstract:
For machine learning-based prognosis and diagnosis of rare diseases, such as pediatric brain tumors, it is necessary to gather medical imaging data from multiple clinical sites that may use different devices and protocols. Deep learning-driven harmonization of radiologic images relies on generative adversarial networks (GANs). However, GANs notoriously generate pseudo structures that do not exist…
▽ More
For machine learning-based prognosis and diagnosis of rare diseases, such as pediatric brain tumors, it is necessary to gather medical imaging data from multiple clinical sites that may use different devices and protocols. Deep learning-driven harmonization of radiologic images relies on generative adversarial networks (GANs). However, GANs notoriously generate pseudo structures that do not exist in the original training data, a phenomenon known as "hallucination". To prevent hallucination in medical imaging, such as magnetic resonance images (MRI) of the brain, we propose a one-shot learning method where we utilize neural style transfer for harmonization. At test time, the method uses one image from a clinical site to generate an image that matches the intensity scale of the collaborating sites. Our approach combines learning a feature extractor, neural style transfer, and adaptive instance normalization. We further propose a novel strategy to evaluate the effectiveness of image harmonization approaches with evaluation metrics that both measure image style harmonization and assess the preservation of anatomical structures. Experimental results demonstrate the effectiveness of our method in preserving patient anatomy while adjusting the image intensities to a new clinical site. Our general harmonization model can be used on unseen data from new sites, making it a valuable tool for real-world medical applications and clinical trials.
△ Less
Submitted 21 August, 2023;
originally announced August 2023.
-
Position Uncertainty in a Sequential Public Goods Game: An Experiment
Authors:
Chowdhury Mohammad Sakib Anwar,
Konstantinos Georgalos
Abstract:
Gallice and Monzón (2019) present a natural environment that sustains full co-operation in one-shot social dilemmas among a finite number of self-interested agents. They demonstrate that in a sequential public goods game, where agents lack knowledge of their position in the sequence but can observe some predecessors' actions, full contribution emerges in equilibrium due to agents' incentive to ind…
▽ More
Gallice and Monzón (2019) present a natural environment that sustains full co-operation in one-shot social dilemmas among a finite number of self-interested agents. They demonstrate that in a sequential public goods game, where agents lack knowledge of their position in the sequence but can observe some predecessors' actions, full contribution emerges in equilibrium due to agents' incentive to induce potential successors to follow suit. In this study, we aim to test the theoretical predictions of this model through an economic experiment. We conducted three treatments, varying the amount of information about past actions that a subject can observe, as well as their positional awareness. Through rigorous structural econometric analysis, we found that approximately 25% of the subjects behaved in line with the theoretical predictions. However, we also observed the presence of alternative behavioural types among the remaining subjects. The majority were classified as conditional co-operators, showing a willingness to cooperate based on others' actions. Some subjects exhibited altruistic tendencies, while only a small minority engaged in free-riding behaviour.
△ Less
Submitted 19 April, 2024; v1 submitted 31 July, 2023;
originally announced August 2023.
-
P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds
Authors:
Ruikai Cui,
Shi Qiu,
Saeed Anwar,
Jiawei Liu,
Chaoyue Xing,
**g Zhang,
Nick Barnes
Abstract:
Point cloud completion aims to recover the complete shape based on a partial observation. Existing methods require either complete point clouds or multiple partial observations of the same object for learning. In contrast to previous approaches, we present Partial2Complete (P2C), the first self-supervised framework that completes point cloud objects using training samples consisting of only a sing…
▽ More
Point cloud completion aims to recover the complete shape based on a partial observation. Existing methods require either complete point clouds or multiple partial observations of the same object for learning. In contrast to previous approaches, we present Partial2Complete (P2C), the first self-supervised framework that completes point cloud objects using training samples consisting of only a single incomplete point cloud per object. Specifically, our framework groups incomplete point clouds into local patches as input and predicts masked patches by learning prior information from different partial objects. We also propose Region-Aware Chamfer Distance to regularize shape mismatch without limiting completion capability, and devise the Normal Consistency Constraint to incorporate a local planarity assumption, encouraging the recovered shape surface to be continuous and complete. In this way, P2C no longer needs multiple observations or complete point clouds as ground truth. Instead, structural cues are learned from a category-specific dataset to complete partial point clouds of objects. We demonstrate the effectiveness of our approach on both synthetic ShapeNet data and real-world ScanNet data, showing that P2C produces comparable results to methods trained with complete shapes, and outperforms methods learned with multiple partial observations. Code is available at https://github.com/CuiRuikai/Partial2Complete.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
A Comprehensive Overview of Large Language Models
Authors:
Humza Naveed,
Asad Ullah Khan,
Shi Qiu,
Muhammad Saqib,
Saeed Anwar,
Muhammad Usman,
Naveed Akhtar,
Nick Barnes,
Ajmal Mian
Abstract:
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond. This success of LLMs has led to a large influx of research contributions in this direction. These works encompass diverse topics such as architectural innovations, better training strategies, context length improvements, fine-tuning, multi-modal LLMs, robotics, datasets,…
▽ More
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond. This success of LLMs has led to a large influx of research contributions in this direction. These works encompass diverse topics such as architectural innovations, better training strategies, context length improvements, fine-tuning, multi-modal LLMs, robotics, datasets, benchmarking, efficiency, and more. With the rapid development of techniques and regular breakthroughs in LLM research, it has become considerably challenging to perceive the bigger picture of the advances in this direction. Considering the rapidly emerging plethora of literature on LLMs, it is imperative that the research community is able to benefit from a concise yet comprehensive overview of the recent developments in this field. This article provides an overview of the existing literature on a broad range of LLM-related concepts. Our self-contained comprehensive overview of LLMs discusses relevant background concepts along with covering the advanced topics at the frontier of research in LLMs. This review article is intended to not only provide a systematic survey but also a quick comprehensive reference for the researchers and practitioners to draw insights from extensive informative summaries of the existing works to advance the LLM research.
△ Less
Submitted 9 April, 2024; v1 submitted 12 July, 2023;
originally announced July 2023.
-
Global synchronization on time-varying higher-order structures
Authors:
Md Sayeed Anwar,
Dibakar Ghosh,
Timoteo Carletti
Abstract:
Synchronization has received a lot of attention from the scientific community for systems evolving on static networks or higher-order structures, such as hypergraphs and simplicial complexes. In many relevant real world applications, the latter are not static but do evolve in time, in this paper we thus discuss the impact of the time-varying nature of high-order structures in the emergence of glob…
▽ More
Synchronization has received a lot of attention from the scientific community for systems evolving on static networks or higher-order structures, such as hypergraphs and simplicial complexes. In many relevant real world applications, the latter are not static but do evolve in time, in this paper we thus discuss the impact of the time-varying nature of high-order structures in the emergence of global synchronization.
To achieve this goal we extend the master stability formalism to account, in a general way, for the additional contributions arising from the time evolution of the higher-order structure supporting the dynamical systems. The theory is successfully challenged against two illustrative examples, the Stuart-Landau nonlinear oscillator and the Lorenz chaotic oscillator.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
SelfFed: Self-supervised Federated Learning for Data Heterogeneity and Label Scarcity in IoMT
Authors:
Sunder Ali Khowaja,
Kapal Dev,
Syed Muhammad Anwar,
Marius George Linguraru
Abstract:
Self-supervised learning in federated learning paradigm has been gaining a lot of interest both in industry and research due to the collaborative learning capability on unlabeled yet isolated data. However, self-supervised based federated learning strategies suffer from performance degradation due to label scarcity and diverse data distributions, i.e., data heterogeneity. In this paper, we propose…
▽ More
Self-supervised learning in federated learning paradigm has been gaining a lot of interest both in industry and research due to the collaborative learning capability on unlabeled yet isolated data. However, self-supervised based federated learning strategies suffer from performance degradation due to label scarcity and diverse data distributions, i.e., data heterogeneity. In this paper, we propose the SelfFed framework for Internet of Medical Things (IoMT). Our proposed SelfFed framework works in two phases. The first phase is the pre-training paradigm that performs augmentive modeling using Swin Transformer based encoder in a decentralized manner. The first phase of SelfFed framework helps to overcome the data heterogeneity issue. The second phase is the fine-tuning paradigm that introduces contrastive network and a novel aggregation strategy that is trained on limited labeled data for a target task in a decentralized manner. This fine-tuning stage overcomes the label scarcity problem. We perform our experimental analysis on publicly available medical imaging datasets and show that our proposed SelfFed framework performs better when compared to existing baselines concerning non-independent and identically distributed (IID) data and label scarcity. Our method achieves a maximum improvement of 8.8% and 4.1% on Retina and COVID-FL datasets on non-IID dataset. Further, our proposed method outperforms existing baselines even when trained on a few (10%) labeled instances.
△ Less
Submitted 4 July, 2023;
originally announced July 2023.
-
UnLoc: A Universal Localization Method for Autonomous Vehicles using LiDAR, Radar and/or Camera Input
Authors:
Muhammad Ibrahim,
Naveed Akhtar,
Saeed Anwar,
Ajmal Mian
Abstract:
Localization is a fundamental task in robotics for autonomous navigation. Existing localization methods rely on a single input data modality or train several computational models to process different modalities. This leads to stringent computational requirements and sub-optimal results that fail to capitalize on the complementary information in other data streams. This paper proposes UnLoc, a nove…
▽ More
Localization is a fundamental task in robotics for autonomous navigation. Existing localization methods rely on a single input data modality or train several computational models to process different modalities. This leads to stringent computational requirements and sub-optimal results that fail to capitalize on the complementary information in other data streams. This paper proposes UnLoc, a novel unified neural modeling approach for localization with multi-sensor input in all weather conditions. Our multi-stream network can handle LiDAR, Camera and RADAR inputs for localization on demand, i.e., it can work with one or more input sensors, making it robust to sensor failure. UnLoc uses 3D sparse convolutions and cylindrical partitioning of the space to process LiDAR frames and implements ResNet blocks with a slot attention-based feature filtering module for the Radar and image modalities. We introduce a unique learnable modality encoding scheme to distinguish between the input sensor data. Our method is extensively evaluated on Oxford Radar RobotCar, ApolloSouthBay and Perth-WA datasets. The results ascertain the efficacy of our technique.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
The Brain Tumor Segmentation (BraTS-METS) Challenge 2023: Brain Metastasis Segmentation on Pre-treatment MRI
Authors:
Ahmed W. Moawad,
Anastasia Janas,
Ujjwal Baid,
Divya Ramakrishnan,
Rachit Saluja,
Nader Ashraf,
Leon Jekel,
Raisa Amiruddin,
Maruf Adewole,
Jake Albrecht,
Udunna Anazodo,
Sanjay Aneja,
Syed Muhammad Anwar,
Timothy Bergquist,
Evan Calabrese,
Veronica Chiang,
Verena Chung,
Gian Marco Marco Conte,
Farouk Dako,
James Eddy,
Ivan Ezhov,
Ariana Familiar,
Keyvan Farahani,
Juan Eugenio Iglesias,
Zhifan Jiang
, et al. (206 additional authors not shown)
Abstract:
The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and chara…
▽ More
The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and characterizes the challenging cases that impacted the performance of the winning algorithms. Untreated brain metastases on standard anatomic MRI sequences (T1, T2, FLAIR, T1PG) from eight contributed international datasets were annotated in stepwise method: published UNET algorithms, student, neuroradiologist, final approver neuroradiologist. Segmentations were ranked based on lesion-wise Dice and Hausdorff distance (HD95) scores. False positives (FP) and false negatives (FN) were rigorously penalized, receiving a score of 0 for Dice and a fixed penalty of 374 for HD95. Eight datasets comprising 1303 studies were annotated, with 402 studies (3076 lesions) released on Synapse as publicly available datasets to challenge competitors. Additionally, 31 studies (139 lesions) were held out for validation, and 59 studies (218 lesions) were used for testing. Segmentation accuracy was measured as rank across subjects, with the winning team achieving a LesionWise mean score of 7.9. Common errors among the leading teams included false negatives for small lesions and misregistration of masks in space.The BraTS-METS 2023 challenge successfully curated well-annotated, diverse datasets and identified common errors, facilitating the translation of BM segmentation across varied clinical environments and providing personalized volumetric reports to patients undergoing BM treatment.
△ Less
Submitted 17 June, 2024; v1 submitted 1 June, 2023;
originally announced June 2023.
-
The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)
Authors:
Anahita Fathi Kazerooni,
Nastaran Khalili,
Xinyang Liu,
Debanjan Haldar,
Zhifan Jiang,
Syed Muhammed Anwar,
Jake Albrecht,
Maruf Adewole,
Udunna Anazodo,
Hannah Anderson,
Sina Bagheri,
Ujjwal Baid,
Timothy Bergquist,
Austin J. Borja,
Evan Calabrese,
Verena Chung,
Gian-Marco Conte,
Farouk Dako,
James Eddy,
Ivan Ezhov,
Ariana Familiar,
Keyvan Farahani,
Shuvanjan Haldar,
Juan Eugenio Iglesias,
Anastasia Janas
, et al. (48 additional authors not shown)
Abstract:
Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20\%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. The MICCA…
▽ More
Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20\%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. The MICCAI Brain Tumor Segmentation (BraTS) Challenge is a landmark community benchmark event with a successful history of 12 years of resource creation for the segmentation and analysis of adult glioma. Here we present the CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs 2023 challenge, which represents the first BraTS challenge focused on pediatric brain tumors with data acquired across multiple international consortia dedicated to pediatric neuro-oncology and clinical trials. The BraTS-PEDs 2023 challenge focuses on benchmarking the development of volumentric segmentation algorithms for pediatric brain glioma through standardized quantitative performance evaluation metrics utilized across the BraTS 2023 cluster of challenges. Models gaining knowledge from the BraTS-PEDs multi-parametric structural MRI (mpMRI) training data will be evaluated on separate validation and unseen test mpMRI dataof high-grade pediatric glioma. The CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs 2023 challenge brings together clinicians and AI/imaging scientists to lead to faster development of automated segmentation techniques that could benefit clinical trials, and ultimately the care of children with brain tumors.
△ Less
Submitted 23 May, 2024; v1 submitted 26 May, 2023;
originally announced May 2023.
-
Investigation of UAV Detection in Images with Complex Backgrounds and Rainy Artifacts
Authors:
Adnan Munir,
Abdul Jabbar Siddiqui,
Saeed Anwar
Abstract:
To detect unmanned aerial vehicles (UAVs) in real-time, computer vision and deep learning approaches are evolving research areas. Interest in this problem has grown due to concerns regarding the possible hazards and misuse of employing UAVs in many applications. These include potential privacy violations. To address the concerns, vision-based object detection methods have been developed for UAV de…
▽ More
To detect unmanned aerial vehicles (UAVs) in real-time, computer vision and deep learning approaches are evolving research areas. Interest in this problem has grown due to concerns regarding the possible hazards and misuse of employing UAVs in many applications. These include potential privacy violations. To address the concerns, vision-based object detection methods have been developed for UAV detection. However, UAV detection in images with complex backgrounds and weather artifacts like rain has yet to be reasonably studied. Hence, for this purpose, we prepared two training datasets. The first dataset has the sky as its background and is called the Sky Background Dataset (SBD). The second training dataset has more complex scenes (with diverse backgrounds) and is named the Complex Background Dataset (CBD). Additionally, two test sets were prepared: one containing clear images and the other with images with three rain artifacts, named the Rainy Test Set (RTS). This work also focuses on benchmarking state-of-the-art object detection models, and to the best of our knowledge, it is the first to investigate the performance of recent and popular vision-based object detection methods for UAV detection under challenging conditions such as complex backgrounds, varying UAV sizes, and low-to-heavy rainy conditions. The findings presented in the paper shall help provide insights concerning the performance of the selected models for UAV detection under challenging conditions and pave the way to develop more robust UAV detection methods. The codes and datasets are available at: https://github.com/AdnanMunir294/UAVD-CBRA.
△ Less
Submitted 5 December, 2023; v1 submitted 25 May, 2023;
originally announced May 2023.
-
The Brain Tumor Segmentation (BraTS) Challenge 2023: Brain MR Image Synthesis for Tumor Segmentation (BraSyn)
Authors:
Hongwei Bran Li,
Gian Marco Conte,
Syed Muhammad Anwar,
Florian Kofler,
Ivan Ezhov,
Koen van Leemput,
Marie Piraud,
Maria Diaz,
Byrone Cole,
Evan Calabrese,
Jeff Rudie,
Felix Meissen,
Maruf Adewole,
Anastasia Janas,
Anahita Fathi Kazerooni,
Dominic LaBella,
Ahmed W. Moawad,
Keyvan Farahani,
James Eddy,
Timothy Bergquist,
Verena Chung,
Russell Takeshi Shinohara,
Farouk Dako,
Walter Wiggins,
Zachary Reitman
, et al. (43 additional authors not shown)
Abstract:
Automated brain tumor segmentation methods have become well-established and reached performance levels offering clear clinical utility. These methods typically rely on four input magnetic resonance imaging (MRI) modalities: T1-weighted images with and without contrast enhancement, T2-weighted images, and FLAIR images. However, some sequences are often missing in clinical practice due to time const…
▽ More
Automated brain tumor segmentation methods have become well-established and reached performance levels offering clear clinical utility. These methods typically rely on four input magnetic resonance imaging (MRI) modalities: T1-weighted images with and without contrast enhancement, T2-weighted images, and FLAIR images. However, some sequences are often missing in clinical practice due to time constraints or image artifacts, such as patient motion. Consequently, the ability to substitute missing modalities and gain segmentation performance is highly desirable and necessary for the broader adoption of these algorithms in the clinical routine. In this work, we present the establishment of the Brain MR Image Synthesis Benchmark (BraSyn) in conjunction with the Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2023. The primary objective of this challenge is to evaluate image synthesis methods that can realistically generate missing MRI modalities when multiple available images are provided. The ultimate aim is to facilitate automated brain tumor segmentation pipelines. The image dataset used in the benchmark is diverse and multi-modal, created through collaboration with various hospitals and research institutions.
△ Less
Submitted 28 June, 2023; v1 submitted 15 May, 2023;
originally announced May 2023.
-
The Brain Tumor Segmentation (BraTS) Challenge 2023: Local Synthesis of Healthy Brain Tissue via Inpainting
Authors:
Florian Kofler,
Felix Meissen,
Felix Steinbauer,
Robert Graf,
Eva Oswald,
Ezequiel de da Rosa,
Hongwei Bran Li,
Ujjwal Baid,
Florian Hoelzl,
Oezguen Turgut,
Izabela Horvath,
Diana Waldmannstetter,
Christina Bukas,
Maruf Adewole,
Syed Muhammad Anwar,
Anastasia Janas,
Anahita Fathi Kazerooni,
Dominic LaBella,
Ahmed W Moawad,
Keyvan Farahani,
James Eddy,
Timothy Bergquist,
Verena Chung,
Russell Takeshi Shinohara,
Farouk Dako
, et al. (43 additional authors not shown)
Abstract:
A myriad of algorithms for the automatic analysis of brain MR images is available to support clinicians in their decision-making. For brain tumor patients, the image acquisition time series typically starts with a scan that is already pathological. This poses problems, as many algorithms are designed to analyze healthy brains and provide no guarantees for images featuring lesions. Examples include…
▽ More
A myriad of algorithms for the automatic analysis of brain MR images is available to support clinicians in their decision-making. For brain tumor patients, the image acquisition time series typically starts with a scan that is already pathological. This poses problems, as many algorithms are designed to analyze healthy brains and provide no guarantees for images featuring lesions. Examples include but are not limited to algorithms for brain anatomy parcellation, tissue segmentation, and brain extraction. To solve this dilemma, we introduce the BraTS 2023 inpainting challenge. Here, the participants' task is to explore inpainting techniques to synthesize healthy brain scans from lesioned ones. The following manuscript contains the task formulation, dataset, and submission procedure. Later it will be updated to summarize the findings of the challenge. The challenge is organized as part of the BraTS 2023 challenge hosted at the MICCAI 2023 conference in Vancouver, Canada.
△ Less
Submitted 9 August, 2023; v1 submitted 15 May, 2023;
originally announced May 2023.
-
The ASNR-MICCAI Brain Tumor Segmentation (BraTS) Challenge 2023: Intracranial Meningioma
Authors:
Dominic LaBella,
Maruf Adewole,
Michelle Alonso-Basanta,
Talissa Altes,
Syed Muhammad Anwar,
Ujjwal Baid,
Timothy Bergquist,
Radhika Bhalerao,
Sully Chen,
Verena Chung,
Gian-Marco Conte,
Farouk Dako,
James Eddy,
Ivan Ezhov,
Devon Godfrey,
Fathi Hilal,
Ariana Familiar,
Keyvan Farahani,
Juan Eugenio Iglesias,
Zhifan Jiang,
Elaine Johanson,
Anahita Fathi Kazerooni,
Collin Kent,
John Kirkpatrick,
Florian Kofler
, et al. (35 additional authors not shown)
Abstract:
Meningiomas are the most common primary intracranial tumor in adults and can be associated with significant morbidity and mortality. Radiologists, neurosurgeons, neuro-oncologists, and radiation oncologists rely on multiparametric MRI (mpMRI) for diagnosis, treatment planning, and longitudinal treatment monitoring; yet automated, objective, and quantitative tools for non-invasive assessment of men…
▽ More
Meningiomas are the most common primary intracranial tumor in adults and can be associated with significant morbidity and mortality. Radiologists, neurosurgeons, neuro-oncologists, and radiation oncologists rely on multiparametric MRI (mpMRI) for diagnosis, treatment planning, and longitudinal treatment monitoring; yet automated, objective, and quantitative tools for non-invasive assessment of meningiomas on mpMRI are lacking. The BraTS meningioma 2023 challenge will provide a community standard and benchmark for state-of-the-art automated intracranial meningioma segmentation models based on the largest expert annotated multilabel meningioma mpMRI dataset to date. Challenge competitors will develop automated segmentation models to predict three distinct meningioma sub-regions on MRI including enhancing tumor, non-enhancing tumor core, and surrounding nonenhancing T2/FLAIR hyperintensity. Models will be evaluated on separate validation and held-out test datasets using standardized metrics utilized across the BraTS 2023 series of challenges including the Dice similarity coefficient and Hausdorff distance. The models developed during the course of this challenge will aid in incorporation of automated meningioma MRI segmentation into clinical practice, which will ultimately improve care of patients with meningioma.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
HeteroEdge: Addressing Asymmetry in Heterogeneous Collaborative Autonomous Systems
Authors:
Mohammad Saeid Anwar,
Emon Dey,
Maloy Kumar Devnath,
Indrajeet Ghosh,
Naima Khan,
Jade Freeman,
Timothy Gregory,
Niranjan Suri,
Kasthuri Jayaraja,
Sreenivasan Ramasamy Ramamurthy,
Nirmalya Roy
Abstract:
Gathering knowledge about surroundings and generating situational awareness for IoT devices is of utmost importance for systems developed for smart urban and uncontested environments. For example, a large-area surveillance system is typically equipped with multi-modal sensors such as cameras and LIDARs and is required to execute deep learning algorithms for action, face, behavior, and object recog…
▽ More
Gathering knowledge about surroundings and generating situational awareness for IoT devices is of utmost importance for systems developed for smart urban and uncontested environments. For example, a large-area surveillance system is typically equipped with multi-modal sensors such as cameras and LIDARs and is required to execute deep learning algorithms for action, face, behavior, and object recognition. However, these systems face power and memory constraints due to their ubiquitous nature, making it crucial to optimize data processing, deep learning algorithm input, and model inference communication. In this paper, we propose a self-adaptive optimization framework for a testbed comprising two Unmanned Ground Vehicles (UGVs) and two NVIDIA Jetson devices. This framework efficiently manages multiple tasks (storage, processing, computation, transmission, inference) on heterogeneous nodes concurrently. It involves compressing and masking input image frames, identifying similar frames, and profiling devices to obtain boundary conditions for optimization.. Finally, we propose and optimize a novel parameter split-ratio, which indicates the proportion of the data required to be offloaded to another device while considering the networking bandwidth, busy factor, memory (CPU, GPU, RAM), and power constraints of the devices in the testbed. Our evaluations captured while executing multiple tasks (e.g., PoseNet, SegNet, ImageNet, DetectNet, DepthNet) simultaneously, reveal that executing 70% (split-ratio=70%) of the data on the auxiliary node minimizes the offloading latency by approx. 33% (18.7 ms/image to 12.5 ms/image) and the total operation time by approx. 47% (69.32s to 36.43s) compared to the baseline configuration (executing on the primary node).
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
A Systematic Study on Object Recognition Using Millimeter-wave Radar
Authors:
Maloy Kumar Devnath,
Avijoy Chakma,
Mohammad Saeid Anwar,
Emon Dey,
Zahid Hasan,
Marc Conn,
Biplab Pal,
Nirmalya Roy
Abstract:
Due to its light and weather-independent sensing, millimeter-wave (MMW) radar is essential in smart environments. Intelligent vehicle systems and industry-grade MMW radars have integrated such capabilities. Industry-grade MMW radars are expensive and hard to get for community-purpose smart environment applications. However, commercially available MMW radars have hidden underpinning challenges that…
▽ More
Due to its light and weather-independent sensing, millimeter-wave (MMW) radar is essential in smart environments. Intelligent vehicle systems and industry-grade MMW radars have integrated such capabilities. Industry-grade MMW radars are expensive and hard to get for community-purpose smart environment applications. However, commercially available MMW radars have hidden underpinning challenges that need to be investigated for tasks like recognizing objects and activities, real-time person tracking, object localization, etc. Image and video data are straightforward to gather, understand, and annotate for such jobs. Image and video data are light and weather-dependent, susceptible to the occlusion effect, and present privacy problems. To eliminate dependence and ensure privacy, commercial MMW radars should be tested. MMW radar's practicality and performance in varied operating settings must be addressed before promoting it. To address the problems, we collected a dataset using Texas Instruments' Automotive mmWave Radar (AWR2944) and reported the best experimental settings for object recognition performance using different deep learning algorithms. Our extensive data gathering technique allows us to systematically explore and identify object identification task problems under cross-ambience conditions. We investigated several solutions and published detailed experimental data.
△ Less
Submitted 3 May, 2023;
originally announced May 2023.
-
Upper Limb Movement Execution Classification using Electroencephalography for Brain Computer Interface
Authors:
Saadat Ullah Khan,
Muhammad Majid,
Syed Muhammad Anwar
Abstract:
An accurate classification of upper limb movements using electroencephalography (EEG) signals is gaining significant importance in recent years due to the prevalence of brain-computer interfaces. The upper limbs in the human body are crucial since different skeletal segments combine to make a range of motion that helps us in our trivial daily tasks. Decoding EEG-based upper limb movements can be o…
▽ More
An accurate classification of upper limb movements using electroencephalography (EEG) signals is gaining significant importance in recent years due to the prevalence of brain-computer interfaces. The upper limbs in the human body are crucial since different skeletal segments combine to make a range of motion that helps us in our trivial daily tasks. Decoding EEG-based upper limb movements can be of great help to people with spinal cord injury (SCI) or other neuro-muscular diseases such as amyotrophic lateral sclerosis (ALS), primary lateral sclerosis, and periodic paralysis. This can manifest in a loss of sensory and motor function, which could make a person reliant on others to provide care in day-to-day activities. We can detect and classify upper limb movement activities, whether they be executed or imagined using an EEG-based brain-computer interface (BCI). Toward this goal, we focus our attention on decoding movement execution (ME) of the upper limb in this study. For this purpose, we utilize a publicly available EEG dataset that contains EEG signal recordings from fifteen subjects acquired using a 61-channel EEG device. We propose a method to classify four ME classes for different subjects using spectrograms of the EEG data through pre-trained deep learning (DL) models. Our proposed method of using EEG spectrograms for the classification of ME has shown significant results, where the highest average classification accuracy (for four ME classes) obtained is 87.36%, with one subject achieving the best classification accuracy of 97.03%.
△ Less
Submitted 1 April, 2023;
originally announced April 2023.
-
Efficient Public Good Provision in a Multipolar World
Authors:
Chowdhury Mohammad Sakib Anwar,
Jorge Bruno,
Renaud Foucart,
Sonali SenGupta
Abstract:
We model a public goods game with groups, position uncertainty, and observational learning. Contributions are simultaneous within groups, but groups play sequentially based on their observation of an incomplete sample of past contributions. We show that full cooperation between and within groups is possible with self-interested players on a fixed horizon. Position uncertainty implies the existence…
▽ More
We model a public goods game with groups, position uncertainty, and observational learning. Contributions are simultaneous within groups, but groups play sequentially based on their observation of an incomplete sample of past contributions. We show that full cooperation between and within groups is possible with self-interested players on a fixed horizon. Position uncertainty implies the existence of an equilibrium where groups of players conditionally cooperate in the hope of influencing further groups. Conditional cooperation implies that each group member is pivotal, so that efficient simultaneous provision within groups is an equilibrium.
△ Less
Submitted 29 July, 2023; v1 submitted 18 March, 2023;
originally announced March 2023.
-
Diurnal and Seasonal Map** of Martian Ices With EMIRS
Authors:
Aurélien Stcherbinine,
Christopher S. Edwards,
Michael D. Smith,
Michael J. Wolff,
Christopher Haberle,
Eman Al Tunaiji,
Nathan M. Smith,
Kezman Saboi,
Saadat Anwar,
Lucas Lange,
Philip R. Christensen
Abstract:
Condensation and sublimation of ices at the surface of the planet is a key part of both the Martian H$_2$O and CO$_2$ cycles, either from a seasonal or diurnal aspect. While most of the ice is located within the polar caps, surface frost is known to be formed during nighttime down to equatorial latitudes. Here, we use data from the Emirates Mars Infrared Spectrometer (EMIRS) onboard the Emirates M…
▽ More
Condensation and sublimation of ices at the surface of the planet is a key part of both the Martian H$_2$O and CO$_2$ cycles, either from a seasonal or diurnal aspect. While most of the ice is located within the polar caps, surface frost is known to be formed during nighttime down to equatorial latitudes. Here, we use data from the Emirates Mars Infrared Spectrometer (EMIRS) onboard the Emirates Mars Mission (EMM) to monitor the diurnal and seasonal evolution of the ices at the surface of Mars over almost one Martian year. The unique local time coverage provided by the instrument allows us to observe the apparition of equatorial CO$_2$ frost in the second half of the Martian night around the equinoxes, to its sublimation at sunrise.
△ Less
Submitted 27 June, 2023; v1 submitted 14 March, 2023;
originally announced March 2023.
-
Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps
Authors:
Muhammad Ibrahim,
Naveed Akhtar,
Saeed Anwar,
Michael Wise,
Ajmal Mian
Abstract:
Precise localization is critical for autonomous vehicles. We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data. We propose a pre-text task that reorganizes the slices of a $360^\circ$ LiDAR scan to leverage its axial properties. Our model, called Slice Transformer, employs multi-head attention while systemat…
▽ More
Precise localization is critical for autonomous vehicles. We present a self-supervised learning method that employs Transformers for the first time for the task of outdoor localization using LiDAR data. We propose a pre-text task that reorganizes the slices of a $360^\circ$ LiDAR scan to leverage its axial properties. Our model, called Slice Transformer, employs multi-head attention while systematically processing the slices. To the best of our knowledge, this is the first instance of leveraging multi-head attention for outdoor point clouds. We additionally introduce the Perth-WA dataset, which provides a large-scale LiDAR map of Perth city in Western Australia, covering $\sim$4km$^2$ area. Localization annotations are provided for Perth-WA. The proposed localization method is thoroughly evaluated on Perth-WA and Appollo-SouthBay datasets. We also establish the efficacy of our self-supervised learning approach for the common downstream task of object classification using ModelNet40 and ScanNN datasets. The code and Perth-WA data will be publicly released.
△ Less
Submitted 13 August, 2023; v1 submitted 21 January, 2023;
originally announced January 2023.
-
PointCaM: Cut-and-Mix for Open-Set Point Cloud Learning
Authors:
Jie Hong,
Shi Qiu,
Weihao Li,
Saeed Anwar,
Mehrtash Harandi,
Nick Barnes,
Lars Petersson
Abstract:
Point cloud learning is receiving increasing attention, however, most existing point cloud models lack the practical ability to deal with the unavoidable presence of unknown objects. This paper mainly discusses point cloud learning under open-set settings, where we train the model without data from unknown classes and identify them in the inference stage. Basically, we propose to solve open-set po…
▽ More
Point cloud learning is receiving increasing attention, however, most existing point cloud models lack the practical ability to deal with the unavoidable presence of unknown objects. This paper mainly discusses point cloud learning under open-set settings, where we train the model without data from unknown classes and identify them in the inference stage. Basically, we propose to solve open-set point cloud learning using a novel Point Cut-and-Mix mechanism consisting of Unknown-Point Simulator and Unknown-Point Estimator modules. Specifically, we use the Unknown-Point Simulator to simulate out-of-distribution data in the training stage by manipulating the geometric context of partial known data. Based on this, the Unknown-Point Estimator module learns to exploit the point cloud's feature context for discriminating the known and unknown data. Extensive experiments show the plausibility of open-set point cloud learning and the effectiveness of our proposed solutions. Our code is available at \url{https://github.com/ShiQiu0419/pointcam}.
△ Less
Submitted 24 August, 2023; v1 submitted 4 December, 2022;
originally announced December 2022.
-
Synchronization in temporal simplicial complexes
Authors:
Md Sayeed Anwar,
Dibakar Ghosh
Abstract:
The stability analysis of synchronization in time-varying higher-order networked structures (simplicial complexes) is one of the challenging problem due to the presence of time-varying group interactions. In this context, most of the previous studies have been done either on temporal pairwise networks or on static simplicial complexes. Here, for the first time, we propose a general framework to st…
▽ More
The stability analysis of synchronization in time-varying higher-order networked structures (simplicial complexes) is one of the challenging problem due to the presence of time-varying group interactions. In this context, most of the previous studies have been done either on temporal pairwise networks or on static simplicial complexes. Here, for the first time, we propose a general framework to study the synchronization phenomenon in temporal simplicial complexes. We show that the synchronous state exists as an invariant solution and obtain the necessary condition for it to be emerged as a stable state in fast switching regime. We prove that the time-averaged simplicial complex plays the role of synchronization indicator whenever the switching among simplicial topologies are adequately fast. We attempt to transform the stability problem into a master stability function form. Unfortunately, for the general circumstances, the dimension reduction of the master stability equation is cumbersome due to the presence of group interactions. However, we overcome this difficulty in two interesting situations based on either the functional forms of the coupling schemes or the connectivity structure of the simplicial complex, and demonstrate that the necessary condition mimics the form of a master stability function in these cases. We verify our analytical findings by applying them on synthetic and real-world networked systems. In addition, our results also reveal that with sufficient higher-order coupling and adequately fast rewiring, the temporal simplicial complex achieves synchrony even in a very low connectivity regime.
△ Less
Submitted 2 December, 2022;
originally announced December 2022.