Skip to main content

Showing 1–50 of 133 results for author: Khan, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08714  [pdf, other

    eess.SP

    Real-time Digital RF Emulation -- II: A Near Memory Custom Accelerator

    Authors: Mandovi Mukherjee, Xiangyu Mao, Nael Rahman, Coleman DeLude, Joe Driscoll, Sudarshan Sharma, Payman Behnam, Uday Kamal, Jongseok Woo, Daehyun Kim, Sharjeel Khan, Jianming Tong, Jamin Seo, Prachi Sinha, Madhavan Swaminathan, Tushar Krishna, Santosh Pande, Justin Romberg, Saibal Mukhopadhyay

    Abstract: A near memory hardware accelerator, based on a novel direct path computational model, for real-time emulation of radio frequency systems is demonstrated. Our evaluation of hardware performance uses both application-specific integrated circuits (ASIC) and field programmable gate arrays (FPGA) methodologies: 1). The ASIC testchip implementation, using TSMC 28nm CMOS, leverages distributed autonomous… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.08710  [pdf, other

    eess.SP

    Real-time Digital RF Emulation -- I: The Direct Path Computational Model

    Authors: Coleman DeLude, Joe Driscoll, Mandovi Mukherjee, Nael Rahman, Uday Kamal, Xiangyu Mao, Sharjeel Khan, Hariharan Sivaraman, Eric Huang, Jeffrey McHarg, Madhavan Swaminathan, Santosh Pande, Saibal Mukhopadhyay, Justin Romberg

    Abstract: In this paper we consider the problem of develo** a computational model for emulating an RF channel. The motivation for this is that an accurate and scalable emulator has the potential to minimize the need for field testing, which is expensive, slow, and difficult to replicate. Traditionally, emulators are built using a tapped delay line model where long filters modeling the physical interaction… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2406.08486  [pdf, other

    eess.IV cs.CV

    On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models

    Authors: Hashmat Shadab Malik, Numan Saeed, Asif Hanif, Muzammal Naseer, Mohammad Yaqub, Salman Khan, Fahad Shahbaz Khan

    Abstract: Volumetric medical segmentation models have achieved significant success on organ and tumor-based segmentation tasks in recent years. However, their vulnerability to adversarial attacks remains largely unexplored, raising serious concerns regarding the real-world deployment of tools employing such models in the healthcare sector. This underscores the importance of investigating the robustness of e… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  4. arXiv:2406.00667  [pdf, other

    eess.IV cs.AI cs.CL cs.CV cs.LG

    An Early Investigation into the Utility of Multimodal Large Language Models in Medical Imaging

    Authors: Sulaiman Khan, Md. Rafiul Biswas, Alina Murad, Hazrat Ali, Zubair Shah

    Abstract: Recent developments in multimodal large language models (MLLMs) have spurred significant interest in their potential applications across various medical imaging domains. On the one hand, there is a temptation to use these generative models to synthesize realistic-looking medical image data, while on the other hand, the ability to identify synthetic image data in a pool of data is also significantl… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted in Fifth IEEE Workshop on Artificial Intelligence for HealthCare, IEEE 25th International Conference on Information Reuse and Integration for Data Science

  5. arXiv:2406.00449  [pdf, other

    eess.IV cs.CV

    Dual Hyperspectral Mamba for Efficient Spectral Compressive Imaging

    Authors: Jiahua Dong, Hui Yin, Hongliu Li, Wenbo Li, Yulun Zhang, Salman Khan, Fahad Shahbaz Khan

    Abstract: Deep unfolding methods have made impressive progress in restoring 3D hyperspectral images (HSIs) from 2D measurements through convolution neural networks or Transformers in spectral compressive imaging. However, they cannot efficiently capture long-range dependencies using global receptive fields, which significantly limits their performance in HSI reconstruction. Moreover, these methods may suffe… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures

  6. arXiv:2405.12986  [pdf

    eess.IV cs.AI cs.CV

    A Novel Feature Map Enhancement Technique Integrating Residual CNN and Transformer for Alzheimer Diseases Diagnosis

    Authors: Saddam Hussain Khan

    Abstract: Alzheimer diseases (ADs) involves cognitive decline and abnormal brain protein accumulation, necessitating timely diagnosis for effective treatment. Therefore, CAD systems leveraging deep learning advancements have demonstrated success in AD detection but pose computational intricacies and the dataset minor contrast, structural, and texture variations. In this regard, a novel hybrid FME-Residual-H… ▽ More

    Submitted 25 May, 2024; v1 submitted 30 March, 2024; originally announced May 2024.

    Comments: 28 Pages, 11 Figures, 3 Tables

  7. arXiv:2404.11771  [pdf

    eess.SY

    IoT-Driven Cloud-based Energy and Environment Monitoring System for Manufacturing Industry

    Authors: Nitol Saha, Md Masruk Aulia, Md. Mostafizur Rahman, Mohammed Shafiul Alam Khan

    Abstract: This research focused on the development of a cost-effective IoT solution for energy and environment monitoring geared towards manufacturing industries. The proposed system is developed using open-source software that can be easily deployed in any manufacturing environment. The system collects real-time temperature, humidity, and energy data from different devices running on different communicatio… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  8. arXiv:2403.08398  [pdf, other

    eess.SY

    Remote UGV Control via Practical Wireless Channels: A Model Predictive Control Approach

    Authors: inghao Cao, Subhan Khan, Wanchun Liu, Yonghui Li, Branka Vucetic

    Abstract: In addressing wireless networked control systems (WNCS) subject to unexpected packet loss and uncertainties, this paper presents a practical Model Predictive Control (MPC) based control scheme with considerations of of packet dropouts, latency, process noise and measurement noise. A discussion of the quasi-static Rayleigh fading channel is presented herein to enhance the realism of the underlying… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  9. arXiv:2403.05415  [pdf

    eess.SY

    An Overview of Automated Vehicle Platooning Strategies

    Authors: M Sabbir Salek, Mugdha Basu Thakur, Pardha Sai Krishna Ala, Mashrur Chowdhury, Matthias Schmid, Pamela Murray-Tuite, Sakib Mahmud Khan, Venkat Krovi

    Abstract: Automated vehicle (AV) platooning has the potential to improve the safety, operational, and energy efficiency of surface transportation systems by limiting or eliminating human involvement in the driving tasks. The theoretical validity of the AV platooning strategies has been established and practical applications are being tested under real-world conditions. The emergence of sensors, communicatio… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  10. arXiv:2402.18102  [pdf, other

    eess.IV cs.CV

    Passive Snapshot Coded Aperture Dual-Pixel RGB-D Imaging

    Authors: Bhargav Ghanekar, Salman Siddique Khan, Pranav Sharma, Shreyas Singh, Vivek Boominathan, Kaushik Mitra, Ashok Veeraraghavan

    Abstract: Passive, compact, single-shot 3D sensing is useful in many application areas such as microscopy, medical imaging, surgical navigation, and autonomous driving where form factor, time, and power constraints can exist. Obtaining RGB-D scene information over a short imaging distance, in an ultra-compact form factor, and in a passive, snapshot manner is challenging. Dual-pixel (DP) sensors are a potent… ▽ More

    Submitted 30 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  11. arXiv:2402.17725  [pdf, other

    eess.IV cs.CV

    MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation

    Authors: Hanan Gani, Muzammal Naseer, Fahad Khan, Salman Khan

    Abstract: Volumetric medical segmentation is a critical component of 3D medical image analysis that delineates different semantic regions. Deep neural networks have significantly improved volumetric medical segmentation, but they generally require large-scale annotated data to achieve better performance, which can be expensive and prohibitive to obtain. To address this limitation, existing works typically p… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Code available at https://github.com/hananshafi/MedContext

  12. arXiv:2402.04326  [pdf, other

    cs.HC cs.LG eess.SP

    Personality Trait Recognition using ECG Spectrograms and Deep Learning

    Authors: Muhammad Mohsin Altaf, Saadat Ullah Khan, Muhammad Majd, Syed Muhammad Anwar

    Abstract: This paper presents an innovative approach to recognizing personality traits using deep learning (DL) methods applied to electrocardiogram (ECG) signals. Within the framework of detecting the big five personality traits model encompassing extra-version, neuroticism, agreeableness, conscientiousness, and openness, the research explores the potential of ECG-derived spectrograms as informative featur… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  13. arXiv:2312.01077  [pdf, other

    eess.IV

    OpEnCam: Lensless Optical Encryption Camera

    Authors: Salman S. Khan, Xiang Yu, Kaushik Mitra, Manmohan Chandraker, Francesco Pittaluga

    Abstract: Lensless cameras multiplex the incoming light before it is recorded by the sensor. This ability to multiplex the incoming light has led to the development of ultra-thin, high-speed, and single-shot 3D imagers. Recently, there have been various attempts at demonstrating another useful aspect of lensless cameras - their ability to preserve the privacy of a scene by capturing encrypted measurements.… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: 11 pages, 11 figures, 3 tables

  14. arXiv:2312.00634  [pdf

    eess.IV cs.CV

    A Recent Survey of Vision Transformers for Medical Image Segmentation

    Authors: Asifullah Khan, Zunaira Rauf, Abdul Rehman Khan, Saima Rathore, Saddam Hussain Khan, Najmus Saher Shah, Umair Farooq, Hifsa Asif, Aqsa Asif, Umme Zahoora, Rafi Ullah Khalil, Suleman Qamar, Umme Hani Asif, Faiza Babar Khan, Abdul Majid, Jeonghwan Gwak

    Abstract: Medical image segmentation plays a crucial role in various healthcare applications, enabling accurate diagnosis, treatment planning, and disease monitoring. Traditionally, convolutional neural networks (CNNs) dominated this domain, excelling at local feature extraction. However, their limitations in capturing long-range dependencies across image regions pose challenges for segmenting complex, inte… ▽ More

    Submitted 18 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

  15. arXiv:2311.10754  [pdf

    eess.IV cs.CV

    A Recent Survey of the Advancements in Deep Learning Techniques for Monkeypox Disease Detection

    Authors: Saddam Hussain Khan, Rashid Iqbal, Saeeda Naz

    Abstract: Monkeypox (MPox) is a zoonotic infectious disease induced by the MPox Virus, part of the poxviridae orthopoxvirus group initially discovered in Africa and gained global attention in mid-2022 with cases reported outside endemic areas. Symptoms include headaches, chills, fever, smallpox, measles, and chickenpox-like skin manifestations and the WHO officially announced MPox as a global public health… ▽ More

    Submitted 23 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: 53 pages, 16 figures, 7 tables

  16. arXiv:2310.20140  [pdf

    eess.IV cs.CV

    Synthesizing Diabetic Foot Ulcer Images with Diffusion Model

    Authors: Reza Basiri, Karim Manji, Francois Harton, Alisha Poonja, Milos R. Popovic, Shehroz S. Khan

    Abstract: Diabetic Foot Ulcer (DFU) is a serious skin wound requiring specialized care. However, real DFU datasets are limited, hindering clinical training and research activities. In recent years, generative adversarial networks and diffusion models have emerged as powerful tools for generating synthetic images with remarkable realism and diversity in many applications. This paper explores the potential of… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 8 pages, 3 figures, 6th Workshop on AI for Aging, Rehabilitation and Intelligent Assisted Living at European Conference on Machine Learning, Italy, 2023

  17. arXiv:2310.17142  [pdf

    eess.AS cs.SD

    Single channel speech enhancement by colored spectrograms

    Authors: Sania Gul, Muhammad Salman Khan, Muhammad Fazeel

    Abstract: Speech enhancement concerns the processes required to remove unwanted background sounds from the target speech to improve its quality and intelligibility. In this paper, a novel approach for single-channel speech enhancement is presented, using colored spectrograms. We propose the use of a deep neural network (DNN) architecture adapted from the pix2pix generative adversarial network (GAN) and trai… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 18 pages, 6 figures, 5 tables

  18. arXiv:2310.11651  [pdf, other

    eess.SY cs.CR

    US Microelectronics Packaging Ecosystem: Challenges and Opportunities

    Authors: Rouhan Noor, Himanandhan Reddy Kottur, Patrick J Craig, Liton Kumar Biswas, M Shafkat M Khan, Nitin Varshney, Hamed Dalir, Elif Akçalı, Bahareh Ghane Motlagh, Charles Woychik, Yong-Kyu Yoon, Navid Asadizanjani

    Abstract: The semiconductor industry is experiencing a significant shift from traditional methods of shrinking devices and reducing costs. Chip designers actively seek new technological solutions to enhance cost-effectiveness while incorporating more features into the silicon footprint. One promising approach is Heterogeneous Integration (HI), which involves advanced packaging techniques to integrate indepe… ▽ More

    Submitted 30 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: 22 pages, 8 figures

  19. arXiv:2310.06434  [pdf, other

    cs.CL cs.AI cs.MM cs.SD eess.AS

    Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition

    Authors: Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Rohit Kumar, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegner

    Abstract: We introduce a new cross-modal fusion technique designed for generative error correction in automatic speech recognition (ASR). Our methodology leverages both acoustic information and external linguistic representations to generate accurate speech transcription contexts. This marks a step towards a fresh paradigm in generative error correction within the realm of n-best hypotheses. Unlike the exis… ▽ More

    Submitted 16 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 as main paper. 10 pages. Revised math notations. GitHub: https://github.com/Srijith-rkr/Whispering-LLaMA

  20. arXiv:2309.11784  [pdf, other

    eess.SY eess.SP

    Collaborative Fault-Identification & Reconstruction in Multi-Agent Systems

    Authors: Shiraz Khan, Inseok Hwang

    Abstract: The conventional solutions for fault-detection, identification, and reconstruction (FDIR) require centralized decision-making mechanisms which are typically combinatorial in their nature, necessitating the design of an efficient distributed FDIR mechanism that is suitable for multi-agent applications. To this end, we develop a general framework for efficiently reconstructing a sparse vector being… ▽ More

    Submitted 22 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

  21. arXiv:2308.13833  [pdf, other

    cs.NI eess.SP

    A Cognitive Network Architecture for Vehicle-to-Network (V2N) Communications over Smart Meters for URLLC

    Authors: Shoaib Ahmed, Sayonto Khan, Kumudu S. Munasinghe, Md. Farhad Hossain

    Abstract: With the rapid advancement of smart city infrastructure, vehicle-to-network (V2N) communication has emerged as a crucial technology to enable intelligent transportation systems (ITS). The investigation of new methods to improve V2N communications is sparked by the growing need for high-speed and dependable communications in vehicular networks. To achieve ultra-reliable low latency communication (U… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 12 pages, 19 figures, IEEE format

  22. arXiv:2308.01981  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    CartiMorph: a framework for automated knee articular cartilage morphometrics

    Authors: Yongcheng Yao, Junru Zhong, Li** Zhang, Sheheryar Khan, Weitian Chen

    Abstract: We introduce CartiMorph, a framework for automated knee articular cartilage morphometrics. It takes an image as input and generates quantitative metrics for cartilage subregions, including the percentage of full-thickness cartilage loss (FCL), mean thickness, surface area, and volume. CartiMorph leverages the power of deep learning models for hierarchical image feature representation. Deep learnin… ▽ More

    Submitted 20 November, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: This preprint is an proofread version of a paper published in Medical Image Analysis (2023), which can be found at https://doi.org/10.1016/j.media.2023.103035

  23. arXiv:2308.00856  [pdf, other

    cs.LG cs.CR eess.IV

    Differential Privacy for Adaptive Weight Aggregation in Federated Tumor Segmentation

    Authors: Muhammad Irfan Khan, Esa Alhoniemi, Elina Kontio, Suleiman A. Khan, Mojtaba Jafaritadi

    Abstract: Federated Learning (FL) is a distributed machine learning approach that safeguards privacy by creating an impartial global model while respecting the privacy of individual client data. However, the conventional FL method can introduce security risks when dealing with diverse client data, potentially compromising privacy and data integrity. To address these challenges, we present a differential pri… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  24. arXiv:2308.00274  [pdf, other

    eess.SY

    Exploiting Sparsity for Localization of Large-Scale Wireless Sensor Networks

    Authors: Shiraz Khan, Inseok Hwang, James Goppert

    Abstract: Wireless Sensor Network (WSN) localization refers to the problem of determining the position of each of the agents in a WSN using noisy measurement information. In many cases, such as in distance and bearing-based localization, the measurement model is a nonlinear function of the agents' positions, leading to pairwise interconnections between the agents. As the optimal solution for the WSN localiz… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  25. arXiv:2308.00268  [pdf, other

    eess.SP

    Distributed Gaussian Mixture PHD Filtering under Communication Constraints

    Authors: Shiraz Khan, Yi-Chieh Sun, Inseok Hwang

    Abstract: The Gaussian Mixture Probability Hypothesis Density (GM-PHD) filter is an almost exact closed-form approximation to the Bayes-optimal multi-target tracking algorithm. Due to its optimality guarantees and ease of implementation, it has been studied extensively in the literature. However, the challenges involved in implementing the GM-PHD filter efficiently in a distributed (multi-sensor) setting ha… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  26. arXiv:2307.13642  [pdf, other

    cs.LG cs.AI eess.SY

    Safety Margins for Reinforcement Learning

    Authors: Alexander Grushin, Walt Woods, Alvaro Velasquez, Simon Khan

    Abstract: Any autonomous controller will be unsafe in some situations. The ability to quantitatively identify when these unsafe situations are about to occur is crucial for drawing timely human oversight in, e.g., freight transportation applications. In this work, we demonstrate that the true criticality of an agent's situation can be robustly defined as the mean reduction in reward given some number of ran… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: 2 pages, 2 figures. Presented at the 2023 IEEE Conference on Artificial Intelligence (CAI), Santa Clara, CA

    MSC Class: 68T07 ACM Class: I.2.6

  27. arXiv:2307.12078  [pdf, other

    eess.SY eess.SP

    Recovery of Localization Errors in Sensor Networks using Inter-Agent Measurements

    Authors: Shiraz Khan, Inseok Hwang

    Abstract: A practical challenge which arises in the operation of sensor networks is the presence of sensor faults, biases, or adversarial attacks, which can lead to significant errors incurring in the localization of the agents, thereby undermining the security and performance of the network. We consider the problem of identifying and correcting the localization errors using inter-agent measurements, such a… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  28. arXiv:2307.10814  [pdf, other

    cs.CL cs.NE cs.SD eess.AS

    Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages

    Authors: Ephrem Afele Retta, Richard Sutcliffe, Jabar Mahmood, Michael Abebe Berwo, Eiad Almekhlafi, Sajjad Ahmed Khan, Shehzad Ashraf Chaudhry, Mustafa Mhamed, Jun Feng

    Abstract: In a conventional Speech emotion recognition (SER) task, a classifier for a given language is trained on a pre-existing dataset for that same language. However, where training data for a language does not exist, data from other languages can be used instead. We experiment with cross-lingual and multilingual SER, working with Amharic, English, German and URDU. For Amharic, we use our own publicly-a… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: 16 pages, 9 tables, 5 figures

  29. arXiv:2307.07822  [pdf, other

    eess.SY

    Design Analysis and Experimental Validation of Relaxation Oscillator-Based Circuit for R-C Sensors

    Authors: Mohamad Idris Wani, Sadan Saquib Khan, Benish Jan, Meraj Ahmad, Maryam Shojaei Baghini, Laxmeesha Somappa, Shahid Malik

    Abstract: Relaxation oscillator-based circuits are widely used for interfacing various resistive and capacitive sensors. The electrical equivalent of most resistive and capacitive sensors is represented using a parallel combination of resistor and capacitor. The relaxation oscillator-based circuits are not suitable for parallel R-C sensors. In this paper, we propose a modified circuit for parallel R-C senso… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

  30. arXiv:2307.07414  [pdf, other

    eess.SY

    An Embedded Auto-Calibrated Offset Current Compensation Technique for PPG/fNIRS System

    Authors: Sadan Saquib Khan, Sumit Kumar, Benish Jan, Laxmeesha Somappa, Shahid Malik

    Abstract: Usually, the current generated by the photodiode proportional to the oxygenated blood in the photoplethysmography (PPG) and functional infrared spectroscopy (fNIRS) based recording systems is small as compared to the offset-current. The offset current is the combination of the dark current of the photodiode, the current due to ambient light, and the current due to the reflected light from fat and… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  31. arXiv:2307.07269  [pdf, other

    eess.IV cs.CV cs.LG

    Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation

    Authors: Asif Hanif, Muzammal Naseer, Salman Khan, Mubarak Shah, Fahad Shahbaz Khan

    Abstract: It is imperative to ensure the robustness of deep learning models in critical applications such as, healthcare. While recent advances in deep learning have improved the performance of volumetric medical image segmentation models, these models cannot be deployed for real-world applications immediately due to their vulnerability to adversarial attacks. We present a 3D frequency domain adversarial at… ▽ More

    Submitted 20 July, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted in MICCAI 2023 conference

  32. arXiv:2306.14255  [pdf, other

    eess.IV cs.CV

    AttResDU-Net: Medical Image Segmentation Using Attention-based Residual Double U-Net

    Authors: Akib Mohammed Khan, Alif Ashrafee, Fahim Shahriar Khan, Md. Bakhtiar Hasan, Md. Hasanul Kabir

    Abstract: Manually inspecting polyps from a colonoscopy for colorectal cancer or performing a biopsy on skin lesions for skin cancer are time-consuming, laborious, and complex procedures. Automatic medical image segmentation aims to expedite this diagnosis process. However, numerous challenges exist due to significant variations in the appearance and sizes of objects with no distinct boundaries. This paper… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted in 2023 International Joint Conference on Neural Networks (IJCNN 2023)

  33. arXiv:2306.09320  [pdf, other

    eess.IV cs.CV

    Learnable Weight Initialization for Volumetric Medical Image Segmentation

    Authors: Shahina Kunhimon, Abdelrahman Shaker, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan

    Abstract: Hybrid volumetric medical image segmentation models, combining the advantages of local convolution and global attention, have recently received considerable attention. While mainly focusing on architectural modifications, most existing hybrid approaches still use conventional data-independent weight initialization schemes which restrict their performance due to ignoring the inherent volumetric nat… ▽ More

    Submitted 3 April, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted at Elsevier AI in Medicine Journal

  34. arXiv:2305.16789  [pdf, other

    cs.LG cs.CV eess.SP

    Modulate Your Spectrum in Self-Supervised Learning

    Authors: Xi Weng, Yunhao Ni, Tengwei Song, Jie Luo, Rao Muhammad Anwer, Salman Khan, Fahad Shahbaz Khan, Lei Huang

    Abstract: Whitening loss offers a theoretical guarantee against feature collapse in self-supervised learning (SSL) with joint embedding architectures. Typically, it involves a hard whitening approach, transforming the embedding and applying loss to the whitened output. In this work, we introduce Spectral Transformation (ST), a framework to modulate the spectrum of embedding and to seek for functions beyond… ▽ More

    Submitted 21 January, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at ICLR 2024. The code is available at https://github.com/winci-ai/intl

  35. arXiv:2305.11244  [pdf, other

    cs.CL cs.AI cs.LG cs.NE eess.AS

    A Parameter-Efficient Learning Approach to Arabic Dialect Identification with Pre-Trained General-Purpose Speech Model

    Authors: Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegner

    Abstract: In this work, we explore Parameter-Efficient-Learning (PEL) techniques to repurpose a General-Purpose-Speech (GSM) model for Arabic dialect identification (ADI). Specifically, we investigate different setups to incorporate trainable features into a multi-layer encoder-decoder GSM formulation under frozen pre-trained settings. Our architecture includes residual adapter and model reprogramming (inpu… ▽ More

    Submitted 3 October, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted to Interspeech 2023, 5 pages. Code is available at: https://github.com/Srijith-rkr/KAUST-Whisper-Adapter under MIT license

  36. arXiv:2304.14922  [pdf, other

    eess.SP cs.AI cs.LG

    Supervised and Unsupervised Deep Learning Approaches for EEG Seizure Prediction

    Authors: Zakary Georgis-Yap, Milos R. Popovic, Shehroz S. Khan

    Abstract: Epilepsy affects more than 50 million people worldwide, making it one of the world's most prevalent neurological diseases. The main symptom of epilepsy is seizures, which occur abruptly and can cause serious injury or death. The ability to predict the occurrence of an epileptic seizure could alleviate many risks and stresses people with epilepsy face. We formulate the problem of detecting preictal… ▽ More

    Submitted 3 February, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 16 figures, 9 tables

    Journal ref: Journal of Health Informatics Research, 2024

  37. arXiv:2304.06036  [pdf, other

    eess.SP cs.HC

    Upper Limb Movement Execution Classification using Electroencephalography for Brain Computer Interface

    Authors: Saadat Ullah Khan, Muhammad Majid, Syed Muhammad Anwar

    Abstract: An accurate classification of upper limb movements using electroencephalography (EEG) signals is gaining significant importance in recent years due to the prevalence of brain-computer interfaces. The upper limbs in the human body are crucial since different skeletal segments combine to make a range of motion that helps us in our trivial daily tasks. Decoding EEG-based upper limb movements can be o… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  38. arXiv:2304.03307  [pdf, other

    cs.CV eess.IV

    Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting

    Authors: Syed Talal Wasim, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: Adopting contrastive image-text pretrained models like CLIP towards video classification has gained attention due to its cost-effectiveness and competitive performance. However, recent works in this area face a trade-off. Finetuning the pretrained model to achieve strong supervised performance results in low zero-shot generalization. Similarly, freezing the backbone to retain zero-shot capability… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: Accepted at CVPR-2023. Codes/models available at https://github.com/TalalWasim/Vita-CLIP

  39. arXiv:2304.02836  [pdf, other

    eess.IV cs.CV cs.LG

    Longitudinal Multimodal Transformer Integrating Imaging and Latent Clinical Signatures From Routine EHRs for Pulmonary Nodule Classification

    Authors: Thomas Z. Li, John M. Still, Kaiwen Xu, Ho Hin Lee, Leon Y. Cai, Aravind R. Krishnan, Riqiang Gao, Mirza S. Khan, Sanja Antic, Michael Kammer, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman, Thomas A. Lasko

    Abstract: The accuracy of predictive models for solitary pulmonary nodule (SPN) diagnosis can be greatly increased by incorporating repeat imaging and medical context, such as electronic health records (EHRs). However, clinically routine modalities such as imaging and diagnostic codes can be asynchronous and irregularly sampled over different time scales which are obstacles to longitudinal multimodal learni… ▽ More

    Submitted 29 June, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to MICCAI 2023

  40. arXiv:2304.01992  [pdf, other

    eess.IV cs.CV

    Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification

    Authors: Amandeep Kumar, Ankan kumar Bhunia, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Jorma Laaksonen, Fahad Shahbaz Khan

    Abstract: In this work, we propose a few-shot colorectal tissue image generation method for addressing the scarcity of histopathological training data for rare cancer tissues. Our few-shot generation method, named XM-GAN, takes one base and a pair of reference tissue images as input and generates high-quality yet diverse images. Within our XM-GAN, a novel controllable fusion block densely aggregates local r… ▽ More

    Submitted 4 July, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Early Accept in MICCAI 2023

  41. arXiv:2303.12073  [pdf, other

    eess.IV cs.CV

    3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers

    Authors: Omkar Thawakar, Rao Muhammad Anwer, Jorma Laaksonen, Orly Reiner, Mubarak Shah, Fahad Shahbaz Khan

    Abstract: Accurate 3D mitochondria instance segmentation in electron microscopy (EM) is a challenging problem and serves as a prerequisite to empirically analyze their distributions and morphology. Most existing approaches employ 3D convolutions to obtain representative features. However, these convolution-based approaches struggle to effectively capture long-range dependencies in the volume mitochondria da… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, 5 Tables, 2 page references

  42. arXiv:2303.00307  [pdf, other

    cs.CR cs.NI eess.SP

    Access-based Lightweight Physical Layer Authentication for the Internet of Things Devices

    Authors: Saud Khan, Chandra Thapa, Salman Durrani, Seyit Camtepe

    Abstract: Physical-layer authentication is a popular alternative to the conventional key-based authentication for internet of things (IoT) devices due to their limited computational capacity and battery power. However, this approach has limitations due to poor robustness under channel fluctuations, reconciliation overhead, and no clear safeguard distance to ensure the secrecy of the generated authentication… ▽ More

    Submitted 6 November, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted for publication in IEEE Internet of Things Journal

    Journal ref: IEEE Internet of Things Journal, vol. 11, no. 7, pp. 11312-11326, April, 2024

  43. arXiv:2302.03224  [pdf, other

    cs.LG eess.SP

    Undersampling and Cumulative Class Re-decision Methods to Improve Detection of Agitation in People with Dementia

    Authors: Zhidong Meng, Andrea Iaboni, Bing Ye, Kristine Newman, Alex Mihailidis, Zhihong Deng, Shehroz S. Khan

    Abstract: Agitation is one of the most prevalent symptoms in people with dementia (PwD) that can place themselves and the caregiver's safety at risk. Develo** objective agitation detection approaches is important to support health and safety of PwD living in a residential setting. In a previous study, we collected multimodal wearable sensor data from 17 participants for 600 days and developed machine lear… ▽ More

    Submitted 15 August, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: 19 pages, 6 figures

  44. arXiv:2302.02619  [pdf

    eess.IV cs.CV cs.LG

    COVID-19 Infection Analysis Framework using Novel Boosted CNNs and Radiological Images

    Authors: Saddam Hussain Khan

    Abstract: COVID-19 is a new pathogen that first appeared in the human population at the end of 2019, and it can lead to novel variants of pneumonia after infection. COVID-19 is a rapidly spreading infectious disease that infects humans faster. Therefore, efficient diagnostic systems may accurately identify infected patients and thus help control their spread. In this regard, a new two-stage analysis framewo… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 26 Pages, 11 Figures, 6 Tables. arXiv admin note: text overlap with arXiv:2209.10963

  45. arXiv:2212.14618  [pdf

    cs.SD cs.LG eess.AS

    Blind Restoration of Real-World Audio by 1D Operational GANs

    Authors: Turker Ince, Serkan Kiranyaz, Ozer Can Devecioglu, Muhammad Salman Khan, Muhammad Chowdhury, Moncef Gabbouj

    Abstract: Objective: Despite numerous studies proposed for audio restoration in the literature, most of them focus on an isolated restoration problem such as denoising or dereverberation, ignoring other artifacts. Moreover, assuming a noisy or reverberant environment with limited number of fixed signal-to-distortion ratio (SDR) levels is a common practice. However, real-world audio is often corrupted by a b… ▽ More

    Submitted 20 January, 2023; v1 submitted 30 December, 2022; originally announced December 2022.

  46. arXiv:2212.02477  [pdf

    eess.IV cs.CV cs.LG

    Malaria Parasitic Detection using a New Deep Boosted and Ensemble Learning Framework

    Authors: Saddam Hussain Khan, Tahani Jaser Alahmadi

    Abstract: Malaria is a potentially fatal plasmodium parasite injected by female anopheles mosquitoes that infect red blood cells and millions worldwide yearly. However, specialists' manual screening in clinical practice is laborious and prone to error. Therefore, a novel Deep Boosted and Ensemble Learning (DBEL) framework, comprising the stacking of new Boosted-BR-STM convolutional neural networks (CNN) and… ▽ More

    Submitted 19 March, 2024; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 26 pages, 10 figures, 9 Tables

  47. arXiv:2211.16571  [pdf

    eess.IV cs.CV cs.LG

    Brain Tumor MRI Classification using a Novel Deep Residual and Regional CNN

    Authors: Mirza Mumtaz Zahoor, Saddam Hussain Khan

    Abstract: Brain tumor classification is crucial for clinical analysis and an effective treatment plan to cure patients. Deep learning models help radiologists to accurately and efficiently analyze tumors without manual intervention. However, brain tumor analysis is challenging because of its complex structure, texture, size, location, and appearance. Therefore, a novel deep residual and regional-based Res-B… ▽ More

    Submitted 10 December, 2022; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 21 pages, 11 figures, 4 tables

  48. arXiv:2211.13114  [pdf, other

    cs.CV eess.SP

    Step Counting with Attention-based LSTM

    Authors: Shehroz S. Khan, Ali Abedi

    Abstract: Physical activity is recognized as an essential component of overall health. One measure of physical activity, the step count, is well known as a predictor of long-term morbidity and mortality. Step Counting (SC) is the automated counting of the number of steps an individual takes over a specified period of time and space. Due to the ubiquity of smartphones and smartwatches, most current SC approa… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Report number: EFI-94-11

  49. arXiv:2211.08350  [pdf, other

    cs.HC cs.LG eess.SP q-bio.NC

    Motor imagery classification using EEG spectrograms

    Authors: Saadat Ullah Khan, Muhammad Majid, Syed Muhammad Anwar

    Abstract: The loss of limb motion arising from damage to the spinal cord is a disability that could effect people while performing their day-to-day activities. The restoration of limb movement would enable people with spinal cord injury to interact with their environment more naturally and this is where a brain-computer interface (BCI) system could be beneficial. The detection of limb movement imagination (… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Submitted to ISBI 2023

  50. arXiv:2211.03615  [pdf, other

    cs.LG cs.AI cs.DC eess.SP

    MAISON -- Multimodal AI-based Sensor platform for Older Individuals

    Authors: Ali Abedi, Faranak Dayyani, Charlene Chu, Shehroz S. Khan

    Abstract: There is a global aging population requiring the need for the right tools that can enable older adults' greater independence and the ability to age at home, as well as assist healthcare workers. It is feasible to achieve this objective by building predictive models that assist healthcare workers in monitoring and analyzing older adults' behavioral, functional, and psychological data. To develop su… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.