Search | arXiv e-print repository

Nuclei Instance Segmentation of Cryosectioned H&E Stained Histological Images using Triple U-Net Architecture

Authors: Zarif Ahmed, Chowdhury Nur E Alam Siddiqi, Fardifa Fathmiul Alam, Tasnim Ahmed, Tareque Mohmud Chowdhury

Abstract: Nuclei instance segmentation is crucial in oncological diagnosis and cancer pathology research. H&E stained images are commonly used for medical diagnosis, but pre-processing is necessary before using them for image processing tasks. Two principal pre-processing methods are formalin-fixed paraffin-embedded samples (FFPE) and frozen tissue samples (FS). While FFPE is widely used, it is time-consumi… ▽ More Nuclei instance segmentation is crucial in oncological diagnosis and cancer pathology research. H&E stained images are commonly used for medical diagnosis, but pre-processing is necessary before using them for image processing tasks. Two principal pre-processing methods are formalin-fixed paraffin-embedded samples (FFPE) and frozen tissue samples (FS). While FFPE is widely used, it is time-consuming, while FS samples can be processed quickly. Analyzing H&E stained images derived from fast sample preparation, staining, and scanning can pose difficulties due to the swift process, which can result in the degradation of image quality. This paper proposes a method that leverages the unique optical characteristics of H&E stained images. A three-branch U-Net architecture has been implemented, where each branch contributes to the final segmentation results. The process includes applying watershed algorithm to separate overlap** regions and enhance accuracy. The Triple U-Net architecture comprises an RGB branch, a Hematoxylin branch, and a Segmentation branch. This study focuses on a novel dataset named CryoNuSeg. The results obtained through robust experiments outperform the state-of-the-art results across various metrics. The benchmark score for this dataset is AJI 52.5 and PQ 47.7, achieved through the implementation of U-Net Architecture. However, the proposed Triple U-Net architecture achieves an AJI score of 67.41 and PQ of 50.56. The proposed architecture improves more on AJI than other evaluation metrics, which further justifies the superiority of the Triple U-Net architecture over the baseline U-Net model, as AJI is a more strict evaluation metric. The use of the three-branch U-Net model, followed by watershed post-processing, significantly surpasses the benchmark scores, showing substantial improvement in the AJI score △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: To be published in "6th IVPR & 11th ICIEV"

arXiv:2311.18260 [pdf, other]

Consensus, dissensus and synergy between clinicians and specialist foundation models in radiology report generation

Authors: Ryutaro Tanno, David G. T. Barrett, Andrew Sellergren, Sumedh Ghaisas, Sumanth Dathathri, Abigail See, Johannes Welbl, Karan Singhal, Shekoofeh Azizi, Tao Tu, Mike Schaekermann, Rhys May, Roy Lee, SiWai Man, Zahra Ahmed, Sara Mahdavi, Yossi Matias, Joelle Barral, Ali Eslami, Danielle Belgrave, Vivek Natarajan, Shravya Shetty, Pushmeet Kohli, Po-Sen Huang, Alan Karthikesalingam , et al. (1 additional authors not shown)

Abstract: Radiology reports are an instrumental part of modern medicine, informing key clinical decisions such as diagnosis and treatment. The worldwide shortage of radiologists, however, restricts access to expert care and imposes heavy workloads, contributing to avoidable errors and delays in report delivery. While recent progress in automated report generation with vision-language models offer clear pote… ▽ More Radiology reports are an instrumental part of modern medicine, informing key clinical decisions such as diagnosis and treatment. The worldwide shortage of radiologists, however, restricts access to expert care and imposes heavy workloads, contributing to avoidable errors and delays in report delivery. While recent progress in automated report generation with vision-language models offer clear potential in ameliorating the situation, the path to real-world adoption has been stymied by the challenge of evaluating the clinical quality of AI-generated reports. In this study, we build a state-of-the-art report generation system for chest radiographs, $\textit{Flamingo-CXR}$, by fine-tuning a well-known vision-language foundation model on radiology data. To evaluate the quality of the AI-generated reports, a group of 16 certified radiologists provide detailed evaluations of AI-generated and human written reports for chest X-rays from an intensive care setting in the United States and an inpatient setting in India. At least one radiologist (out of two per case) preferred the AI report to the ground truth report in over 60$\%$ of cases for both datasets. Amongst the subset of AI-generated reports that contain errors, the most frequently cited reasons were related to the location and finding, whereas for human written reports, most mistakes were related to severity and finding. This disparity suggested potential complementarity between our AI system and human experts, prompting us to develop an assistive scenario in which Flamingo-CXR generates a first-draft report, which is subsequently revised by a clinician. This is the first demonstration of clinician-AI collaboration for report writing, and the resultant reports are assessed to be equivalent or preferred by at least one radiologist to reports written by experts alone in 80$\%$ of in-patient cases and 60$\%$ of intensive care cases. △ Less

Submitted 20 December, 2023; v1 submitted 30 November, 2023; originally announced November 2023.

arXiv:2309.09090 [pdf, other]

Optimal Photodetector Size for High-Speed Free-Space Optics Receivers

Authors: Muhammad Salman Bashir, Qasim Zeeshan Ahmed, Mohamed-Slim Alouini

Abstract: The selection of an optimal photodetector area is closely linked to the attainment of higher data rates in optical wireless communication receivers. If the photodetector area is too large, the channel capacity degrades due to lower modulation bandwidth of the detector. A smaller photodetector maximizes the bandwidth, but minimizes the captured signal power and the subsequent signal-to-noise ratio.… ▽ More The selection of an optimal photodetector area is closely linked to the attainment of higher data rates in optical wireless communication receivers. If the photodetector area is too large, the channel capacity degrades due to lower modulation bandwidth of the detector. A smaller photodetector maximizes the bandwidth, but minimizes the captured signal power and the subsequent signal-to-noise ratio. Therein lies an opportunity in this trade-off to maximize the channel rate by choosing the optimal photodetector area. In this study, we have optimized the photodetector area in order to maximize the channel capacity of a free-space optical link for a diverse set of communication scenarios. We believe that the study in this paper in general -- and the closed-form solutions derived in this study in particular -- will be helpful to maximize achievable data rates of a wide gamut of optical wireless communication systems: from long range deep space optical links to short range indoor visible light communication systems. △ Less

Submitted 16 September, 2023; originally announced September 2023.

arXiv:2309.01319 [pdf, other]

An ML-assisted OTFS vs. OFDM adaptable modem

Authors: I. Zakir Ahmed, Hamid R. Sadjadpour

Abstract: The Orthogonal-Time-Frequency-Space (OTFS) signaling is known to be resilient to doubly-dispersive channels, which impacts high mobility scenarios. On the other hand, the Orthogonal-Frequency-Division-Multiplexing (OFDM) waveforms enjoy the benefits of the reuse of legacy architectures, simplicity of receiver design, and low-complexity detection. Several studies that compare the performance of OFD… ▽ More The Orthogonal-Time-Frequency-Space (OTFS) signaling is known to be resilient to doubly-dispersive channels, which impacts high mobility scenarios. On the other hand, the Orthogonal-Frequency-Division-Multiplexing (OFDM) waveforms enjoy the benefits of the reuse of legacy architectures, simplicity of receiver design, and low-complexity detection. Several studies that compare the performance of OFDM and OTFS have indicated mixed outcomes due to the plethora of system parameters at play beyond high-mobility conditions. In this work, we exemplify this observation using simulations and propose a deep neural network (DNN)-based adaptation scheme to switch between using either an OTFS or OFDM signal processing chain at the transmitter and receiver for optimal mean-squared-error (MSE) performance. The DNN classifier is trained to switch between the two schemes by observing the channel condition, received SNR, and modulation format. We compare the performance of the OTFS, OFDM, and the proposed switched-waveform scheme. The simulations indicate superior performance with the proposed scheme with a well-trained DNN, thus improving the MSE performance of the communication significantly. △ Less

Submitted 19 October, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

Comments: Accepted for publication in IEEE Future Networks World Forum 2023

arXiv:2309.00723 [pdf, other]

Contextual Biasing of Named-Entities with Large Language Models

Authors: Chuanneng Sun, Zeeshan Ahmed, Yingyi Ma, Zhe Liu, Lucas Kabela, Yutong Pang, Ozlem Kalinli

Abstract: This paper studies contextual biasing with Large Language Models (LLMs), where during second-pass rescoring additional contextual information is provided to a LLM to boost Automatic Speech Recognition (ASR) performance. We propose to leverage prompts for a LLM without fine tuning during rescoring which incorporate a biasing list and few-shot examples to serve as additional information when calcula… ▽ More This paper studies contextual biasing with Large Language Models (LLMs), where during second-pass rescoring additional contextual information is provided to a LLM to boost Automatic Speech Recognition (ASR) performance. We propose to leverage prompts for a LLM without fine tuning during rescoring which incorporate a biasing list and few-shot examples to serve as additional information when calculating the score for the hypothesis. In addition to few-shot prompt learning, we propose multi-task training of the LLM to predict both the entity class and the next token. To improve the efficiency for contextual biasing and to avoid exceeding LLMs' maximum sequence lengths, we propose dynamic prompting, where we select the most likely class using the class tag prediction, and only use entities in this class as contexts for next token prediction. Word Error Rate (WER) evaluation is performed on i) an internal calling, messaging, and dictation dataset, and ii) the SLUE-Voxpopuli dataset. Results indicate that biasing lists and few-shot examples can achieve 17.8% and 9.6% relative improvement compared to first pass ASR, and that multi-task training and dynamic prompting can achieve 20.0% and 11.3% relative WER improvement, respectively. △ Less

Submitted 21 September, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

Comments: 5 pages, 4 figures. Conference: ICASSP 2024

MSC Class: 68T10 ACM Class: I.2.7

arXiv:2304.11091 [pdf, other]

doi 10.1109/JSEN.2022.3198680.

Feature-Based Generalized Gaussian Distribution Method for NLoS Detection in Ultra-Wideband (UWB) Indoor Positioning System

Authors: Fuhu Che, Qasim Zeeshan Ahmed, Jaron Fontaine, Ben Van Herbruggen, Adnan Shahid, Eli De Poorter, Pavlos I. Lazaridis

Abstract: Non-Line-of-Sight (NLoS) propagation condition is a crucial factor affecting the precision of the localization in the Ultra-Wideband (UWB) Indoor Positioning System (IPS). Numerous supervised Machine Learning (ML) approaches have been applied for NLoS identification to improve the accuracy of the IPS. However, it is difficult for existing ML approaches to maintain a high classification accuracy wh… ▽ More Non-Line-of-Sight (NLoS) propagation condition is a crucial factor affecting the precision of the localization in the Ultra-Wideband (UWB) Indoor Positioning System (IPS). Numerous supervised Machine Learning (ML) approaches have been applied for NLoS identification to improve the accuracy of the IPS. However, it is difficult for existing ML approaches to maintain a high classification accuracy when the database contains a small number of NLoS signals and a large number of Line-of-Sight (LoS) signals. The inaccurate localization of the target node caused by this small number of NLoS signals can still be problematic. To solve this issue, we propose feature-based Gaussian Distribution (GD) and Generalized Gaussian Distribution (GGD) NLoS detection algorithms. By employing our detection algorithm for the imbalanced dataset, a classification accuracy of $96.7\%$ and $98.0\%$ can be achieved. We also compared the proposed algorithm with the existing cutting-edge such as Support-Vector-Machine (SVM), Decision Tree (DT), Naive Bayes (NB), and Neural Network (NN), which can achieve an accuracy of $92.6\%$, $92.8\%$, $93.2\%$, and $95.5\%$, respectively. The results demonstrate that the GGD algorithm can achieve high classification accuracy with the imbalanced dataset. Finally, the proposed algorithm can also achieve a higher classification accuracy for different ratios of LoS and NLoS signals which proves the robustness and effectiveness of the proposed method. △ Less

Submitted 14 April, 2023; originally announced April 2023.

Journal ref: IEEE Sensors Journal, vol. 22, no. 19, pp. 18726-18739, 1 Oct.1, 2022,

arXiv:2304.11067 [pdf, other]

Novel Fine-Tuned Attribute Weighted Naïve Bayes NLoS Classifier for UWB Positioning

Authors: Fuhu Che, Qasim Zeeshan Ahmed, Fahd Ahmed Khan, Faheem A. Khan

Abstract: In this paper, we propose a novel Fine-Tuned attribute Weighted Naïve Bayes (FT-WNB) classifier to identify the Line-of-Sight (LoS) and Non-Line-of-Sight (NLoS) for UltraWide Bandwidth (UWB) signals in an Indoor Positioning System (IPS). The FT-WNB classifier assigns each signal feature a specific weight and fine-tunes its probabilities to address the mismatch between the predicted and actual clas… ▽ More In this paper, we propose a novel Fine-Tuned attribute Weighted Naïve Bayes (FT-WNB) classifier to identify the Line-of-Sight (LoS) and Non-Line-of-Sight (NLoS) for UltraWide Bandwidth (UWB) signals in an Indoor Positioning System (IPS). The FT-WNB classifier assigns each signal feature a specific weight and fine-tunes its probabilities to address the mismatch between the predicted and actual class. The performance of the FT-WNB classifier is compared with the state-of-the-art Machine Learning (ML) classifiers such as minimum Redundancy Maximum Relevance (mRMR)- $k$-Nearest Neighbour (KNN), Support Vector Machine (SVM), Decision Tree (DT), Naïve Bayes (NB), and Neural Network (NN). It is demonstrated that the proposed classifier outperforms other algorithms by achieving a high NLoS classification accuracy of $99.7\%$ with imbalanced data and $99.8\%$ with balanced data. The experimental results indicate that our proposed FT-WNB classifier significantly outperforms the existing state-of-the-art ML methods for LoS and NLoS signals in IPS in the considered scenario. △ Less

Submitted 14 April, 2023; originally announced April 2023.

arXiv:2301.06228 [pdf, other]

An information-theoretic branch-and-prune algorithm for discrete phase optimization of RIS in massive MIMO

Authors: I. Zakir Ahmed, Hamid R. Sadjadpour, Shahram Yousefi

Abstract: In this paper, we consider passive RIS-assisted multi-user communication between wireless nodes to improve the blocked line-of-sight (LOS) link performance. The wireless nodes are assumed to be equipped with Massive Multiple-Input Multiple-Output antennas, hybrid precoder, combiner, and low-resolution analog-to-digital converters (ADCs). We first derive the expression for the Cramer-Rao lower boun… ▽ More In this paper, we consider passive RIS-assisted multi-user communication between wireless nodes to improve the blocked line-of-sight (LOS) link performance. The wireless nodes are assumed to be equipped with Massive Multiple-Input Multiple-Output antennas, hybrid precoder, combiner, and low-resolution analog-to-digital converters (ADCs). We first derive the expression for the Cramer-Rao lower bound (CRLB) of the Mean Squared Error (MSE) of the received and combined signal at the intended receiver under interference. By appropriate design of the hybrid precoder, combiner, and RIS phase settings, it can be shown that the MSE achieves the CRLB. We further show that minimizing the MSE w.r.t. the phase settings of the RIS is equivalent to maximizing the throughput and energy efficiency of the system. We then propose a novel Information-Directed Branch-and-Prune (IDBP) algorithm to derive the phase settings of the RIS. We, for the first time in the literature, use an information-theoretic measure to decide on the pruning rules in a tree-search algorithm to arrive at the RIS phase-setting solution, which is vastly different compared to the traditional branch-and-bound algorithm that uses bounds of the cost function to define the pruning rules. In addition, we provide the theoretical guarantees of the near-optimality of the RIS phase-setting solution thus obtained using the Asymptotic Equipartition property. This also ensures near-optimal throughput and MSE performance. △ Less

Submitted 15 January, 2023; originally announced January 2023.

Comments: Accepted for publication in "IEEE Transactions on Vehicular Technology"

arXiv:2203.12599 [pdf, other]

Single Carrier Frequency Domain Detectors for Internet of Underwater Things

Authors: Amer Aljanabi, Osama Alluhaibi, Qasim Z. Ahmed, Fahd A. Khan, Waqas-Bin-Abbas, Pavlos I. Lazaridis

Abstract: This paper proposes low complexity detection for internet of underwater things (IoUT)s communication. The signal is transmitted from the source to the destination using several sensors. To simplify the computational operations at the transmitter and the sensory nodes, a single carrier frequency domain equalizer (SC-FDE) is proposed and amplify-and-forward (AF) protocols are employed. Fast Fourier… ▽ More This paper proposes low complexity detection for internet of underwater things (IoUT)s communication. The signal is transmitted from the source to the destination using several sensors. To simplify the computational operations at the transmitter and the sensory nodes, a single carrier frequency domain equalizer (SC-FDE) is proposed and amplify-and-forward (AF) protocols are employed. Fast Fourier transform (FFT) and use of cyclic prefix (CP) are also proposed to simplify these algorithms when compared to time-domain equalization. As precise channel data is difficult to capture in underwater communications, the adaptive implementation of FDE is proposed as a solution that can be employed when the channel experiences a fast doppler shift. The two adaptive detectors are based on the least mean-square (LMS) and recursive least square (RLS) principles. Numerical simulations show that the performance of the bit error rate (BER) performance of the proposed detectors is close to that of the ideal minimum mean square error (MMSE). △ Less

Submitted 18 March, 2022; originally announced March 2022.

Journal ref: Wireless Personal Communications 2022

arXiv:2202.11696 [pdf, other]

Double Threshold based Optimal Device Selection Scheme for D2D or Sidelink Network

Authors: Shamganth Kumarapandian, Qasim Zeeshan Ahmed, Faheem A. Khan

Abstract: Device-to-device (D2D) or Sidelink aided communication is regarded as one of the most promising technologies to improve the spectral efficiency of the 5G and beyond communication system. However, two main challenges exist: 1) the selection of the optimal number of devices for improving the spectral efficiency, and 2) improving the physical layer security of such a communication system. The optimal… ▽ More Device-to-device (D2D) or Sidelink aided communication is regarded as one of the most promising technologies to improve the spectral efficiency of the 5G and beyond communication system. However, two main challenges exist: 1) the selection of the optimal number of devices for improving the spectral efficiency, and 2) improving the physical layer security of such a communication system. The optimal device improves the secrecy capacity, and the selection of optimal devices enhances the physical layer security. Therefore, in this paper, we propose a double threshold-based optimal device selection (ODS) scheme for a cooperative wireless network with amplify and forward (AF) protocol in the presence and absence of an eavesdropper to enhance the physical layer security for the D2D network. The proposed scheme is analyzed with different distance cases, device scenarios, and modulation schemes. The bit error rate (BER) performance analysis concludes that a performance gain of more than 4 dB is achieved at the BER of 0:004 for the proposed double threshold-based ODS scheme compared to the existing ODS scheme at a high signal to noise ratio (SNR). Furthermore, the proposed scheme enhances the physical layer security. △ Less

Submitted 3 February, 2022; originally announced February 2022.

Comments: 13 pages, 7 figures,

arXiv:2202.08532 [pdf, other]

Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition

Authors: Chao-Han Huck Yang, Zeeshan Ahmed, Yile Gu, Joseph Szurley, Roger Ren, Linda Liu, Andreas Stolcke, Ivan Bulyko

Abstract: In this work, we aim to enhance the system robustness of end-to-end automatic speech recognition (ASR) against adversarially-noisy speech examples. We focus on a rigorous and empirical "closed-model adversarial robustness" setting (e.g., on-device or cloud applications). The adversarial noise is only generated by closed-model optimization (e.g., evolutionary and zeroth-order estimation) without ac… ▽ More In this work, we aim to enhance the system robustness of end-to-end automatic speech recognition (ASR) against adversarially-noisy speech examples. We focus on a rigorous and empirical "closed-model adversarial robustness" setting (e.g., on-device or cloud applications). The adversarial noise is only generated by closed-model optimization (e.g., evolutionary and zeroth-order estimation) without accessing gradient information of a targeted ASR model directly. We propose an advanced Bayesian neural network (BNN) based adversarial detector, which could model latent distributions against adaptive adversarial perturbation with divergence measurement. We further simulate deployment scenarios of RNN Transducer, Conformer, and wav2vec-2.0 based ASR systems with the proposed adversarial detection system. Leveraging the proposed BNN based detection system, we improve detection rate by +2.77 to +5.42% (relative +3.03 to +6.26%) and reduce the word error rate by 5.02 to 7.47% on LibriSpeech datasets compared to the current model enhancement methods against the adversarial speech examples. △ Less

Submitted 17 February, 2022; originally announced February 2022.

Comments: Accepted to ICASSP 2022

arXiv:2112.03512 [pdf, ps, other]

Constrained Resource Allocation Problems in Communications: An Information-assisted Approach

Authors: I. Zakir Ahmed, Hamid Sadjadpour, Shahram Yousefi

Abstract: We consider a class of resource allocation problems given a set of unconditional constraints whose objective function satisfies Bellman's optimality principle. Such problems are ubiquitous in wireless communication, signal processing, and networking. These constrained combinatorial optimization problems are, in general, NP-Hard. This paper proposes two algorithms to solve this class of problems us… ▽ More We consider a class of resource allocation problems given a set of unconditional constraints whose objective function satisfies Bellman's optimality principle. Such problems are ubiquitous in wireless communication, signal processing, and networking. These constrained combinatorial optimization problems are, in general, NP-Hard. This paper proposes two algorithms to solve this class of problems using a dynamic programming framework assisted by an information-theoretic measure. We demonstrate that the proposed algorithms ensure optimal solutions under carefully chosen conditions and use significantly reduced computational resources. We substantiate our claims by solving the power-constrained bit allocation problem in 5G massive Multiple-Input Multiple-Output receivers using the proposed approach. △ Less

Submitted 7 December, 2021; originally announced December 2021.

Comments: Accepted for publication in IEEE Military Communications Conference 2021

arXiv:2108.10210 [pdf, ps, other]

Anomaly Detection Based on Generalized Gaussian Distribution approach for Ultra-Wideband (UWB) Indoor Positioning System

Authors: Fuhu Che, Qasim Zeeshan Ahmed, Faheem A. Khan, Pavlos I. Lazaridis

Abstract: With the rapid development of the Internet of Things (IoT), Indoor Positioning System (IPS) has attracted significant interest in academic research. Ultra-Wideband (UWB) is an emerging technology that can be employed for IPS as it offers centimetre-level accuracy. However, the UWB system still faces several technical challenges in practice, one of which is Non-Line-of-Sight (NLoS) signal propagati… ▽ More With the rapid development of the Internet of Things (IoT), Indoor Positioning System (IPS) has attracted significant interest in academic research. Ultra-Wideband (UWB) is an emerging technology that can be employed for IPS as it offers centimetre-level accuracy. However, the UWB system still faces several technical challenges in practice, one of which is Non-Line-of-Sight (NLoS) signal propagation. Several machine learning approaches have been applied for the NLoS component identification. However, when the data contains a very small amount of NLoS components it becomes very difficult for existing algorithms to classify them. This paper focuses on employing an anomaly detection approach based on Gaussian Distribution (GD) and Generalized Gaussian Distribution (GGD) algorithms to detect and identify the NLoS components. The simulation results indicate that the proposed approach can provide a robust NLoS component identification which improves the NLoS signal classification accuracy which results in significant improvement in the UWB positioning system. △ Less

Submitted 9 August, 2021; originally announced August 2021.

arXiv:2106.07708 [pdf]

CathAI: Fully Automated Interpretation of Coronary Angiograms Using Neural Networks

Authors: Robert Avram, Jeffrey E. Olgin, Alvin Wan, Zeeshan Ahmed, Louis Verreault-Julien, Sean Abreau, Derek Wan, Joseph E. Gonzalez, Derek Y. So, Krishan Soni, Geoffrey H. Tison

Abstract: Coronary heart disease (CHD) is the leading cause of adult death in the United States and worldwide, and for which the coronary angiography procedure is the primary gateway for diagnosis and clinical management decisions. The standard-of-care for interpretation of coronary angiograms depends upon ad-hoc visual assessment by the physician operator. However, ad-hoc visual interpretation of angiogram… ▽ More Coronary heart disease (CHD) is the leading cause of adult death in the United States and worldwide, and for which the coronary angiography procedure is the primary gateway for diagnosis and clinical management decisions. The standard-of-care for interpretation of coronary angiograms depends upon ad-hoc visual assessment by the physician operator. However, ad-hoc visual interpretation of angiograms is poorly reproducible, highly variable and bias prone. Here we show for the first time that fully-automated angiogram interpretation to estimate coronary artery stenosis is possible using a sequence of deep neural network algorithms. The algorithmic pipeline we developed--called CathAI--achieves state-of-the art performance across the sequence of tasks required to accomplish automated interpretation of unselected, real-world angiograms. CathAI (Algorithms 1-2) demonstrated positive predictive value, sensitivity and F1 score of >=90% to identify the projection angle overall and >=93% for left or right coronary artery angiogram detection, the primary anatomic structures of interest. To predict obstructive coronary artery stenosis (>=70% stenosis), CathAI (Algorithm 4) exhibited an area under the receiver operating characteristic curve (AUC) of 0.862 (95% CI: 0.843-0.880). When externally validated in a healthcare system in another country, CathAI AUC was 0.869 (95% CI: 0.830-0.907) to predict obstructive coronary artery stenosis. Our results demonstrate that multiple purpose-built neural networks can function in sequence to accomplish the complex series of tasks required for automated analysis of real-world angiograms. Deployment of CathAI may serve to increase standardization and reproducibility in coronary stenosis assessment, while providing a robust foundation to accomplish future tasks for algorithmic angiographic interpretation. △ Less

Submitted 14 June, 2021; originally announced June 2021.

Comments: 62 pages, 3 main figures, 2 main tables

ACM Class: I.4.9; I.2.10; J.3

arXiv:2105.06085 [pdf, ps, other]

doi 10.1109/LATINCOM50620.2020.9282342

A Low-Complexity Multi-Survivor Dynamic Programming for Constrained Discrete Optimization

Authors: I. Zakir Ahmed, Hamid Sadjadpour, Shahram Yousefi

Abstract: Constrained discrete optimization problems are encountered in many areas of communication and machine learning. We consider the case where the objective function satisfies Bellman's optimality principle without the constraints on which we place no conditions. We first show that these problems are a generalization of optimization in constrained Markov decision processes with finite horizon used in… ▽ More Constrained discrete optimization problems are encountered in many areas of communication and machine learning. We consider the case where the objective function satisfies Bellman's optimality principle without the constraints on which we place no conditions. We first show that these problems are a generalization of optimization in constrained Markov decision processes with finite horizon used in reinforcement learning and are NP-Hard. We then present a novel multi-survivor dynamic programming (msDP) algorithm that guarantees optimality at significant computational savings. We demonstrate this by solving 5G quantizer bit allocation and DNA fragment assembly problems. The results are very promising and suggest that msDP can be used for many applications. △ Less

Submitted 13 May, 2021; originally announced May 2021.

Journal ref: 2020 IEEE Latin-American Conference on Communications (LATINCOM)

arXiv:2104.05186 [pdf, ps, other]

doi 10.1109/TGCN.2020.3039282

An Optimal Low-Complexity Energy-Efficient ADC Bit Allocation for Massive MIMO

Authors: I. Zakir Ahmed, Hamid Sadjadpour, Shahram Yousefi

Abstract: Fixed low-resolution Analog to Digital Converters (ADC) help reduce the power consumption in millimeter-wave Massive Multiple-Input Multiple-Output (Ma-MIMO) receivers operating at large bandwidths. However, they do not guarantee optimal Energy Efficiency (EE). It has been shown that adopting variable-resolution (VR) ADCs in Ma-MIMO receivers can improve performance with Mean Squared Error (MSE) a… ▽ More Fixed low-resolution Analog to Digital Converters (ADC) help reduce the power consumption in millimeter-wave Massive Multiple-Input Multiple-Output (Ma-MIMO) receivers operating at large bandwidths. However, they do not guarantee optimal Energy Efficiency (EE). It has been shown that adopting variable-resolution (VR) ADCs in Ma-MIMO receivers can improve performance with Mean Squared Error (MSE) and throughput while providing better EE. In this paper, we present an optimal energy-efficient bit allocation (BA) algorithm for Ma-MIMO receivers equipped with VR ADCs under a power constraint. We derive an expression for EE as a function of the Cramer-Rao Lower Bound on the MSE of the received, combined, and quantized signal. An optimal BA condition is derived by maximizing EE under a power constraint. We show that the optimal BA thus obtained is exactly the same as that obtained using the brute-force BA with a significant reduction in computational complexity. We also study the EE performance and computational complexity of a heuristic algorithm that yields a near-optimal solution. △ Less

Submitted 11 April, 2021; originally announced April 2021.

Comments: arXiv admin note: text overlap with arXiv:1902.03375

Journal ref: EEE Transactions on Green Communications and Networking, vol. 5, no. 1, pp. 61-71, March 2021

arXiv:2103.10614 [pdf, other]

Hyperspectral Image Super-Resolution in Arbitrary Input-Output Band Settings

Authors: Zhongyang Zhang, Zhiyang Xu, Zia Ahmed, Asif Salekin, Tauhidur Rahman

Abstract: Hyperspectral image (HSI) with narrow spectral bands can capture rich spectral information, but it sacrifices its spatial resolution in the process. Many machine-learning-based HSI super-resolution (SR) algorithms have been proposed recently. However, one of the fundamental limitations of these approaches is that they are highly dependent on image and camera settings and can only learn to map an i… ▽ More Hyperspectral image (HSI) with narrow spectral bands can capture rich spectral information, but it sacrifices its spatial resolution in the process. Many machine-learning-based HSI super-resolution (SR) algorithms have been proposed recently. However, one of the fundamental limitations of these approaches is that they are highly dependent on image and camera settings and can only learn to map an input HSI with one specific setting to an output HSI with another. However, different cameras capture images with different spectral response functions and bands numbers due to the diversity of HSI cameras. Consequently, the existing machine-learning-based approaches fail to learn to super-resolve HSIs for a wide variety of input-output band settings. We propose a single Meta-Learning-Based Super-Resolution (MLSR) model, which can take in HSI images at an arbitrary number of input bands' peak wavelengths and generate SR HSIs with an arbitrary number of output bands' peak wavelengths. We leverage NTIRE2020 and ICVL datasets to train and validate the performance of the MLSR model. The results show that the single proposed model can successfully generate super-resolved HSI bands at arbitrary input-output band settings. The results are better or at least comparable to baselines that are separately trained on a specific input-output band setting. △ Less

Submitted 15 November, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

Comments: Accepted by WACV 2022 Workshop WACI(Workshop on Applications of Computational Imaging)

arXiv:2011.08972 [pdf, ps, other]

Reducing the Mutual Outage Probability of Cooperative Non-Orthogonal Multiple Access

Authors: Sana Riaz, Fahd Ahmed Khan, Sajid Saleem, Qasim Zeeshan Ahmed

Abstract: In this letter, a new power allocation scheme is proposed to improve the reliability of cooperative non-orthogonal multiple access (CO-NOMA). The strong user is allocated the maximum power, whereas the weak user is allocated the minimum power. This power allocation alters the decoding sequence along with the signal-to-interference plus noise ratio (SINR), at the users. The weak user benefits from… ▽ More In this letter, a new power allocation scheme is proposed to improve the reliability of cooperative non-orthogonal multiple access (CO-NOMA). The strong user is allocated the maximum power, whereas the weak user is allocated the minimum power. This power allocation alters the decoding sequence along with the signal-to-interference plus noise ratio (SINR), at the users. The weak user benefits from receiving multiple copies of the signal whereas the strong user benefits from the higher power allocation. Numerical simulation results show that the proposed scheme has a lower mutual outage probability (MOP) and offers better reliability as compared to the conventional power allocation scheme for CONOMA. An exact closed-form expression of MOP is derived for the two-user CO-NOMA system and it is shown that each user achieves full diversity. The proposed allocation is able to achieve approximately 30% higher transmission rate at 15 dB as compared to conventional CO-NOMA in a practical non-power balanced scenario. △ Less

Submitted 24 November, 2020; v1 submitted 28 October, 2020; originally announced November 2020.

arXiv:2008.02493 [pdf, other]

HooliGAN: Robust, High Quality Neural Vocoding

Authors: Ollie McCarthy, Zohaib Ahmed

Abstract: Recent developments in generative models have shown that deep learning combined with traditional digital signal processing (DSP) techniques could successfully generate convincing violin samples [1], that source-excitation combined with WaveNet yields high-quality vocoders [2, 3] and that generative adversarial network (GAN) training can improve naturalness [4, 5]. By combining the ideas in these m… ▽ More Recent developments in generative models have shown that deep learning combined with traditional digital signal processing (DSP) techniques could successfully generate convincing violin samples [1], that source-excitation combined with WaveNet yields high-quality vocoders [2, 3] and that generative adversarial network (GAN) training can improve naturalness [4, 5]. By combining the ideas in these models we introduce HooliGAN, a robust vocoder that has state of the art results, finetunes very well to smaller datasets (<30 minutes of speechdata) and generates audio at 2.2MHz on GPU and 35kHz on CPU. We also show a simple modification to Tacotron-basedmodels that allows seamless integration with HooliGAN. Results from our listening tests show the proposed model's ability to consistently output high-quality audio with a variety of datasets, big and small. We provide samples at the following demo page: https://resemble-ai.github.io/hooligan_demo/ △ Less

Submitted 6 August, 2020; originally announced August 2020.

arXiv:1902.03375 [pdf, other]

Optimal Bit Allocation Variable-Resolution ADC for Massive MIMO

Authors: I. Zakir Ahmed, Hamid Sadjadpour, Shahram Yousefi

Abstract: In this paper, we derive an optimal ADC bit-allocation (BA) condition for a Single-User (SU) Millimeter wave (mmWave) Massive Multiple-Input Multiple-Output (Ma-MIMO) receiver equipped with variable-resolution ADCs under power constraint with the following criteria: (i) Minimizing the Mean Squared Error (MSE) of the received, quantized and combined symbol vector and (ii) Maximizing the capacity of… ▽ More In this paper, we derive an optimal ADC bit-allocation (BA) condition for a Single-User (SU) Millimeter wave (mmWave) Massive Multiple-Input Multiple-Output (Ma-MIMO) receiver equipped with variable-resolution ADCs under power constraint with the following criteria: (i) Minimizing the Mean Squared Error (MSE) of the received, quantized and combined symbol vector and (ii) Maximizing the capacity of the SU mmWave Ma-MIMO channel encompassing hybrid precoder and combiner. Optimal BA under both criteria results the same. We jointly design the hybrid combiner based on the SVD of the channel. We demonstrate improvement of the proposed optimal BA over the BA based on Minimization of the Mean Square Quantization Error (MSQE). Using Monte-Carlo simulations, it is shown that the MSE and capacity performance of the proposed BA is very close to that of the Exhaustive Search (ES). The computational complexity of the proposed techniques are compared with ES and MQSE BA algorithms. △ Less

Submitted 9 February, 2019; originally announced February 2019.

Comments: This work has been submitted to the IEEE for possible publication

arXiv:1809.02777 [pdf, other]

Capacity analysis and bit allocation design for variable-resolution ADCs in Massive MIMO

Authors: I. Zakir Ahmed, Hamid Sadjadpour, Shahram Yousefi

Abstract: We derive an expression for the capacity of massive multiple-input multiple-output Millimeter wave (mmWave) channel where the receiver is equipped with a variable-resolution Analog to Digital Converter (ADC) and a hybrid combiner. The capacity is shown to be a function of Cramer-Rao Lower Bound (CRLB) for a given bit-allocation matrix and hybrid combiner. The condition for optimal ADC bit-allocati… ▽ More We derive an expression for the capacity of massive multiple-input multiple-output Millimeter wave (mmWave) channel where the receiver is equipped with a variable-resolution Analog to Digital Converter (ADC) and a hybrid combiner. The capacity is shown to be a function of Cramer-Rao Lower Bound (CRLB) for a given bit-allocation matrix and hybrid combiner. The condition for optimal ADC bit-allocation under a receiver power constraint is derived. This is derived based on the maximization of capacity with respect to bit-allocation matrix for a given channel, hybrid precoder, and hybrid combiner. It is shown that this condition coincides with that obtained using the CRLB minimization proposed by Ahmed et al. Monte-carlo simulations show that the capacity calculated using the proposed condition matches very closely with the capacity obtained using the Exhaustive Search bit allocation. △ Less

Submitted 8 September, 2018; originally announced September 2018.

arXiv:1804.08595 [pdf, ps, other]

Single-User mmWave Massive MIMO: SVD-based ADC Bit Allocation and Combiner Design

Authors: I. Zakir Ahmed, Hamid Sadjadpour, Shahram Yousefi

Abstract: In this paper, we propose a Singular-Value-Decomposition-based variable-resolution Analog to Digital Converter (ADC) bit allocation design for a single-user Millimeter wave massive Multiple-Input Multiple-Output receiver. We derive the optimality condition for bit allocation under a power constraint. This condition ensures optimal receiver performance in the Mean Squared Error (MSE) sense. We deri… ▽ More In this paper, we propose a Singular-Value-Decomposition-based variable-resolution Analog to Digital Converter (ADC) bit allocation design for a single-user Millimeter wave massive Multiple-Input Multiple-Output receiver. We derive the optimality condition for bit allocation under a power constraint. This condition ensures optimal receiver performance in the Mean Squared Error (MSE) sense. We derive the MSE expression and show that it approaches the Cramer-Rao Lower Bound (CRLB). The CRLB is seen to be a function of the analog combiner, the digital combiner, and the bit allocation matrix. We attempt to minimize the CRLB with respect to the bit allocation matrix by making suitable assumptions regarding the structure of the combiners. In doing so, the bit allocation design reduces to a set of simple inequalities consisting of ADC bits, channel singular values and covariance of the quantization noise along each RF path. This results in a simple and computationally efficient bit allocation algorithm. Using simulations, we show that the MSE performance of our proposed bit allocation is very close to that of the Full Search (FS) bit allocation. We also show that the computational complexity of our proposed method has an order of magnitude improvement compared to FS and Genetic Algorithm based bit allocation of $\cite{Zakir1}$ △ Less

Submitted 23 April, 2018; originally announced April 2018.

Comments: Accepted for publication in SPCOM 2018

arXiv:1711.06706 [pdf, other]

A Joint Combiner and Bit Allocation Design for Massive MIMO Using Genetic Algorithm

Authors: I. Zakir Ahmed, Hamid Sadjadpour, Shahram Yousefi

Abstract: In this paper, we derive a closed-form expression for the combiner of a multiple-input-multiple-output (MIMO) receiver equipped with a minimum-mean-square-error (MMSE) estimator. We propose using variable-bit-resolution analog-to- digital converters (ADC) across radio frequency (RF) paths. The combiner designed is a function of the quantization errors across each RF path. Using very low bit resolu… ▽ More In this paper, we derive a closed-form expression for the combiner of a multiple-input-multiple-output (MIMO) receiver equipped with a minimum-mean-square-error (MMSE) estimator. We propose using variable-bit-resolution analog-to- digital converters (ADC) across radio frequency (RF) paths. The combiner designed is a function of the quantization errors across each RF path. Using very low bit resolution ADCs (1-2bits) is a popular approach with massive MIMO receiver architectures to mitigate large power demands. We show that for certain channel conditions, adopting unequal bit resolution ADCs (e.g., between 1 and 4 bits) on different RF chains, along with the proposed combiner, improves the performance of the MIMO receiver in the Mean Squared Error (MSE) sense. The variable-bit-resolution ADCs is still within the power constraint of using equal bit resolution ADCs on all paths (e.g., 2-bits). We propose a genetic algorithm in conjunction with the derived combiner to arrive at an optimal ADC bit allocation framework with significant reduction in computational complexity. △ Less

Submitted 17 November, 2017; originally announced November 2017.

Comments: Accepted for publication in Asilomar Conference on Signals, Systems, and Computers 2017

Showing 1–23 of 23 results for author: Ahmed, Z