Search | arXiv e-print repository

CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks

Authors: Nick Nikzad, Yongsheng Gao, Jun Zhou

Abstract: In recent years, convolutional neural networks (CNNs) with channel-wise feature refining mechanisms have brought noticeable benefits to modelling channel dependencies. However, current attention paradigms fail to infer an optimal channel descriptor capable of simultaneously exploiting statistical and spatial relationships among feature maps. In this paper, to overcome this shortcoming, we present… ▽ More In recent years, convolutional neural networks (CNNs) with channel-wise feature refining mechanisms have brought noticeable benefits to modelling channel dependencies. However, current attention paradigms fail to infer an optimal channel descriptor capable of simultaneously exploiting statistical and spatial relationships among feature maps. In this paper, to overcome this shortcoming, we present a novel channel-wise spatially autocorrelated (CSA) attention mechanism. Inspired by geographical analysis, the proposed CSA exploits the spatial relationships between channels of feature maps to produce an effective channel descriptor. To the best of our knowledge, this is the f irst time that the concept of geographical spatial analysis is utilized in deep CNNs. The proposed CSA imposes negligible learning parameters and light computational overhead to the deep model, making it a powerful yet efficient attention module of choice. We validate the effectiveness of the proposed CSA networks (CSA-Nets) through extensive experiments and analysis on ImageNet, and MS COCO benchmark datasets for image classification, object detection, and instance segmentation. The experimental results demonstrate that CSA-Nets are able to consistently achieve competitive performance and superior generalization than several state-of-the-art attention-based CNNs over different benchmark tasks and datasets. △ Less

Submitted 13 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

arXiv:2403.07126 [pdf, other]

Heterogeneous Image-based Classification Using Distributional Data Analysis

Authors: Alec Reinhardt, Newsha Nikzad, Raven J. Hollis, Galia Jacobson, Millicent A. Roach, Mohamed Badawy, Peter Chul Park, Laura Beretta, Prasun K Jalal, David T. Fuentes, Eugene J. Koay, Suprateek Kundu

Abstract: Diagnostic imaging has gained prominence as potential biomarkers for early detection and diagnosis in a diverse array of disorders including cancer. However, existing methods routinely face challenges arising from various factors such as image heterogeneity. We develop a novel imaging-based distributional data analysis (DDA) approach that incorporates the probability (quantile) distribution of the… ▽ More Diagnostic imaging has gained prominence as potential biomarkers for early detection and diagnosis in a diverse array of disorders including cancer. However, existing methods routinely face challenges arising from various factors such as image heterogeneity. We develop a novel imaging-based distributional data analysis (DDA) approach that incorporates the probability (quantile) distribution of the pixel-level features as covariates. The proposed approach uses a smoothed quantile distribution (via a suitable basis representation) as functional predictors in a scalar-on-functional quantile regression model. Some distinctive features of the proposed approach include the ability to: (i) account for heterogeneity within the image; (ii) incorporate granular information spanning the entire distribution; and (iii) tackle variability in image sizes for unregistered images in cancer applications. Our primary goal is risk prediction in Hepatocellular carcinoma that is achieved via predicting the change in tumor grades at post-diagnostic visits using pre-diagnostic enhancement pattern map** (EPM) images of the liver. Along the way, the proposed DDA approach is also used for case versus control diagnosis and risk stratification objectives. Our analysis reveals that when coupled with global structural radiomics features derived from the corresponding T1-MRI scans, the proposed smoothed quantile distributions derived from EPM images showed considerable improvements in sensitivity and comparable specificity in contrast to classification based on routinely used summary measures that do not account for image heterogeneity. Given that there are limited predictive modeling approaches based on heterogeneous images in cancer, the proposed method is expected to provide considerable advantages in image-based early detection and risk prediction. △ Less

Submitted 11 March, 2024; originally announced March 2024.

Comments: 16, 2 figures, 3 tables

arXiv:2402.06196 [pdf, other]

Large Language Models: A Survey

Authors: Shervin Minaee, Tomas Mikolov, Narjes Nikzad, Meysam Chenaghlu, Richard Socher, Xavier Amatriain, Jianfeng Gao

Abstract: Large Language Models (LLMs) have drawn a lot of attention due to their strong performance on a wide range of natural language tasks, since the release of ChatGPT in November 2022. LLMs' ability of general-purpose language understanding and generation is acquired by training billions of model's parameters on massive amounts of text data, as predicted by scaling laws \cite{kaplan2020scaling,hoffman… ▽ More Large Language Models (LLMs) have drawn a lot of attention due to their strong performance on a wide range of natural language tasks, since the release of ChatGPT in November 2022. LLMs' ability of general-purpose language understanding and generation is acquired by training billions of model's parameters on massive amounts of text data, as predicted by scaling laws \cite{kaplan2020scaling,hoffmann2022training}. The research area of LLMs, while very recent, is evolving rapidly in many different ways. In this paper, we review some of the most prominent LLMs, including three popular LLM families (GPT, LLaMA, PaLM), and discuss their characteristics, contributions and limitations. We also give an overview of techniques developed to build, and augment LLMs. We then survey popular datasets prepared for LLM training, fine-tuning, and evaluation, review widely used LLM evaluation metrics, and compare the performance of several popular LLMs on a set of representative benchmarks. Finally, we conclude the paper by discussing open challenges and future research directions. △ Less

Submitted 20 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2401.14423

arXiv:2309.03980 [pdf, ps, other]

Enhancement Pattern Map** for Detection of Hepatocellular Carcinoma in Patients with Cirrhosis

Authors: Newsha Nikzad, David Thomas Fuentes, Millicent Roach, Tasadduk Chowdhury, Matthew Cagley, Mohamed Badawy, Ahmed Elkhesen, Manal Hassan, Khaled Elsayes, Laura Beretta, Eugene Jon Koay, Prasun Kumar Jalal

Abstract: Background and Aims: Limited methods exist to accurately characterize risk of malignant progression of liver lesions in patients undergoing surveillance for hepatocellular carcinoma (HCC). Enhancement pattern map** (EPM) measures voxel-based root mean square deviation (RMSD) and improves the contrast-to-noise ratio (CNR) of liver lesions on standard of care imaging. This study investigates the u… ▽ More Background and Aims: Limited methods exist to accurately characterize risk of malignant progression of liver lesions in patients undergoing surveillance for hepatocellular carcinoma (HCC). Enhancement pattern map** (EPM) measures voxel-based root mean square deviation (RMSD) and improves the contrast-to-noise ratio (CNR) of liver lesions on standard of care imaging. This study investigates the utilization of EPM to differentiate between HCC versus benign cirrhotic tissue. Methods: Patients with liver cirrhosis undergoing MRI surveillance at a single, tertiary-care hospital were studied prospectively. Controls (n=99) were patients without lesions during surveillance or progression to HCC. Cases (n=48) were defined as patients with LI-RADS 3 and 4 lesions who developed HCC within the study period. RMSD measured with EPM was compared to the signal from MRI arterial and portovenous (PV) phases. EPM signals of liver parenchyma between cases and controls were quantitatively validated on an independent patient set using cross validation. Results: With EPM, RMSD of 0.37 was identified as a quantitative cutoff for distinguishing lesions that progress to HCC from background parenchyma on pre-diagnostic scans with an area under the curve (AUC) of 0.83 (CI: 0.73-0.94) and a sensitivity, specificity, and accuracy of 0.65, 0.97, and 0.89, respectively. At the time of diagnostic scans, a sensitivity, specificity, and accuracy of 0.79, 0.93, and 0.88 was achieved with an AUC of 0.89 (CI: 0.82-0.96). EPM RMSD signals of background parenchyma in cases and controls were similar (case EPM: 0.22 +/- 0.08, control EPM: 0.22 +/- 0.09, p=0.8). Conclusions: EPM differentiates between HCC and non-cancerous parenchyma in a surveillance population and may aid in early detection of HCC. Future directions involve applying EPM for risk stratification of indeterminate lesions. △ Less

Submitted 15 September, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

Comments: Pre-print, 9 pages, 4 figures

arXiv:2004.03705 [pdf, other]

Deep Learning Based Text Classification: A Comprehensive Review

Authors: Shervin Minaee, Nal Kalchbrenner, Erik Cambria, Narjes Nikzad, Meysam Chenaghlu, Jianfeng Gao

Abstract: Deep learning based models have surpassed classical machine learning based approaches in various text classification tasks, including sentiment analysis, news categorization, question answering, and natural language inference. In this paper, we provide a comprehensive review of more than 150 deep learning based models for text classification developed in recent years, and discuss their technical c… ▽ More Deep learning based models have surpassed classical machine learning based approaches in various text classification tasks, including sentiment analysis, news categorization, question answering, and natural language inference. In this paper, we provide a comprehensive review of more than 150 deep learning based models for text classification developed in recent years, and discuss their technical contributions, similarities, and strengths. We also provide a summary of more than 40 popular datasets widely used for text classification. Finally, we provide a quantitative analysis of the performance of different deep learning models on popular benchmarks, and discuss future research directions. △ Less

Submitted 4 January, 2021; v1 submitted 5 April, 2020; originally announced April 2020.

Showing 1–5 of 5 results for author: Nikzad, N