Search | arXiv e-print repository

Attention versus Contrastive Learning of Tabular Data -- A Data-centric Benchmarking

Authors: Shourav B. Rabbani, Ivan V. Medri, Manar D. Samad

Abstract: Despite groundbreaking success in image and text learning, deep learning has not achieved significant improvements against traditional machine learning (ML) when it comes to tabular data. This performance gap underscores the need for data-centric treatment and benchmarking of learning algorithms. Recently, attention and contrastive learning breakthroughs have shifted computer vision and natural la… ▽ More Despite groundbreaking success in image and text learning, deep learning has not achieved significant improvements against traditional machine learning (ML) when it comes to tabular data. This performance gap underscores the need for data-centric treatment and benchmarking of learning algorithms. Recently, attention and contrastive learning breakthroughs have shifted computer vision and natural language processing paradigms. However, the effectiveness of these advanced deep models on tabular data is sparsely studied using a few data sets with very large sample sizes, reporting mixed findings after benchmarking against a limited number of baselines. We argue that the heterogeneity of tabular data sets and selective baselines in the literature can bias the benchmarking outcomes. This article extensively evaluates state-of-the-art attention and contrastive learning methods on a wide selection of 28 tabular data sets (14 easy and 14 hard-to-classify) against traditional deep and machine learning. Our data-centric benchmarking demonstrates when traditional ML is preferred over deep learning and vice versa because no best learning method exists for all tabular data sets. Combining between-sample and between-feature attentions conquers the invincible traditional ML on tabular data sets by a significant margin but fails on high dimensional data, where contrastive learning takes a robust lead. While a hybrid attention-contrastive learning strategy mostly wins on hard-to-classify data sets, traditional methods are frequently superior on easy-to-classify data sets with presumably simpler decision boundaries. To the best of our knowledge, this is the first benchmarking paper with statistical analyses of attention and contrastive learning performances on a diverse selection of tabular data sets against traditional deep and machine learning baselines to facilitate further advances in this field. △ Less

Submitted 8 January, 2024; originally announced January 2024.

arXiv:2306.06772 [pdf, other]

Between-Sample Relationship in Learning Tabular Data Using Graph and Attention Networks

Authors: Shourav B. Rabbani, Manar D. Samad

Abstract: Traditional machine learning assumes samples in tabular data to be independent and identically distributed (i.i.d). This assumption may miss useful information within and between sample relationships in representation learning. This paper relaxes the i.i.d assumption to learn tabular data representations by incorporating between-sample relationships for the first time using graph neural networks (… ▽ More Traditional machine learning assumes samples in tabular data to be independent and identically distributed (i.i.d). This assumption may miss useful information within and between sample relationships in representation learning. This paper relaxes the i.i.d assumption to learn tabular data representations by incorporating between-sample relationships for the first time using graph neural networks (GNN). We investigate our hypothesis using several GNNs and state-of-the-art (SOTA) deep attention models to learn the between-sample relationship on ten tabular data sets by comparing them to traditional machine learning methods. GNN methods show the best performance on tabular data with large feature-to-sample ratios. Our results reveal that attention-based GNN methods outperform traditional machine learning on five data sets and SOTA deep tabular learning methods on three data sets. Between-sample learning via GNN and deep attention methods yield the best classification accuracy on seven of the ten data sets. This suggests that the i.i.d assumption may not always hold for most tabular data sets. △ Less

Submitted 11 June, 2023; originally announced June 2023.

Comments: Accepted to the 19th Int. Conf. on Data Science, Las Vegas, NV

arXiv:2304.11704 [pdf, other]

Interplay of viscosity and wettability controls fluid displacement in porous media

Authors: Saideep Pavuluri, Ran Holtzman, Luqman Kazeem, Malyah Mohammed, Thomas Daniel Seers, Harris Sajjad Rabbani

Abstract: Direct numerical simulations are used to elucidate the interplay of wettability and fluid viscosities on immiscible fluid displacements in a heterogeneous porous medium.We classify the flow regimes based using qualitative and quantitative analysis into viscous fingering (low $M$), compact displacement (high $M$), and an intermediate transition regime ($M \approx 1$). We use stability analysis to o… ▽ More Direct numerical simulations are used to elucidate the interplay of wettability and fluid viscosities on immiscible fluid displacements in a heterogeneous porous medium.We classify the flow regimes based using qualitative and quantitative analysis into viscous fingering (low $M$), compact displacement (high $M$), and an intermediate transition regime ($M \approx 1$). We use stability analysis to obtain theoretical phase boundaries between these regimes, which agree well with our analyses. At the macroscopic (sample) scale, we find that wettability strongly controls the threshold $M$ (at which the regimes change). At the pore scale, wettability alters the dominant pore-filling mechanism. At very small $M$ (viscous fingering regime), smaller pore spaces are preferentially invaded during imbibition, with flow of films of invading fluid along the pore walls. In contrast, during drainage, bursts result in filling of pores irrespective of their size. As $M$ increases, the effect of wettability decreases as cooperative filling becomes the dominant mechanism regardless of wettability. This suggest that for imbibition at a given contact angle, decreasing $M$ is associated with change in effective wetting from neutral-wet (cooperative filling) to strong-wet (film flow). △ Less

Submitted 23 April, 2023; originally announced April 2023.

arXiv:2304.03549 [pdf, other]

Comparative analysis of five NH$_3$/air oxidation mechanisms

Authors: Shahid Rabbani, Dimitris M. Manias, Dimitrios C. Kyritsis, Dimitris A. Goussis

Abstract: Five recently developed chemical kinetics mechanisms for ammonia oxidation are analysed and compared, in the context of homogeneous adiabatic autoignition. The analysis focuses on the ignition delay and is based on the explosive mode that is shown to drive the process. Using algorithmic tools based on the Computational Singular Perturbation algorithm, the reactions responsible for the generation o… ▽ More Five recently developed chemical kinetics mechanisms for ammonia oxidation are analysed and compared, in the context of homogeneous adiabatic autoignition. The analysis focuses on the ignition delay and is based on the explosive mode that is shown to drive the process. Using algorithmic tools based on the Computational Singular Perturbation algorithm, the reactions responsible for the generation of the explosive mode are identified, along with the variables (species mass fractions and temperature) that associate the most to this mode. Comparison of these sets of reactions and variables, obtained for each mechanism, allows to correlate the differences in the predictive outcomes from the mechanisms with specific reactions. The major differences identified, which lead to different ignition delay times, relate to (i) the relative duration of chemical and thermal runaways (a sizeable chemical runaway develops only in some mechanisms) and (ii) the dominant chemistry during the chemical runaway (chemistry involving species with two nitrogen atoms is active only in some mechanisms). The major similarities identified refer to the thermal runaway and in particular to (i) the chemical activity, which is supported mainly by OH-producing reactions and by reactions producing their reactants and (ii) the thermal activity, which is dominated by strongly exothermic OH-consuming reactions. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: 27 pages, 51 figures

arXiv:2301.00802 [pdf, other]

Deep Clustering of Tabular Data by Weighted Gaussian Distribution Learning

Authors: Shourav B. Rabbani, Ivan V. Medri, Manar D. Samad

Abstract: Deep learning methods are primarily proposed for supervised learning of images or text with limited applications to clustering problems. In contrast, tabular data with heterogeneous features pose unique challenges in representation learning, where deep learning has yet to replace traditional machine learning. This paper addresses these challenges in develo** one of the first deep clustering meth… ▽ More Deep learning methods are primarily proposed for supervised learning of images or text with limited applications to clustering problems. In contrast, tabular data with heterogeneous features pose unique challenges in representation learning, where deep learning has yet to replace traditional machine learning. This paper addresses these challenges in develo** one of the first deep clustering methods for tabular data: Gaussian Cluster Embedding in Autoencoder Latent Space (G-CEALS). G-CEALS is an unsupervised deep clustering framework for learning the parameters of multivariate Gaussian cluster distributions by iteratively updating individual cluster weights. The G-CEALS method presents average rank orderings of 2.9(1.7) and 2.8(1.7) based on clustering accuracy and adjusted Rand index (ARI) scores on sixteen tabular data sets, respectively, and outperforms nine state-of-the-art clustering methods. G-CEALS substantially improves clustering performance compared to traditional K-means and GMM, which are still de facto methods for clustering tabular data. Similar computationally efficient and high-performing deep clustering frameworks are imperative to reap the myriad benefits of deep learning on tabular data over traditional machine learning. △ Less

Submitted 17 May, 2024; v1 submitted 2 January, 2023; originally announced January 2023.

arXiv:2112.02145 [pdf, other]

Exploratory Data Analysis of Urdu Poetry

Authors: Shahid Rabbani, Zahid Ahmed Qureshi

Abstract: The study presented here provides numerical insight into ghazal -- the most appreciated genre in Urdu poetry. Using 48,761 poetic works from 4,754 poets produced over a period of 800 years, this study explores the main features of Urdu ghazal that make it popular and admired more than other forms. A detailed explanation is provided as to the types of words used for expressing love, nature, birds,… ▽ More The study presented here provides numerical insight into ghazal -- the most appreciated genre in Urdu poetry. Using 48,761 poetic works from 4,754 poets produced over a period of 800 years, this study explores the main features of Urdu ghazal that make it popular and admired more than other forms. A detailed explanation is provided as to the types of words used for expressing love, nature, birds, and flowers etc. Also considered is the way in which the poets addressed their loved ones in their poetry. The style of poetry is numerically analyzed using Multi Dimensional Scaling to reveal the lexical diversity and similarities/differences between the different poetic works that have drawn the attention of critics, such as Iqbal and Ghalib, Mir Taqi Mir and Mir Dard. The analysis produced here is particularly helpful for research in computational stylistics, neurocognitive poetics, and sentiment analysis. △ Less

Submitted 3 December, 2021; originally announced December 2021.

Comments: 11 Pages, 12 Figures, Submitted to Scientific Studies of Reading

arXiv:2111.06096 [pdf, other]

Enhancing Autoignition Characteristics: A Framework to Discover Fuel Additives and Making Predictions Using Machine Learning

Authors: Shahid Rabbani

Abstract: Combustion process can become more energy efficient and environment friendly if used with appropriate fuel additive. Discovery of fuel additive can be accelerated by applying hybrid approach of using of chemical kinetics and Machine Learning (ML). In this work, we present a framework that takes the robustness of Machine Learning and accuracy of chemical kinetics to predict the effect of fuel addit… ▽ More Combustion process can become more energy efficient and environment friendly if used with appropriate fuel additive. Discovery of fuel additive can be accelerated by applying hybrid approach of using of chemical kinetics and Machine Learning (ML). In this work, we present a framework that takes the robustness of Machine Learning and accuracy of chemical kinetics to predict the effect of fuel additive on autoignition process. We present a case of making predictions for Ignition Delay Time (IDT) of biofuel n-butanol ($C_4H_9OH$) with several fuel additives. The proposed framework was able to predict IDT of autoignition with high accuracy when used with unseen additives. This framework highlights the potential of ML to exploit chemical mechanisms in exploring and develo** the fuel additives to obtain the desirable autoignition characteristics. △ Less

Submitted 11 November, 2021; originally announced November 2021.

Comments: 6 pages, 1 table, 5 figures, Submitted to International Conference on Applied Energy 2021 (ICAE2021)

arXiv:2107.04566 [pdf]

Multi-level Stress Assessment from ECG in a Virtual Reality Environment using Multimodal Fusion

Authors: Zeeshan Ahmad, Suha Rabbani, Muhammad Rehman Zafar, Syem Ishaque, Sridhar Krishnan, Naimul Khan

Abstract: ECG is an attractive option to assess stress in serious Virtual Reality (VR) applications due to its non-invasive nature. However, the existing Machine Learning (ML) models perform poorly. Moreover, existing studies only perform a binary stress assessment, while to develop a more engaging biofeedback-based application, multi-level assessment is necessary. Existing studies annotate and classify a s… ▽ More ECG is an attractive option to assess stress in serious Virtual Reality (VR) applications due to its non-invasive nature. However, the existing Machine Learning (ML) models perform poorly. Moreover, existing studies only perform a binary stress assessment, while to develop a more engaging biofeedback-based application, multi-level assessment is necessary. Existing studies annotate and classify a single experience (e.g. watching a VR video) to a single stress level, which again prevents design of dynamic experiences where real-time in-game stress assessment can be utilized. In this paper, we report our findings on a new study on VR stress assessment, where three stress levels are assessed. ECG data was collected from 9 users experiencing a VR roller coaster. The VR experience was then manually labeled in 10-seconds segments to three stress levels by three raters. We then propose a novel multimodal deep fusion model utilizing spectrogram and 1D ECG that can provide a stress prediction from just a 1-second window. Experimental results demonstrate that the proposed model outperforms the classical HRV-based ML models (9% increase in accuracy) and baseline deep learning models (2.5% increase in accuracy). We also report results on the benchmark WESAD dataset to show the supremacy of the model. △ Less

Submitted 9 July, 2021; originally announced July 2021.

Comments: Under review

arXiv:1905.12223 [pdf]

Analysis of evoked EMG using wavelet transformation

Authors: Zaid Bin Mahbub, J H Karami, K Siddique-e Rabbani

Abstract: Evoked EMG M-responses obtained from the thenar muscle in the palm by electrical stimulation of the median nerve demonstrate a well-established smooth bipolar shape for normal healthy subjects while kinks are observed in certain neurological disorders, particularly in cervical spondylotic neuropathy. A first differentiation failed to identify these kinks because of comparable values obtained for n… ▽ More Evoked EMG M-responses obtained from the thenar muscle in the palm by electrical stimulation of the median nerve demonstrate a well-established smooth bipolar shape for normal healthy subjects while kinks are observed in certain neurological disorders, particularly in cervical spondylotic neuropathy. A first differentiation failed to identify these kinks because of comparable values obtained for normally rising and falling segments of the smooth regions, and due to noise. In this study, the usefulness of the wavelet transform (WT), that provides localized measures of non-stationary signals is investigated. The Haar WT was used to analyze a total of 36 M-responses recorded from the median nerves of 6 normal subjects (having smooth shape) and 12 subjects with assumed neurological disorders (having kinks), for two points of stimulation on the same nerve. Features in the time-scale representation of the M-responses were studied using WT to distinguish smooth M-responses from ones with kinks. Variations in the coefficient line of the WT were also studied to allow visualization of WT at different scales (inverse of frequency). The high and low frequency regions in the WT came out distinctively which helped identifications of kinks even of very subtle ones in the M-responses which were difficult to obtain using the differentiated signal. In conclusion, the wavelet analysis may be a technique of choice in identifying kinks in M-responses in relation to time, thus enhancing the accuracy of neurological diagnosis. △ Less

Submitted 29 May, 2019; originally announced May 2019.

Showing 1–9 of 9 results for author: Rabbani, S