Skip to main content

Showing 1–9 of 9 results for author: Babaali, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07327  [pdf

    cs.AI

    Multi-Modal Emotion Recognition by Text, Speech and Video Using Pretrained Transformers

    Authors: Minoo Shayaninasab, Bagher Babaali

    Abstract: Due to the complex nature of human emotions and the diversity of emotion representation methods in humans, emotion recognition is a challenging field. In this research, three input modalities, namely text, audio (speech), and video, are employed to generate multimodal feature vectors. For generating features for each of these modalities, pre-trained Transformer models with fine-tuning are utilized… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  2. arXiv:2402.07326  [pdf

    cs.AI cs.SD eess.AS

    Persian Speech Emotion Recognition by Fine-Tuning Transformers

    Authors: Minoo Shayaninasab, Bagher Babaali

    Abstract: Given the significance of speech emotion recognition, numerous methods have been developed in recent years to create effective and efficient systems in this domain. One of these methods involves the use of pretrained transformers, fine-tuned to address this specific problem, resulting in high accuracy. Despite extensive discussions and global-scale efforts to enhance these systems, the application… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  3. arXiv:2310.11640  [pdf, other

    cs.CR cs.LG

    Free-text Keystroke Authentication using Transformers: A Comparative Study of Architectures and Loss Functions

    Authors: Saleh Momeni, Bagher BabaAli

    Abstract: Keystroke biometrics is a promising approach for user identification and verification, leveraging the unique patterns in individuals' ty** behavior. In this paper, we propose a Transformer-based network that employs self-attention to extract informative features from keystroke sequences, surpassing the performance of traditional Recurrent Neural Networks. We explore two distinct architectures, n… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  4. arXiv:2310.06645  [pdf, other

    cs.LG cs.CL

    Self-Supervised Representation Learning for Online Handwriting Text Classification

    Authors: Pouya Mehralian, Bagher BabaAli, Ashena Gorgan Mohammadi

    Abstract: Self-supervised learning offers an efficient way of extracting rich representations from various types of unlabeled data while avoiding the cost of annotating large-scale datasets. This is achievable by designing a pretext task to form pseudo labels with respect to the modality and domain of the data. Given the evolving applications of online handwritten texts, in this study, we propose the novel… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  5. arXiv:2307.15045  [pdf, other

    cs.CV cs.LG

    A Transformer-based Approach for Arabic Offline Handwritten Text Recognition

    Authors: Saleh Momeni, Bagher BabaAli

    Abstract: Handwriting recognition is a challenging and critical problem in the fields of pattern recognition and machine learning, with applications spanning a wide range of domains. In this paper, we focus on the specific issue of recognizing offline Arabic handwritten text. Existing approaches typically utilize a combination of convolutional neural networks for image feature extraction and recurrent neura… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  6. Writer Identification and Writer Retrieval Based on NetVLAD with Re-ranking

    Authors: Shervin Rasoulzadeh, Bagher Babaali

    Abstract: This paper addresses writer identification and writer retrieval which is considered as a challenging problem in the document analysis and recognition field. In this work, a novel pipeline is proposed for the problem at hand by employing a unified neural network architecture consisting of the ResNet-20 as a feature extractor and an integrated NetVLAD layer, inspired by the vector of locally aggrega… ▽ More

    Submitted 22 February, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: 22 pages, 12 figures

  7. arXiv:1910.00330  [pdf, other

    cs.LG cs.CL eess.AS stat.ML

    A Multi-Modal Feature Embedding Approach to Diagnose Alzheimer Disease from Spoken Language

    Authors: S. Soroush Haj Zargarbashi, Bagher Babaali

    Abstract: Introduction: Alzheimer's disease is a type of dementia in which early diagnosis plays a major rule in the quality of treatment. Among new works in the diagnosis of Alzheimer's disease, there are many of them analyzing the voice stream acoustically, syntactically or both. The mostly used tools to perform these analysis usually include machine learning techniques. Objective: Designing an automatic… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    Comments: 14 pages, 4 figures

  8. arXiv:1904.11914  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Statistical feature embedding for heart sound classification

    Authors: Mohammad Adiban, Bagher BabaAli, Saeedreza Shehnepoor

    Abstract: Cardiovascular Disease (CVD) is considered as one of the principal causes of death in the world. Over recent years, this field of study has attracted researchers' attention to investigate heart sounds' patterns for disease diagnostics. In this study, an approach is proposed for normal/abnormal heart sound classification on the Physionet challenge 2016 dataset. For the first time, a fixed-length fe… ▽ More

    Submitted 9 November, 2020; v1 submitted 26 April, 2019; originally announced April 2019.

    Journal ref: Journal of Electrical Engineering, 70(4), 259-272 (2019)

  9. On Usage of Autoencoders and Siamese Networks for Online Handwritten Signature Verification

    Authors: Kian Ahrabian, Bagher Babaali

    Abstract: In this paper, we propose a novel writer-independent global feature extraction framework for the task of automatic signature verification which aims to make robust systems for automatically distinguishing negative and positive samples. Our method consists of an autoencoder for modeling the sample space into a fixed length latent space and a Siamese Network for classifying the fixed-length samples… ▽ More

    Submitted 29 December, 2017; v1 submitted 7 December, 2017; originally announced December 2017.

    Comments: 13 pages, 10 figures, Submitted to Neural Computing and Applications journal