Skip to main content

Showing 1–25 of 25 results for author: Togneri, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.12792  [pdf, other

    cs.SD eess.AS

    Sparks of Large Audio Models: A Survey and Outlook

    Authors: Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Erik Cambria, Björn W. Schuller

    Abstract: This survey paper provides a comprehensive overview of the recent advancements and challenges in applying large language models to the field of audio signal processing. Audio processing, with its diverse signal representations and a wide range of sources--from human voices to musical instruments and environmental sounds--poses challenges distinct from those found in traditional Natural Language Pr… ▽ More

    Submitted 21 September, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Under review, Repo URL: https://github.com/EmulationAI/awesome-large-audio-models

  2. arXiv:2209.13112  [pdf, other

    eess.AS cs.SD

    Automated Sex Classification of Children's Voices and Changes in Differentiating Factors with Age

    Authors: Fuling Chen, Roberto Togneri, Murray Maybery, Diana Weiting Tan

    Abstract: Sex classification of children's voices allows for an investigation of the development of secondary sex characteristics which has been a key interest in the field of speech analysis. This research investigated a broad range of acoustic features from scripted and spontaneous speech and applied a hierarchical clustering-based machine learning model to distinguish the sex of children aged between 5 a… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  3. arXiv:2209.12702  [pdf, other

    eess.AS cs.SD

    End-to-End Lyrics Recognition with Self-supervised Learning

    Authors: Xiangyu Zhang, Shuyue Stella Li, Zhanhong He, Roberto Togneri, Leibny Paola Garcia

    Abstract: Lyrics recognition is an important task in music processing. Despite traditional algorithms such as the hybrid HMM- TDNN model achieving good performance, studies on applying end-to-end models and self-supervised learning (SSL) are limited. In this paper, we first establish an end-to-end baseline for lyrics recognition and then explore the performance of SSL models on lyrics recognition task. We e… ▽ More

    Submitted 26 October, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: 4 pages, 2 figures, 3 tables

  4. Spatio-Temporal Graph Representation Learning for Fraudster Group Detection

    Authors: Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

    Abstract: Motivated by potential financial gain, companies may hire fraudster groups to write fake reviews to either demote competitors or promote their own businesses. Such groups are considerably more successful in misleading customers, as people are more likely to be influenced by the opinion of a large group. To detect such groups, a common model is to represent fraudster groups' static networks, conseq… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

  5. arXiv:2111.05645  [pdf, other

    cs.LG cs.AI

    Social Fraud Detection Review: Methods, Challenges and Analysis

    Authors: Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

    Abstract: Social reviews have dominated the web and become a plausible source of product information. People and businesses use such information for decision-making. Businesses also make use of social information to spread fake information using a single user, groups of users, or a bot trained to generate fraudulent content. Many studies proposed approaches based on user behaviors and review text to address… ▽ More

    Submitted 4 January, 2023; v1 submitted 10 November, 2021; originally announced November 2021.

  6. arXiv:2110.04453  [pdf, other

    cs.IT

    A Novel Quantum Calculus-based Complex Least Mean Square Algorithm (q-CLMS)

    Authors: Alishba Sadiq, Imran Naseem, Shujaat Khan, Muhammad Moinuddin, Roberto Togneri, Mohammed Bennamoun

    Abstract: In this research, a novel adaptive filtering algorithm is proposed for complex domain signal processing. The proposed algorithm is based on Wirtinger calculus and is called as q-Complex Least Mean Square (q-CLMS) algorithm. The proposed algorithm could be considered as an extension of the q-LMS algorithm for the complex domain. Transient and steady-state analyses of the proposed q-CLMS algorithm a… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: 35 pages, 14 figures

  7. arXiv:2106.01370  [pdf, other

    cs.LG math.QA

    q-RBFNN:A Quantum Calculus-based RBF Neural Network

    Authors: Syed Saiq Hussain, Muhammad Usman, Taha Hasan Masood Siddique, Imran Naseem, Roberto Togneri, Mohammed Bennamoun

    Abstract: In this research a novel stochastic gradient descent based learning approach for the radial basis function neural networks (RBFNN) is proposed. The proposed method is based on the q-gradient which is also known as Jackson derivative. In contrast to the conventional gradient, which finds the tangent, the q-gradient finds the secant of the function and takes larger steps towards the optimal solution… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Article is under review. This is a preprint version

  8. HIN-RNN: A Graph Representation Learning Neural Network for Fraudster Group Detection With No Handcrafted Features

    Authors: Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

    Abstract: Social reviews are indispensable resources for modern consumers' decision making. For financial gain, companies pay fraudsters preferably in groups to demote or promote products and services since consumers are more likely to be misled by a large number of similar reviews from groups. Recent approaches on fraudster group detection employed handcrafted features of group behaviors without considerin… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

  9. arXiv:2102.07982  [pdf, other

    cs.SD eess.AS

    Voice Gender Scoring and Independent Acoustic Characterization of Perceived Masculinity and Femininity

    Authors: Fuling Chen, Roberto Togneri, Murray Maybery, Diana Tan

    Abstract: Previous research has found that voices can provide reliable information to be used for gender classification with a high level of accuracy. In social psychology, perceived masculinity and femininity (masculinity and femininity rated by humans) has often been considered an important feature when investigating the influence of vocal features on social behaviours. While previous studies have charact… ▽ More

    Submitted 4 August, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: 24 pages, 7 figures, journal

  10. arXiv:2012.03154  [pdf, other

    eess.AS cs.CR cs.SD

    Multi-task Learning Based Spoofing-Robust Automatic Speaker Verification System

    Authors: Yuanjun Zhao, Roberto Togneri, Victor Sreeram

    Abstract: Spoofing attacks posed by generating artificial speech can severely degrade the performance of a speaker verification system. Recently, many anti-spoofing countermeasures have been proposed for detecting varying types of attacks from synthetic speech to replay presentations. While there are numerous effective defenses reported on standalone anti-spoofing solutions, the integration for speaker veri… ▽ More

    Submitted 5 December, 2020; originally announced December 2020.

    Comments: 12 pages, 6 figures, codes used in the experimental section can be found at https://github.com/zhaoyj1122/SRASV

  11. arXiv:2007.02592  [pdf, other

    cs.LG stat.ML

    Multi-Kernel Fusion for RBF Neural Networks

    Authors: Syed Muhammad Atif, Shujaat Khan, Imran Naseem, Roberto Togneri, Mohammed Bennamoun

    Abstract: A simple yet effective architectural design of radial basis function neural networks (RBFNN) makes them amongst the most popular conventional neural networks. The current generation of radial basis function neural network is equipped with multiple kernels which provide significant performance benefits compared to the previous generation using only a single kernel. In existing multi-kernel RBF algo… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  12. arXiv:2006.06561  [pdf, other

    cs.LG cs.CR stat.ML

    ScoreGAN: A Fraud Review Detector based on Multi Task Learning of Regulated GAN with Data Augmentation

    Authors: Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

    Abstract: The promising performance of Deep Neural Networks (DNNs) in text classification, has attracted researchers to use them for fraud review detection. However, the lack of trusted labeled data has limited the performance of the current solutions in detecting fraud reviews. The Generative Adversarial Network (GAN) as a semi-supervised method has demonstrated to be effective for data augmentation purpos… ▽ More

    Submitted 17 March, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

  13. arXiv:2006.05718  [pdf, other

    cs.LG cs.SI

    DFraud3- Multi-Component Fraud Detection freeof Cold-start

    Authors: Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun

    Abstract: Fraud review detection is a hot research topic inrecent years. The Cold-start is a particularly new but significant problem referring to the failure of a detection system to recognize the authenticity of a new user. State-of-the-art solutions employ a translational knowledge graph embedding approach (TransE) to model the interaction of the components of a review system. However, these approaches s… ▽ More

    Submitted 11 June, 2020; v1 submitted 10 June, 2020; originally announced June 2020.

  14. arXiv:2005.14502  [pdf, other

    cs.CV cs.LG cs.RO

    Unconstrained Matching of 2D and 3D Descriptors for 6-DOF Pose Estimation

    Authors: Uzair Nadeem, Mohammed Bennamoun, Roberto Togneri, Ferdous Sohel

    Abstract: This paper proposes a novel concept to directly match feature descriptors extracted from 2D images with feature descriptors extracted from 3D point clouds. We use this concept to directly localize images in a 3D point cloud. We generate a dataset of matching 2D and 3D points and their corresponding feature descriptors, which is used to learn a Descriptor-Matcher classifier. To localize the pose of… ▽ More

    Submitted 29 May, 2020; originally announced May 2020.

  15. arXiv:1906.06064  [pdf, other

    cs.CV

    Direct Image to Point Cloud Descriptors Matching for 6-DOF Camera Localization in Dense 3D Point Cloud

    Authors: Uzair Nadeem, Mohammad A. A. K. Jalwana, Mohammed Bennamoun, Roberto Togneri, Ferdous Sohel

    Abstract: We propose a novel concept to directly match feature descriptors extracted from RGB images, with feature descriptors extracted from 3D point clouds. We use this concept to localize the position and orientation (pose) of the camera of a query image in dense point clouds. We generate a dataset of matching 2D and 3D descriptors, and use it to train a proposed Descriptor-Matcher algorithm. To localize… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

  16. arXiv:1905.03546  [pdf, other

    stat.ML cs.CV cs.LG math.OC

    A Novel Adaptive Kernel for the RBF Neural Networks

    Authors: Shujaat Khan, Imran Naseem, Roberto Togneri, Mohammed Bennamoun

    Abstract: In this paper, we propose a novel adaptive kernel for the radial basis function (RBF) neural networks. The proposed kernel adaptively fuses the Euclidean and cosine distance measures to exploit the reciprocating properties of the two. The proposed framework dynamically adapts the weights of the participating kernels using the gradient descent method thereby alleviating the need for predetermined w… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

    Journal ref: Circuits, Systems, and Signal Processing, vol. 36, no. 4, pp. 1639-1653, 2017

  17. arXiv:1809.09620  [pdf, other

    q-bio.BM cs.LG stat.AP

    RAFP-Pred: Robust Prediction of Antifreeze Proteins using Localized Analysis of n-Peptide Compositions

    Authors: Shujaat Khan, Imran Naseem, Roberto Togneri, Mohammed Bennamoun

    Abstract: In extreme cold weather, living organisms produce Antifreeze Proteins (AFPs) to counter the otherwise lethal intracellular formation of ice. Structures and sequences of various AFPs exhibit a high degree of heterogeneity, consequently the prediction of the AFPs is considered to be a challenging task. In this research, we propose to handle this arduous manifold learning task using the notion of loc… ▽ More

    Submitted 25 September, 2018; originally announced September 2018.

    Comments: 7 pages, 2 figures

    Journal ref: "RAFP-Pred: Robust Prediction of Antifreeze Proteins Using Localized Analysis of n-Peptide Compositions," in IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 15, no. 1, pp. 244-250, 1 Jan.-Feb. 2018

  18. arXiv:1808.01091  [pdf, ps, other

    cs.SE

    DataDeps.jl: Repeatable Data Setup for Replicable Data Science

    Authors: Lyndon White, Roberto Togneri, Wei Liu, Mohammed Bennamoun

    Abstract: We present DataDeps.jl: a julia package for the reproducible handling of static datasets to enhance the repeatability of scripts used in the data and computational sciences. It is used to automate the data setup part of running software which accompanies a paper to replicate a result. This step is commonly done manually, which expends time and allows for confusion. This functionality is also usefu… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

    Comments: Source code: https://github.com/oxinabox/DataDeps.jl/

  19. arXiv:1803.09470  [pdf, other

    cs.CV

    Real Time Surveillance for Low Resolution and Limited-Data Scenarios: An Image Set Classification Approach

    Authors: Uzair Nadeem, Syed Afaq Ali Shah, Mohammed Bennamoun, Roberto Togneri, Ferdous Sohel

    Abstract: This paper proposes a novel image set classification technique based on the concept of linear regression. Unlike most other approaches, the proposed technique does not involve any training or feature extraction. The gallery image sets are represented as subspaces in a high dimensional space. Class specific gallery subspaces are used to estimate regression models for each image of the test image se… ▽ More

    Submitted 3 March, 2019; v1 submitted 26 March, 2018; originally announced March 2018.

  20. arXiv:1711.04973  [pdf, ps, other

    math.OC cs.IT math.ST

    A Robust Variable Step Size Fractional Least Mean Square (RVSS-FLMS) Algorithm

    Authors: Shujaat Khan, Muhammad Usman, Imran Naseem, Roberto Togneri, Mohammed Bennamoun

    Abstract: In this paper, we propose an adaptive framework for the variable step size of the fractional least mean square (FLMS) algorithm. The proposed algorithm named the robust variable step size-FLMS (RVSS-FLMS), dynamically updates the step size of the FLMS to achieve high convergence rate with low steady state error. For the evaluation purpose, the problem of system identification is considered. The ex… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

    Comments: 15 pages, 3 figures, 13th IEEE Colloquium on Signal Processing & its Applications (CSPA 2017)

    Journal ref: 2017 IEEE 13th International Colloquium on Signal Processing & its Applications (CSPA), Batu Ferringhi, 2017, pp. 1-6

  21. arXiv:1709.09360  [pdf, other

    cs.CL

    Learning of Colors from Color Names: Distribution and Point Estimation

    Authors: Lyndon White, Roberto Togneri, Wei Liu, Mohammed Bennamoun

    Abstract: Color names are often made up of multiple words. As a task in natural language understanding we investigate in depth the capacity of neural networks based on sums of word embeddings (SOWE), recurrence (LSTM and GRU based RNNs) and convolution (CNN), to estimate colors from sequences of terms. We consider both point and distribution estimates of color. We argue that the latter has a particular valu… ▽ More

    Submitted 10 January, 2020; v1 submitted 27 September, 2017; originally announced September 2017.

    Comments: Implementation available at https://github.com/oxinabox/ColoringNames.jl/

  22. arXiv:1701.02485  [pdf, other

    cs.CV

    Efficient Image Set Classification using Linear Regression based Image Reconstruction

    Authors: Syed Afaq Ali Shah, Uzair Nadeem, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri

    Abstract: We propose a novel image set classification technique using linear regression models. Downsampled gallery image sets are interpreted as subspaces of a high dimensional space to avoid the computationally expensive training step. We estimate regression models for each test image using the class specific gallery subspaces. Images of the test set are then reconstructed using the regression models. Bas… ▽ More

    Submitted 10 January, 2017; originally announced January 2017.

  23. arXiv:1606.02009  [pdf, other

    cs.CV

    Learning deep structured network for weakly supervised change detection

    Authors: Salman H Khan, Xuming He, Fatih Porikli, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri

    Abstract: Conventional change detection methods require a large number of images to learn background models or depend on tedious pixel-level labeling by humans. In this paper, we present a weakly supervised approach that needs only image-level labels to simultaneously detect and localize changes in a pair of images. To this end, we employ a deep neural network with DAG topology to learn patterns of change f… ▽ More

    Submitted 22 May, 2017; v1 submitted 6 June, 2016; originally announced June 2016.

  24. arXiv:1508.03422  [pdf, other

    cs.CV

    Cost Sensitive Learning of Deep Feature Representations from Imbalanced Data

    Authors: Salman H. Khan, Munawar Hayat, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri

    Abstract: Class imbalance is a common problem in the case of real-world object detection and classification tasks. Data of some classes is abundant making them an over-represented majority, and data of other classes is scarce, making them an under-represented minority. This imbalance makes it challenging for a classifier to appropriately learn the discriminating boundaries of the majority and minority class… ▽ More

    Submitted 23 March, 2017; v1 submitted 14 August, 2015; originally announced August 2015.

  25. A Discriminative Representation of Convolutional Features for Indoor Scene Recognition

    Authors: Salman H. Khan, Munawar Hayat, Mohammed Bennamoun, Roberto Togneri, Ferdous Sohel

    Abstract: Indoor scene recognition is a multi-faceted and challenging problem due to the diverse intra-class variations and the confusing inter-class similarities. This paper presents a novel approach which exploits rich mid-level convolutional features to categorize indoor scenes. Traditionally used convolutional features preserve the global spatial structure, which is a desirable property for general obje… ▽ More

    Submitted 16 June, 2015; originally announced June 2015.