Skip to main content

Showing 1–24 of 24 results for author: Malik, I

Searching in archive cs. Search in all archives.
.
  1. A Perspective Analysis of Handwritten Signature Technology

    Authors: Moises Diaz, Miguel A. Ferrer, Donato Impedovo, Muhammad Imran Malik, Giuseppe Pirlo, Rejean Plamondon

    Abstract: Handwritten signatures are biometric traits at the center of debate in the scientific community. Over the last 40 years, the interest in signature studies has grown steadily, having as its main reference the application of automatic signature verification, as previously published reviews in 1989, 2000, and 2008 bear witness. Ever since, and over the last 10 years, the application of handwritten si… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Journal ref: ACM Computing Surveys (CSUR), vol.51, no 6, pp. 117:1-117:39 (2018)

  2. arXiv:2404.00946  [pdf

    cs.CV

    Exploring the Efficacy of Group-Normalization in Deep Learning Models for Alzheimer's Disease Classification

    Authors: Gousia Habib, Ishfaq Ahmed Malik, Jameel Ahmad, Imtiaz Ahmed, Shaima Qureshi

    Abstract: Batch Normalization is an important approach to advancing deep learning since it allows multiple networks to train simultaneously. A problem arises when normalizing along the batch dimension because B.N.'s error increases significantly as batch size shrinks because batch statistics estimates are inaccurate. As a result, computer vision tasks like detection, segmentation, and video, which require t… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 19 pages, 3 figures

  3. arXiv:2309.14662  [pdf, other

    cs.LG cs.CY cs.IR

    Transformer-based classification of user queries for medical consultancy with respect to expert specialization

    Authors: Dmitry Lyutkin, Andrey Soloviev, Dmitry Zhukov, Denis Pozdnyakov, Muhammad Shahid Iqbal Malik, Dmitry I. Ignatov

    Abstract: The need for skilled medical support is growing in the era of digital healthcare. This research presents an innovative strategy, utilizing the RuBERT model, for categorizing user inquiries in the field of medical consultation with a focus on expert specialization. By harnessing the capabilities of transformers, we fine-tuned the pre-trained RuBERT model on a varied dataset, which facilitates preci… ▽ More

    Submitted 2 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: 16 pages, 5 figures

  4. arXiv:2308.04868  [pdf, other

    cs.CV

    InstantAvatar: Efficient 3D Head Reconstruction via Surface Rendering

    Authors: Antonio Canela, Pol Caselles, Ibrar Malik, Eduard Ramon, Jaime García, Jordi Sánchez-Riera, Gil Triginer, Francesc Moreno-Noguer

    Abstract: Recent advances in full-head reconstruction have been obtained by optimizing a neural field through differentiable surface or volume rendering to represent a single scene. While these techniques achieve an unprecedented accuracy, they take several minutes, or even hours, due to the expensive optimization process required. In this work, we introduce InstantAvatar, a method that recovers full-head a… ▽ More

    Submitted 5 April, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

  5. arXiv:2307.06090  [pdf, other

    cs.SD eess.AS

    Can Large Language Models Aid in Annotating Speech Emotional Data? Uncovering New Frontiers

    Authors: Siddique Latif, Muhammad Usama, Mohammad Ibrahim Malik, Björn W. Schuller

    Abstract: Despite recent advancements in speech emotion recognition (SER) models, state-of-the-art deep learning (DL) approaches face the challenge of the limited availability of annotated data. Large language models (LLMs) have revolutionised our understanding of natural language, introducing emergent properties that broaden comprehension in language, speech, and vision. This paper examines the potential o… ▽ More

    Submitted 19 June, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted in IEEE Computational Intelligence Magazine

  6. arXiv:2305.11413  [pdf, other

    cs.SD eess.AS

    A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model

    Authors: Ibrahim Malik, Siddique Latif, Raja Jurdak, Björn Schuller

    Abstract: In this paper, we propose to utilise diffusion models for data augmentation in speech emotion recognition (SER). In particular, we present an effective approach to utilise improved denoising diffusion probabilistic models (IDDPM) to generate synthetic emotional data. We condition the IDDPM with the textual embedding from bidirectional encoder representations from transformers (BERT) to generate hi… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted Interspeech 2023

  7. arXiv:2305.00725  [pdf, other

    cs.SD eess.AS

    Emotions Beyond Words: Non-Speech Audio Emotion Recognition With Edge Computing

    Authors: Ibrahim Malik, Siddique Latif, Sanaullah Manzoor, Muhammad Usama, Junaid Qadir, Raja Jurdak

    Abstract: Non-speech emotion recognition has a wide range of applications including healthcare, crime control and rescue, and entertainment, to name a few. Providing these applications using edge computing has great potential, however, recent studies are focused on speech-emotion recognition using complex architectures. In this paper, a non-speech-based emotion recognition system is proposed, which can rely… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: Under review

  8. arXiv:2211.06761  [pdf, other

    cs.CV

    Few-Shot Learning for Biometric Verification

    Authors: Saad Bin Ahmed, Umaid M. Zaffar, Marium Aslam, Muhammad Imran Malik

    Abstract: In machine learning applications, it is common practice to feed as much information as possible. In most cases, the model can handle large data sets that allow to predict more accurately. In the presence of data scarcity, a Few-Shot learning (FSL) approach aims to build more accurate algorithms with limited training data. We propose a novel end-to-end lightweight architecture that verifies biometr… ▽ More

    Submitted 3 May, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: 19 pages, 7 figures

  9. arXiv:2202.03903  [pdf, other

    cs.LG cs.AI

    KENN: Enhancing Deep Neural Networks by Leveraging Knowledge for Time Series Forecasting

    Authors: Muhammad Ali Chattha, Ludger van Elst, Muhammad Imran Malik, Andreas Dengel, Sheraz Ahmed

    Abstract: End-to-end data-driven machine learning methods often have exuberant requirements in terms of quality and quantity of training data which are often impractical to fulfill in real-world applications. This is specifically true in time series domain where problems like disaster prediction, anomaly detection, and demand prediction often do not have a large amount of historical data. Moreover, relying… ▽ More

    Submitted 16 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  10. arXiv:2201.01249  [pdf, ps, other

    cs.AI cs.LG eess.IV

    ExAID: A Multimodal Explanation Framework for Computer-Aided Diagnosis of Skin Lesions

    Authors: Adriano Lucieri, Muhammad Naseer Bajwa, Stephan Alexander Braun, Muhammad Imran Malik, Andreas Dengel, Sheraz Ahmed

    Abstract: One principal impediment in the successful deployment of AI-based Computer-Aided Diagnosis (CAD) systems in clinical workflows is their lack of transparent decision making. Although commonly used eXplainable AI methods provide some insight into opaque algorithms, such explanations are usually convoluted and not readily comprehensible except by highly trained experts. The explanation of decisions r… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

    Comments: Accepted for publication in Computer Methods and Programs in Biomedicine

  11. arXiv:2110.09294  [pdf

    eess.IV cs.CV cs.LG

    Comparative Analysis of Deep Learning Algorithms for Classification of COVID-19 X-Ray Images

    Authors: Unsa Maheen, Khawar Iqbal Malik, Gohar Ali

    Abstract: The Coronavirus was first emerged in December, in the city of China named Wuhan in 2019 and spread quickly all over the world. It has very harmful effects all over the global economy, education, social, daily living and general health of humans. To restrict the quick expansion of the disease initially, main difficulty is to explore the positive corona patients as quickly as possible. As there are… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  12. arXiv:2103.02438  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    Deep Adaptive Design: Amortizing Sequential Bayesian Experimental Design

    Authors: Adam Foster, Desi R. Ivanova, Ilyas Malik, Tom Rainforth

    Abstract: We introduce Deep Adaptive Design (DAD), a method for amortizing the cost of adaptive Bayesian experimental design that allows experiments to be run in real-time. Traditional sequential Bayesian optimal experimental design approaches require substantial computation at each stage of the experiment. This makes them unsuitable for most real-world applications, where decisions must typically be made q… ▽ More

    Submitted 11 June, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Published as a conference paper at ICML 2021

  13. arXiv:2010.16408  [pdf

    cs.CL

    Sentiment Analysis for Roman Urdu Text over Social Media, a Comparative Study

    Authors: Irfan Qutab, Khawar Iqbal Malik, Hira Arooj

    Abstract: In present century, data volume is increasing enormously. The data could be in form for image, text, voice, and video. One factor in this huge growth of data is usage of social media where everyone is posting data on daily basis during chatting, exchanging information, and uploading their personal and official credential. Research of sentiments seeks to uncover abstract knowledge in Published text… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 8 Pages, 12 Figures. International Journal of Computer Science and Network - 2020

  14. arXiv:2006.09158  [pdf, other

    eess.IV cs.CV cs.LG

    G1020: A Benchmark Retinal Fundus Image Dataset for Computer-Aided Glaucoma Detection

    Authors: Muhammad Naseer Bajwa, Gur Amrit Pal Singh, Wolfgang Neumeier, Muhammad Imran Malik, Andreas Dengel, Sheraz Ahmed

    Abstract: Scarcity of large publicly available retinal fundus image datasets for automated glaucoma detection has been the bottleneck for successful application of artificial intelligence towards practical Computer-Aided Diagnosis (CAD). A few small datasets that are available for research community usually suffer from impractical image capturing conditions and stringent inclusion criteria. These shortcomin… ▽ More

    Submitted 28 May, 2020; originally announced June 2020.

    Comments: Accepted in IJCNN-2020, 7 pages, 5 figures

  15. Combining Fine- and Coarse-Grained Classifiers for Diabetic Retinopathy Detection

    Authors: Muhammad Naseer Bajwa, Yoshinobu Taniguchi, Muhammad Imran Malik, Wolfgang Neumeier, Andreas Dengel, Sheraz Ahmed

    Abstract: Visual artefacts of early diabetic retinopathy in retinal fundus images are usually small in size, inconspicuous, and scattered all over retina. Detecting diabetic retinopathy requires physicians to look at the whole image and fixate on some specific regions to locate potential biomarkers of the disease. Therefore, getting inspiration from ophthalmologist, we propose to combine coarse-grained clas… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: Pages 12, Figures 5

  16. arXiv:2005.14284  [pdf

    cs.CV cs.LG eess.IV

    Two-stage framework for optic disc localization and glaucoma classification in retinal fundus images using deep learning

    Authors: Muhammad Naseer Bajwa, Muhammad Imran Malik, Shoaib Ahmed Siddiqui, Andreas Dengel, Faisal Shafait, Wolfgang Neumeier, Sheraz Ahmed

    Abstract: With the advancement of powerful image processing and machine learning techniques, CAD has become ever more prevalent in all fields of medicine including ophthalmology. Since optic disc is the most important part of retinal fundus image for glaucoma detection, this paper proposes a two-stage framework that first detects and localizes optic disc and then classifies it into healthy or glaucomatous.… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: 16 Pages, 10 Figures

    Journal ref: BMC medical informatics and decision making 19.1 (2019): 136

  17. arXiv:2005.02000  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    On Interpretability of Deep Learning based Skin Lesion Classifiers using Concept Activation Vectors

    Authors: Adriano Lucieri, Muhammad Naseer Bajwa, Stephan Alexander Braun, Muhammad Imran Malik, Andreas Dengel, Sheraz Ahmed

    Abstract: Deep learning based medical image classifiers have shown remarkable prowess in various application areas like ophthalmology, dermatology, pathology, and radiology. However, the acceptance of these Computer-Aided Diagnosis (CAD) systems in real clinical setups is severely limited primarily because their decision-making process remains largely obscure. This work aims at elucidating a deep learning b… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: Accepted for the IEEE International Joint Conference on Neural Networks (IJCNN) 2020

    Journal ref: 2020 International Joint Conference on Neural Networks (IJCNN)

  18. arXiv:1912.11356  [pdf

    q-bio.GN cs.LG

    A Robust and Precise ConvNet for small non-coding RNA classification (RPC-snRC)

    Authors: Muhammad Nabeel Asima, Muhammad Imran Malik, Andreas Dengela, Sheraz Ahmed

    Abstract: Functional or non-coding RNAs are attracting more attention as they are now potentially considered valuable resources in the development of new drugs intended to cure several human diseases. The identification of drugs targeting the regulatory circuits of functional RNAs depends on knowing its family, a task which is known as RNA sequence classification. State-of-the-art small noncoding RNA classi… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

    Comments: 34 pages

  19. arXiv:1909.05478  [pdf, other

    cs.CL cs.IR cs.LG

    A Robust Hybrid Approach for Textual Document Classification

    Authors: Muhammad Nabeel Asim, Muhammad Usman Ghani Khan, Muhammad Imran Malik, Andreas Dengel, Sheraz Ahmed

    Abstract: Text document classification is an important task for diverse natural language processing based applications. Traditional machine learning approaches mainly focused on reducing dimensionality of textual data to perform classification. This although improved the overall classification accuracy, the classifiers still faced sparsity problem due to lack of better data representation techniques. Deep l… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Comments: ICDAR Conference

  20. arXiv:1902.05653  [pdf, other

    cs.LG stat.ML

    KINN: Incorporating Expert Knowledge in Neural Networks

    Authors: Muhammad Ali Chattha, Shoaib Ahmed Siddiqui, Muhammad Imran Malik, Ludger van Elst, Andreas Dengel, Sheraz Ahmed

    Abstract: The promise of ANNs to automatically discover and extract useful features/patterns from data without dwelling on domain expertise although seems highly promising but comes at the cost of high reliance on large amount of accurately labeled data, which is often hard to acquire and formulate especially in time-series domains like anomaly detection, natural disaster management, predictive maintenance… ▽ More

    Submitted 14 February, 2019; originally announced February 2019.

  21. arXiv:1712.06536  [pdf, other

    stat.ML cs.AI cs.LG

    Nonparametric Inference for Auto-Encoding Variational Bayes

    Authors: Erik Bodin, Iman Malik, Carl Henrik Ek, Neill D. F. Campbell

    Abstract: We would like to learn latent representations that are low-dimensional and highly interpretable. A model that has these characteristics is the Gaussian Process Latent Variable Model. The benefits and negative of the GP-LVM are complementary to the Variational Autoencoder, the former provides interpretable low-dimensional latent representations while the latter is able to handle large amounts of da… ▽ More

    Submitted 18 December, 2017; originally announced December 2017.

    Comments: Presented at NIPS 2017 Workshop on Advances in Approximate Bayesian Inference

  22. arXiv:1708.03535  [pdf, other

    cs.SD cs.NE

    Neural Translation of Musical Style

    Authors: Iman Malik, Carl Henrik Ek

    Abstract: Music is an expressive form of communication often used to convey emotion in scenarios where "words are not enough". Part of this information lies in the musical composition where well-defined language exists. However, a significant amount of information is added during a performance as the musician interprets the composition. The performer injects expressiveness into the written score through var… ▽ More

    Submitted 11 August, 2017; originally announced August 2017.

  23. arXiv:1705.11181  [pdf, other

    cs.HC

    AirScript - Creating Documents in Air

    Authors: Ayushman Dash, Amit Sahu, Rajveer Shringi, John Cristian Borges Gamboa, Muhammad Zeshan Afzal, Muhammad Imran Malik, Sheraz Ahmed, Andreas Dengel

    Abstract: This paper presents a novel approach, called AirScript, for creating, recognizing and visualizing documents in air. We present a novel algorithm, called 2-DifViz, that converts the hand movements in air (captured by a Myo-armband worn by a user) into a sequence of x, y coordinates on a 2D Cartesian plane, and visualizes them on a canvas. Existing sensor-based approaches either do not provide visua… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

  24. arXiv:1605.01189  [pdf, other

    cs.CV

    A Generic Method for Automatic Ground Truth Generation of Camera-captured Documents

    Authors: Sheraz Ahmed, Muhammad Imran Malik, Muhammad Zeshan Afzal, Koichi Kise, Masakazu Iwamura, Andreas Dengel, Marcus Liwicki

    Abstract: The contribution of this paper is fourfold. The first contribution is a novel, generic method for automatic ground truth generation of camera-captured document images (books, magazines, articles, invoices, etc.). It enables us to build large-scale (i.e., millions of images) labeled camera-captured/scanned documents datasets, without any human intervention. The method is generic, language independe… ▽ More

    Submitted 4 May, 2016; originally announced May 2016.