Skip to main content

Showing 1–23 of 23 results for author: Hussain, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.16757  [pdf, other

    cs.SD eess.AS

    Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids

    Authors: Jasper Kirton-Wingate, Shafique Ahmed, Adeel Hussain, Mandar Gogate, Kia Dashtipour, Jen-Cheng Hou, Tassadaq Hussain, Yu Tsao, Amir Hussain

    Abstract: Since the advent of Deep Learning (DL), Speech Enhancement (SE) models have performed well under a variety of noise conditions. However, such systems may still introduce sonic artefacts, sound unnatural, and restrict the ability for a user to hear ambient sound which may be of importance. Hearing Aid (HA) users may wish to customise their SE systems to suit their personal preferences and day-to-da… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: This has been submitted to the Trends in Hearing journal

  2. arXiv:2402.16394  [pdf, other

    eess.AS cs.SD

    Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues

    Authors: Tassadaq Hussain, Kia Dashtipour, Yu Tsao, Amir Hussain

    Abstract: In real-world environments, background noise significantly degrades the intelligibility and clarity of human speech. Audio-visual speech enhancement (AVSE) attempts to restore speech quality, but existing methods often fall short, particularly in dynamic noise conditions. This study investigates the inclusion of emotion as a novel contextual cue within AVSE, hypothesizing that incorporating emotio… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  3. arXiv:2306.08402  [pdf, other

    cs.CR

    Fairness and Privacy-Preserving in Federated Learning: A Survey

    Authors: Taki Hasan Rafi, Faiza Anan Noor, Tahmid Hussain, Dong-Kyu Chae

    Abstract: Federated learning (FL) as distributed machine learning has gained popularity as privacy-aware Machine Learning (ML) systems have emerged as a technique that prevents privacy leakage by building a global model and by conducting individualized training of decentralized edge clients on their own private data. The existing works, however, employ privacy mechanisms such as Secure Multiparty Computing… ▽ More

    Submitted 14 July, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 23 pages; 2 figures

  4. arXiv:2305.01111  [pdf, other

    cs.CV cs.AI cs.LG

    Local and Global Contextual Features Fusion for Pedestrian Intention Prediction

    Authors: Mohsen Azarmi, Mahdi Rezaei, Tanveer Hussain, Chenghao Qian

    Abstract: Autonomous vehicles (AVs) are becoming an indispensable part of future transportation. However, safety challenges and lack of reliability limit their real-world deployment. Towards boosting the appearance of AVs on the roads, the interaction of AVs with pedestrians including "prediction of the pedestrian crossing intention" deserves extensive research. This is a highly challenging task as involves… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  5. arXiv:2303.14787  [pdf, other

    cs.DC

    A Generalized Look at Federated Learning: Survey and Perspectives

    Authors: Taki Hasan Rafi, Faiza Anan Noor, Tahmid Hussain, Dong-Kyu Chae, Zhaohui Yang

    Abstract: Federated learning (FL) refers to a distributed machine learning framework involving learning from several decentralized edge clients without sharing local dataset. This distributed strategy prevents data leakage and enables on-device training as it updates the global model based on the local model updates. Despite offering several advantages, including data privacy and scalability, FL poses chall… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: 9 pages, 2 figures

  6. arXiv:2210.17456  [pdf, other

    eess.AS cs.SD

    Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings

    Authors: I-Chun Chern, Kuo-Hsuan Hung, Yi-Ting Chen, Tassadaq Hussain, Mandar Gogate, Amir Hussain, Yu Tsao, Jen-Cheng Hou

    Abstract: AV-HuBERT, a multi-modal self-supervised learning model, has been shown to be effective for categorical problems such as automatic speech recognition and lip-reading. This suggests that useful audio-visual speech representations can be obtained via utilizing multi-modal self-supervised embeddings. Nevertheless, it is unclear if such representations can be generalized to solve real-world multi-moda… ▽ More

    Submitted 31 May, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: ICASSP AMHAT 2023

  7. arXiv:2209.05778  [pdf, other

    cs.CV cs.AI

    Self-supervised motion descriptor for cardiac phase detection in 4D CMR based on discrete vector field estimations

    Authors: Sven Koehler, Tarique Hussain, Hamza Hussain, Daniel Young, Samir Sarikouch, Thomas Pickhardt, Gerald Greil, Sandy Engelhardt

    Abstract: Cardiac magnetic resonance (CMR) sequences visualise the cardiac function voxel-wise over time. Simultaneously, deep learning-based deformable image registration is able to estimate discrete vector fields which warp one time step of a CMR sequence to the following in a self-supervised manner. However, despite the rich source of information included in these 3D+t vector fields, a standardised inter… ▽ More

    Submitted 18 September, 2022; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: accepted for the STACOM2022 workshop @ MICCAI2022

  8. arXiv:2204.06788  [pdf, other

    cs.CV

    Pyramidal Attention for Saliency Detection

    Authors: Tanveer Hussain, Abbas Anwar, Saeed Anwar, Lars Petersson, Sung Wook Baik

    Abstract: Salient object detection (SOD) extracts meaningful contents from an input image. RGB-based SOD methods lack the complementary depth clues; hence, providing limited performance for complex scenarios. Similarly, RGB-D models process RGB and depth inputs, but the depth data availability during testing may hinder the model's practical applicability. This paper exploits only RGB images, estimates depth… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted at CVPRW 2022. (2022 IEEE CVPR Workshop on Fair, Data Efficient and Trusted Computer Vision)

  9. arXiv:2202.05756  [pdf, other

    cs.SD cs.LG eess.AS

    A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning

    Authors: Tassadaq Hussain, Muhammad Diyan, Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Yu Tsao, Amir Hussain

    Abstract: Current deep learning (DL) based approaches to speech intelligibility enhancement in noisy environments are often trained to minimise the feature distance between noise-free speech and enhanced speech signals. Despite improving the speech quality, such approaches do not deliver required levels of speech intelligibility in everyday noisy environments . Intelligibility-oriented (I-O) loss functions… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.04172

  10. arXiv:2202.05662  [pdf, other

    cs.CR cs.SD eess.AS

    A Novel Chaos-based Light-weight Image Encryption Scheme for Multi-modal Hearing Aids

    Authors: Awais Aziz Shah, Ahsan Adeel, Jawad Ahmad, Ahmed Al-Dubai, Mandar Gogate, Abhijeet Bishnu, Muhammad Diyan, Tassadaq Hussain, Kia Dashtipour, Tharm Ratnarajah, Amir Hussain

    Abstract: Multimodal hearing aids (HAs) aim to deliver more intelligible audio in noisy environments by contextually sensing and processing data in the form of not only audio but also visual information (e.g. lip reading). Machine learning techniques can play a pivotal role for the contextually processing of multimodal data. However, since the computational power of HA devices is low, therefore this data mu… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

  11. arXiv:2202.04172   

    eess.AS cs.SD

    A Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning for Hearing-Assistive Technologies

    Authors: Tassadaq Hussain, Muhammad Diyan, Mandar Gogate, Kia Dashtipour, Ahsan Adeel, Yu Tsao, Amir Hussain

    Abstract: Current deep learning (DL) based approaches to speech intelligibility enhancement in noisy environments are generally trained to minimise the distance between clean and enhanced speech features. These often result in improved speech quality however they suffer from a lack of generalisation and may not deliver the required speech intelligibility in everyday noisy situations. In an attempt to addres… ▽ More

    Submitted 15 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: We would like to withdraw this article because we have accidentally uploaded the revised version of the same article from another account. The updated version is titled "A Novel Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning" (arXiv:2202.05756)

  12. arXiv:2201.09913  [pdf

    eess.AS cs.SD

    A Novel Temporal Attentive-Pooling based Convolutional Recurrent Architecture for Acoustic Signal Enhancement

    Authors: Tassadaq Hussain, Wei-Chien Wang, Mandar Gogate, Kia Dashtipour, Yu Tsao, Xugang Lu, Adeel Ahsan, Amir Hussain

    Abstract: In acoustic signal processing, the target signals usually carry semantic information, which is encoded in a hierarchal structure of short and long-term contexts. However, the background noise distorts these structures in a nonuniform way. The existing deep acoustic signal enhancement (ASE) architectures ignore this kind of local and global effect. To address this problem, we propose to integrate a… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

  13. arXiv:2112.10223  [pdf, other

    cs.DC

    Parallel Algorithms for Adding a Collection of Sparse Matrices

    Authors: Md Taufique Hussain, Guttu Sai Abhishek, Aydin Buluç, Ariful Azad

    Abstract: We develop a family of parallel algorithms for the SpKAdd operation that adds a collection of k sparse matrices. SpKAdd is a much needed operation in many applications including distributed memory sparse matrix-matrix multiplication (SpGEMM), streaming accumulations of graphs, and algorithmic sparsification of the gradient updates in deep learning. While adding two sparse matrices is a common oper… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

  14. arXiv:2111.09642  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Towards Intelligibility-Oriented Audio-Visual Speech Enhancement

    Authors: Tassadaq Hussain, Mandar Gogate, Kia Dashtipour, Amir Hussain

    Abstract: Existing deep learning (DL) based speech enhancement approaches are generally optimised to minimise the distance between clean and enhanced speech features. These often result in improved speech quality however they suffer from a lack of generalisation and may not deliver the required speech intelligibility in real noisy situations. In an attempt to address these challenges, researchers have explo… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: 6 pages, 4 figures

  15. arXiv:2110.06342  [pdf, other

    cs.RO cs.MA

    Decentralized Connectivity Maintenance for Multi-robot Systems Under Motion and Sensing Uncertainties

    Authors: Akshay Shetty, Timmy Hussain, Grace Gao

    Abstract: Communication connectivity is desirable for safe and efficient operation of multi-robot systems. While decentralized algorithms for connectivity maintenance have been explored in recent literature, the majority of these works do not account for robot motion and sensing uncertainties. These uncertainties are inherent in practical robots and result in robots deviating from their desired positions wh… ▽ More

    Submitted 20 July, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted in NAVIGATION: Journal of The Institute of Navigation

  16. arXiv:2106.14402  [pdf, other

    cs.DC cs.DM cs.PF math.CO

    Combinatorial BLAS 2.0: Scaling combinatorial algorithms on distributed-memory systems

    Authors: Ariful Azad, Oguz Selvitopi, Md Taufique Hussain, John R. Gilbert, Aydin Buluc

    Abstract: Combinatorial algorithms such as those that arise in graph analysis, modeling of discrete systems, bioinformatics, and chemistry, are often hard to parallelize. The Combinatorial BLAS library implements key computational primitives for rapid development of combinatorial algorithms in distributed-memory systems. During the decade since its first introduction, the Combinatorial BLAS library has evol… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: To appear in IEEE Transactions on Parallel and Distributed Systems

  17. arXiv:2104.01161  [pdf, ps, other

    cs.SD eess.AS

    An Audio-Based Deep Learning Framework For BBC Television Programme Classification

    Authors: Lam Pham, Chris Baume, Qiuqiang Kong, Tassadaq Hussain, Wenwu Wang, Mark Plumbley

    Abstract: This paper proposes a deep learning framework for classification of BBC television programmes using audio. The audio is firstly transformed into spectrograms, which are fed into a pre-trained convolutional Neural Network (CNN), obtaining predicted probabilities of sound events occurring in the audio recording. Statistics for the predicted probabilities and detected sound events are then calculated… ▽ More

    Submitted 11 February, 2022; v1 submitted 2 April, 2021; originally announced April 2021.

  18. arXiv:2102.06407  [pdf, other

    cs.CV

    Densely Deformable Efficient Salient Object Detection Network

    Authors: Tanveer Hussain, Saeed Anwar, Amin Ullah, Khan Muhammad, Sung Wook Baik

    Abstract: Salient Object Detection (SOD) domain using RGB-D data has lately emerged with some current models' adequately precise results. However, they have restrained generalization abilities and intensive computational complexity. In this paper, inspired by the best background/foreground separation abilities of deformable convolutions, we employ them in our Densely Deformable Network (DDNet) to achieve ef… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  19. arXiv:2101.07653  [pdf, other

    eess.IV cs.CV cs.LG

    Unsupervised Domain Adaptation from Axial to Short-Axis Multi-Slice Cardiac MR Images by Incorporating Pretrained Task Networks

    Authors: Sven Koehler, Tarique Hussain, Zach Blair, Tyler Huffaker, Florian Ritzmann, Animesh Tandon, Thomas Pickardt, Samir Sarikouch, Heiner Latus, Gerald Greil, Ivo Wolf, Sandy Engelhardt

    Abstract: Anisotropic multi-slice Cardiac Magnetic Resonance (CMR) Images are conventionally acquired in patient-specific short-axis (SAX) orientation. In specific cardiovascular diseases that affect right ventricular (RV) morphology, acquisitions in standard axial (AX) orientation are preferred by some investigators, due to potential superiority in RV volume measurement for treatment planning. Unfortunatel… ▽ More

    Submitted 20 January, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

    Comments: Accepted for IEEE Transaction on Medical Imaging (TMI) 2021 on 13.01.2021

  20. arXiv:2010.08526  [pdf, other

    cs.DC

    Communication-Avoiding and Memory-Constrained Sparse Matrix-Matrix Multiplication at Extreme Scale

    Authors: Md Taufique Hussain, Oguz Selvitopi, Aydin Buluç, Ariful Azad

    Abstract: Sparse matrix-matrix multiplication (SpGEMM) is a widely used kernel in various graph, scientific computing and machine learning algorithms. In this paper, we consider SpGEMMs performed on hundreds of thousands of processors generating trillions of nonzeros in the output matrix. Distributed SpGEMM at this extreme scale faces two key challenges: (1) high communication cost and (2) inadequate memory… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: 14 pages, 15 figures

  21. arXiv:2002.10083  [pdf, other

    cs.DC

    Optimizing High Performance Markov Clustering for Pre-Exascale Architectures

    Authors: Oguz Selvitopi, Md Taufique Hussain, Ariful Azad, Aydın Buluç

    Abstract: HipMCL is a high-performance distributed memory implementation of the popular Markov Cluster Algorithm (MCL) and can cluster large-scale networks within hours using a few thousand CPU-equipped nodes. It relies on sparse matrix computations and heavily makes use of the sparse matrix-sparse matrix multiplication kernel (SpGEMM). The existing parallel algorithms in HipMCL are not scalable to Exascale… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Journal ref: 34th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

  22. arXiv:2002.04392  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    How well do U-Net-based segmentation trained on adult cardiac magnetic resonance imaging data generalise to rare congenital heart diseases for surgical planning?

    Authors: Sven Koehler, Animesh Tandon, Tarique Hussain, Heiner Latus, Thomas Pickardt, Samir Sarikouch, Philipp Beerbaum, Gerald Greil, Sandy Engelhardt, Ivo Wolf

    Abstract: Planning the optimal time of intervention for pulmonary valve replacement surgery in patients with the congenital heart disease Tetralogy of Fallot (TOF) is mainly based on ventricular volume and function according to current guidelines. Both of these two biomarkers are most reliably assessed by segmentation of 3D cardiac magnetic resonance (CMR) images. In several grand challenges in the last yea… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: Accepted for SPIE Medical Imaging 2020

  23. arXiv:1904.11576  [pdf

    physics.ao-ph cs.LG stat.ML

    Forecasting Drought Using Multilayer Perceptron Artificial Neural Network Model

    Authors: Zulifqar Ali, Ijaz Hussain, Muhammad Faisal, Hafiza Mamona Nazir, Tajammal Hussain, Muhammad Yousaf Shad, Alaa Mohamd Shoukry, Showkat Hussain Gani

    Abstract: These days human beings are facing many environmental challenges due to frequently occurring drought hazards. It may have an effect on the countrys environment, the community, and industries. Several adverse impacts of drought hazard are continued in Pakistan, including other hazards. However, early measurement and detection of drought can provide guidance to water resources management for employi… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.