Skip to main content

Showing 1–50 of 59 results for author: Shah, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16932  [pdf, other

    eess.SP cs.LG

    Xi-Net: Transformer Based Seismic Waveform Reconstructor

    Authors: Anshuman Gaharwar, Parth Parag Kulkarni, Joshua Dickey, Mubarak Shah

    Abstract: Missing/erroneous data is a major problem in today's world. Collected seismic data sometimes contain gaps due to multitude of reasons like interference and sensor malfunction. Gaps in seismic waveforms hamper further signal processing to gain valuable information. Plethora of techniques are used for data reconstruction in other domains like image, video, audio, but translation of those methods to… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Oral Presentation at IEEE International Conference on Image Processing(ICIP) 2023 (Multidimensional Signal Processing Track)

  2. arXiv:2405.07354  [pdf, other

    cs.SD cs.IR cs.LG cs.MM eess.AS

    SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset

    Authors: Sushant Gautam, Mehdi Houshmand Sarkhoosh, Jan Held, Cise Midoglu, Anthony Cioppa, Silvio Giancola, Vajira Thambawita, Michael A. Riegler, Pål Halvorsen, Mubarak Shah

    Abstract: The application of Automatic Speech Recognition (ASR) technology in soccer offers numerous opportunities for sports analytics. Specifically, extracting audio commentaries with ASR provides valuable insights into the events of the game, and opens the door to several downstream applications such as automatic highlight generation. This paper presents SoccerNet-Echoes, an augmentation of the SoccerNet… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    ACM Class: I.2.7; I.7

  3. arXiv:2405.07338  [pdf, other

    eess.IV cs.CV

    Explainable Convolutional Neural Networks for Retinal Fundus Classification and Cutting-Edge Segmentation Models for Retinal Blood Vessels from Fundus Images

    Authors: Fatema Tuj Johora Faria, Mukaffi Bin Moin, Pronay Debnath, Asif Iftekher Fahim, Faisal Muhammad Shah

    Abstract: Our research focuses on the critical field of early diagnosis of disease by examining retinal blood vessels in fundus images. While automatic segmentation of retinal blood vessels holds promise for early detection, accurate analysis remains challenging due to the limitations of existing methods, which often lack discrimination power and are susceptible to influences from pathological regions. Our… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  4. arXiv:2404.15009  [pdf, other

    cs.CV eess.IV

    The Brain Tumor Segmentation in Pediatrics (BraTS-PEDs) Challenge: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)

    Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Deep Gandhi, Xinyang Liu, Zhifan Jiang, Syed Muhammed Anwar, Jake Albrecht, Maruf Adewole, Udunna Anazodo, Hannah Anderson, Sina Bagheri, Ujjwal Baid, Timothy Bergquist, Austin J. Borja, Evan Calabrese, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Anurag Gottipati, Debanjan Haldar, Shuvanjan Haldar , et al. (51 additional authors not shown)

    Abstract: Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. Here we pr… ▽ More

    Submitted 29 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2305.17033

  5. arXiv:2403.07937  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Speech Robust Bench: A Robustness Benchmark For Speech Recognition

    Authors: Muhammad A. Shah, David Solans Noguero, Mikko A. Heikkila, Nicolas Kourtellis

    Abstract: As Automatic Speech Recognition (ASR) models become ever more pervasive, it is important to ensure that they make reliable predictions under corruptions present in the physical and digital world. We propose Speech Robust Bench (SRB), a comprehensive benchmark for evaluating the robustness of ASR models to diverse corruptions. SRB is composed of 69 input perturbations which are intended to simulate… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  6. arXiv:2312.11868  [pdf, other

    cs.RO eess.SY

    Dynamic Loco-manipulation on HECTOR: Humanoid for Enhanced ConTrol and Open-source Research

    Authors: Junheng Li, Junchao Ma, Omar Kolt, Manas Shah, Quan Nguyen

    Abstract: Despite their remarkable advancement in locomotion and manipulation, humanoid robots remain challenged by a lack of synchronized loco-manipulation control, hindering their full dynamic potential. In this work, we introduce a versatile and effective approach to controlling and generalizing dynamic locomotion and loco-manipulation on humanoid robots via a Force-and-moment-based Model Predictive Cont… ▽ More

    Submitted 21 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: 14 pages, 13 figures

  7. arXiv:2312.05623  [pdf, other

    cs.IT eess.SP

    Impact of Urban Street Geometry on the Detection Probability of Automotive Radars

    Authors: Mohammad Taha Shah, Ankit Kumar, Gourab Ghatak, Shobha Sundar Ram

    Abstract: Prior works have analyzed the performance of millimeter wave automotive radars in the presence of diverse clutter and interference scenarios using stochastic geometry tools instead of more time-consuming measurement studies or system-level simulations. In these works, the distributions of radars or discrete clutter scatterers were modeled as Poisson point processes in the Euclidean space. However,… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: Submitted to IEEE Radar Conference 2024 (RadarConf24)

  8. arXiv:2311.05548  [pdf, other

    cs.CV eess.IV

    L-WaveBlock: A Novel Feature Extractor Leveraging Wavelets for Generative Adversarial Networks

    Authors: Mirat Shah, Vansh Jain, Anmol Chokshi, Guruprasad Parasnis, Pramod Bide

    Abstract: Generative Adversarial Networks (GANs) have risen to prominence in the field of deep learning, facilitating the generation of realistic data from random noise. The effectiveness of GANs often depends on the quality of feature extraction, a critical aspect of their architecture. This paper introduces L-WaveBlock, a novel and robust feature extractor that leverages the capabilities of the Discrete W… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 12 figures, 8 pages

  9. arXiv:2310.05932  [pdf, other

    cs.MA cs.AI eess.SY

    A Multi-Agent Systems Approach for Peer-to-Peer Energy Trading in Dairy Farming

    Authors: Mian Ibad Ali Shah, Abdul Wahid, Enda Barrett, Karl Mason

    Abstract: To achieve desired carbon emission reductions, integrating renewable generation and accelerating the adoption of peer-to-peer energy trading is crucial. This is especially important for energy-intensive farming, like dairy farming. However, integrating renewables and peer-to-peer trading presents challenges. To address this, we propose the Multi-Agent Peer-to-Peer Dairy Farm Energy Simulator (MAPD… ▽ More

    Submitted 21 August, 2023; originally announced October 2023.

    Comments: Proc. of the Artificial Intelligence for Sustainability, ECAI 2023, Eunika et al. (eds.), Sep 30- Oct 1, 2023, https://sites.google.com/view/ai4s. 2023

  10. A Review on AI Algorithms for Energy Management in E-Mobility Services

    Authors: Sen Yan, Maqsood Hussain Shah, Ji Li, Noel O'Connor, Mingming Liu

    Abstract: E-mobility, or electric mobility, has emerged as a pivotal solution to address pressing environmental and sustainability concerns in the transportation sector. The depletion of fossil fuels, escalating greenhouse gas emissions, and the imperative to combat climate change underscore the significance of transitioning to electric vehicles (EVs). This paper seeks to explore the potential of artificial… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 8 pages, 4 tables, 1 figure

  11. arXiv:2309.13962  [pdf, other

    cs.CV eess.IV

    Egocentric RGB+Depth Action Recognition in Industry-Like Settings

    Authors: Jyoti Kini, Sarah Fleischer, Ishan Dave, Mubarak Shah

    Abstract: Action recognition from an egocentric viewpoint is a crucial perception task in robotics and enables a wide range of human-robot interactions. While most computer vision approaches prioritize the RGB camera, the Depth modality - which can further amplify the subtleties of actions from an egocentric perspective - remains underexplored. Our work focuses on recognizing actions from egocentric RGB and… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  12. arXiv:2308.11405  [pdf, other

    cs.IT eess.SP

    Achievable Sum-rate of variants of QAM over Gaussian Multiple Access Channel with and without security

    Authors: Shifa Showkat, Zahid Bashir Dar, Shahid Mehraj Shah

    Abstract: The performance of next generation wireless systems (5G/6G and beyond) at the physical layer is primarily driven by the choice of digital modulation techniques that are bandwidth and power efficient, while maintaining high data rates. Achievable rates for Gaussian input and some finite constellations (BPSK/QPSK/QAM) are well studied in the literature. However, new variants of Quadrature Amplitude… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: 11 Figures, two tables. Accepted for publication in IEEE International Conference on Signal Processing and Computer Vision (SPCV-2023)

  13. arXiv:2308.09693  [pdf, other

    cs.CV cs.LG eess.IV

    A Lightweight Transformer for Faster and Robust EBSD Data Collection

    Authors: Harry Dong, Sean Donegan, Megna Shah, Yuejie Chi

    Abstract: Three dimensional electron back-scattered diffraction (EBSD) microscopy is a critical tool in many applications in materials science, yet its data quality can fluctuate greatly during the arduous collection process, particularly via serial-sectioning. Fortunately, 3D EBSD data is inherently sequential, opening up the opportunity to use transformers, state-of-the-art deep learning architectures tha… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  14. arXiv:2307.07395  [pdf, other

    eess.SP

    Flexible Beamforming in B5G for Improving Tethered UAV Coverage over Smart Environments

    Authors: Abdu Saif, Nor Shahida Mohd Shah, Soreen Ameen Fattah, Saeed Hamood Alsamhi, Santosh Kumar, Ali Saad Al khuraib

    Abstract: Unmanned Aerial Vehicles (UAVs) are being used for wireless communications in smart environments. However, the need for mobility, scalability of data transmission over wide areas, and the required coverage area make UAV beamforming essential for better coverage and user experience. To this end, we propose a flexible beamforming approach to improve tethered UAV coverage quality and maximize the num… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: 6 pages, 7 figures

  15. arXiv:2307.07269  [pdf, other

    eess.IV cs.CV cs.LG

    Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation

    Authors: Asif Hanif, Muzammal Naseer, Salman Khan, Mubarak Shah, Fahad Shahbaz Khan

    Abstract: It is imperative to ensure the robustness of deep learning models in critical applications such as, healthcare. While recent advances in deep learning have improved the performance of volumetric medical image segmentation models, these models cannot be deployed for real-world applications immediately due to their vulnerability to adversarial attacks. We present a 3D frequency domain adversarial at… ▽ More

    Submitted 20 July, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted in MICCAI 2023 conference

  16. arXiv:2306.09239  [pdf, ps, other

    q-bio.NC cs.LG eess.IV

    Exploiting the Brain's Network Structure for Automatic Identification of ADHD Subjects

    Authors: Soumyabrata Dey, Ravishankar Rao, Mubarak Shah

    Abstract: Attention Deficit Hyperactive Disorder (ADHD) is a common behavioral problem affecting children. In this work, we investigate the automatic classification of ADHD subjects using the resting state Functional Magnetic Resonance Imaging (fMRI) sequences of the brain. We show that the brain can be modeled as a functional network, and certain properties of the networks differ in ADHD subjects from cont… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  17. arXiv:2306.09209  [pdf, ps, other

    cs.IT cs.GT eess.SP

    Bayesian Game Formulation of Power Allocation in Multiple Access Wiretap Channel with Incomplete CSI

    Authors: Basharat Rashid, Majed Haddad, Shahid Mehraj Shah

    Abstract: In this paper, we address the problem of distributed power allocation in a $K$ user fading multiple access wiretap channel, where global channel state information is limited, i.e., each user has knowledge of their own channel state with respect to Bob and Eve but only knows the distribution of other users' channel states. We model this problem as a Bayesian game, where each user is assumed to self… ▽ More

    Submitted 4 September, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: 7 Pages, 2 Figures, submitted for possible publication

  18. arXiv:2305.17033  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)

    Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Xinyang Liu, Debanjan Haldar, Zhifan Jiang, Syed Muhammed Anwar, Jake Albrecht, Maruf Adewole, Udunna Anazodo, Hannah Anderson, Sina Bagheri, Ujjwal Baid, Timothy Bergquist, Austin J. Borja, Evan Calabrese, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Shuvanjan Haldar, Juan Eugenio Iglesias, Anastasia Janas , et al. (48 additional authors not shown)

    Abstract: Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20\%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. The MICCA… ▽ More

    Submitted 23 May, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  19. arXiv:2304.03307  [pdf, other

    cs.CV eess.IV

    Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting

    Authors: Syed Talal Wasim, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: Adopting contrastive image-text pretrained models like CLIP towards video classification has gained attention due to its cost-effectiveness and competitive performance. However, recent works in this area face a trade-off. Finetuning the pretrained model to achieve strong supervised performance results in low zero-shot generalization. Similarly, freezing the backbone to retain zero-shot capability… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: Accepted at CVPR-2023. Codes/models available at https://github.com/TalalWasim/Vita-CLIP

  20. arXiv:2303.17959  [pdf, other

    cs.CV eess.IV

    Diffusion Action Segmentation

    Authors: Daochang Liu, Qiyue Li, AnhDung Dinh, Tingting Jiang, Mubarak Shah, Chang Xu

    Abstract: Temporal action segmentation is crucial for understanding long-form videos. Previous works on this task commonly adopt an iterative refinement paradigm by using multi-stage models. We propose a novel framework via denoising diffusion models, which nonetheless shares the same inherent spirit of such iterative refinement. In this framework, action predictions are iteratively generated from random no… ▽ More

    Submitted 11 August, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: ICCV 2023

  21. arXiv:2303.12073  [pdf, other

    eess.IV cs.CV

    3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers

    Authors: Omkar Thawakar, Rao Muhammad Anwer, Jorma Laaksonen, Orly Reiner, Mubarak Shah, Fahad Shahbaz Khan

    Abstract: Accurate 3D mitochondria instance segmentation in electron microscopy (EM) is a challenging problem and serves as a prerequisite to empirically analyze their distributions and morphology. Most existing approaches employ 3D convolutions to obtain representative features. However, these convolution-based approaches struggle to effectively capture long-range dependencies in the volume mitochondria da… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, 5 Tables, 2 page references

  22. arXiv:2303.04417  [pdf

    eess.SY cs.NI eess.SP

    An Efficient Game Theory-Based Power Control Algorithm for D2D Communication in 5G Networks

    Authors: Abdu Saif, Kamarul Ariffin bin Noordin, Kaharudin Dimyati, Nor Shahida Mohd Shah, Yousef Ali Al-Gumaei, Qazwan Abdullah, Kamal Ali Alezabi

    Abstract: Device-to-Device (D2D) communication is one of the enabling technologies for 5G networks that support proximity-based service (ProSe) for wireless network communications. This paper proposes a power control algorithm based on the Nash equilibrium and game theory to eliminate the interference between the cellular user device and D2D links. This leads to reliable connectivity with minimal power cons… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Journal ref: KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS 2020

  23. arXiv:2212.03288  [pdf

    cs.NI eess.SP

    Estimation Large- Scale Fading Channels for Transmit Orthogonal Pilot Reuse Sequences in Massive MIMO System

    Authors: Qazwan Abdullah, Nor Shahida Mohd Shah, Shipun Hamzah, Adeb Salh, Mahathir Mohamad, Shahilah Nordin, Maisarah Abu, Mohammed Abdo Albaom, safwan sadeq

    Abstract: Massive multiple-input multiple-output (MIMO) is a critical technology for future fifth-generation (5G) systems. Reduced pilot contamination (PC) enhanced system performance, and reduced inter-cell interference and improved channel estimation. However, because the pilot sequence transmitted by users in a single cell to neighboring cells is not orthogonal, massive MIMO systems are still constrained… ▽ More

    Submitted 20 October, 2022; originally announced December 2022.

  24. arXiv:2211.08390  [pdf

    cs.IT eess.SP

    A New Technique for Improving Energy Efficiency in 5G Mm-wave Hybrid Precoding Systems

    Authors: Adeb Salh, Qazwan Abdullah, Ghasan Hussain, Razlai Ngah, Lukman Audah, Nor Shahida Mohd Shah, Shipun Hamzah

    Abstract: In this article, we present a new approach to optimizing the energy efficiency of the cost-efficiency of quantized hybrid pre-encoding (HP) design. We present effective alternating minimization algorithms (AMA) based on the zero gradient method to produce completely connected structures (CCSs) and partially connected structures (PCSs). Alternative minimization algorithms offer lower complexity by… ▽ More

    Submitted 20 October, 2022; originally announced November 2022.

  25. arXiv:2210.13336  [pdf, other

    eess.IV cs.CV cs.LG

    Brain Tumor Segmentation using Enhanced U-Net Model with Empirical Analysis

    Authors: MD Abdullah Al Nasim, Abdullah Al Munem, Maksuda Islam, Md Aminul Haque Palash, MD. Mahim Anjum Haque, Faisal Muhammad Shah

    Abstract: Cancer of the brain is deadly and requires careful surgical segmentation. The brain tumors were segmented using U-Net using a Convolutional Neural Network (CNN). When looking for overlaps of necrotic, edematous, growing, and healthy tissue, it might be hard to get relevant information from the images. The 2D U-Net network was improved and trained with the BraTS datasets to find these four areas. U… ▽ More

    Submitted 15 January, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: 5 tables, 4 figures, 5 equations

  26. arXiv:2208.01637  [pdf, other

    eess.IV cs.CV

    Comparative Analysis of State-of-the-Art Deep Learning Models for Detecting COVID-19 Lung Infection from Chest X-Ray Images

    Authors: Zeba Ghaffar, Pir Masoom Shah, Hikmat Khan, Syed Farhan Alam Zaidi, Abdullah Gani, Izaz Ahmad Khan, Munam Ali Shah, Saif ul Islam

    Abstract: The ongoing COVID-19 pandemic has already taken millions of lives and damaged economies across the globe. Most COVID-19 deaths and economic losses are reported from densely crowded cities. It is comprehensible that the effective control and prevention of epidemic/pandemic infectious diseases is vital. According to WHO, testing and diagnosis is the best strategy to control pandemics. Scientists wor… ▽ More

    Submitted 30 June, 2022; originally announced August 2022.

  27. arXiv:2206.12815  [pdf, other

    eess.IV cs.CV cs.LG

    Breast Cancer Classification using Deep Learned Features Boosted with Handcrafted Features

    Authors: Unaiza Sajid, Rizwan Ahmed Khan, Shahid Munir Shah, Sheeraz Arif

    Abstract: Breast cancer is one of the leading causes of death among women across the globe. It is difficult to treat if detected at advanced stages, however, early detection can significantly increase chances of survival and improves lives of millions of women. Given the widespread prevalence of breast cancer, it is of utmost importance for the research community to come up with the framework for early dete… ▽ More

    Submitted 16 January, 2023; v1 submitted 26 June, 2022; originally announced June 2022.

    Journal ref: Biomedical Signal Processing and Control 2023

  28. arXiv:2205.08242  [pdf, other

    cs.IT eess.SP math.PR

    Outage Analysis of Energy Efficiency in a Finite-Element-IRS Aided Communication System

    Authors: Aaqib Bulla, Shahid M Shah

    Abstract: In this paper, we study the performance of an energy efficient wireless communication system, assisted by a finite-element-intelligent reflecting surface (IRS). With no instantaneous channel state information (CSI) at the transmitter, we characterize the system performance in terms of the outage probability (OP) of energy efficiency (EE). Depending upon the availability of line-of-sight (LOS) path… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 18 Pages, 6 Figures, 2 Tables

  29. arXiv:2204.10846  [pdf, other

    cs.CV eess.IV

    Self-Supervised Video Object Segmentation via Cutout Prediction and Tagging

    Authors: Jyoti Kini, Fahad Shahbaz Khan, Salman Khan, Mubarak Shah

    Abstract: We propose a novel self-supervised Video Object Segmentation (VOS) approach that strives to achieve better object-background discriminability for accurate object segmentation. Distinct from previous self-supervised VOS methods, our approach is based on a discriminative learning loss formulation that takes into account both object and background information to ensure object-background discriminabil… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  30. arXiv:2204.10765  [pdf, other

    cs.CV eess.IV

    Tag-Based Attention Guided Bottom-Up Approach for Video Instance Segmentation

    Authors: Jyoti Kini, Mubarak Shah

    Abstract: Video Instance Segmentation is a fundamental computer vision task that deals with segmenting and tracking object instances across a video sequence. Most existing methods typically accomplish this task by employing a multi-stage top-down approach that usually involves separate networks to detect and segment objects in each frame, followed by associating these detections in consecutive frames using… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  31. arXiv:2204.09909  [pdf, other

    eess.IV cs.CV

    An Efficient End-to-End Deep Neural Network for Interstitial Lung Disease Recognition and Classification

    Authors: Masum Shah Junayed, Afsana Ahsan Jeny, Md Baharul Islam, Ikhtiar Ahmed, A F M Shahen Shah

    Abstract: The automated Interstitial Lung Diseases (ILDs) classification technique is essential for assisting clinicians during the diagnosis process. Detecting and classifying ILDs patterns is a challenging problem. This paper introduces an end-to-end deep convolution neural network (CNN) for classifying ILDs patterns. The proposed model comprises four convolutional layers with different kernel sizes and R… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: Turkish Journal of Electrical Engineering and Computer Sciences

  32. arXiv:2111.08606  [pdf

    eess.IV cs.CV

    Advancement of Deep Learning in Pneumonia and Covid-19 Classification and Localization: A Qualitative and Quantitative Analysis

    Authors: Aakash Shah, Manan Shah

    Abstract: Around 450 million people are affected by pneumonia every year which results in 2.5 million deaths. Covid-19 has also affected 181 million people which has lead to 3.92 million casualties. The chances of death in both of these diseases can be significantly reduced if they are diagnosed early. However, the current methods of diagnosing pneumonia (complaints + chest X-ray) and covid-19 (RT-PCR) requ… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 20 pages, 5 figures, 5 tables

    Report number: CDTM-D-21-00047R2

  33. Artificial Intelligence For Breast Cancer Detection: Trends & Directions

    Authors: Shahid Munir Shah, Rizwan Ahmed Khan, Sheeraz Arif, Unaiza Sajid

    Abstract: In the last decade, researchers working in the domain of computer vision and Artificial Intelligence (AI) have beefed up their efforts to come up with the automated framework that not only detects but also identifies stage of breast cancer. The reason for this surge in research activities in this direction are mainly due to advent of robust AI algorithms (deep learning), availability of hardware t… ▽ More

    Submitted 3 October, 2021; originally announced October 2021.

    Journal ref: Computers in Biology and Medicine 2022

  34. arXiv:2108.13149  [pdf

    cs.NI eess.SP

    An Optimization of Fractal Microstrip Patch Antenna with Partial Ground using Genetic Algorithm Method

    Authors: Hamid M. Q. Rasheda, Norshahida Mohd Shah, Abdu Saif, Qazwan Abdullah, Abbas Ugurenver, Abdul Rashid. O. Mumin, Nan Bin Mad Sahar

    Abstract: Ultra-wideband is increasingly advancing as a high data rate wireless technology after the Federal Communication Commission announced the bandwidth of 7.5 GHz (from 3.1 GHz to 10.6 GHz) for ultra-wideband applications. Furthermore, designing a UWB antenna faces more difficulties than designing a narrow band antenna. A suitable UWB antenna should be able to work over the Federal Communication Commi… ▽ More

    Submitted 30 June, 2021; originally announced August 2021.

    Comments: 6pages

  35. arXiv:2108.10076  [pdf

    eess.SP cs.CC

    Development of A Fully Data-Driven Artificial Intelligence and Deep Learning for URLLC Application in 6G Wireless Systems: A Survey

    Authors: Adeeb Salh, Lukman Audah, Qazwan Abdullah, Abdullah Noorsaliza, Nor Shahida Mohd Shah, Jameel Mukred, Shipun Hamzah

    Abstract: The full future of the sixth generation will develop a fully data-driven that provide terabit rate per second, and adopt an average of 1000+ massive number of connections per person in 10 years 2030 virtually instantaneously. Data-driven for ultra-reliable and low latency communication is a new service paradigm provided by a new application of future sixth-generation wireless communication and net… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Report number: 19

  36. arXiv:2106.12628  [pdf, other

    cs.CV eess.IV

    Florida Wildlife Camera Trap Dataset

    Authors: Crystal Gagne, Jyoti Kini, Daniel Smith, Mubarak Shah

    Abstract: Trail camera imagery has increasingly gained popularity amongst biologists for conservation and ecological research. Minimal human interference required to operate camera traps allows capturing unbiased species activities. Several studies - based on human and wildlife interactions, migratory patterns of various species, risk of extinction in endangered populations - are limited by the lack of rich… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition, CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling Workshop, 2021

  37. arXiv:2106.03664  [pdf

    cs.IT eess.SP

    Optimal Transmit Power and Antenna Selection to Achieve Energy Efficient and Low Complexity in fifth generation Massive MIMO Systems

    Authors: Adeeb Salh, Lukman Audah, Nor Shahida Mohd Shah, Qazwan Abdullah, Noorsaliza Abdullah, Jameel Mukred, Shipun Hamzah

    Abstract: This paper investigates joint antenna selection and optimal transmit power in multi cell massive multiple input multiple output systems. The pilot interference and activated transmit antenna selection plays an essential role in maximizing energy efficiency. We derived the closed-form of maximal energy efficiency with complete knowledge of large-scale fading with maximum ratio transmission while ac… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: 10Pages

  38. arXiv:2105.10722  [pdf

    cs.IT eess.SP

    Trade-off Energy and Spectral Efficiency in 5G Massive MIMO System

    Authors: Adeb Salh, Nor Shahida Mohd Shah, Lukman Audah, Qazwan Abdullah, Norsaliza Abdullah, Shipun A. Hamzah, Abdu Saif

    Abstract: A massive multiple input multiple-output system is very important to optimize the trade-off energy efficiency and spectral efficiency in fifth-generation cellular networks. The challenges for the next generation depend on increasing the high data traffic in the wireless communication system for both EE and SE. In this paper, the trade off energy efficiency and spectral efficiency based on the firs… ▽ More

    Submitted 22 May, 2021; originally announced May 2021.

    Comments: 6 pages

  39. arXiv:2104.08892  [pdf

    eess.SP

    Internet of Fly Things For Post-Disaster Recovery Based on Multi-environment

    Authors: Abdu Saif, Kaharudin Bin Dimyati, Kamarul Ariffin Bin Noordin, Nor Shahida Mohd Shah, Qazwan Abdullah, Fadhil Mukhlif, Mahathir Mohamad

    Abstract: Natural disasters such as floods and earthquakes immensely impact the telecommunication network infrastructure, leading to the malfunctioning and interruption of wireless services. Consequently, the user devices under the disaster zone are unable to access the cellular base stations. Wireless coverage on an unmanned aerial vehicle (UAV) is considered for providing coverage service to ground user d… ▽ More

    Submitted 8 May, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

  40. arXiv:2104.06037  [pdf

    eess.IV

    Unmanned Aerial Vehicle and Optimal Relay for Extending Coverage in Post-Disaster Scenarios

    Authors: Abdu Saif, Kaharudin Dimyati, Kamarul Ariffin Noordin, Nor Shahida Mohd Shah, Qazwan Abdullah, Mahathir Mohamad, Mahmod Abd Hakim Mohamad, Ahmed M. Al-Saman

    Abstract: The malfunction or interruption of wireless coverage services has been shown to increase the mortality rate during natural disasters. Wireless coverage by an unmanned aerial vehicle (UAV) provides network coverage to ground user devices during and post-disaster events. The relay hops receive wireless coverage and can be forwarded to user devices that are out of coverage allowing reliable connectiv… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

  41. arXiv:2103.12720  [pdf, other

    cs.IT eess.SP

    Secure Energy Efficiency: Power Allocation and Outage Analysis for SWIPT-in-DAS based IoT

    Authors: Aaqib Bulla, Shahid M Shah

    Abstract: In this paper we study secure energy efficiency (SEE) for simultaneous wireless information and power transfer (SWIPT) in a distributed antenna system (DAS) based IoT network. We consider a system in which both legitimate users (Bobs) and eavesdroppers (Eves) have power splitting (PS) receivers to simultaneously decode information and harvest energy from the received signal. When the channel state… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: Submitted to a journal for possible publication

  42. arXiv:2103.07931  [pdf

    eess.SP

    Distributed Clustering for User Devices Under Unmanned Aerial Vehicle Coverage Area during Disaster Recovery

    Authors: Abdu Saif, Kaharudin Bin Dimyati, Kamarul Ariffin Bin Noordin, Nor Shahida Mohd. Shah, S. H. Alsamhi, Qazwan Abdullah, Nabil Farah

    Abstract: An Unmanned Aerial Vehicle (UAV) is a promising technology for providing wireless coverage to ground user devices. For all the infrastructure communication networks destroyed in disasters, UAVs battery life is challenging during service delivery in a post-disaster scenario. Therefore, selecting cluster heads among user devices plays a vital role in detecting UAV signals and processing data for imp… ▽ More

    Submitted 14 March, 2021; originally announced March 2021.

    Comments: conference

  43. arXiv:2011.07491  [pdf, other

    cs.CV cs.LG eess.IV

    Anomaly Detection in Video via Self-Supervised and Multi-Task Learning

    Authors: Mariana-Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, Mubarak Shah

    Abstract: Anomaly detection in video is a challenging computer vision problem. Due to the lack of anomalous events at training time, anomaly detection requires the design of learning methods without full supervision. In this paper, we approach anomalous event detection in video through self-supervised and multi-task learning at the object level. We first utilize a pre-trained detector to detect objects. The… ▽ More

    Submitted 10 September, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

    Comments: Accepted at CVPR 2021. Main paper and supplementary are both included

  44. Federated Learning for Breast Density Classification: A Real-World Implementation

    Authors: Holger R. Roth, Ken Chang, Praveer Singh, Nir Neumark, Wenqi Li, Vikash Gupta, Sharut Gupta, Liangqiong Qu, Alvin Ihsani, Bernardo C. Bizzo, Yuhong Wen, Varun Buch, Meesam Shah, Felipe Kitamura, Matheus Mendonça, Vitor Lavor, Ahmed Harouni, Colin Compas, Jesse Tetreault, Prerna Dogra, Yan Cheng, Selnur Erdal, Richard White, Behrooz Hashemian, Thomas Schultz , et al. (18 additional authors not shown)

    Abstract: Building robust deep learning-based models requires large quantities of diverse training data. In this study, we investigate the use of federated learning (FL) to build medical imaging classification models in a real-world collaborative setting. Seven clinical institutions from across the world joined this FL effort to train a model for breast density classification based on Breast Imaging, Report… ▽ More

    Submitted 20 October, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

    Comments: Accepted at the 1st MICCAI Workshop on "Distributed And Collaborative Learning"; add citation to Fig. 1 & 2 and update Fig. 5; fix typo in affiliations

    Journal ref: In: Albarqouni S. et al. (eds) Domain Adaptation and Representation Transfer, and Distributed and Collaborative Learning. DART 2020, DCL 2020. Lecture Notes in Computer Science, vol 12444. Springer, Cham

  45. A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video

    Authors: Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, Mubarak Shah

    Abstract: Abnormal event detection in video is a complex computer vision problem that has attracted significant attention in recent years. The complexity of the task arises from the commonly-adopted definition of an abnormal event, that is, a rarely occurring event that typically depends on the surrounding context. Following the standard formulation of abnormal event detection as outlier detection, we propo… ▽ More

    Submitted 6 April, 2023; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Accepted in IEEE Transactions on Pattern Analysis and Machine Intelligence

  46. arXiv:2008.09180  [pdf, other

    eess.IV cs.CV cs.IT

    Conditional Entropy Coding for Efficient Video Compression

    Authors: Jerry Liu, Shenlong Wang, Wei-Chiu Ma, Meet Shah, Rui Hu, Pranaab Dhawan, Raquel Urtasun

    Abstract: We propose a very simple and efficient video compression framework that only focuses on modeling the conditional entropy between frames. Unlike prior learning-based approaches, we reduce complexity by not performing any form of explicit transformations between frames and assume each frame is encoded with an independent state-of-the-art deep image compressor. We first show that a simple architectur… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

    Comments: ECCV 2020

  47. arXiv:2008.00634  [pdf, other

    cs.CV eess.IV

    Deep Photo Cropper and Enhancer

    Authors: Aaron Ott, Amir Mazaheri, Niels D. Lobo, Mubarak Shah

    Abstract: This paper introduces a new type of image enhancement problem. Compared to traditional image enhancement methods, which mostly deal with pixel-wise modifications of a given photo, our proposed task is to crop an image which is embedded within a photo and enhance the quality of the cropped image. We split our proposed approach into two deep networks: deep photo cropper and deep image enhancer. In t… ▽ More

    Submitted 2 August, 2020; originally announced August 2020.

  48. arXiv:2007.07355  [pdf, other

    cs.CV eess.IV

    TinyVIRAT: Low-resolution Video Action Recognition

    Authors: Ugur Demir, Yogesh S Rawat, Mubarak Shah

    Abstract: The existing research in action recognition is mostly focused on high-quality videos where the action is distinctly visible. In real-world surveillance environments, the actions in videos are captured at a wide range of resolutions. Most activities occur at a distance with a small resolution and recognizing such activities is a challenging problem. In this work, we focus on recognizing tiny action… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  49. arXiv:2005.03804  [pdf, other

    cs.CV eess.IV

    Text Synopsis Generation for Egocentric Videos

    Authors: Aidean Sharghi, Niels da Vitoria Lobo, Mubarak Shah

    Abstract: Mass utilization of body-worn cameras has led to a huge corpus of available egocentric video. Existing video summarization algorithms can accelerate browsing such videos by selecting (visually) interesting shots from them. Nonetheless, since the system user still has to watch the summary videos, browsing large video databases remain a challenge. Hence, in this work, we propose to generate a textua… ▽ More

    Submitted 21 September, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: ICPR 2020

  50. arXiv:2004.11475  [pdf, other

    cs.CV eess.IV

    Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos

    Authors: Mamshad Nayeem Rizve, Ugur Demir, Praveen Tirupattur, Aayush Jung Rana, Kevin Duarte, Ishan Dave, Yogesh Singh Rawat, Mubarak Shah

    Abstract: Activity detection in security videos is a difficult problem due to multiple factors such as large field of view, presence of multiple activities, varying scales and viewpoints, and its untrimmed nature. The existing research in activity detection is mainly focused on datasets, such as UCF-101, JHMDB, THUMOS, and AVA, which partially address these issues. The requirement of processing the security… ▽ More

    Submitted 19 May, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: 9 pages