Skip to main content

Showing 1–50 of 66 results for author: Asif, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08443  [pdf, other

    cs.CV cs.LG

    Transformation-Dependent Adversarial Attacks

    Authors: Yaoteng Tan, Zikui Cai, M. Salman Asif

    Abstract: We introduce transformation-dependent adversarial attacks, a new class of threats where a single additive perturbation can trigger diverse, controllable mis-predictions by systematically transforming the input (e.g., scaling, blurring, compression). Unlike traditional attacks with static effects, our perturbations embed metamorphic properties to enable different adversarial attacks as a function o… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.02575  [pdf, other

    cs.CL cs.CR cs.LG

    Cross-Modal Safety Alignment: Is textual unlearning all you need?

    Authors: Trishna Chakraborty, Erfan Shayegani, Zikui Cai, Nael Abu-Ghazaleh, M. Salman Asif, Yue Dong, Amit K. Roy-Chowdhury, Chengyu Song

    Abstract: Recent studies reveal that integrating new modalities into Large Language Models (LLMs), such as Vision-Language Models (VLMs), creates a new attack surface that bypasses existing safety training techniques like Supervised Fine-tuning (SFT) and Reinforcement Learning with Human Feedback (RLHF). While further SFT and RLHF-based safety training can be conducted in multi-modal settings, collecting mu… ▽ More

    Submitted 27 May, 2024; originally announced June 2024.

  3. arXiv:2404.08921  [pdf, other

    cs.CV

    PNeRV: Enhancing Spatial Consistency via Pyramidal Neural Representation for Videos

    Authors: Qi Zhao, M. Salman Asif, Zhan Ma

    Abstract: The primary focus of Neural Representation for Videos (NeRV) is to effectively model its spatiotemporal consistency. However, current NeRV systems often face a significant issue of spatial inconsistency, leading to decreased perceptual quality. To address this issue, we introduce the Pyramidal Neural Representation for Videos (PNeRV), which is built on a multi-scale information connection and comp… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  4. Overcoming Distribution Shifts in Plug-and-Play Methods with Test-Time Training

    Authors: Edward P. Chandler, Shirin Shoushtari, Jiaming Liu, M. Salman Asif, Ulugbek S. Kamilov

    Abstract: Plug-and-Play Priors (PnP) is a well-known class of methods for solving inverse problems in computational imaging. PnP methods combine physical forward models with learned prior models specified as image denoisers. A common issue with the learned models is that of a performance drop when there is a distribution shift between the training and testing data. Test-time training (TTT) was recently prop… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Journal ref: 2023 IEEE 9th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP), 2023, pg. 186-190

  5. arXiv:2401.13722  [pdf, other

    cs.HC cs.AI

    Proactive Emotion Tracker: AI-Driven Continuous Mood and Emotion Monitoring

    Authors: Mohammad Asif, Sudhakar Mishra, Ankush Sonker, Sanidhya Gupta, Somesh Kumar Maurya, Uma Shanker Tiwary

    Abstract: This research project aims to tackle the growing mental health challenges in today's digital age. It employs a modified pre-trained BERT model to detect depressive text within social media and users' web browsing data, achieving an impressive 93% test accuracy. Simultaneously, the project aims to incorporate physiological signals from wearable devices, such as smartwatches and EEG sensors, to prov… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  6. arXiv:2401.07892  [pdf, other

    cs.HC

    Deep Fuzzy Framework for Emotion Recognition using EEG Signals and Emotion Representation in Type-2 Fuzzy VAD Space

    Authors: Mohammad Asif, Noman Ali, Sudhakar Mishra, Anushka Dandawate, Uma Shanker Tiwary

    Abstract: Recently, the representation of emotions in the Valence, Arousal and Dominance (VAD) space has drawn enough attention. However, the complex nature of emotions and the subjective biases in self-reported values of VAD make the emotion model too specific to a particular experiment. This study aims to develop a generic model representing emotions using a fuzzy VAD space and improve emotion recognition… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  7. arXiv:2312.16221  [pdf, other

    cs.CV

    STRIDE: Single-video based Temporally Continuous Occlusion Robust 3D Pose Estimation

    Authors: Rohit Lal, Saketh Bachu, Yash Garg, Arindam Dutta, Calvin-Khang Ta, Dripta S. Raychaudhuri, Hannah Dela Cruz, M. Salman Asif, Amit K. Roy-Chowdhury

    Abstract: The capability to accurately estimate 3D human poses is crucial for diverse fields such as action recognition, gait recognition, and virtual/augmented reality. However, a persistent and significant challenge within this field is the accurate prediction of human poses under conditions of severe occlusion. Traditional image-based estimators struggle with heavy occlusions due to a lack of temporal co… ▽ More

    Submitted 13 March, 2024; v1 submitted 24 December, 2023; originally announced December 2023.

  8. arXiv:2312.03864  [pdf, other

    cs.RO

    Geometry Matching for Multi-Embodiment Gras**

    Authors: Maria Attarian, Muhammad Adil Asif, **gzhou Liu, Ruthrash Hari, Animesh Garg, Igor Gilitschenski, Jonathan Tompson

    Abstract: Many existing learning-based gras** approaches concentrate on a single embodiment, provide limited generalization to higher DoF end-effectors and cannot capture a diverse set of grasp modes. We tackle the problem of gras** using multiple embodiments by learning rich geometric representations for both objects and end-effectors using Graph Neural Networks. Our novel method - GeoMatch - applies s… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Journal ref: 7th Annual Conference on Robot Learning, 2023

  9. arXiv:2312.03140  [pdf, other

    cs.LG cs.AI cs.CL cs.DC

    FlexModel: A Framework for Interpretability of Distributed Large Language Models

    Authors: Matthew Choi, Muhammad Adil Asif, John Willes, David Emerson

    Abstract: With the growth of large language models, now incorporating billions of parameters, the hardware prerequisites for their training and deployment have seen a corresponding increase. Although existing tools facilitate model parallelization and distributed training, deeper model interactions, crucial for interpretability and responsible AI techniques, still demand thorough knowledge of distributed co… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 14 pages, 8 figures. To appear at the Socially Responsible Language Modelling Research (SoLaR) Workshop, 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  10. arXiv:2310.06124  [pdf, other

    cs.LG cs.CV

    Factorized Tensor Networks for Multi-Task and Multi-Domain Learning

    Authors: Yash Garg, Nebiyou Yismaw, Rakib Hyder, Ashley Prater-Bennette, M. Salman Asif

    Abstract: Multi-task and multi-domain learning methods seek to learn multiple tasks/domains, jointly or one after another, using a single unified network. The key challenge and opportunity is to exploit shared information across tasks and domains to improve the efficiency of the unified network. The efficiency can be in terms of accuracy, storage cost, computation, or sample complexity. In this paper, we pr… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  11. arXiv:2310.03986  [pdf, other

    cs.CV cs.LG

    Robust Multimodal Learning with Missing Modalities via Parameter-Efficient Adaptation

    Authors: Md Kaykobad Reza, Ashley Prater-Bennette, M. Salman Asif

    Abstract: Multimodal learning seeks to utilize data from multiple sources to improve the overall performance of downstream tasks. It is desirable for redundancies in the data to make multimodal systems robust to missing or corrupted observations in some correlated modalities. However, we observe that the performance of several existing multimodal networks significantly deteriorates if one or multiple modali… ▽ More

    Submitted 26 February, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: 22 pages, 3 figures, 11 tables

  12. arXiv:2310.00133  [pdf, other

    cs.CV

    Prior Mismatch and Adaptation in PnP-ADMM with a Nonconvex Convergence Analysis

    Authors: Shirin Shoushtari, Jiaming Liu, Edward P. Chandler, M. Salman Asif, Ulugbek S. Kamilov

    Abstract: Plug-and-Play (PnP) priors is a widely-used family of methods for solving imaging inverse problems by integrating physical measurement models with image priors specified using image denoisers. PnP methods have been shown to achieve state-of-the-art performance when the prior is obtained using powerful deep denoisers. Despite extensive work on PnP, the topic of distribution mismatch between the tra… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  13. MMSFormer: Multimodal Transformer for Material and Semantic Segmentation

    Authors: Md Kaykobad Reza, Ashley Prater-Bennette, M. Salman Asif

    Abstract: Leveraging information across diverse modalities is known to enhance performance on multimodal segmentation tasks. However, effectively fusing information from different modalities remains challenging due to the unique characteristics of each modality. In this paper, we propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new… ▽ More

    Submitted 7 April, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE Open Journal of Signal Processing. 15 pages, 3 figures, 9 tables

  14. Inter Subject Emotion Recognition Using Spatio-Temporal Features From EEG Signal

    Authors: Mohammad Asif, Diya Srivastava, Aditya Gupta, Uma Shanker Tiwary

    Abstract: Inter-subject or subject-independent emotion recognition has been a challenging task in affective computing. This work is about an easy-to-implement emotion recognition model that classifies emotions from EEG signals subject independently. It is based on the famous EEGNet architecture, which is used in EEG-related BCIs. We used the Dataset on Emotion using Naturalistic Stimuli (DENS) dataset. The… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Report number: 2023 27th International Computer Science and Engineering Conference (ICSEC)

  15. PIQI: Perceptual Image Quality Index based on Ensemble of Gaussian Process Regression

    Authors: Nisar Ahmed, Hafiz Muhammad Shahzad Asif, Hassan Khalid

    Abstract: Digital images contain a lot of redundancies, therefore, compression techniques are applied to reduce the image size without loss of reasonable image quality. Same become more prominent in the case of videos which contains image sequences and higher compression ratios are achieved in low throughput networks. Assessment of quality of images in such scenarios has become of particular interest. Subje… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Journal ref: AMultimed Tools Appl 80, 15677 to 15700 (2021)

  16. Deep Ensembling for Perceptual Image Quality Assessment

    Authors: Nisar Ahmed, H. M. Shahzad Asif, Abdul Rauf Bhatti, Atif Khan

    Abstract: Blind image quality assessment is a challenging task particularly due to the unavailability of reference information. Training a deep neural network requires a large amount of training data which is not readily available for image quality. Transfer learning is usually opted to overcome this limitation and different deep architectures are used for this purpose as they learn features differently. Af… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Journal ref: Soft Comput 26, 7601 to 7622 (2022)

  17. arXiv:2304.06544  [pdf, other

    cs.CV

    DNeRV: Modeling Inherent Dynamics via Difference Neural Representation for Videos

    Authors: Qi Zhao, M. Salman Asif, Zhan Ma

    Abstract: Existing implicit neural representation (INR) methods do not fully exploit spatiotemporal redundancies in videos. Index-based INRs ignore the content-specific spatial features and hybrid INRs ignore the contextual dependency on adjacent frames, leading to poor modeling capability for scenes with large motion or dynamics. We analyze this limitation from the perspective of function fitting and revea… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  18. arXiv:2303.14304  [pdf, other

    cs.CV cs.CR cs.LG

    Ensemble-based Blackbox Attacks on Dense Prediction

    Authors: Zikui Cai, Yaoteng Tan, M. Salman Asif

    Abstract: We propose an approach for adversarial attacks on dense prediction models (such as object detectors and segmentation). It is well known that the attacks generated by a single surrogate model do not transfer to arbitrary (blackbox) victim models. Furthermore, targeted attacks are often more challenging than the untargeted attacks. In this paper, we show that a carefully designed ensemble can create… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 Accepted

  19. arXiv:2303.13269  [pdf, other

    cs.CV

    Disguise without Disruption: Utility-Preserving Face De-Identification

    Authors: Zikui Cai, Zhongpai Gao, Benjamin Planche, Meng Zheng, Terrence Chen, M. Salman Asif, Ziyan Wu

    Abstract: With the rise of cameras and smart sensors, humanity generates an exponential amount of data. This valuable information, including underrepresented cases like AI in medical settings, can fuel new deep-learning tools. However, data scientists must prioritize ensuring privacy for individuals in these untapped datasets, especially for images or videos with faces, which are prime targets for identific… ▽ More

    Submitted 18 December, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted at AAAI 2024. Paper + supplementary material

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 38(1), 2024

  20. arXiv:2303.06235  [pdf, other

    cs.CV cs.LG

    Compressive Sensing with Tensorized Autoencoder

    Authors: Rakib Hyder, M. Salman Asif

    Abstract: Deep networks can be trained to map images into a low-dimensional latent space. In many cases, different images in a collection are articulated versions of one another; for example, same object with different lighting, background, or pose. Furthermore, in many cases, parts of images can be corrupted by noise or missing entries. In this paper, our goal is to recover images without access to the gro… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Journal ref: ICASSP 2023

  21. arXiv:2301.00321  [pdf, other

    cs.CL

    Floods Relevancy and Identification of Location from Twitter Posts using NLP Techniques

    Authors: Muhammad Suleman, Muhammad Asif, Tayyab Zamir, Ayaz Mehmood, Jebran Khan, Nasir Ahmad, Kashif Ahmad

    Abstract: This paper presents our solutions for the MediaEval 2022 task on DisasterMM. The task is composed of two subtasks, namely (i) Relevance Classification of Twitter Posts (RCTP), and (ii) Location Extraction from Twitter Texts (LETT). The RCTP subtask aims at differentiating flood-related and non-relevant social posts while LETT is a Named Entity Recognition (NER) task and aims at the extraction of l… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Comments: 5 pages, 1 figure, and 4 tables

  22. Efficient Visual Computing with Camera RAW Snapshots

    Authors: Zhihao Li, Ming Lu, Xu Zhang, Xin Feng, M. Salman Asif, Zhan Ma

    Abstract: Conventional cameras capture image irradiance on a sensor and convert it to RGB images using an image signal processor (ISP). The images can then be used for photography or visual computing tasks in a variety of applications, such as public safety surveillance and autonomous driving. One can argue that since RAW images contain all the captured information, the conversion of RAW to RGB using an ISP… ▽ More

    Submitted 25 January, 2024; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Accepted by T-PAMI 2024. Homepage: https://njuvision.github.io/rho-vision

  23. arXiv:2211.02637  [pdf, other

    eess.SP cs.AI cs.HC cs.LG

    Emotion Recognition With Temporarily Localized 'Emotional Events' in Naturalistic Context

    Authors: Mohammad Asif, Sudhakar Mishra, Majithia Tejas Vinodbhai, Uma Shanker Tiwary

    Abstract: Emotion recognition using EEG signals is an emerging area of research due to its broad applicability in BCI. Emotional feelings are hard to stimulate in the lab. Emotions do not last long, yet they need enough context to be perceived and felt. However, most EEG-related emotion databases either suffer from emotionally irrelevant details (due to prolonged duration stimulus) or have minimal context d… ▽ More

    Submitted 25 October, 2022; originally announced November 2022.

  24. arXiv:2209.12443  [pdf

    cs.CV

    Image Quality Assessment for Foliar Disease Identification (AgroPath)

    Authors: Nisar Ahmed, Hafiz Muhammad Shahzad Asif, Gulshan Saleem, Muhammad Usman Younus

    Abstract: Crop diseases are a major threat to food security and their rapid identification is important to prevent yield loss. Swift identification of these diseases are difficult due to the lack of necessary infrastructure. Recent advances in computer vision and increasing penetration of smartphones have paved the way for smartphone-assisted disease identification. Most of the plant diseases leave particul… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Journal ref: Journal of Agricultural Research 59.2 (2021): 177-186

  25. arXiv:2209.09883  [pdf, other

    cs.CV

    Leveraging Local Patch Differences in Multi-Object Scenes for Generative Adversarial Attacks

    Authors: Abhishek Aich, Shasha Li, Chengyu Song, M. Salman Asif, Srikanth V. Krishnamurthy, Amit K. Roy-Chowdhury

    Abstract: State-of-the-art generative model-based attacks against image classifiers overwhelmingly focus on single-object (i.e., single dominant object) images. Different from such settings, we tackle a more practical problem of generating adversarial perturbations using multi-object (i.e., multiple dominant objects) images as they are representative of most real-world scenes. Our goal is to design an attac… ▽ More

    Submitted 3 October, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: Accepted at WACV 2023 (Round 1), camera-ready version

  26. arXiv:2209.09502  [pdf, other

    cs.CV

    GAMA: Generative Adversarial Multi-Object Scene Attacks

    Authors: Abhishek Aich, Calvin-Khang Ta, Akash Gupta, Chengyu Song, Srikanth V. Krishnamurthy, M. Salman Asif, Amit K. Roy-Chowdhury

    Abstract: The majority of methods for crafting adversarial attacks have focused on scenes with a single dominant object (e.g., images from ImageNet). On the other hand, natural scenes include multiple dominant objects that are semantically related. Thus, it is crucial to explore designing attack strategies that look beyond learning on single-object scenes or attack single-object victim classifiers. Due to t… ▽ More

    Submitted 15 October, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: Accepted at NeurIPS 2022; First two authors contributed equally; Includes Supplementary Material

  27. arXiv:2209.04829  [pdf, other

    cs.IT eess.SP

    Energy-Efficient Beamforming and Resource Optimization for AmBSC-Assisted Cooperative NOMA IoT Networks

    Authors: Muhammad Asif, Asim Ihsan, Wali Ullah Khan, Ali Ranjha, Shengli Zhang, Sissi Xiaoxiao Wu

    Abstract: In this manuscript, we present an energy-efficient alternating optimization framework based on the multi-antenna ambient backscatter communication (AmBSC) assisted cooperative non-orthogonal multiple access (NOMA) for next-generation (NG) internet-of-things (IoT) enabled communication networks. Specifically, the energy-efficiency maximization is achieved for the considered AmBSC-enabled multi-clus… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

  28. arXiv:2208.03705  [pdf, other

    cs.IT eess.SP

    Rate Splitting Multiple Access for Next Generation Cognitive Radio Enabled LEO Satellite Networks

    Authors: ali Ullah Khan, Zain Ali, Eva Lagunas, Asad Mahmood, Muhammad Asif, Asim Ihsan, Symeon Chatzinotas, Björn Ottersten, Octavia A. Dobre

    Abstract: This paper proposes a cognitive radio enabled LEO SatCom using RSMA radio access technique with the coexistence of GEO SatCom network. In particular, this work aims to maximize the sum rate of LEO SatCom by simultaneously optimizing the power budget over different beams, RSMA power allocation for users over each beam, and subcarrier user assignment while restricting the interference temperature to… ▽ More

    Submitted 6 February, 2023; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: 32,9. arXiv admin note: substantial text overlap with arXiv:2208.02924

  29. arXiv:2208.03610  [pdf, other

    cs.LG cs.CV

    Blackbox Attacks via Surrogate Ensemble Search

    Authors: Zikui Cai, Chengyu Song, Srikanth Krishnamurthy, Amit Roy-Chowdhury, M. Salman Asif

    Abstract: Blackbox adversarial attacks can be categorized into transfer- and query-based attacks. Transfer methods do not require any feedback from the victim model, but provide lower success rates compared to query-based methods. Query attacks often require a large number of queries for success. To achieve the best of both approaches, recent efforts have tried to combine them, but still require hundreds of… ▽ More

    Submitted 23 November, 2022; v1 submitted 6 August, 2022; originally announced August 2022.

    Comments: Our code is available at https://github.com/CSIPlab/BASES

    Journal ref: NeurIPS 2022

  30. arXiv:2208.02436  [pdf, other

    cs.CV

    H2-Stereo: High-Speed, High-Resolution Stereoscopic Video System

    Authors: Ming Cheng, Yiling Xu, Wang Shen, M. Salman Asif, Chao Ma, Jun Sun, Zhan Ma

    Abstract: High-speed, high-resolution stereoscopic (H2-Stereo) video allows us to perceive dynamic 3D content at fine granularity. The acquisition of H2-Stereo video, however, remains challenging with commodity cameras. Existing spatial super-resolution or temporal frame interpolation methods provide compromised solutions that lack temporal or spatial details, respectively. To alleviate this problem, we pro… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

  31. arXiv:2208.01123  [pdf, other

    cs.IT eess.SP

    Energy-Efficient Backscatter-Assisted Coded Cooperative-NOMA for B5G Wireless Communications

    Authors: Muhammad Asif, Asim Ihsan, Wali Ullah Khan, Ali Ranjha, Shengli Zhang, Sissi Xiaoxiao Wu

    Abstract: In this manuscript, we propose an alternating optimization framework to maximize the energy efficiency of a backscatter-enabled cooperative Non-orthogonal multiple access (NOMA) system by optimizing the transmit power of the source, power allocation coefficients (PAC), and power of the relay node under imperfect successive interference cancellation (SIC) decoding. A three-stage low-complexity ener… ▽ More

    Submitted 20 August, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: 30, 7

  32. arXiv:2207.09074  [pdf, other

    cs.CV cs.LG

    Incremental Task Learning with Incremental Rank Updates

    Authors: Rakib Hyder, Ken Shao, Boyu Hou, Panos Markopoulos, Ashley Prater-Bennette, M. Salman Asif

    Abstract: Incremental Task learning (ITL) is a category of continual learning that seeks to train a single network for multiple tasks (one after another), where training data for each task is only available during the training of that task. Neural networks tend to forget older tasks when they are trained for the newer tasks; this property is often known as catastrophic forgetting. To address this issue, ITL… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Code will be available at https://github.com/CSIPlab/task-increment-rank-update.git

    Journal ref: ECCV 2022

  33. arXiv:2207.03295  [pdf, other

    cs.IT eess.SP

    Cooperative Backscatter NOMA with Imperfect SIC: Towards Energy Efficient Sum Rate Maximization in Sustainable 6G Networks

    Authors: Manzoor Ahmed, Zain Ali, Wali Ullah Khan, Omer Waqar, Muhammad Asif, Abd Ullah Khan, Muhammad Awais Javed, Fahd N. Al-Wesabi

    Abstract: The combination of backscatter communication with non-orthogonal multiple access (NOMA) has the potential to support low-powered massive connections in upcoming sixth-generation (6G) wireless networks. More specifically, backscatter communication can harvest and use the existing RF signals in the atmosphere for communication, while NOMA provides communication to multiple wireless devices over the… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: 9, 7

  34. arXiv:2204.05172  [pdf, other

    cs.CV

    Event Transformer

    Authors: Bin Jiang, Zhihao Li, M. Salman Asif, Xun Cao, Zhan Ma

    Abstract: The event camera's low power consumption and ability to capture microsecond brightness changes make it attractive for various computer vision tasks. Existing event representation methods typically convert events into frames, voxel grids, or spikes for deep neural networks (DNNs). However, these approaches often sacrifice temporal granularity or require specialized devices for processing. This work… ▽ More

    Submitted 12 June, 2024; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted by ICASSP2024

  35. arXiv:2203.15230  [pdf, other

    cs.CV cs.CR cs.LG

    Zero-Query Transfer Attacks on Context-Aware Object Detectors

    Authors: Zikui Cai, Shantanu Rane, Alejandro E. Brito, Chengyu Song, Srikanth V. Krishnamurthy, Amit K. Roy-Chowdhury, M. Salman Asif

    Abstract: Adversarial attacks perturb images such that a deep neural network produces incorrect classification results. A promising approach to defend against adversarial attacks on natural multi-object scenes is to impose a context-consistency check, wherein, if the detected objects are not consistent with an appropriately defined context, then an attack is suspected. Stronger attacks are needed to fool su… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: CVPR 2022 Accepted

  36. arXiv:2203.02026  [pdf, other

    cs.LG

    Provable and Efficient Continual Representation Learning

    Authors: Yingcong Li, Mingchen Li, M. Salman Asif, Samet Oymak

    Abstract: In continual learning (CL), the goal is to design models that can learn a sequence of tasks without catastrophic forgetting. While there is a rich set of techniques for CL, relatively little understanding exists on how representations built by previous tasks benefit new tasks that are added to the network. To address this, we study the problem of continual representation learning (CRL) where we le… ▽ More

    Submitted 7 November, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

  37. arXiv:2201.09493  [pdf, other

    cs.CR eess.SY

    STRIDE-based Cyber Security Threat Modeling for IoT-enabled Precision Agriculture Systems

    Authors: Md. Rashid Al Asif, Khondokar Fida Hasan, Md Zahidul Islam, Rahamatullah Khondoker

    Abstract: The concept of traditional farming is changing rapidly with the introduction of smart technologies like the Internet of Things (IoT). Under the concept of smart agriculture, precision agriculture is gaining popularity to enable Decision Support System (DSS)-based farming management that utilizes widespread IoT sensors and wireless connectivity to enable automated detection and optimization of reso… ▽ More

    Submitted 30 January, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

  38. arXiv:2112.13893  [pdf

    eess.IV cs.CV cs.LG

    Non-Reference Quality Monitoring of Digital Images using Gradient Statistics and Feedforward Neural Networks

    Authors: Nisar Ahmed, Hafiz Muhammad Shahzad Asif, Hassan Khalid

    Abstract: Digital images contain a lot of redundancies, therefore, compressions are applied to reduce the image size without the loss of reasonable image quality. The same become more prominent in the case of videos that contains image sequences and higher compression ratios are achieved in low throughput networks. Assessment of the quality of images in such scenarios becomes of particular interest. Subject… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: Fifth International Conference on Aerospace Science & Engineering (ICASE 2017) (ICASE Proceedings, Page No. 300-305)

    MSC Class: 94A08 ACM Class: I.4.5; I.5.4

  39. arXiv:2112.03223  [pdf, other

    cs.CV cs.AI cs.LG

    Context-Aware Transfer Attacks for Object Detection

    Authors: Zikui Cai, Xinxin Xie, Shasha Li, Mingjun Yin, Chengyu Song, Srikanth V. Krishnamurthy, Amit K. Roy-Chowdhury, M. Salman Asif

    Abstract: Blackbox transfer attacks for image classifiers have been extensively studied in recent years. In contrast, little progress has been made on transfer attacks for object detectors. Object detectors take a holistic view of the image and the detection of one object (or lack thereof) often depends on other objects in the scene. This makes such detectors inherently context-aware and adversarial attacks… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: accepted to AAAI 2022

  40. Coded Illumination for Improved Lensless Imaging

    Authors: Yucheng Zheng, M. Salman Asif

    Abstract: Mask-based lensless cameras can be flat, thin, and light-weight, which makes them suitable for novel designs of computational imaging systems with large surface areas and arbitrary shapes. Despite recent progress in lensless cameras, the quality of images recovered from the lensless cameras is often poor due to the ill-conditioning of the underlying measurement system. In this paper, we propose to… ▽ More

    Submitted 9 January, 2023; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Supplementary material, codes, and data are available at https://github.com/CSIPlab/codedcam

    Journal ref: IEEE Transactions on Computational Imaging, 2023

  41. arXiv:2110.12321  [pdf, other

    cs.CV cs.LG

    ADC: Adversarial attacks against object Detection that evade Context consistency checks

    Authors: Mingjun Yin, Shasha Li, Chengyu Song, M. Salman Asif, Amit K. Roy-Chowdhury, Srikanth V. Krishnamurthy

    Abstract: Deep Neural Networks (DNNs) have been shown to be vulnerable to adversarial examples, which are slightly perturbed input images which lead DNNs to make wrong predictions. To protect from such examples, various defense strategies have been proposed. A very recent defense strategy for detecting adversarial examples, that has been shown to be robust to current attacks, is to check for intrinsic conte… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: WCAV'22 Acceptted

  42. arXiv:2110.01823  [pdf, other

    cs.CV

    Adversarial Attacks on Black Box Video Classifiers: Leveraging the Power of Geometric Transformations

    Authors: Shasha Li, Abhishek Aich, Shitong Zhu, M. Salman Asif, Chengyu Song, Amit K. Roy-Chowdhury, Srikanth V. Krishnamurthy

    Abstract: When compared to the image classification models, black-box adversarial attacks against video classification models have been largely understudied. This could be possible because, with video, the temporal dimension poses significant additional challenges in gradient estimation. Query-efficient black-box attacks rely on effectively estimated gradients towards maximizing the probability of misclassi… ▽ More

    Submitted 26 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted at NeurIPS 2021; First two authors contributed equally; Includes Supplementary Material

  43. arXiv:2108.08421  [pdf, other

    cs.CV cs.LG

    Exploiting Multi-Object Relationships for Detecting Adversarial Attacks in Complex Scenes

    Authors: Mingjun Yin, Shasha Li, Zikui Cai, Chengyu Song, M. Salman Asif, Amit K. Roy-Chowdhury, Srikanth V. Krishnamurthy

    Abstract: Vision systems that deploy Deep Neural Networks (DNNs) are known to be vulnerable to adversarial examples. Recent research has shown that checking the intrinsic consistencies in the input data is a promising way to detect adversarial attacks (e.g., by checking the object co-occurrence relationships in complex scenes). However, existing approaches are tied to specific models and do not offer genera… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: ICCV'21 Accepted

  44. arXiv:2108.07966  [pdf, other

    eess.IV cs.CV

    A Simple Framework for 3D Lensless Imaging with Programmable Masks

    Authors: Yucheng Zheng, Yi Hua, Aswin C. Sankaranarayanan, M. Salman Asif

    Abstract: Lensless cameras provide a framework to build thin imaging systems by replacing the lens in a conventional camera with an amplitude or phase mask near the sensor. Existing methods for lensless imaging can recover the depth and intensity of the scene, but they require solving computationally-expensive inverse problems. Furthermore, existing methods struggle to recover dense scenes with large depth… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: Supplementary material available at https://github.com/CSIPlab/Programmable3Dcam.git

    Journal ref: International Conference on Computer Vision (ICCV) 2021

  45. arXiv:2108.02605  [pdf, other

    cs.CL cs.AI cs.NE

    EENLP: Cross-lingual Eastern European NLP Index

    Authors: Alexey Tikhonov, Alex Malkhasov, Andrey Manoshin, George Dima, Réka Cserháti, Md. Sadek Hossain Asif, Matt Sárdi

    Abstract: Motivated by the sparsity of NLP resources for Eastern European languages, we present a broad index of existing Eastern European language resources (90+ datasets and 45+ models) published as a github repository open for updates from the community. Furthermore, to support the evaluation of commonsense reasoning tasks, we provide hand-crafted cross-lingual datasets for five different semantic tasks… ▽ More

    Submitted 10 May, 2022; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: Accepted for LREC 2022. 5 pages, 1 figure. Originally EEML 2021 project

    MSC Class: 68T50

  46. arXiv:2106.03668  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Recovery Analysis for Plug-and-Play Priors using the Restricted Eigenvalue Condition

    Authors: Jiaming Liu, M. Salman Asif, Brendt Wohlberg, Ulugbek S. Kamilov

    Abstract: The plug-and-play priors (PnP) and regularization by denoising (RED) methods have become widely used for solving inverse problems by leveraging pre-trained deep denoisers as image priors. While the empirical imaging performance and the theoretical convergence properties of these algorithms have been widely investigated, their recovery properties have not previously been theoretically analyzed. We… ▽ More

    Submitted 26 October, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: 27 pages, 13 figures

  47. arXiv:2105.06371  [pdf, other

    cs.LG stat.ML

    Provably Convergent Algorithms for Solving Inverse Problems Using Generative Models

    Authors: Viraj Shah, Rakib Hyder, M. Salman Asif, Chinmay Hegde

    Abstract: The traditional approach of hand-crafting priors (such as sparsity) for solving inverse problems is slowly being replaced by the use of richer learned priors (such as those modeled by deep generative networks). In this work, we study the algorithmic aspects of such a learning-based approach from a theoretical perspective. For certain generative network architectures, we establish a simple non-conv… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:1810.03587, arXiv:1802.08406

  48. arXiv:2102.05755  [pdf

    cs.LG

    Development of Crop Yield Estimation Model using Soil and Environmental Parameters

    Authors: Nisar Ahmed, Hafiz Muhammad Shahzad Asif, Gulshan Saleem, Muhammad Usman Younus

    Abstract: Crop yield is affected by various soil and environmental parameters and can vary significantly. Therefore, a crop yield estimation model which can predict pre-harvest yield is required for food security. The study is conducted on tea forms operating under National Tea Research Institute, Pakistan. The data is recorded on monthly basis for ten years period. The parameters collected are minimum and… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Comments: crop yield forecasting, regression, data mining, artificial neural network, ensemble learning

    Journal ref: Journal of Agricultural Research, 2021

  49. arXiv:2102.04515  [pdf

    cs.CV

    Leaf Image-based Plant Disease Identification using Color and Texture Features

    Authors: Nisar Ahmed, Hafiz Muhammad Shahzad Asif, Gulshan Saleem

    Abstract: Identification of plant disease is usually done through visual inspection or during laboratory examination which causes delays resulting in yield loss by the time identification is complete. On the other hand, complex deep learning models perform the task with reasonable performance but due to their large size and high computational requirements, they are not suited to mobile and handheld devices.… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  50. arXiv:2007.14621  [pdf, other

    eess.IV cs.CV

    Solving Phase Retrieval with a Learned Reference

    Authors: Rakib Hyder, Zikui Cai, M. Salman Asif

    Abstract: Fourier phase retrieval is a classical problem that deals with the recovery of an image from the amplitude measurements of its Fourier coefficients. Conventional methods solve this problem via iterative (alternating) minimization by leveraging some prior knowledge about the structure of the unknown image. The inherent ambiguities about shift and flip in the Fourier measurements make this problem e… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: Accepted to ECCV 2020. Code is available at https://github.com/CSIPlab/learnPR_reference