Skip to main content

Showing 1–20 of 20 results for author: Bandara, W G C

.
  1. arXiv:2403.06978  [pdf, other

    cs.CV

    Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Spatiotemporal Modeling

    Authors: Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: In this paper, we introduce Attention Prompt Tuning (APT) - a computationally efficient variant of prompt tuning for video-based applications such as action recognition. Prompt tuning approaches involve injecting a set of learnable prompts along with data tokens during fine-tuning while kee** the backbone frozen. This approach greatly reduces the number of learnable parameters compared to full t… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted at 18th IEEE International Conference on Automatic Face and Gesture Recognition (FG'24) Code available at: https://github.com/wgcban/apt 12 pages, 8 figures, 6 tables

  2. arXiv:2312.02151  [pdf, other

    cs.CV cs.AI cs.LG

    Guarding Barlow Twins Against Overfitting with Mixed Samples

    Authors: Wele Gedara Chaminda Bandara, Celso M. De Melo, Vishal M. Patel

    Abstract: Self-supervised Learning (SSL) aims to learn transferable feature representations for downstream applications without relying on labeled data. The Barlow Twins algorithm, renowned for its widespread adoption and straightforward implementation compared to its counterparts like contrastive learning methods, minimizes feature redundancy while maximizing invariance to common corruptions. Optimizing fo… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Code and checkpoints are available at: https://github.com/wgcban/mix-bt.git

  3. arXiv:2303.12790  [pdf, other

    cs.CV

    $CrowdDiff$: Multi-hypothesis Crowd Density Estimation using Diffusion Models

    Authors: Yasiru Ranasinghe, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: Crowd counting is a fundamental problem in crowd analysis which is typically accomplished by estimating a crowd density map and summing over the density values. However, this approach suffers from background noise accumulation and loss of density due to the use of broad Gaussian kernels to create the ground truth density maps. This issue can be overcome by narrowing the Gaussian kernel. However, e… ▽ More

    Submitted 4 April, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR'24. The project is available at https://dylran.github.io/crowddiff.github.io

  4. arXiv:2303.09536  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Metric Learning for Unsupervised Remote Sensing Change Detection

    Authors: Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: Remote Sensing Change Detection (RS-CD) aims to detect relevant changes from Multi-Temporal Remote Sensing Images (MT-RSIs), which aids in various RS applications such as land cover, land use, human development analysis, and disaster response. The performance of existing RS-CD methods is attributed to training on large annotated datasets. Furthermore, most of these models are less transferable in… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Code available at https://github.com/wgcban/Metric-CD

  5. arXiv:2212.00793  [pdf, other

    cs.CV

    Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models

    Authors: Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: Generating photos satisfying multiple constraints find broad utility in the content creation industry. A key hurdle to accomplishing this task is the need for paired data consisting of all modalities (i.e., constraints) and their corresponding output. Moreover, existing methods need retraining using paired data across all modalities to introduce a new condition. This paper proposes a solution to t… ▽ More

    Submitted 20 April, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: Accepted at CVPR 2023

  6. arXiv:2211.09120  [pdf, other

    cs.CV cs.AI

    AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders

    Authors: Wele Gedara Chaminda Bandara, Naman Patel, Ali Gholami, Mehdi Nikkhah, Motilal Agrawal, Vishal M. Patel

    Abstract: Masked Autoencoders (MAEs) learn generalizable representations for image, text, audio, video, etc., by reconstructing masked input data from tokens of the visible data. Current MAE approaches for videos rely on random patch, tube, or frame-based masking strategies to select these tokens. This paper proposes AdaMAE, an adaptive masking strategy for MAEs that is end-to-end trainable. Our adaptive ma… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Code available at: https://github.com/wgcban/adamae

  7. arXiv:2206.11892  [pdf, other

    cs.CV cs.LG

    DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change Detection

    Authors: Wele Gedara Chaminda Bandara, Nithin Gopalakrishnan Nair, Vishal M. Patel

    Abstract: Remote sensing change detection is crucial for understanding the dynamics of our planet's surface, facilitating the monitoring of environmental changes, evaluating human impact, predicting future trends, and supporting decision-making. In this work, we introduce a novel approach for change detection that can leverage off-the-shelf, unlabeled remote sensing images in the training process by pre-tra… ▽ More

    Submitted 12 January, 2024; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: Code available at: https://github.com/wgcban/ddpm-cd

  8. arXiv:2206.08481  [pdf, other

    eess.IV cs.CV

    Orientation-guided Graph Convolutional Network for Bone Surface Segmentation

    Authors: Aimon Rahman, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M Patel

    Abstract: Due to imaging artifacts and low signal-to-noise ratio in ultrasound images, automatic bone surface segmentation networks often produce fragmented predictions that can hinder the success of ultrasound-guided computer-assisted surgical procedures. Existing pixel-wise predictions often fail to capture the accurate topology of bone tissues due to a lack of supervision to enforce connectivity. In this… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted at MICCAI 2022

  9. arXiv:2206.05039  [pdf, other

    cs.CV

    Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models

    Authors: Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M Patel

    Abstract: Image synthesis under multi-modal priors is a useful and challenging task that has received increasing attention in recent years. A major challenge in using generative models to accomplish this task is the lack of paired data containing all modalities (i.e. priors) and corresponding outputs. In recent work, a variational auto-encoder (VAE) model was trained in a weakly supervised manner to address… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  10. SAR Despeckling using a Denoising Diffusion Probabilistic Model

    Authors: Malsha V. Perera, Nithin Gopalakrishnan Nair, Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: Speckle is a multiplicative noise which affects all coherent imaging modalities including Synthetic Aperture Radar (SAR) images. The presence of speckle degrades the image quality and adversely affects the performance of SAR image understanding applications such as automatic target recognition and change detection. Thus, SAR despeckling is an important problem in remote sensing. In this paper, we… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Our code is available at https://github.com/malshaV/SAR_DDPM

  11. arXiv:2205.15906  [pdf, ps, other

    cs.CV eess.IV

    SAR Despeckling Using Overcomplete Convolutional Networks

    Authors: Malsha V. Perera, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Synthetic Aperture Radar (SAR) despeckling is an important problem in remote sensing as speckle degrades SAR images, affecting downstream tasks like detection and segmentation. Recent studies show that convolutional neural networks(CNNs) outperform classical despeckling methods. Traditional CNNs try to increase the receptive field size as the network goes deeper, thus extracting global features. H… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: Accepted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. Our code is available at https://github.com/malshaV/sar_overcomplete

  12. arXiv:2204.08454  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Revisiting Consistency Regularization for Semi-supervised Change Detection in Remote Sensing Images

    Authors: Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: Remote-sensing (RS) Change Detection (CD) aims to detect "changes of interest" from co-registered bi-temporal images. The performance of existing deep supervised CD methods is attributed to the large amounts of annotated data used to train the networks. However, annotating large amounts of remote sensing images is labor-intensive and expensive, particularly with bi-temporal images, as it requires… ▽ More

    Submitted 21 April, 2022; v1 submitted 18 April, 2022; originally announced April 2022.

    Comments: Code available at https://github.com/wgcban/SemiCD 36 pages

  13. arXiv:2203.02503  [pdf, other

    cs.CV eess.IV

    HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening

    Authors: Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: Pansharpening aims to fuse a registered high-resolution panchromatic image (PAN) with a low-resolution hyperspectral image (LR-HSI) to generate an enhanced HSI with high spectral and spatial resolution. Existing pansharpening approaches neglect using an attention mechanism to transfer HR texture features from PAN to LR-HSI features, resulting in spatial and spectral distortions. In this paper, we… ▽ More

    Submitted 28 March, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR'22. Project page: https://www.wgcban.com/research#h.ar24vwqlm021 Code available at: https://github.com/wgcban/HyperTransformer

  14. arXiv:2201.09355  [pdf, ps, other

    cs.CV eess.IV

    Transformer-based SAR Image Despeckling

    Authors: Malsha V. Perera, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Synthetic Aperture Radar (SAR) images are usually degraded by a multiplicative noise known as speckle which makes processing and interpretation of SAR images difficult. In this paper, we introduce a transformer-based network for SAR image despeckling. The proposed despeckling network comprises of a transformer-based encoder which allows the network to learn global dependencies between different im… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: Submitted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. Our code is available at https://github.com/malshaV/sar_transformer

  15. arXiv:2201.01293  [pdf, other

    cs.CV

    A Transformer-Based Siamese Network for Change Detection

    Authors: Wele Gedara Chaminda Bandara, Vishal M. Patel

    Abstract: This paper presents a transformer-based Siamese network architecture (abbreviated by ChangeFormer) for Change Detection (CD) from a pair of co-registered remote sensing images. Different from recent CD frameworks, which are based on fully convolutional networks (ConvNets), the proposed method unifies hierarchically structured transformer encoder with Multi-Layer Perception (MLP) decoder in a Siame… ▽ More

    Submitted 1 September, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

    Comments: Accepted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. 4 pages, 2 figures. Code & trained models are available at https://github.com/wgcban/ChangeFormer

  16. arXiv:2109.07701  [pdf, other

    cs.CV cs.LG cs.RO

    SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and Interaction Space Graph Reasoning for Autonomous Driving

    Authors: Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Road extraction is an essential step in building autonomous navigation systems. Detecting road segments is challenging as they are of varying widths, bifurcated throughout the image, and are often occluded by terrain, cloud, or other weather conditions. Using just convolution neural networks (ConvNets) for this problem is not effective as it is inefficient at capturing distant dependencies between… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: Code available at: https://github.com/wgcban/SPIN_RoadMapper.git

    Journal ref: IEEE Conference of Robotics and Automation (ICRA) 2022

  17. arXiv:2108.10387  [pdf, ps, other

    eess.SY eess.SP

    A Sensitivity Matrix Approach Using Two-Stage Optimization for Voltage Regulation of LV Networks with High PV Penetration

    Authors: A. S. Jameel Hassan, Umar Marikkar, G. W. Kasun Prabhath, Aranee Balachandran, W. G. Chaminda Bandara, Parakrama B. Ekanayake, Roshan I. Godaliyadda, Janaka B. Ekanayake

    Abstract: The occurrence of voltage violations are a major deterrent for absorbing more roof-top solar power to smart Low Voltage Distribution Grids (LVDG). Recent studies have focused on decentralized control methods to solve this problem due to the high computational time in performing load flows in centralized control techniques. To address this issue a novel sensitivity matrix is developed to estimate v… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

  18. arXiv:2107.02630  [pdf, other

    cs.CV cs.LG eess.IV

    Hyperspectral Pansharpening Based on Improved Deep Image Prior and Residual Reconstruction

    Authors: Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Hyperspectral pansharpening aims to synthesize a low-resolution hyperspectral image (LR-HSI) with a registered panchromatic image (PAN) to generate an enhanced HSI with high spectral and spatial resolution. Recently proposed HS pansharpening methods have obtained remarkable results using deep convolutional networks (ConvNets), which typically consist of three steps: (1) up-sampling the LR-HSI, (2)… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  19. arXiv:2011.14644  [pdf

    eess.SP eess.IV

    Quantitative Assessment of Adulteration and Reuse of Coconut Oil Using Transmittance Multispectral Imaging

    Authors: S. Herath, H. K. Weerasooriya, D. Y. L. Ranasinghe, W. G. C. Bandara, H. M. V. R. Herath, G. M. R. I. Godaliyadda, M. P. B. Ekanayake, Terrence Madhujith

    Abstract: Coconut oil known for its wide range of uses is often adulterated with other edible oils. Repeated use of coconut oil in food preparation could lead to many health issues. Existing methods available for evaluating quality of oil are laborious and time consuming. Therefore, we propose an imaging system hardware and image processing-based algorithm to estimate the adulteration of coconut oil with pa… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

    Comments: 10 pages, 11 figures, journal

  20. Coordinated PV re-phasing: a novel method to maximize renewable energy integration in LV networks by mitigating network unbalances

    Authors: W. G. Chaminda Bandara, G. M. R. I. Godaliyadda, M. P. B. Ekanayake, J. B. Ekanayake

    Abstract: As combating climate change has become a top priority and as many countries are taking steps to make their power generation sustainable, there is a marked increase in the use of renewable energy sources (RESs) for electricity generation. Among these RESs, solar photovoltaics (PV) is one of the most popular sources of energy connected to LV distribution networks. With the greater integration of sol… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.