Skip to main content

Showing 1–9 of 9 results for author: Samarasekera, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.16979  [pdf, other

    cs.CV cs.LG

    Unsupervised Domain Adaptation for Semantic Segmentation with Pseudo Label Self-Refinement

    Authors: Xingchen Zhao, Niluthpol Chowdhury Mithun, Abhinav Rajvanshi, Han-Pang Chiu, Supun Samarasekera

    Abstract: Deep learning-based solutions for semantic segmentation suffer from significant performance degradation when tested on data with different characteristics than what was used during the training. Adapting the models using annotated data from the new domain is not always practical. Unsupervised Domain Adaptation (UDA) approaches are crucial in deploying these models in the actual operating condition… ▽ More

    Submitted 24 December, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: WACV 2024

  2. arXiv:2303.17132  [pdf, other

    cs.CV

    C-SFDA: A Curriculum Learning Aided Self-Training Framework for Efficient Source Free Domain Adaptation

    Authors: Nazmul Karim, Niluthpol Chowdhury Mithun, Abhinav Rajvanshi, Han-pang Chiu, Supun Samarasekera, Nazanin Rahnavard

    Abstract: Unsupervised domain adaptation (UDA) approaches focus on adapting models trained on a labeled source domain to an unlabeled target domain. UDA methods have a strong assumption that the source data is accessible during adaptation, which may not be feasible in many real-world scenarios due to privacy concerns and resource constraints of devices. In this regard, source-free domain adaptation (SFDA) e… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  3. Cross-View Visual Geo-Localization for Outdoor Augmented Reality

    Authors: Niluthpol Chowdhury Mithun, Kshitij Minhas, Han-Pang Chiu, Taragay Oskiper, Mikhail Sizintsev, Supun Samarasekera, Rakesh Kumar

    Abstract: Precise estimation of global orientation and location is critical to ensure a compelling outdoor Augmented Reality (AR) experience. We address the problem of geo-pose estimation by cross-view matching of query ground images to a geo-referenced aerial satellite image database. Recently, neural network-based methods have shown state-of-the-art performance in cross-view matching. However, most of the… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: IEEE VR 2023

  4. arXiv:2205.08325  [pdf, other

    cs.CV

    GraphMapper: Efficient Visual Navigation by Scene Graph Generation

    Authors: Zachary Seymour, Niluthpol Chowdhury Mithun, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

    Abstract: Understanding the geometric relationships between objects in a scene is a core capability in enabling both humans and autonomous agents to navigate in new environments. A sparse, unified representation of the scene topology will allow agents to act efficiently to move through their environment, communicate the environment state with others, and utilize the representation for diverse downstream tas… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: ICPR 2022

  5. arXiv:2108.11945  [pdf, other

    cs.RO cs.CL cs.CV

    SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments

    Authors: Muhammad Zubair Irshad, Niluthpol Chowdhury Mithun, Zachary Seymour, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

    Abstract: This paper presents a novel approach for the Vision-and-Language Navigation (VLN) task in continuous 3D environments, which requires an autonomous agent to follow natural language instructions in unseen environments. Existing end-to-end learning-based VLN methods struggle at this task as they focus mostly on utilizing raw visual observations and lack the semantic spatio-temporal reasoning capabili… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: 10 pages, 4 figures

  6. arXiv:2103.11374  [pdf, other

    cs.CV cs.RO

    MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation

    Authors: Zachary Seymour, Kowshik Thopalli, Niluthpol Mithun, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

    Abstract: Visual navigation for autonomous agents is a core task in the fields of computer vision and robotics. Learning-based methods, such as deep reinforcement learning, have the potential to outperform the classical solutions developed for this task; however, they come at a significantly increased computational load. Through this work, we design a novel approach that focuses on performing better or comp… ▽ More

    Submitted 21 March, 2021; originally announced March 2021.

    Comments: 6 pages, 5 figures, accepted at ICRA 2021

  7. RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

    Authors: Niluthpol Chowdhury Mithun, Karan Sikka, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

    Abstract: We study an important, yet largely unexplored problem of large-scale cross-modal visual localization by matching ground RGB images to a geo-referenced aerial LIDAR 3D point cloud (rendered as depth images). Prior works were demonstrated on small datasets and did not lend themselves to scaling up for large-scale applications. To enable large-scale evaluation, we introduce a new dataset containing o… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

    Comments: ACM Multimedia 2020

  8. arXiv:1812.03402  [pdf, other

    cs.CV

    Semantically-Aware Attentive Neural Embeddings for Image-based Visual Localization

    Authors: Zachary Seymour, Karan Sikka, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

    Abstract: We present an approach that combines appearance and semantic information for 2D image-based localization (2D-VL) across large perceptual changes and time lags. Compared to appearance features, the semantic layout of a scene is generally more invariant to appearance variations. We use this intuition and propose a novel end-to-end deep attention-based framework that utilizes multimodal cues to gener… ▽ More

    Submitted 2 July, 2019; v1 submitted 8 December, 2018; originally announced December 2018.

    Comments: Appearing in BMVC 2019

  9. arXiv:1801.00858  [pdf, other

    cs.CV

    Utilizing Semantic Visual Landmarks for Precise Vehicle Navigation

    Authors: Varun Murali, Han-Pang Chiu, Supun Samarasekera, Rakesh, Kumar

    Abstract: This paper presents a new approach for integrating semantic information for vision-based vehicle navigation. Although vision-based vehicle navigation systems using pre-mapped visual landmarks are capable of achieving submeter level accuracy in large-scale urban environment, a typical error source in this type of systems comes from the presence of visual landmarks or features from temporal objects… ▽ More

    Submitted 2 January, 2018; originally announced January 2018.

    Comments: Published at IEEE ITSC 2017