Skip to main content

Showing 1–12 of 12 results for author: Chawla, H

Searching in archive cs. Search in all archives.
.
  1. Transformers in Unsupervised Structure-from-Motion

    Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

    Abstract: Transformers have revolutionized deep learning based computer vision with improved performance as well as robustness to natural corruptions and adversarial attacks. Transformers are used predominantly for 2D vision tasks, including image classification, semantic segmentation, and object detection. However, robots and advanced driver assistance systems also require 3D scene understanding for decisi… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: International Joint Conference on Computer Vision, Imaging and Computer Graphics. Cham: Springer Nature Switzerland, 2022. Published at "Communications in Computer and Information Science, vol 1815. Springer Nature". arXiv admin note: text overlap with arXiv:2202.03131

  2. arXiv:2311.02393  [pdf, other

    cs.CV cs.AI

    Continual Learning of Unsupervised Monocular Depth from Videos

    Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

    Abstract: Spatial scene understanding, including monocular depth estimation, is an important problem in various applications, such as robotics and autonomous driving. While improvements in unsupervised monocular depth estimation have potentially allowed models to be trained on diverse crowdsourced videos, this remains underexplored as most methods utilize the standard training protocol, wherein the models a… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: Accepted at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

  3. arXiv:2210.03570  [pdf

    cs.CV

    AI-Driven Road Maintenance Inspection v2: Reducing Data Dependency & Quantifying Road Damage

    Authors: Haris Iqbal, Hemang Chawla, Arnav Varma, Terence Brouns, Ahmed Badar, Elahe Arani, Bahram Zonooz

    Abstract: Road infrastructure maintenance inspection is typically a labor-intensive and critical task to ensure the safety of all road users. Existing state-of-the-art techniques in Artificial Intelligence (AI) for object detection and segmentation help automate a huge chunk of this task given adequate annotated data. However, annotating videos from scratch is cost-prohibitive. For instance, it can take an… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: Accepted at IRF Global R2T Conference & Exhibition 2022

  4. arXiv:2210.02357  [pdf, other

    cs.CV

    Image Masking for Robust Self-Supervised Monocular Depth Estimation

    Authors: Hemang Chawla, Kishaan Jeeveswaran, Elahe Arani, Bahram Zonooz

    Abstract: Self-supervised monocular depth estimation is a salient task for 3D scene understanding. Learned jointly with monocular ego-motion estimation, several methods have been proposed to predict accurate pixel-wise depth without using labeled data. Nevertheless, these methods focus on improving performance under ideal conditions without natural or digital corruptions. The general absence of occlusions i… ▽ More

    Submitted 1 February, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at 2023 IEEE International Conference on Robotics and Automation (ICRA)

  5. Adversarial Attacks on Monocular Pose Estimation

    Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

    Abstract: Advances in deep learning have resulted in steady progress in computer vision with improved accuracy on tasks such as object detection and semantic segmentation. Nevertheless, deep neural networks are vulnerable to adversarial attacks, thus presenting a challenge in reliable deployment. Two of the prominent tasks in 3D scene-understanding for robotics and advanced drive assistance systems are mono… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted at the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  6. Transformers in Self-Supervised Monocular Depth Estimation with Unknown Camera Intrinsics

    Authors: Arnav Varma, Hemang Chawla, Bahram Zonooz, Elahe Arani

    Abstract: The advent of autonomous driving and advanced driver assistance systems necessitates continuous developments in computer vision for 3D scene understanding. Self-supervised monocular depth estimation, a method for pixel-wise distance estimation of objects from a single camera without the use of ground truth labels, is an important task in 3D scene understanding. However, existing methods for this t… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: Published in 17th International Conference on Computer Vision Theory and Applications (VISAP, 2022)

  7. Attack of the Knights: A Non Uniform Cache Side-Channel Attack

    Authors: Farabi Mahmud, Sungkeun Kim, Harpreet Singh Chawla, Chia-Che Tsai, Eun Jung Kim, Abdullah Muzahid

    Abstract: For a distributed last-level cache (LLC) in a large multicore chip, the access time to one LLC bank can significantly differ from that to another due to the difference in physical distance. In this paper, we successfully demonstrated a new distance-based side-channel attack by timing the AES decryption operation and extracting part of an AES secret key on an Intel Knights Landing CPU. We introduce… ▽ More

    Submitted 31 May, 2023; v1 submitted 18 December, 2021; originally announced December 2021.

    Journal ref: Annual Computer Security Applications Conference ACSAC 2023

  8. Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation

    Authors: Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz

    Abstract: Dense depth estimation is essential to scene-understanding for autonomous driving. However, recent self-supervised approaches on monocular videos suffer from scale-inconsistency across long sequences. Utilizing data from the ubiquitously copresent global positioning systems (GPS), we tackle this challenge by proposing a dynamically-weighted GPS-to-Scale (g2s) loss to complement the appearance-base… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: Accepted at 2021 IEEE International Conference on Robotics and Automation (ICRA)

  9. Practical Auto-Calibration for Spatial Scene-Understanding from Crowdsourced Dashcamera Videos

    Authors: Hemang Chawla, Matti Jukola, Shabbir Marzban, Elahe Arani, Bahram Zonooz

    Abstract: Spatial scene-understanding, including dense depth and ego-motion estimation, is an important problem in computer vision for autonomous vehicles and advanced driver assistance systems. Thus, it is beneficial to design perception modules that can utilize crowdsourced videos collected from arbitrary vehicular onboard or dashboard cameras. However, the intrinsic parameters corresponding to such camer… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: Accepted at 16th International Conference on Computer Vision Theory and Applications (VISAP, 2021)

  10. Crowdsourced 3D Map**: A Combined Multi-View Geometry and Self-Supervised Learning Approach

    Authors: Hemang Chawla, Matti Jukola, Terence Brouns, Elahe Arani, Bahram Zonooz

    Abstract: The ability to efficiently utilize crowdsourced visual data carries immense potential for the domains of large scale dynamic map** and autonomous driving. However, state-of-the-art methods for crowdsourced 3D map** assume prior knowledge of camera intrinsics. In this work, we propose a framework that estimates the 3D positions of semantically meaningful landmarks such as traffic signs without… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

    Comments: Accepted at 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  11. Monocular Vision based Crowdsourced 3D Traffic Sign Positioning with Unknown Camera Intrinsics and Distortion Coefficients

    Authors: Hemang Chawla, Matti Jukola, Elahe Arani, Bahram Zonooz

    Abstract: Autonomous vehicles and driver assistance systems utilize maps of 3D semantic landmarks for improved decision making. However, scaling the map** process as well as regularly updating such maps come with a huge cost. Crowdsourced map** of these landmarks such as traffic sign positions provides an appealing alternative. The state-of-the-art approaches to crowdsourced map** use ground truth cam… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted at 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)

  12. arXiv:1601.01398  [pdf

    cs.NI

    A Proof-of-Concept Device-to-Device Communication Testbed

    Authors: Vibhutesh Kumar Singh, Hardik Chawla, Vivek Ashok Bohara

    Abstract: This paper presents the design and development of proof-of-concept Device-to-Device (D2D) Communication testbed. This testbed also seeks to address the design issues involved in the implementation of a D2D network in a realistic scenario. The performance of this testbed has been validated by emulating a Cellular network consisting of a Base Staion (BTS) and many D2D devices in its proximity. The d… ▽ More

    Submitted 6 January, 2016; originally announced January 2016.

    Comments: 8th International Conference on COMmunication Systems & NETworkS (COMSNETS 2016), Demos & Exhibits Session