Skip to main content

Showing 1–3 of 3 results for author: Chidananda, P

.
  1. arXiv:2312.14115  [pdf, other

    cs.RO cs.AI cs.CV

    LingoQA: Video Question Answering for Autonomous Driving

    Authors: Ana-Maria Marcu, Long Chen, Jan Hünermann, Alice Karnsund, Benoit Hanotte, Prajwal Chidananda, Saurabh Nair, Vijay Badrinarayanan, Alex Kendall, Jamie Shotton, Elahe Arani, Oleg Sinavski

    Abstract: Autonomous driving has long faced a challenge with public acceptance due to the lack of explainability in the decision-making process. Video question-answering (QA) in natural language provides the opportunity for bridging this gap. Nonetheless, evaluating the performance of Video QA models has proved particularly tough due to the absence of comprehensive benchmarks. To fill this gap, we introduce… ▽ More

    Submitted 19 March, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Benchmark and dataset are available at https://github.com/wayveai/LingoQA/

  2. arXiv:2209.03910  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    PixTrack: Precise 6DoF Object Pose Tracking using NeRF Templates and Feature-metric Alignment

    Authors: Prajwal Chidananda, Saurabh Nair, Douglas Lee, Adrian Kaehler

    Abstract: We present PixTrack, a vision based object pose tracking framework using novel view synthesis and deep feature-metric alignment. We follow an SfM-based relocalization paradigm where we use a Neural Radiance Field to canonically represent the tracked object. Our evaluations demonstrate that our method produces highly accurate, robust, and jitter-free 6DoF pose estimates of objects in both monocular… ▽ More

    Submitted 14 February, 2024; v1 submitted 8 September, 2022; originally announced September 2022.

  3. arXiv:1909.05897  [pdf, other

    cs.CV cs.HC

    Efficient 2.5D Hand Pose Estimation via Auxiliary Multi-Task Training for Embedded Devices

    Authors: Prajwal Chidananda, Ayan Sinha, Adithya Rao, Douglas Lee, Andrew Rabinovich

    Abstract: 2D Key-point estimation is an important precursor to 3D pose estimation problems for human body and hands. In this work, we discuss the data, architecture, and training procedure necessary to deploy extremely efficient 2.5D hand pose estimation on embedded devices with highly constrained memory and compute envelope, such as AR/VR wearables. Our 2.5D hand pose estimation consists of 2D key-point es… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Comments: CVPR Workshop on Computer Vision for Augmented and Virtual Reality, Long Beach, CA, 2019