Skip to main content

Showing 1–7 of 7 results for author: Khurana, T

.
  1. arXiv:2404.11554  [pdf, other

    cs.CV

    Predicting Long-horizon Futures by Conditioning on Geometry and Time

    Authors: Tarasha Khurana, Deva Ramanan

    Abstract: Our work explores the task of generating future sensor observations conditioned on the past. We are motivated by `predictive coding' concepts from neuroscience as well as robotic applications such as self-driving vehicles. Predictive video modeling is challenging because the future may be multi-modal and learning at scale remains computationally expensive for video processing. To address both chal… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Project page: http://www.cs.cmu.edu/~tkhurana/depthforecasting/

  2. arXiv:2312.12433  [pdf, other

    cs.CV cs.AI cs.LG

    TAO-Amodal: A Benchmark for Tracking Any Object Amodally

    Authors: Cheng-Yen Hsieh, Kaihua Chen, Achal Dave, Tarasha Khurana, Deva Ramanan

    Abstract: Amodal perception, the ability to comprehend complete object structures from partial visibility, is a fundamental skill, even for infants. Its significance extends to applications like autonomous driving, where a clear understanding of heavily occluded objects is essential. However, modern detection and tracking algorithms often overlook this critical capability, perhaps due to the prevalence of \… ▽ More

    Submitted 2 April, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Project Page: https://tao-amodal.github.io

  3. arXiv:2302.13130  [pdf, other

    cs.CV eess.SP

    Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting

    Authors: Tarasha Khurana, Peiyun Hu, David Held, Deva Ramanan

    Abstract: Predicting how the world can evolve in the future is crucial for motion planning in autonomous systems. Classical methods are limited because they rely on costly human annotations in the form of semantic class labels, bounding boxes, and tracks or HD maps of cities to plan their motion and thus are difficult to scale to large unlabeled datasets. One promising self-supervised task is 3D point cloud… ▽ More

    Submitted 30 April, 2023; v1 submitted 25 February, 2023; originally announced February 2023.

    Comments: CVPR 2023. Project page: https://www.cs.cmu.edu/~tkhurana/ff4d/index.html Code: https://github.com/tarashakhurana/4d-occ-forecasting

  4. arXiv:2210.01917  [pdf, other

    cs.CV cs.RO

    Differentiable Raycasting for Self-supervised Occupancy Forecasting

    Authors: Tarasha Khurana, Peiyun Hu, Achal Dave, Jason Ziglar, David Held, Deva Ramanan

    Abstract: Motion planning for safe autonomous driving requires learning how the environment around an ego-vehicle evolves with time. Ego-centric perception of driveable regions in a scene not only changes with the motion of actors in the environment, but also with the movement of the ego-vehicle itself. Self-supervised representations proposed for large-scale planning, such as ego-centric freespace, confoun… ▽ More

    Submitted 18 October, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: ECCV 2022. Code available at https://github.com/tarashakhurana/emergent-occ-forecasting

  5. arXiv:2209.12118  [pdf, other

    cs.CV

    BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video

    Authors: Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan

    Abstract: Multiple existing benchmarks involve tracking and segmenting objects in video e.g., Video Object Segmentation (VOS) and Multi-Object Tracking and Segmentation (MOTS), but there is little interaction between them due to the use of disparate benchmark datasets and metrics (e.g. J&F, mAP, sMOTSA). As a result, published works usually target a particular benchmark, and are not easily comparable to eac… ▽ More

    Submitted 22 November, 2022; v1 submitted 24 September, 2022; originally announced September 2022.

  6. arXiv:2012.08419  [pdf, other

    cs.CV

    Detecting Invisible People

    Authors: Tarasha Khurana, Achal Dave, Deva Ramanan

    Abstract: Monocular object detection and tracking have improved drastically in recent years, but rely on a key assumption: that objects are visible to the camera. Many offline tracking approaches reason about occluded objects post-hoc, by linking together tracklets after the object re-appears, making use of reidentification (ReID). However, online tracking in embodied robotic agents (such as a self-driving… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: Project page: http://www.cs.cmu.edu/~tkhurana/invisible.htm

  7. arXiv:2005.10356  [pdf, other

    cs.CV

    TAO: A Large-Scale Benchmark for Tracking Any Object

    Authors: Achal Dave, Tarasha Khurana, Pavel Tokmakov, Cordelia Schmid, Deva Ramanan

    Abstract: For many years, multi-object tracking benchmarks have focused on a handful of categories. Motivated primarily by surveillance and self-driving applications, these datasets provide tracks for people, vehicles, and animals, ignoring the vast majority of objects in the world. By contrast, in the related field of object detection, the introduction of large-scale, diverse datasets (e.g., COCO) have fos… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: Project page: http://taodataset.org/