Skip to main content

Showing 1–3 of 3 results for author: Shenoi, A

.
  1. arXiv:2002.08945  [pdf, other

    cs.CV

    Spatiotemporal Relationship Reasoning for Pedestrian Intent Prediction

    Authors: Bingbin Liu, Ehsan Adeli, Zhangjie Cao, Kuan-Hui Lee, Abhijeet Shenoi, Adrien Gaidon, Juan Carlos Niebles

    Abstract: Reasoning over visual data is a desirable capability for robotics and vision-based applications. Such reasoning enables forecasting of the next events or actions in videos. In recent years, various models have been developed based on convolution operations for prediction or forecasting, but they lack the ability to reason over spatiotemporal data and infer the relationships of different objects in… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: Accepted at ICRA 2020 and IEEE Robotics and Automation Letters

  2. arXiv:2002.08397  [pdf, other

    cs.CV cs.RO

    JRMOT: A Real-Time 3D Multi-Object Tracker and a New Large-Scale Dataset

    Authors: Abhijeet Shenoi, Mihir Patel, JunYoung Gwak, Patrick Goebel, Amir Sadeghian, Hamid Rezatofighi, Roberto Martín-Martín, Silvio Savarese

    Abstract: Robots navigating autonomously need to perceive and track the motion of objects and other agents in its surroundings. This information enables planning and executing robust and safe trajectories. To facilitate these processes, the motion should be perceived in 3D Cartesian space. However, most recent multi-object tracking (MOT) research has focused on tracking people and moving objects in 2D RGB v… ▽ More

    Submitted 22 July, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: 8 pages, 5 figures, 2 tables; Accepted at IROS 2020

  3. JRDB: A Dataset and Benchmark of Egocentric Robot Visual Perception of Humans in Built Environments

    Authors: Roberto Martín-Martín, Mihir Patel, Hamid Rezatofighi, Abhijeet Shenoi, JunYoung Gwak, Eric Frankel, Amir Sadeghian, Silvio Savarese

    Abstract: We present JRDB, a novel egocentric dataset collected from our social mobile manipulator JackRabbot. The dataset includes 64 minutes of annotated multimodal sensor data including stereo cylindrical 360$^\circ$ RGB video at 15 fps, 3D point clouds from two Velodyne 16 Lidars, line 3D point clouds from two Sick Lidars, audio signal, RGB-D video at 30 fps, 360$^\circ$ spherical image from a fisheye c… ▽ More

    Submitted 24 April, 2021; v1 submitted 25 October, 2019; originally announced October 2019.