Skip to main content

Showing 1–14 of 14 results for author: Dariush, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.06597  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning

    Authors: Enna Sachdeva, Nakul Agarwal, Suhas Chundi, Sean Roelofs, Jiachen Li, Mykel Kochenderfer, Chiho Choi, Behzad Dariush

    Abstract: The widespread adoption of commercial autonomous vehicles (AVs) and advanced driver assistance systems (ADAS) may largely depend on their acceptance by society, for which their perceived trustworthiness and interpretability to riders are crucial. In general, this task is challenging because modern autonomous systems software relies heavily on black-box artificial intelligence models. Towards this… ▽ More

    Submitted 8 November, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

  2. arXiv:2203.13309  [pdf, other

    cs.CV

    Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos

    Authors: Reza Ghoddoosian, Isht Dwivedi, Nakul Agarwal, Chiho Choi, Behzad Dariush

    Abstract: This paper addresses a new problem of weakly-supervised online action segmentation in instructional videos. We present a framework to segment streaming videos online at test time using Dynamic Programming and show its advantages over greedy sliding window approach. We improve our framework by introducing the Online-Offline Discrepancy Loss (OODL) to encourage the segmentation results to have a hig… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted CVPR 2022

  3. arXiv:2011.04853  [pdf, other

    cs.CV cs.AI

    Social-STAGE: Spatio-Temporal Multi-Modal Future Trajectory Forecast

    Authors: Srikanth Malla, Chiho Choi, Behzad Dariush

    Abstract: This paper considers the problem of multi-modal future trajectory forecast with ranking. Here, multi-modality and ranking refer to the multiple plausible path predictions and the confidence in those predictions, respectively. We propose Social-STAGE, Social interaction-aware Spatio-Temporal multi-Attention Graph convolution network with novel Evaluation for multi-modality. Our main contributions i… ▽ More

    Submitted 24 March, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: ICRA 2021

  4. arXiv:2010.09211  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation for Spatio-Temporal Action Localization

    Authors: Nakul Agarwal, Yi-Ting Chen, Behzad Dariush, Ming-Hsuan Yang

    Abstract: Spatio-temporal action localization is an important problem in computer vision that involves detecting where and when activities occur, and therefore requires modeling of both spatial and temporal features. This problem is typically formulated in the context of supervised learning, where the learned classifiers operate on the premise that both training and test data are sampled from the same under… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: Accepted in BMVC 2020

  5. Recognition and 3D Localization of Pedestrian Actions from Monocular Video

    Authors: Jun Hayakawa, Behzad Dariush

    Abstract: Understanding and predicting pedestrian behavior is an important and challenging area of research for realizing safe and effective navigation strategies in automated and advanced driver assistance technologies in urban scenes. This paper focuses on monocular pedestrian action recognition and 3D localization from an egocentric view for the purpose of predicting intention and forecasting future traj… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

    Journal ref: IEEE Intelligent Transportation Systems Conference (ITSC) 2020

  6. Ego-motion and Surrounding Vehicle State Estimation Using a Monocular Camera

    Authors: Jun Hayakawa, Behzad Dariush

    Abstract: Understanding ego-motion and surrounding vehicle state is essential to enable automated driving and advanced driving assistance technologies. Typical approaches to solve this problem use fusion of multiple sensors such as LiDAR, camera, and radar to recognize surrounding vehicle state, including position, velocity, and orientation. Such sensing modalities are overly complex and costly for producti… ▽ More

    Submitted 5 May, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Journal ref: 2019 IEEE Intelligent Vehicles Symposium (IV)

  7. arXiv:2004.05846  [pdf, other

    cs.CV cs.LG cs.RO

    SSP: Single Shot Future Trajectory Prediction

    Authors: Isht Dwivedi, Srikanth Malla, Behzad Dariush, Chiho Choi

    Abstract: We propose a robust solution to future trajectory forecast, which can be practically applicable to autonomous agents in highly crowded environments. For this, three aspects are particularly addressed in this paper. First, we use composite fields to predict future locations of all road agents in a single-shot, which results in a constant time complexity, regardless of the number of agents in the sc… ▽ More

    Submitted 8 November, 2020; v1 submitted 13 April, 2020; originally announced April 2020.

    Comments: Accepted at IROS 2020

  8. arXiv:2003.13886  [pdf, other

    cs.CV cs.LG cs.RO

    TITAN: Future Forecast using Action Priors

    Authors: Srikanth Malla, Behzad Dariush, Chiho Choi

    Abstract: We consider the problem of predicting the future trajectory of scene agents from egocentric views obtained from a moving platform. This problem is important in a variety of domains, particularly for autonomous systems making reactive or strategic decisions in navigation. In an attempt to address this problem, we introduce TITAN (Trajectory Inference using Targeted Action priors Network), a new mod… ▽ More

    Submitted 6 August, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: CVPR 2020 [oral], dataset url: https://usa.honda-ri.com/titan

  9. arXiv:1912.03442  [pdf, other

    cs.CV cs.LG

    Spatio-Temporal Pyramid Graph Convolutions for Human Action Recognition and Postural Assessment

    Authors: Behnoosh Parsa, Athma Narayanan, Behzad Dariush

    Abstract: Recognition of human actions and associated interactions with objects and the environment is an important problem in computer vision due to its potential applications in a variety of domains. The most versatile methods can generalize to various environments and deal with cluttered backgrounds, occlusions, and viewpoint variations. Among them, methods based on graph convolutional networks that extr… ▽ More

    Submitted 7 December, 2019; originally announced December 2019.

  10. arXiv:1910.00628  [pdf, other

    cs.CV cs.LG

    Sensor Fusion: Gated Recurrent Fusion to Learn Driving Behavior from Temporal Multimodal Data

    Authors: Athma Narayanan, Avinash Siravuru, Behzad Dariush

    Abstract: The Tactical Driver Behavior modeling problem requires understanding of driver actions in complicated urban scenarios from a rich multi modal signals including video, LiDAR and CAN bus data streams. However, the majority of deep learning research is focused either on learning the vehicle/environment state (sensor fusion) or the driver policy (from temporal data), but not both. Learning both tasks… ▽ More

    Submitted 21 January, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: Accepted to Robotics and Automation Letters 2020

  11. arXiv:1909.08150  [pdf, other

    cs.CV cs.RO eess.IV

    NEMO: Future Object Localization Using Noisy Ego Priors

    Authors: Srikanth Malla, Isht Dwivedi, Behzad Dariush, Chiho Choi

    Abstract: Predicting the future trajectory of agents from visual observations is an important problem for realization of safe and effective navigation of autonomous systems in dynamic environments. This paper focuses on two important aspects of future trajectory forecast which are particularly relevant for mobile platforms: 1) modeling uncertainty of the predictions, particularly from egocentric views, wher… ▽ More

    Submitted 22 July, 2020; v1 submitted 17 September, 2019; originally announced September 2019.

  12. arXiv:1905.12708  [pdf, other

    cs.CV eess.IV

    Dynamic Traffic Scene Classification with Space-Time Coherence

    Authors: Athma Narayanan, Isht Dwivedi, Behzad Dariush

    Abstract: This paper examines the problem of dynamic traffic scene classification under space-time variations in viewpoint that arise from video captured on-board a moving vehicle. Solutions to this problem are important for realization of effective driving assistance technologies required to interpret or predict road user behavior. Currently, dynamic traffic scene classification has not been adequately add… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: accpeted in (International Conference on Robotics and Automation)ICRA 2019

  13. arXiv:1905.08855  [pdf, other

    cs.CV

    Looking to Relations for Future Trajectory Forecast

    Authors: Chiho Choi, Behzad Dariush

    Abstract: Inferring relational behavior between road users as well as road users and their surrounding physical space is an important step toward effective modeling and prediction of navigation strategies adopted by participants in road scenes. To this end, we propose a relation-aware framework for future trajectory forecast. Our system aims to infer relational information from the interactions of road user… ▽ More

    Submitted 27 August, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: ICCV 2019

  14. arXiv:1809.07408  [pdf, other

    cs.CV cs.RO

    Egocentric Vision-based Future Vehicle Localization for Intelligent Driving Assistance Systems

    Authors: Yu Yao, Mingze Xu, Chiho Choi, David J. Crandall, Ella M. Atkins, Behzad Dariush

    Abstract: Predicting the future location of vehicles is essential for safety-critical applications such as advanced driver assistance systems (ADAS) and autonomous driving. This paper introduces a novel approach to simultaneously predict both the location and scale of target vehicles in the first-person (egocentric) view of an ego-vehicle. We present a multi-stream recurrent neural network (RNN) encoder-dec… ▽ More

    Submitted 3 March, 2019; v1 submitted 19 September, 2018; originally announced September 2018.

    Comments: To appear on ICRA 2019