Search | arXiv e-print repository

arXiv:1903.05766 [pdf, other]

Simulating Emergent Properties of Human Driving Behavior Using Multi-Agent Reward Augmented Imitation Learning

Authors: Raunak P. Bhattacharyya, Derek J. Phillips, Changliu Liu, Jayesh K. Gupta, Katherine Driggs-Campbell, Mykel J. Kochenderfer

Abstract: Recent developments in multi-agent imitation learning have shown promising results for modeling the behavior of human drivers. However, it is challenging to capture emergent traffic behaviors that are observed in real-world datasets. Such behaviors arise due to the many local interactions between agents that are not commonly accounted for in imitation learning. This paper proposes Reward Augmented… ▽ More Recent developments in multi-agent imitation learning have shown promising results for modeling the behavior of human drivers. However, it is challenging to capture emergent traffic behaviors that are observed in real-world datasets. Such behaviors arise due to the many local interactions between agents that are not commonly accounted for in imitation learning. This paper proposes Reward Augmented Imitation Learning (RAIL), which integrates reward augmentation into the multi-agent imitation learning framework and allows the designer to specify prior knowledge in a principled fashion. We prove that convergence guarantees for the imitation learning process are preserved under the application of reward augmentation. This method is validated in a driving scenario, where an entire traffic scene is controlled by driving policies learned using our proposed algorithm. Further, we demonstrate improved performance in comparison to traditional imitation learning algorithms both in terms of the local actions of a single agent and the behavior of emergent properties in complex, multi-agent settings. △ Less

Submitted 13 March, 2019; originally announced March 2019.

Comments: Accepted for publication at ICRA 2019

arXiv:1902.01293 [pdf, other]

Real-time Prediction of Automotive Collision Risk from Monocular Video

Authors: Derek J. Phillips, Juan Carlos Aragon, Anjali Roychowdhury, Regina Madigan, Sunil Chintakindi, Mykel J. Kochenderfer

Abstract: Many automotive applications, such as Advanced Driver Assistance Systems (ADAS) for collision avoidance and warnings, require estimating the future automotive risk of a driving scene. We present a low-cost system that predicts the collision risk over an intermediate time horizon from a monocular video source, such as a dashboard-mounted camera. The modular system includes components for object det… ▽ More Many automotive applications, such as Advanced Driver Assistance Systems (ADAS) for collision avoidance and warnings, require estimating the future automotive risk of a driving scene. We present a low-cost system that predicts the collision risk over an intermediate time horizon from a monocular video source, such as a dashboard-mounted camera. The modular system includes components for object detection, object tracking, and state estimation. We introduce solutions to the object tracking and distance estimation problems. Advanced approaches to the other tasks are used to produce real-time predictions of the automotive risk for the next 10 s at over 5 Hz. The system is designed such that alternative components can be substituted with minimal effort. It is demonstrated on common physical hardware, specifically an off-the-shelf gaming laptop and a webcam. We extend the framework to support absolute speed estimation and more advanced risk estimation techniques. △ Less

Submitted 4 February, 2019; originally announced February 2019.

Comments: Submitted to IV2019. 7 pages, 4 figures, 3 tables

arXiv:1803.01044 [pdf, other]

Multi-Agent Imitation Learning for Driving Simulation

Authors: Raunak P. Bhattacharyya, Derek J. Phillips, Blake Wulfe, Jeremy Morton, Alex Kuefler, Mykel J. Kochenderfer

Abstract: Simulation is an appealing option for validating the safety of autonomous vehicles. Generative Adversarial Imitation Learning (GAIL) has recently been shown to learn representative human driver models. These human driver models were learned through training in single-agent environments, but they have difficulty in generalizing to multi-agent driving scenarios. We argue these difficulties arise bec… ▽ More Simulation is an appealing option for validating the safety of autonomous vehicles. Generative Adversarial Imitation Learning (GAIL) has recently been shown to learn representative human driver models. These human driver models were learned through training in single-agent environments, but they have difficulty in generalizing to multi-agent driving scenarios. We argue these difficulties arise because observations at training and test time are sampled from different distributions. This difference makes such models unsuitable for the simulation of driving scenes, where multiple agents must interact realistically over long time horizons. We extend GAIL to address these shortcomings through a parameter-sharing approach grounded in curriculum learning. Compared with single-agent GAIL policies, policies generated by our PS-GAIL method prove superior at interacting stably in a multi-agent setting and capturing the emergent behavior of human drivers. △ Less

Submitted 2 March, 2018; originally announced March 2018.

Comments: 6 pages, 3 figures, 1 table

arXiv:cs/0109098 [pdf]

The Influence of Policy Regimes on the Development and Social Implications of Privacy Enhancing Technologies

Authors: David J. Phillips

Abstract: As privacy issues have gained social salience, entrepreneurs have begun to offer privacy enhancing technologies (PETs) and the U.S. has begun to enact privacy legislation. But "privacy" is an ambiguous notion. In the liberal tradition, it is an individualistic value protecting citizens from intrusion into a realm of autonomy. A feminist critique suggests that the social utility of privacy is t… ▽ More As privacy issues have gained social salience, entrepreneurs have begun to offer privacy enhancing technologies (PETs) and the U.S. has begun to enact privacy legislation. But "privacy" is an ambiguous notion. In the liberal tradition, it is an individualistic value protecting citizens from intrusion into a realm of autonomy. A feminist critique suggests that the social utility of privacy is to exclude certain issues from the public realm. Sociologists suggest that privacy is about identity management, while political economists suggest that the most salient privacy issue is the use of personal information to normalize and rationalize populations according to the needs of capital. While PETs have been developed for use by individual consumers, recently developers are focusing on the business to business market, where demand is stoked by the existence of new privacy regulations. These new laws tend to operationalize privacy in terms of "personally identifiable information." The new generation of PETs reflect and reify that definition. This, in turn, has implications for the everyday understandings of privacy and the constitution of identity and social life. In particular, this socio-technical practice may strengthen the ability of data holders to rationalize populations and create self-serving social categories. At the same time, they may permit individuals to negotiate these categories outside of panoptic vision. They may also encourage public discussion and awareness of these created social categories. △ Less

Submitted 23 October, 2001; v1 submitted 24 September, 2001; originally announced September 2001.

Comments: 29th TPRC Conference, 2001

Report number: TPRC-2001-067 ACM Class: K.4.m Miscellaneous

Showing 1–4 of 4 results for author: Phillips, D J