Showing 1–2 of 2 results for author: Tang, Y C

Search v0.5.6 released 2020-02-24

arXiv:1912.08444 [pdf, other]

cs.LG cs.AI cs.CV cs.RO stat.ML

Relational Mimic for Visual Adversarial Imitation Learning

Authors: Lionel Blondé, Yichuan Charlie Tang, Jian Zhang, Russ Webb

Abstract: In this work, we introduce a new method for imitation learning from video demonstrations. Our method, Relational Mimic (RM), improves on previous visual imitation learning methods by combining generative adversarial networks and relational learning. RM is flexible and can be used in conjunction with other recent advances in generative adversarial imitation learning to better address the need for m… ▽ More In this work, we introduce a new method for imitation learning from video demonstrations. Our method, Relational Mimic (RM), improves on previous visual imitation learning methods by combining generative adversarial networks and relational learning. RM is flexible and can be used in conjunction with other recent advances in generative adversarial imitation learning to better address the need for more robust and sample-efficient approaches. In addition, we introduce a new neural network architecture that improves upon the previous state-of-the-art in reinforcement learning and illustrate how increasing the relational reasoning capabilities of the agent enables the latter to achieve increasingly higher performance in a challenging locomotion task with pixel inputs. Finally, we study the effects and contributions of relational learning in policy evaluation, policy improvement and reward learning through ablation studies. △ Less

Submitted 18 December, 2019; originally announced December 2019.
arXiv:1911.00997 [pdf, other]

cs.LG cs.CV cs.MA cs.RO stat.ML

Multiple Futures Prediction

Authors: Yichuan Charlie Tang, Ruslan Salakhutdinov

Abstract: Temporal prediction is critical for making intelligent and robust decisions in complex dynamic environments. Motion prediction needs to model the inherently uncertain future which often contains multiple potential outcomes, due to multi-agent interactions and the latent goals of others. Towards these goals, we introduce a probabilistic framework that efficiently learns latent variables to jointly… ▽ More Temporal prediction is critical for making intelligent and robust decisions in complex dynamic environments. Motion prediction needs to model the inherently uncertain future which often contains multiple potential outcomes, due to multi-agent interactions and the latent goals of others. Towards these goals, we introduce a probabilistic framework that efficiently learns latent variables to jointly model the multi-step future motions of agents in a scene. Our framework is data-driven and learns semantically meaningful latent variables to represent the multimodal future, without requiring explicit labels. Using a dynamic attention-based state encoder, we learn to encode the past as well as the future interactions among agents, efficiently scaling to any number of agents. Finally, our model can be used for planning via computing a conditional probability density over the trajectories of other agents given a hypothetical rollout of the 'self' agent. We demonstrate our algorithms by predicting vehicle trajectories of both simulated and real data, demonstrating the state-of-the-art results on several vehicle trajectory datasets. △ Less

Submitted 6 December, 2019; v1 submitted 3 November, 2019; originally announced November 2019.

Comments: In proceedings of NeurIPS 2019, Vancouver, British Columbia, Canada

Search v0.5.6 released 2020-02-24