Skip to main content

Showing 1–20 of 20 results for author: Bayen, A M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.17050  [pdf, other

    eess.SY cs.RO

    Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test

    Authors: Kathy Jang, Nathan Lichtlé, Eugene Vinitsky, Adit Shah, Matthew Bunting, Matthew Nice, Benedetto Piccoli, Benjamin Seibold, Daniel B. Work, Maria Laura Delle Monache, Jonathan Sprinkle, Jonathan W. Lee, Alexandre M. Bayen

    Abstract: In this article, we explore the technical details of the reinforcement learning (RL) algorithms that were deployed in the largest field test of automated vehicles designed to smooth traffic flow in history as of 2023, uncovering the challenges and breakthroughs that come with develo** RL controllers for automated vehicles. We delve into the fundamental concepts behind RL algorithms and their app… ▽ More

    Submitted 14 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  2. arXiv:2401.09666  [pdf, other

    eess.SY cs.AI cs.MA

    Traffic Smoothing Controllers for Autonomous Vehicles Using Deep Reinforcement Learning and Real-World Trajectory Data

    Authors: Nathan Lichtlé, Kathy Jang, Adit Shah, Eugene Vinitsky, Jonathan W. Lee, Alexandre M. Bayen

    Abstract: Designing traffic-smoothing cruise controllers that can be deployed onto autonomous vehicles is a key step towards improving traffic flow, reducing congestion, and enhancing fuel efficiency in mixed autonomy traffic. We bypass the common issue of having to carefully fine-tune a large traffic microsimulator by leveraging real-world trajectory data from the I-24 highway in Tennessee, replayed in a o… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted to be published as part of the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC) 2023, Bilbao, Spain, September 24-28, 2023

  3. arXiv:2208.12534  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Learning energy-efficient driving behaviors by imitating experts

    Authors: Abdul Rahman Kreidieh, Zhe Fu, Alexandre M. Bayen

    Abstract: The rise of vehicle automation has generated significant interest in the potential role of future automated vehicles (AVs). In particular, in highly dense traffic settings, AVs are expected to serve as congestion-dampeners, mitigating the presence of instabilities that arise from various sources. However, in many applications, such maneuvers rely heavily on non-local sensing or coordination by int… ▽ More

    Submitted 28 June, 2022; originally announced August 2022.

  4. Unified Automatic Control of Vehicular Systems with Reinforcement Learning

    Authors: Zhongxia Yan, Abdul Rahman Kreidieh, Eugene Vinitsky, Alexandre M. Bayen, Cathy Wu

    Abstract: Emerging vehicular systems with increasing proportions of automated components present opportunities for optimal control to mitigate congestion and increase efficiency. There has been a recent interest in applying deep reinforcement learning (DRL) to these nonlinear dynamical systems for the automatic design of effective control strategies. Despite conceptual advantages of DRL being model-free, st… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: 16 pages, 14 figures, IEEE Transactions on Automation Science and Engineering (T-ASE), 2022

  5. arXiv:2112.14345  [pdf, other

    eess.SY cs.MA cs.RO

    Reachability Analysis for FollowerStopper: Safety Analysis and Experimental Results

    Authors: Fang-Chieh Chou, Marsalis Gibson, Rahul Bhadani, Alexandre M. Bayen, Jonathan Sprinkle

    Abstract: Motivated by earlier work and the developer of a new algorithm, the FollowerStopper, this article uses reachability analysis to verify the safety of the FollowerStopper algorithm, which is a controller designed for dampening stop- and-go traffic waves. With more than 1100 miles of driving data collected by our physical platform, we validate our analysis results by comparing it to human driving beh… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

    Comments: 6 pages; 10 figures; ICRA publication

  6. arXiv:2110.11943  [pdf, other

    math.DS cs.MA cs.NI eess.SY math.OC

    Solving N-player dynamic routing games with congestion: a mean field approach

    Authors: Theophile Cabannes, Mathieu Lauriere, Julien Perolat, Raphael Marinier, Sertan Girgin, Sarah Perrin, Olivier Pietquin, Alexandre M. Bayen, Eric Goubault, Romuald Elie

    Abstract: The recent emergence of navigational tools has changed traffic patterns and has now enabled new types of congestion-aware routing control like dynamic road pricing. Using the fundamental diagram of traffic flows - applied in macroscopic and mesoscopic traffic modeling - the article introduces a new N-player dynamic routing game with explicit congestion dynamics. The model is well-posed and can rep… ▽ More

    Submitted 27 October, 2021; v1 submitted 22 October, 2021; originally announced October 2021.

  7. arXiv:2002.07386  [pdf, other

    cs.LG stat.ML

    ResiliNet: Failure-Resilient Inference in Distributed Neural Networks

    Authors: Ashkan Yousefpour, Brian Q. Nguyen, Siddartha Devic, Guanhua Wang, Aboudy Kreidieh, Hans Lobel, Alexandre M. Bayen, Jason P. Jue

    Abstract: Federated Learning aims to train distributed deep models without sharing the raw data with the centralized server. Similarly, in distributed inference of neural networks, by partitioning the network and distributing it across several physical nodes, activations and gradients are exchanged between physical nodes, rather than raw data. Nevertheless, when a neural network is partitioned and distribut… ▽ More

    Submitted 19 December, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: Accepted in FL-ICML 2020 (International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with ICML 2020). Added FAQ to the end of the paper

  8. arXiv:1912.02368  [pdf, other

    cs.LG cs.AI stat.ML

    Inter-Level Cooperation in Hierarchical Reinforcement Learning

    Authors: Abdul Rahman Kreidieh, Glen Berseth, Brandon Trabucco, Samyak Parajuli, Sergey Levine, Alexandre M. Bayen

    Abstract: Hierarchies of temporally decoupled policies present a promising approach for enabling structured exploration in complex long-term planning problems. To fully achieve this approach an end-to-end training paradigm is needed. However, training these multi-level policies has had limited success due to challenges arising from interactions between the goal-assigning and goal-achieving levels within a h… ▽ More

    Submitted 17 November, 2021; v1 submitted 4 December, 2019; originally announced December 2019.

  9. Guardians of the Deep Fog: Failure-Resilient DNN Inference from Edge to Cloud

    Authors: Ashkan Yousefpour, Siddartha Devic, Brian Q. Nguyen, Aboudy Kreidieh, Alan Liao, Alexandre M. Bayen, Jason P. Jue

    Abstract: Partitioning and distributing deep neural networks (DNNs) over physical nodes such as edge, fog, or cloud nodes, could enhance sensor fusion, and reduce bandwidth and inference latency. However, when a DNN is distributed over physical nodes, failure of the physical nodes causes the failure of the DNN units that are placed on these nodes. The performance of the inference task will be unpredictable,… ▽ More

    Submitted 21 September, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: Accepted to ACM AIChallengeIoT 2019

  10. arXiv:1908.03821  [pdf, other

    cs.CY cs.MA

    BISTRO: Berkeley Integrated System for Transportation Optimization

    Authors: Sidney A. Feygin, Jessica R. Lazarus, Edward H. Forscher, Valentine Golfier-Vetterli, Jonathan W. Lee, Abhishek Gupta, Rashid A. Waraich, Colin J. R. Sheppard, Alexandre M. Bayen

    Abstract: This article introduces BISTRO, a new open source transportation planning decision support system that uses an agent-based simulation and optimization approach to anticipate and develop adaptive plans for possible technological disruptions and growth scenarios. The new framework was evaluated in the context of a machine learning competition hosted within Uber Technologies, Inc., in which over 400… ▽ More

    Submitted 22 January, 2020; v1 submitted 10 August, 2019; originally announced August 2019.

  11. arXiv:1803.07246  [pdf, other

    cs.LG cs.AI stat.ML

    Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines

    Authors: Cathy Wu, Aravind Rajeswaran, Yan Duan, Vikash Kumar, Alexandre M Bayen, Sham Kakade, Igor Mordatch, Pieter Abbeel

    Abstract: Policy gradient methods have enjoyed great success in deep reinforcement learning but suffer from high variance of gradient estimates. The high variance problem is particularly exasperated in problems with long horizons or high-dimensional action spaces. To mitigate this issue, we derive a bias-free action-dependent baseline for variance reduction which fully exploits the structural form of the st… ▽ More

    Submitted 19 March, 2018; originally announced March 2018.

    Comments: Accepted to ICLR 2018, Oral (2%)

  12. arXiv:1710.05465  [pdf, other

    cs.AI cs.RO eess.SY

    Flow: A Modular Learning Framework for Mixed Autonomy Traffic

    Authors: Cathy Wu, Aboudy Kreidieh, Kanaad Parvate, Eugene Vinitsky, Alexandre M Bayen

    Abstract: The rapid development of autonomous vehicles (AVs) holds vast potential for transportation systems through improved safety, efficiency, and access to mobility. However, the progression of these impacts, as AVs are adopted, is not well understood. Numerous technical challenges arise from the goal of analyzing the partial adoption of autonomy: partial control and observation, multi-vehicle interacti… ▽ More

    Submitted 30 December, 2021; v1 submitted 15 October, 2017; originally announced October 2017.

    Comments: 17 pages, 8 figures, 5 tables. 2021 IEEE Transactions on Robotics (T-RO)

  13. arXiv:1701.08832  [pdf, other

    cs.AI

    Expert Level control of Ramp Metering based on Multi-task Deep Reinforcement Learning

    Authors: Francois Belletti, Daniel Haziza, Gabriel Gomes, Alexandre M. Bayen

    Abstract: This article shows how the recent breakthroughs in Reinforcement Learning (RL) that have enabled robots to learn to play arcade video games, walk or assemble colored bricks, can be used to perform other tasks that are currently at the core of engineering cyberphysical systems. We present the first use of RL for the control of systems modeled by discretized non-linear Partial Differential Equations… ▽ More

    Submitted 30 January, 2017; originally announced January 2017.

  14. arXiv:1603.03336  [pdf, other

    cs.LG stat.ME

    Scalable Linear Causal Inference for Irregularly Sampled Time Series with Long Range Dependencies

    Authors: Francois W. Belletti, Evan R. Sparks, Michael J. Franklin, Alexandre M. Bayen, Joseph E. Gonzalez

    Abstract: Linear causal analysis is central to a wide range of important application spanning finance, the physical sciences, and engineering. Much of the existing literature in linear causal analysis operates in the time domain. Unfortunately, the direct application of time domain linear causal analysis to many real-world time series presents three critical challenges: irregular temporal sampling, long ran… ▽ More

    Submitted 10 March, 2016; originally announced March 2016.

  15. Differential Privacy of Populations in Routing Games

    Authors: Roy Dong, Walid Krichene, Alexandre M. Bayen, S. Shankar Sastry

    Abstract: As our ground transportation infrastructure modernizes, the large amount of data being measured, transmitted, and stored motivates an analysis of the privacy aspect of these emerging cyber-physical technologies. In this paper, we consider privacy in the routing game, where the origins and destinations of drivers are considered private. This is motivated by the fact that this spatiotemporal informa… ▽ More

    Submitted 15 January, 2016; originally announced January 2016.

    Comments: Extended draft of paper that appears in 2015 IEEE CDC

  16. arXiv:1511.06493  [pdf, other

    cs.DC

    Embarrassingly Parallel Time Series Analysis for Large Scale Weak Memory Systems

    Authors: Francois Belletti, Evan Sparks, Michael Franklin, Alexandre M. Bayen

    Abstract: Second order stationary models in time series analysis are based on the analysis of essential statistics whose computations follow a common pattern. In particular, with a map-reduce nomenclature, most of these operations can be modeled as map** a kernel that only depends on short windows of consecutive data and reducing the results produced by each computation. This computational pattern stems f… ▽ More

    Submitted 20 November, 2015; originally announced November 2015.

    MSC Class: 68M14; 37M10; 62M10

  17. arXiv:1408.0017  [pdf, other

    cs.LG cs.GT

    Learning Nash Equilibria in Congestion Games

    Authors: Walid Krichene, Benjamin Drighès, Alexandre M. Bayen

    Abstract: We study the repeated congestion game, in which multiple populations of players share resources, and make, at each iteration, a decentralized decision on which resources to utilize. We investigate the following question: given a model of how individual players update their strategies, does the resulting dynamics of strategy profiles converge to the set of Nash equilibria of the one-shot game? We c… ▽ More

    Submitted 31 July, 2014; originally announced August 2014.

  18. Environmental Sensing by Wearable Device for Indoor Activity and Location Estimation

    Authors: Ming **, Han Zou, Kevin Weekly, Ruoxi Jia, Alexandre M. Bayen, Costas J. Spanos

    Abstract: We present results from a set of experiments in this pilot study to investigate the causal influence of user activity on various environmental parameters monitored by occupant carried multi-purpose sensors. Hypotheses with respect to each type of measurements are verified, including temperature, humidity, and light level collected during eight typical activities: sitting in lab / cubicle, indoor w… ▽ More

    Submitted 22 June, 2014; originally announced June 2014.

    Comments: submitted to the 40th Annual Conference of the IEEE Industrial Electronics Society (IECON)

  19. arXiv:1312.1075  [pdf, ps, other

    cs.GT eess.SY math.OC

    A Necessary and Sufficient Condition for the Existence of Potential Functions for Heterogeneous Routing Games

    Authors: Farhad Farokhi, Walid Krichene, Alexandre M. Bayen, Karl H. Johansson

    Abstract: We study a heterogeneous routing game in which vehicles might belong to more than one type. The type determines the cost of traveling along an edge as a function of the flow of various types of vehicles over that edge. We relax the assumptions needed for the existence of a Nash equilibrium in this heterogeneous routing game. We extend the available results to present necessary and sufficient condi… ▽ More

    Submitted 3 February, 2014; v1 submitted 4 December, 2013; originally announced December 2013.

    Comments: Improved Literature Review; Updated Introduction

  20. arXiv:1212.3393  [pdf, other

    cs.RO cs.SE

    Large Scale Estimation in Cyberphysical Systems using Streaming Data: a Case Study with Smartphone Traces

    Authors: Timothy Hunter, Tathagata Das, Matei Zaharia, Pieter Abbeel, Alexandre M. Bayen

    Abstract: Controlling and analyzing cyberphysical and robotics systems is increasingly becoming a Big Data challenge. Pushing this data to, and processing in the cloud is more efficient than on-board processing. However, current cloud-based solutions are not suitable for the latency requirements of these applications. We present a new concept, Discretized Streams or D-Streams, that enables massively scalabl… ▽ More

    Submitted 14 December, 2012; originally announced December 2012.