Skip to main content

Showing 1–22 of 22 results for author: Sapp, B

.
  1. arXiv:2310.08710  [pdf, other

    cs.RO cs.LG

    Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research

    Authors: Cole Gulino, Justin Fu, Wenjie Luo, George Tucker, Eli Bronstein, Yiren Lu, Jean Harb, Xinlei Pan, Yan Wang, Xiangyu Chen, John D. Co-Reyes, Rishabh Agarwal, Rebecca Roelofs, Yao Lu, Nico Montali, Paul Mougin, Zoey Yang, Brandyn White, Aleksandra Faust, Rowan McAllister, Dragomir Anguelov, Benjamin Sapp

    Abstract: Simulation is an essential tool to develop and benchmark autonomous vehicle planning software in a safe and cost-effective manner. However, realistic simulation requires accurate modeling of nuanced and complex multi-agent interactive behaviors. To address these challenges, we introduce Waymax, a new data-driven simulator for autonomous driving in multi-agent scenes, designed for large-scale simul… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  2. arXiv:2309.16534  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    MotionLM: Multi-Agent Motion Forecasting as Language Modeling

    Authors: Ari Seff, Brian Cera, Dian Chen, Mason Ng, Aurick Zhou, Nigamaa Nayakanti, Khaled S. Refaat, Rami Al-Rfou, Benjamin Sapp

    Abstract: Reliable forecasting of the future behavior of road agents is a critical component to safe planning in autonomous vehicles. Here, we represent continuous trajectories as sequences of discrete motion tokens and cast multi-agent motion prediction as a language modeling task over this domain. Our model, MotionLM, provides several advantages: First, it does not require anchors or explicit latent varia… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: To appear at the International Conference on Computer Vision (ICCV) 2023

  3. arXiv:2306.03083  [pdf, other

    cs.RO cs.AI

    MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion

    Authors: Chiyu Max Jiang, Andre Cornman, Cheolho Park, Ben Sapp, Yin Zhou, Dragomir Anguelov

    Abstract: We present MotionDiffuser, a diffusion based representation for the joint distribution of future trajectories over multiple agents. Such representation has several key advantages: first, our model learns a highly multimodal distribution that captures diverse future outcomes. Second, the simple predictor design requires only a single L2 loss training objective, and does not depend on trajectory anc… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted as a highlight paper in CVPR 2023. Walkthrough video: https://youtu.be/IfGTZwm1abg

  4. arXiv:2212.11419  [pdf, other

    cs.AI cs.RO

    Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios

    Authors: Yiren Lu, Justin Fu, George Tucker, Xinlei Pan, Eli Bronstein, Rebecca Roelofs, Benjamin Sapp, Brandyn White, Aleksandra Faust, Shimon Whiteson, Dragomir Anguelov, Sergey Levine

    Abstract: Imitation learning (IL) is a simple and powerful way to use high-quality human driving data, which can be collected at scale, to produce human-like behavior. However, policies based on imitation learning alone often fail to sufficiently account for safety and reliability concerns. In this paper, we show how imitation learning combined with reinforcement learning using simple rewards can substantia… ▽ More

    Submitted 10 August, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    ACM Class: I.2.9; I.2.6

  5. arXiv:2212.08710  [pdf, other

    cs.MA cs.LG cs.RO

    JFP: Joint Future Prediction with Interactive Multi-Agent Modeling for Autonomous Driving

    Authors: Wenjie Luo, Cheolho Park, Andre Cornman, Benjamin Sapp, Dragomir Anguelov

    Abstract: We propose JFP, a Joint Future Prediction model that can learn to generate accurate and consistent multi-agent future trajectories. For this task, many different methods have been proposed to capture social interactions in the encoding part of the model, however, considerably less focus has been placed on representing interactions in the decoder and output stages. As a result, the predicted trajec… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  6. arXiv:2207.05844  [pdf, other

    cs.CV

    Wayformer: Motion Forecasting via Simple & Efficient Attention Networks

    Authors: Nigamaa Nayakanti, Rami Al-Rfou, Aurick Zhou, Kratarth Goel, Khaled S. Refaat, Benjamin Sapp

    Abstract: Motion forecasting for autonomous driving is a challenging task because complex driving scenarios result in a heterogeneous mix of static and dynamic inputs. It is an open problem how best to represent and fuse information about road geometry, lane connectivity, time-varying traffic light state, and history of a dynamic set of agents and their interactions into an effective encoding. To model this… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  7. The Science Performance of JWST as Characterized in Commissioning

    Authors: Jane Rigby, Marshall Perrin, Michael McElwain, Randy Kimble, Scott Friedman, Matt Lallo, René Doyon, Lee Feinberg, Pierre Ferruit, Alistair Glasse, Marcia Rieke, George Rieke, Gillian Wright, Chris Willott, Knicole Colon, Stefanie Milam, Susan Neff, Christopher Stark, Jeff Valenti, Jim Abell, Faith Abney, Yasin Abul-Huda, D. Scott Acton, Evan Adams, David Adler , et al. (601 additional authors not shown)

    Abstract: This paper characterizes the actual science performance of the James Webb Space Telescope (JWST), as determined from the six month commissioning period. We summarize the performance of the spacecraft, telescope, science instruments, and ground system, with an emphasis on differences from pre-launch expectations. Commissioning has made clear that JWST is fully capable of achieving the discoveries f… ▽ More

    Submitted 10 April, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: 5th version as accepted to PASP; 31 pages, 18 figures; https://iopscience.iop.org/article/10.1088/1538-3873/acb293

    Journal ref: PASP 135 048001 (2023)

  8. arXiv:2207.03586  [pdf, other

    cs.LG cs.AI cs.RO

    CausalAgents: A Robustness Benchmark for Motion Forecasting using Causal Relationships

    Authors: Rebecca Roelofs, Liting Sun, Ben Caine, Khaled S. Refaat, Ben Sapp, Scott Ettinger, Wei Chai

    Abstract: As machine learning models become increasingly prevalent in motion forecasting for autonomous vehicles (AVs), it is critical to ensure that model predictions are safe and reliable. However, exhaustively collecting and labeling the data necessary to fully test the long tail of rare and challenging scenarios is difficult and expensive. In this work, we construct a new benchmark for evaluating and im… ▽ More

    Submitted 6 October, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Rebecca Roelofs and Liting Sun are equally contributed to the work

  9. arXiv:2206.04176  [pdf, other

    cs.CV cs.LG cs.RO

    VN-Transformer: Rotation-Equivariant Attention for Vector Neurons

    Authors: Serge Assaad, Carlton Downey, Rami Al-Rfou, Nigamaa Nayakanti, Ben Sapp

    Abstract: Rotation equivariance is a desirable property in many practical applications such as motion forecasting and 3D perception, where it can offer benefits like sample efficiency, better generalization, and robustness to input perturbations. Vector Neurons (VN) is a recently developed framework offering a simple yet effective approach for deriving rotation-equivariant analogs of standard machine learni… ▽ More

    Submitted 24 January, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR), 2023; Previous version appeared in Workshop on Machine Learning for Autonomous Driving, Conference on Neural Information Processing Systems (NeurIPS), 2022

  10. arXiv:2206.03970  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Narrowing the Coordinate-frame Gap in Behavior Prediction Models: Distillation for Efficient and Accurate Scene-centric Motion Forecasting

    Authors: DiJia Su, Bertrand Douillard, Rami Al-Rfou, Cheolho Park, Benjamin Sapp

    Abstract: Behavior prediction models have proliferated in recent years, especially in the popular real-world robotics application of autonomous driving, where representing the distribution over possible futures of moving agents is essential for safe and comfortable motion planning. In these models, the choice of coordinate frames to represent inputs and outputs has crucial trade offs which broadly fall into… ▽ More

    Submitted 10 June, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted at ICRA 2022

  11. arXiv:2206.00991  [pdf, ps, other

    cs.RO cs.CV

    StopNet: Scalable Trajectory and Occupancy Prediction for Urban Autonomous Driving

    Authors: **kyu Kim, Reza Mahjourian, Scott Ettinger, Mayank Bansal, Brandyn White, Ben Sapp, Dragomir Anguelov

    Abstract: We introduce a motion forecasting (behavior prediction) method that meets the latency requirements for autonomous driving in dense urban environments without sacrificing accuracy. A whole-scene sparse input representation allows StopNet to scale to predicting trajectories for hundreds of road agents with reliable latency. In addition to predicting trajectories, our scene encoder lends itself to pr… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Journal ref: IEEE International Conference on Robotics and Automation 2022

  12. Occupancy Flow Fields for Motion Forecasting in Autonomous Driving

    Authors: Reza Mahjourian, **kyu Kim, Yuning Chai, Mingxing Tan, Ben Sapp, Dragomir Anguelov

    Abstract: We propose Occupancy Flow Fields, a new representation for motion forecasting of multiple agents, an important task in autonomous driving. Our representation is a spatio-temporal grid with each grid cell containing both the probability of the cell being occupied by any agent, and a two-dimensional flow vector representing the direction and magnitude of the motion in that cell. Our method successfu… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Journal ref: IEEE Robotics and Automation Letters

  13. arXiv:2111.14973  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    MultiPath++: Efficient Information Fusion and Trajectory Aggregation for Behavior Prediction

    Authors: Balakrishnan Varadarajan, Ahmed Hefny, Avikalp Srivastava, Khaled S. Refaat, Nigamaa Nayakanti, Andre Cornman, Kan Chen, Bertrand Douillard, Chi Pang Lam, Dragomir Anguelov, Benjamin Sapp

    Abstract: Predicting the future behavior of road users is one of the most challenging and important problems in autonomous driving. Applying deep learning to this problem requires fusing heterogeneous world state in the form of rich perception signals and map information, and inferring highly multi-modal distributions over possible futures. In this paper, we present MultiPath++, a future prediction model th… ▽ More

    Submitted 21 December, 2021; v1 submitted 29 November, 2021; originally announced November 2021.

  14. arXiv:2106.08417  [pdf, other

    cs.CV cs.LG cs.RO

    Scene Transformer: A unified architecture for predicting multiple agent trajectories

    Authors: Jiquan Ngiam, Benjamin Caine, Vijay Vasudevan, Zhengdong Zhang, Hao-Tien Lewis Chiang, Jeffrey Ling, Rebecca Roelofs, Alex Bewley, Chenxi Liu, Ashish Venugopal, David Weiss, Ben Sapp, Zhifeng Chen, Jonathon Shlens

    Abstract: Predicting the motion of multiple agents is necessary for planning in dynamic environments. This task is challenging for autonomous driving since agents (e.g. vehicles and pedestrians) and their associated behaviors may be diverse and influence one another. Most prior work have focused on predicting independent futures for each agent based on all past motion, and planning against these independent… ▽ More

    Submitted 4 March, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: ICLR 2022

  15. arXiv:2104.10133  [pdf, other

    cs.CV cs.LG cs.RO

    Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset

    Authors: Scott Ettinger, Shuyang Cheng, Benjamin Caine, Chenxi Liu, Hang Zhao, Sabeek Pradhan, Yuning Chai, Ben Sapp, Charles Qi, Yin Zhou, Zoey Yang, Aurelien Chouard, Pei Sun, Jiquan Ngiam, Vijay Vasudevan, Alexander McCauley, Jonathon Shlens, Dragomir Anguelov

    Abstract: As autonomous driving systems mature, motion forecasting has received increasing attention as a critical requirement for planning. Of particular importance are interactive situations such as merges, unprotected turns, etc., where predicting individual object motion is not sufficient. Joint predictions of multiple objects are required for effective route planning. There has been a critical need for… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: 15 pages, 10 figures

  16. arXiv:2104.09959  [pdf, other

    cs.RO

    Identifying Driver Interactions via Conditional Behavior Prediction

    Authors: Ekaterina Tolstaya, Reza Mahjourian, Carlton Downey, Balakrishnan Varadarajan, Benjamin Sapp, Dragomir Anguelov

    Abstract: Interactive driving scenarios, such as lane changes, merges and unprotected turns, are some of the most challenging situations for autonomous driving. Planning in interactive scenarios requires accurately modeling the reactions of other agents to different future actions of the ego agent. We develop end-to-end models for conditional behavior prediction (CBP) that take as an input a query future tr… ▽ More

    Submitted 1 June, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

  17. arXiv:2008.08294  [pdf, other

    cs.CV cs.RO

    TNT: Target-driveN Trajectory Prediction

    Authors: Hang Zhao, Jiyang Gao, Tian Lan, Chen Sun, Benjamin Sapp, Balakrishnan Varadarajan, Yue Shen, Yi Shen, Yuning Chai, Cordelia Schmid, Congcong Li, Dragomir Anguelov

    Abstract: Predicting the future behavior of moving agents is essential for real world applications. It is challenging as the intent of the agent and the corresponding behavior is unknown and intrinsically multimodal. Our key insight is that for prediction within a moderate time horizon, the future modes can be effectively captured by a set of target states. This leads to our target-driven trajectory predict… ▽ More

    Submitted 21 August, 2020; v1 submitted 19 August, 2020; originally announced August 2020.

  18. arXiv:1910.05449  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    MultiPath: Multiple Probabilistic Anchor Trajectory Hypotheses for Behavior Prediction

    Authors: Yuning Chai, Benjamin Sapp, Mayank Bansal, Dragomir Anguelov

    Abstract: Predicting human behavior is a difficult and crucial task required for motion planning. It is challenging in large part due to the highly uncertain and multi-modal set of possible outcomes in real-world domains such as autonomous driving. Beyond single MAP trajectory prediction, obtaining an accurate probability distribution of the future is an area of active interest. We present MultiPath, which… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

    Comments: Appears in CoRL 2019

  19. arXiv:1906.08945  [pdf, other

    cs.CV cs.LG cs.RO

    Rules of the Road: Predicting Driving Behavior with a Convolutional Model of Semantic Interactions

    Authors: Joey Hong, Benjamin Sapp, James Philbin

    Abstract: We focus on the problem of predicting future states of entities in complex, real-world driving scenarios. Previous research has used low-level signals to predict short time horizons, and has not addressed how to leverage key assets relied upon heavily by industry self-driving systems: (1) large 3D perception efforts which provide highly accurate 3D states of agents with rich attributes, and (2) de… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

    Comments: Accepted at CVPR 2019

    Journal ref: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 8454-8462

  20. arXiv:1902.08501  [pdf, other

    physics.app-ph quant-ph

    Development of transmon qubits solely from optical lithography on 300mm wafers

    Authors: N. Foroozani, C. Hobbs, C. C. Hung, S. Olson, D. Ashworth, E. Holland, M. Malloy, P. Kearney, B. O'Brien, B. Bunday, D. DiPaola, W. Advocate, T. Murray, P. Hansen, S. Novak, S. Bennett, M. Rodgers, B. Baker-O'Neal, B. Sapp, E. Barth, J. Hedrick, R. Goldblatt, S. S. Papa Rao, K. D. Osborn

    Abstract: Qubit information processors are increasing in footprint but currently rely on e-beam lithography for patterning the required Josephson junctions (JJs). Advanced optical lithography is an alternative patterning method, and we report on the development of transmon qubits patterned solely with optical lithography. The lithography uses 193 nm wavelength exposure and 300-mm large silicon wafers. Qubit… ▽ More

    Submitted 22 February, 2019; originally announced February 2019.

    Comments: 7 pages, 4 figures, submitted to Quantum Science and Technology

    Journal ref: Journal of Quantum Science and Technology, 2019

  21. arXiv:1511.06789  [pdf, other

    cs.CV

    The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition

    Authors: Jonathan Krause, Benjamin Sapp, Andrew Howard, Howard Zhou, Alexander Toshev, Tom Duerig, James Philbin, Li Fei-Fei

    Abstract: Current approaches for fine-grained recognition do the following: First, recruit experts to annotate a dataset of images, optionally also collecting more structured data in the form of part annotations and bounding boxes. Second, train a model utilizing this data. Toward the goal of solving fine-grained recognition, we introduce an alternative approach, leveraging free, noisy data from the web and… ▽ More

    Submitted 18 October, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    Comments: ECCV 2016, data is released

  22. arXiv:1208.3279  [pdf, other

    stat.ML cs.LG

    Structured Prediction Cascades

    Authors: David Weiss, Benjamin Sapp, Ben Taskar

    Abstract: Structured prediction tasks pose a fundamental trade-off between the need for model complexity to increase predictive power and the limited computational resources for inference in the exponentially-sized output spaces such models require. We formulate and develop the Structured Prediction Cascade architecture: a sequence of increasingly complex models that progressively filter the space of possib… ▽ More

    Submitted 6 August, 2012; originally announced August 2012.

    Comments: 32 pages, in submission