Skip to main content

Showing 1–17 of 17 results for author: Roelofs, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.08710  [pdf, other

    cs.RO cs.LG

    Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research

    Authors: Cole Gulino, Justin Fu, Wenjie Luo, George Tucker, Eli Bronstein, Yiren Lu, Jean Harb, Xinlei Pan, Yan Wang, Xiangyu Chen, John D. Co-Reyes, Rishabh Agarwal, Rebecca Roelofs, Yao Lu, Nico Montali, Paul Mougin, Zoey Yang, Brandyn White, Aleksandra Faust, Rowan McAllister, Dragomir Anguelov, Benjamin Sapp

    Abstract: Simulation is an essential tool to develop and benchmark autonomous vehicle planning software in a safe and cost-effective manner. However, realistic simulation requires accurate modeling of nuanced and complex multi-agent interactive behaviors. To address these challenges, we introduce Waymax, a new data-driven simulator for autonomous driving in multi-agent scenes, designed for large-scale simul… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  2. arXiv:2212.11419  [pdf, other

    cs.AI cs.RO

    Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios

    Authors: Yiren Lu, Justin Fu, George Tucker, Xinlei Pan, Eli Bronstein, Rebecca Roelofs, Benjamin Sapp, Brandyn White, Aleksandra Faust, Shimon Whiteson, Dragomir Anguelov, Sergey Levine

    Abstract: Imitation learning (IL) is a simple and powerful way to use high-quality human driving data, which can be collected at scale, to produce human-like behavior. However, policies based on imitation learning alone often fail to sufficiently account for safety and reliability concerns. In this paper, we show how imitation learning combined with reinforcement learning using simple rewards can substantia… ▽ More

    Submitted 10 August, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    ACM Class: I.2.9; I.2.6

  3. arXiv:2207.03586  [pdf, other

    cs.LG cs.AI cs.RO

    CausalAgents: A Robustness Benchmark for Motion Forecasting using Causal Relationships

    Authors: Rebecca Roelofs, Liting Sun, Ben Caine, Khaled S. Refaat, Ben Sapp, Scott Ettinger, Wei Chai

    Abstract: As machine learning models become increasingly prevalent in motion forecasting for autonomous vehicles (AVs), it is critical to ensure that model predictions are safe and reliable. However, exhaustively collecting and labeling the data necessary to fully test the long tail of rare and challenging scenarios is difficult and expensive. In this work, we construct a new benchmark for evaluating and im… ▽ More

    Submitted 6 October, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Rebecca Roelofs and Liting Sun are equally contributed to the work

  4. arXiv:2205.04596  [pdf, other

    cs.CV

    When does dough become a bagel? Analyzing the remaining mistakes on ImageNet

    Authors: Vijay Vasudevan, Benjamin Caine, Raphael Gontijo-Lopes, Sara Fridovich-Keil, Rebecca Roelofs

    Abstract: Image classification accuracy on the ImageNet dataset has been a barometer for progress in computer vision over the last decade. Several recent papers have questioned the degree to which the benchmark remains useful to the community, yet innovations continue to contribute gains to performance, with today's largest models achieving 90%+ top-1 accuracy. To help contextualize progress on ImageNet and… ▽ More

    Submitted 25 May, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: Data and analysis available at https://github.com/google-research/imagenet-mistakes

  5. arXiv:2203.05482  [pdf, other

    cs.LG cs.CL cs.CV

    Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

    Authors: Mitchell Wortsman, Gabriel Ilharco, Samir Yitzhak Gadre, Rebecca Roelofs, Raphael Gontijo-Lopes, Ari S. Morcos, Hongseok Namkoong, Ali Farhadi, Yair Carmon, Simon Kornblith, Ludwig Schmidt

    Abstract: The conventional recipe for maximizing model accuracy is to (1) train multiple models with various hyperparameters and (2) pick the individual model which performs best on a held-out validation set, discarding the remainder. In this paper, we revisit the second step of this procedure in the context of fine-tuning large pre-trained models, where fine-tuned models often appear to lie in a single low… ▽ More

    Submitted 1 July, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: ICML 2022. The last three authors contributed equally

  6. arXiv:2110.02424  [pdf, other

    cs.LG

    Spectral Bias in Practice: The Role of Function Frequency in Generalization

    Authors: Sara Fridovich-Keil, Raphael Gontijo-Lopes, Rebecca Roelofs

    Abstract: Despite their ability to represent highly expressive functions, deep learning models seem to find simple solutions that generalize surprisingly well. Spectral bias -- the tendency of neural networks to prioritize learning low frequency functions -- is one possible explanation for this phenomenon, but so far spectral bias has primarily been observed in theoretical models and simplified experiments.… ▽ More

    Submitted 28 September, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

  7. arXiv:2109.01903  [pdf, other

    cs.CV cs.LG

    Robust fine-tuning of zero-shot models

    Authors: Mitchell Wortsman, Gabriel Ilharco, Jong Wook Kim, Mike Li, Simon Kornblith, Rebecca Roelofs, Raphael Gontijo-Lopes, Hannaneh Hajishirzi, Ali Farhadi, Hongseok Namkoong, Ludwig Schmidt

    Abstract: Large pre-trained models such as CLIP or ALIGN offer consistent accuracy across a range of data distributions when performing zero-shot inference (i.e., without fine-tuning on a specific dataset). Although existing fine-tuning methods substantially improve accuracy on a given target distribution, they often reduce robustness to distribution shifts. We address this tension by introducing a simple a… ▽ More

    Submitted 21 June, 2022; v1 submitted 4 September, 2021; originally announced September 2021.

    Comments: CVPR 2022

  8. arXiv:2106.15831  [pdf, other

    cs.LG cs.AI cs.CV

    The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning

    Authors: Anders Andreassen, Yasaman Bahri, Behnam Neyshabur, Rebecca Roelofs

    Abstract: Although machine learning models typically experience a drop in performance on out-of-distribution data, accuracies on in- versus out-of-distribution data are widely observed to follow a single linear trend when evaluated across a testbed of models. Models that are more accurate on the out-of-distribution data relative to this baseline exhibit "effective robustness" and are exceedingly rare. Ident… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: 27 pages, 25 figures

  9. arXiv:2106.08417  [pdf, other

    cs.CV cs.LG cs.RO

    Scene Transformer: A unified architecture for predicting multiple agent trajectories

    Authors: Jiquan Ngiam, Benjamin Caine, Vijay Vasudevan, Zhengdong Zhang, Hao-Tien Lewis Chiang, Jeffrey Ling, Rebecca Roelofs, Alex Bewley, Chenxi Liu, Ashish Venugopal, David Weiss, Ben Sapp, Zhifeng Chen, Jonathon Shlens

    Abstract: Predicting the motion of multiple agents is necessary for planning in dynamic environments. This task is challenging for autonomous driving since agents (e.g. vehicles and pedestrians) and their associated behaviors may be diverse and influence one another. Most prior work have focused on predicting independent futures for each agent based on all past motion, and planning against these independent… ▽ More

    Submitted 4 March, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: ICLR 2022

  10. arXiv:2106.04732  [pdf, other

    cs.LG cs.AI cs.CV

    AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation

    Authors: David Berthelot, Rebecca Roelofs, Kihyuk Sohn, Nicholas Carlini, Alex Kurakin

    Abstract: We extend semi-supervised learning to the problem of domain adaptation to learn significantly higher-accuracy models that train on one data distribution and test on a different one. With the goal of generality, we introduce AdaMatch, a method that unifies the tasks of unsupervised domain adaptation (UDA), semi-supervised learning (SSL), and semi-supervised domain adaptation (SSDA). In an extensive… ▽ More

    Submitted 15 March, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: Accepted to ICLR 2022

  11. arXiv:2103.02093  [pdf, other

    cs.CV cs.LG

    Pseudo-labeling for Scalable 3D Object Detection

    Authors: Benjamin Caine, Rebecca Roelofs, Vijay Vasudevan, Jiquan Ngiam, Yuning Chai, Zhifeng Chen, Jonathon Shlens

    Abstract: To safely deploy autonomous vehicles, onboard perception systems must work reliably at high accuracy across a diverse set of environments and geographies. One of the most common techniques to improve the efficacy of such systems in new domains involves collecting large labeled datasets, but such datasets can be extremely costly to obtain, especially if each new deployment geography requires additi… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

  12. arXiv:2012.08668  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Mitigating Bias in Calibration Error Estimation

    Authors: Rebecca Roelofs, Nicholas Cain, Jonathon Shlens, Michael C. Mozer

    Abstract: For an AI system to be reliable, the confidence it expresses in its decisions must match its accuracy. To assess the degree of match, examples are typically binned by confidence and the per-bin mean confidence and accuracy are compared. Most research in calibration focuses on techniques to reduce this empirical measure of calibration error, ECE_bin. We instead focus on assessing statistical bias i… ▽ More

    Submitted 10 February, 2022; v1 submitted 15 December, 2020; originally announced December 2020.

    Comments: To be published in AISTATS 2022. Code is available https://github.com/google-research/google-research/tree/master/caltrain

  13. arXiv:1906.02168  [pdf, other

    cs.LG cs.CV stat.ML

    Do Image Classifiers Generalize Across Time?

    Authors: Vaishaal Shankar, Achal Dave, Rebecca Roelofs, Deva Ramanan, Benjamin Recht, Ludwig Schmidt

    Abstract: We study the robustness of image classifiers to temporal perturbations derived from videos. As part of this study, we construct two datasets, ImageNet-Vid-Robust and YTBB-Robust , containing a total 57,897 images grouped into 3,139 sets of perceptually similar images. Our datasets were derived from ImageNet-Vid and Youtube-BB respectively and thoroughly re-annotated by human experts for image simi… ▽ More

    Submitted 9 December, 2019; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: 23 pages, 11 tables, 11 figures. Paper Website: https://modestyachts.github.io/natural-perturbations-website/

  14. arXiv:1902.10811  [pdf, other

    cs.CV cs.LG stat.ML

    Do ImageNet Classifiers Generalize to ImageNet?

    Authors: Benjamin Recht, Rebecca Roelofs, Ludwig Schmidt, Vaishaal Shankar

    Abstract: We build new test sets for the CIFAR-10 and ImageNet datasets. Both benchmarks have been the focus of intense research for almost a decade, raising the danger of overfitting to excessively re-used test sets. By closely following the original dataset creation processes, we test to what extent current classification models generalize to new data. We evaluate a broad range of models and find accuracy… ▽ More

    Submitted 12 June, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

  15. arXiv:1806.00451  [pdf, other

    cs.LG stat.ML

    Do CIFAR-10 Classifiers Generalize to CIFAR-10?

    Authors: Benjamin Recht, Rebecca Roelofs, Ludwig Schmidt, Vaishaal Shankar

    Abstract: Machine learning is currently dominated by largely experimental work focused on improvements in a few key tasks. However, the impressive accuracy numbers of the best performing models are questionable because the same test sets have been used to select these models for multiple years now. To understand the danger of overfitting, we measure the accuracy of CIFAR-10 classifiers by creating a new tes… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  16. arXiv:1705.08292  [pdf, other

    stat.ML cs.LG

    The Marginal Value of Adaptive Gradient Methods in Machine Learning

    Authors: Ashia C. Wilson, Rebecca Roelofs, Mitchell Stern, Nathan Srebro, Benjamin Recht

    Abstract: Adaptive optimization methods, which perform local optimization with a metric constructed from the history of iterates, are becoming increasingly popular for training deep neural networks. Examples include AdaGrad, RMSProp, and Adam. We show that for simple overparameterized problems, adaptive methods often find drastically different solutions than gradient descent (GD) or stochastic gradient desc… ▽ More

    Submitted 21 May, 2018; v1 submitted 23 May, 2017; originally announced May 2017.

  17. arXiv:1602.05310  [pdf, other

    cs.LG math.OC stat.ML

    Large Scale Kernel Learning using Block Coordinate Descent

    Authors: Stephen Tu, Rebecca Roelofs, Shivaram Venkataraman, Benjamin Recht

    Abstract: We demonstrate that distributed block coordinate descent can quickly solve kernel regression and classification problems with millions of data points. Armed with this capability, we conduct a thorough comparison between the full kernel, the Nyström method, and random features on three large classification tasks from various domains. Our results suggest that the Nyström method generally achieves be… ▽ More

    Submitted 17 February, 2016; originally announced February 2016.