Skip to main content

Showing 1–37 of 37 results for author: Cubuk, E D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08197  [pdf, other

    cs.CV

    Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

    Authors: Zichao Li, Cihang Xie, Ekin Dogus Cubuk

    Abstract: This paper investigates the performance of the Contrastive Language-Image Pre-training (CLIP) when scaled down to limited computation budgets. We explore CLIP along three dimensions: data, architecture, and training strategies. With regards to data, we demonstrate the significance of high-quality training data and show that a smaller dataset of high-quality data can outperform a larger dataset wit… ▽ More

    Submitted 15 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  2. arXiv:2312.08472  [pdf, other

    cs.NE cs.LG math.NA

    AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions

    Authors: Esteban Real, Yao Chen, Mirko Rossini, Connal de Souza, Manav Garg, Akhil Verghese, Moritz Firsching, Quoc V. Le, Ekin Dogus Cubuk, David H. Park

    Abstract: Computers calculate transcendental functions by approximating them through the composition of a few limited-precision instructions. For example, an exponential can be calculated with a Taylor series. These approximation methods were developed over the centuries by mathematicians, who emphasized the attainability of arbitrary precision. Computers, however, operate on few limited precision types, su… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    ACM Class: I.2.2; I.2.6; G.1.2

  3. arXiv:2311.17894  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cs.LG

    Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy

    Authors: Max Schwarzer, Jesse Farebrother, Joshua Greaves, Ekin Dogus Cubuk, Rishabh Agarwal, Aaron Courville, Marc G. Bellemare, Sergei Kalinin, Igor Mordatch, Pablo Samuel Castro, Kevin M. Roccapriore

    Abstract: We introduce a machine learning approach to determine the transition dynamics of silicon atoms on a single layer of carbon atoms, when stimulated by the electron beam of a scanning transmission electron microscope (STEM). Our method is data-centric, leveraging data collected on a STEM. The data samples are processed and filtered to produce symbolic representations, which we use to train a neural n… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  4. arXiv:2311.09235  [pdf, other

    cs.LG cs.AI

    Scalable Diffusion for Materials Generation

    Authors: Sherry Yang, KwangHwan Cho, Amil Merchant, Pieter Abbeel, Dale Schuurmans, Igor Mordatch, Ekin Dogus Cubuk

    Abstract: Generative models trained on internet-scale data are capable of generating novel and realistic texts, images, and videos. A natural next question is whether these models can advance science, for example by generating novel stable materials. Traditionally, models with explicit structures (e.g., graphs) have been used in modeling structural relationships in scientific data (e.g., atoms and bonds in… ▽ More

    Submitted 3 June, 2024; v1 submitted 18 October, 2023; originally announced November 2023.

    Comments: https://unified-materials.github.io/

  5. arXiv:2310.01117  [pdf

    cond-mat.mtrl-sci cond-mat.dis-nn cs.LG physics.comp-ph

    Predicting emergence of crystals from amorphous matter with deep learning

    Authors: Muratahan Aykol, Amil Merchant, Simon Batzner, Jennifer N. Wei, Ekin Dogus Cubuk

    Abstract: Crystallization of the amorphous phases into metastable crystals plays a fundamental role in the formation of new matter, from geological to biological processes in nature to synthesis and development of new materials in the laboratory. Predicting the outcome of such phase transitions reliably would enable new research directions in these areas, but has remained beyond reach with molecular modelin… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 5 main figures, 4 supplementary figures

  6. arXiv:2305.13520  [pdf, other

    cs.CV cs.AI cs.LG

    Tied-Augment: Controlling Representation Similarity Improves Data Augmentation

    Authors: Emirhan Kurtulus, Zichao Li, Yann Dauphin, Ekin Dogus Cubuk

    Abstract: Data augmentation methods have played an important role in the recent advance of deep learning models, and have become an indispensable component of state-of-the-art models in semi-supervised, self-supervised, and supervised training for vision. Despite incurring no additional latency at test time, data augmentation often requires more epochs of training to be effective. For example, even the simp… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 14 pages, 2 figures, ICML 2023

  7. arXiv:2305.06925  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph physics.comp-ph

    Accurate Surface and Finite Temperature Bulk Properties of Lithium Metal at Large Scales using Machine Learning Interaction Potentials

    Authors: Mgcini Keith Phuthi, Archie Mingze Yao, Simon Batzner, Albert Musaelian, Boris Kozinsky, Ekin Dogus Cubuk, Venkatasubramanian Viswanathan

    Abstract: The properties of lithium metal are key parameters in the design of lithium ion and lithium metal batteries. They are difficult to probe experimentally due to the high reactivity and low melting point of lithium as well as the microscopic scales at which lithium exists in batteries where it is found to have enhanced strength, with implications for dendrite suppression strategies. Computationally,… ▽ More

    Submitted 22 May, 2023; v1 submitted 24 April, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures, 3 pages of Supporting Information

  8. arXiv:2210.13488  [pdf, other

    cs.CV

    LidarAugment: Searching for Scalable 3D LiDAR Data Augmentations

    Authors: Zhaoqi Leng, Guowang Li, Chenxi Liu, Ekin Dogus Cubuk, Pei Sun, Tong He, Dragomir Anguelov, Mingxing Tan

    Abstract: Data augmentations are important in training high-performance 3D object detectors for point clouds. Despite recent efforts on designing new data augmentations, perhaps surprisingly, most state-of-the-art 3D detectors only use a few simple data augmentations. In particular, different from 2D image data augmentations, 3D data augmentations need to account for different representations of input data… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  9. arXiv:2210.10879  [pdf, other

    cs.LG cs.CL cs.SD eess.AS

    G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR

    Authors: Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park

    Abstract: Data augmentation is a ubiquitous technique used to provide robustness to automatic speech recognition (ASR) training. However, even as so much of the ASR training process has become automated and more "end-to-end", the data augmentation policy (what augmentation functions to use, and how to apply them) remains hand-crafted. We present Graph-Augment, a technique to define the augmentation space as… ▽ More

    Submitted 24 October, 2022; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 6 pages, accepted at SLT 2022. Updated with copyright

  10. arXiv:2210.05546  [pdf, other

    cs.LG cs.CV

    What does a deep neural network confidently perceive? The effective dimension of high certainty class manifolds and their low confidence boundaries

    Authors: Stanislav Fort, Ekin Dogus Cubuk, Surya Ganguli, Samuel S. Schoenholz

    Abstract: Deep neural network classifiers partition input space into high confidence regions for each class. The geometry of these class manifolds (CMs) is widely studied and intimately related to model performance; for example, the margin depends on CM boundaries. We exploit the notions of Gaussian width and Gordon's escape theorem to tractably estimate the effective dimension of CMs and their boundaries t… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: An extended version of /Slice, Dice, and Optimize: Measuring the Dimension of Neural Network Class Manifolds/

  11. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, AdriĆ  Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  12. arXiv:2204.12511  [pdf, other

    cs.CV

    PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions

    Authors: Zhaoqi Leng, Mingxing Tan, Chenxi Liu, Ekin Dogus Cubuk, Xiaojie Shi, Shuyang Cheng, Dragomir Anguelov

    Abstract: Cross-entropy loss and focal loss are the most common choices when training deep neural networks for classification problems. Generally speaking, however, a good loss function can take on much more flexible forms, and should be tailored for different tasks and datasets. Motivated by how functions can be approximated via Taylor expansion, we propose a simple framework, named PolyLoss, to view and d… ▽ More

    Submitted 10 May, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: Add ablation studies on COCO detection using RetinaNet (Section 8)

    Journal ref: International Conference on Learning Representations. 2021

  13. arXiv:2203.04946  [pdf, other

    cs.CV

    Do better ImageNet classifiers assess perceptual similarity better?

    Authors: Manoj Kumar, Neil Houlsby, Nal Kalchbrenner, Ekin D. Cubuk

    Abstract: Perceptual distances between images, as measured in the space of pre-trained deep features, have outperformed prior low-level, pixel-based metrics on assessing perceptual similarity. While the capabilities of older and less accurate models such as AlexNet and VGG to capture perceptual similarity are well known, modern and more accurate models are less studied. In this paper, we present a large-sca… ▽ More

    Submitted 29 October, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: TMLR 2022 (https://openreview.net/forum?id=qrGKGZZvH0)

  14. arXiv:2110.12899  [pdf, other

    cs.LG

    No One Representation to Rule Them All: Overlap** Features of Training Methods

    Authors: Raphael Gontijo-Lopes, Yann Dauphin, Ekin D. Cubuk

    Abstract: Despite being able to capture a range of features of the data, high accuracy models trained with supervision tend to make similar predictions. This seemingly implies that high-performing models share similar biases regardless of training methodology, which would limit ensembling benefits and render low-accuracy models as having little practical use. Against this backdrop, recent work has developed… ▽ More

    Submitted 25 April, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Journal ref: International Conference on Learning Representations (ICLR) 2022

  15. arXiv:2108.11353  [pdf, other

    cs.CV

    Multi-Task Self-Training for Learning General Representations

    Authors: Golnaz Ghiasi, Barret Zoph, Ekin D. Cubuk, Quoc V. Le, Tsung-Yi Lin

    Abstract: Despite the fast progress in training specialized models for various tasks, learning a single general model that works well for many tasks is still challenging for computer vision. Here we introduce multi-task self-training (MuST), which harnesses the knowledge in independent specialized teacher models (e.g., ImageNet model on classification) to train a single general student model. Our approach h… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: ICCV 2021

  16. arXiv:2107.09661  [pdf, other

    cs.LG

    Learn2Hop: Learned Optimization on Rough Landscapes

    Authors: Amil Merchant, Luke Metz, Sam Schoenholz, Ekin Dogus Cubuk

    Abstract: Optimization of non-convex loss surfaces containing many local minima remains a critical problem in a variety of domains, including operations research, informatics, and material design. Yet, current techniques either require extremely high iteration counts or a large number of random restarts for good performance. In this work, we propose adapting recent developments in meta-learning to these man… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: ICML 2021, 16 pages, 8 figures, 6 tables

  17. arXiv:2103.07579  [pdf, other

    cs.CV

    Revisiting ResNets: Improved Training and Scaling Strategies

    Authors: Irwan Bello, William Fedus, Xianzhi Du, Ekin D. Cubuk, Aravind Srinivas, Tsung-Yi Lin, Jonathon Shlens, Barret Zoph

    Abstract: Novel computer vision architectures monopolize the spotlight, but the impact of the model architecture is often conflated with simultaneous changes to training methodology and scaling strategies. Our work revisits the canonical ResNet (He et al., 2015) and studies these three aspects in an effort to disentangle them. Perhaps surprisingly, we find that training and scaling strategies may matter mor… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  18. arXiv:2012.07177  [pdf, other

    cs.CV

    Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

    Authors: Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph

    Abstract: Building instance segmentation models that are data-efficient and can handle rare object categories is an important challenge in computer vision. Leveraging data augmentations is a promising direction towards addressing this challenge. Here, we perform a systematic study of the Copy-Paste augmentation ([13, 12]) for instance segmentation where we randomly paste objects onto an image. Prior studies… ▽ More

    Submitted 23 June, 2021; v1 submitted 13 December, 2020; originally announced December 2020.

    Comments: Accepted at CVPR 2021 (Oral)

  19. arXiv:2012.02920  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Dataset of Random Relaxations for Crystal Structure Search of Li-Si System

    Authors: Gowoon Cheon, Lusann Yang, Kevin McCloskey, Evan J. Reed, Ekin D. Cubuk

    Abstract: Crystal structure search is a long-standing challenge in materials design. We present a dataset of more than 100,000 structural relaxations of potential battery anode materials from randomized structures using density functional theory calculations. We illustrate the usage of the dataset by training graph neural networks to predict structural relaxations from randomly generated structures. Our mod… ▽ More

    Submitted 8 March, 2023; v1 submitted 4 December, 2020; originally announced December 2020.

  20. arXiv:2010.07810  [pdf, other

    cs.CV cs.LG

    Does Data Augmentation Benefit from Split BatchNorms

    Authors: Amil Merchant, Barret Zoph, Ekin Dogus Cubuk

    Abstract: Data augmentation has emerged as a powerful technique for improving the performance of deep neural networks and led to state-of-the-art results in computer vision. However, state-of-the-art data augmentation strongly distorts training images, leading to a disparity between examples seen during training and inference. In this work, we explore a recently proposed training paradigm in order to correc… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: 9 pages (+ 3 for references)

  21. arXiv:2009.08551  [pdf, other

    physics.comp-ph cs.LG

    Kohn-Sham equations as regularizer: building prior knowledge into machine-learned physics

    Authors: Li Li, Stephan Hoyer, Ryan Pederson, Ruoxi Sun, Ekin D. Cubuk, Patrick Riley, Kieron Burke

    Abstract: Including prior knowledge is important for effective machine learning models in physics, and is usually achieved by explicitly adding loss terms or constraints on model architectures. Prior knowledge embedded in the physics computation itself rarely draws attention. We show that solving the Kohn-Sham equations when training neural networks for the exchange-correlation functional provides an implic… ▽ More

    Submitted 17 November, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Journal ref: Phys. Rev. Lett. 126, 036401 (2021)

  22. arXiv:2006.06882  [pdf, other

    cs.CV cs.LG stat.ML

    Rethinking Pre-training and Self-training

    Authors: Barret Zoph, Golnaz Ghiasi, Tsung-Yi Lin, Yin Cui, Hanxiao Liu, Ekin D. Cubuk, Quoc V. Le

    Abstract: Pre-training is a dominant paradigm in computer vision. For example, supervised ImageNet pre-training is commonly used to initialize the backbones of object detection and segmentation models. He et al., however, show a surprising result that ImageNet pre-training has limited impact on COCO object detection. Here we investigate self-training as another method to utilize additional data on the same… ▽ More

    Submitted 15 November, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted for publication at the Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020)

  23. arXiv:2005.10266  [pdf, other

    cs.CV

    Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation

    Authors: Liang-Chieh Chen, Raphael Gontijo Lopes, Bowen Cheng, Maxwell D. Collins, Ekin D. Cubuk, Barret Zoph, Hartwig Adam, Jonathon Shlens

    Abstract: Supervised learning in large discriminative models is a mainstay for modern computer vision. Such an approach necessitates investing in large-scale human-annotated datasets for achieving state-of-the-art results. In turn, the efficacy of supervised learning may be limited by the size of the human annotated dataset. This limitation is particularly notable for image segmentation tasks, where the exp… ▽ More

    Submitted 19 July, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted to ECCV 2020

  24. arXiv:2004.00831  [pdf, other

    cs.CV

    Improving 3D Object Detection through Progressive Population Based Augmentation

    Authors: Shuyang Cheng, Zhaoqi Leng, Ekin Dogus Cubuk, Barret Zoph, Chunyan Bai, Jiquan Ngiam, Yang Song, Benjamin Caine, Vijay Vasudevan, Congcong Li, Quoc V. Le, Jonathon Shlens, Dragomir Anguelov

    Abstract: Data augmentation has been widely adopted for object detection in 3D point clouds. However, all previous related efforts have focused on manually designing specific data augmentation methods for individual architectures. In this work, we present the first attempt to automate the design of data augmentation policies for 3D object detection. We introduce the Progressive Population Based Augmentation… ▽ More

    Submitted 16 July, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: Accepted at ECCV 2020

  25. arXiv:2002.08973  [pdf, other

    cs.LG cs.CV stat.ML

    Affinity and Diversity: Quantifying Mechanisms of Data Augmentation

    Authors: Raphael Gontijo-Lopes, Sylvia J. Smullin, Ekin D. Cubuk, Ethan Dyer

    Abstract: Though data augmentation has become a standard component of deep neural network training, the underlying mechanism behind the effectiveness of these techniques remains poorly understood. In practice, augmentation policies are often chosen using heuristics of either distribution shift or augmentation diversity. Inspired by these, we seek to quantify how data augmentation improves model generalizati… ▽ More

    Submitted 4 June, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: 10 pages, 7 figures

  26. arXiv:2001.07685  [pdf

    cs.LG cs.CV stat.ML

    FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence

    Authors: Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, Colin Raffel

    Abstract: Semi-supervised learning (SSL) provides an effective means of leveraging unlabeled data to improve a model's performance. In this paper, we demonstrate the power of a simple combination of two common SSL methods: consistency regularization and pseudo-labeling. Our algorithm, FixMatch, first generates pseudo-labels using the model's predictions on weakly-augmented unlabeled images. For a given imag… ▽ More

    Submitted 25 November, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

    Comments: Published at NeurIPS 2020 as a conference paper

  27. arXiv:1912.02781  [pdf, other

    stat.ML cs.CV cs.LG

    AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

    Authors: Dan Hendrycks, Norman Mu, Ekin D. Cubuk, Barret Zoph, Justin Gilmer, Balaji Lakshminarayanan

    Abstract: Modern deep neural networks can achieve high accuracy when the training distribution and test distribution are identically distributed, but this assumption is frequently violated in practice. When the train and test distributions are mismatched, accuracy can plummet. Currently there are few techniques that improve robustness to unforeseen data shifts encountered during deployment. In this work, we… ▽ More

    Submitted 17 February, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Code available at https://github.com/google-research/augmix

  28. arXiv:1911.09785  [pdf, other

    cs.LG cs.CV stat.ML

    ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring

    Authors: David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Kihyuk Sohn, Han Zhang, Colin Raffel

    Abstract: We improve the recently-proposed "MixMatch" semi-supervised learning algorithm by introducing two new techniques: distribution alignment and augmentation anchoring. Distribution alignment encourages the marginal distribution of predictions on unlabeled data to be close to the marginal distribution of ground-truth labels. Augmentation anchoring feeds multiple strongly augmented versions of an input… ▽ More

    Submitted 13 February, 2020; v1 submitted 21 November, 2019; originally announced November 2019.

  29. arXiv:1909.13719  [pdf, other

    cs.CV

    RandAugment: Practical automated data augmentation with a reduced search space

    Authors: Ekin D. Cubuk, Barret Zoph, Jonathon Shlens, Quoc V. Le

    Abstract: Recent work has shown that data augmentation has the potential to significantly improve the generalization of deep learning models. Recently, automated augmentation strategies have led to state-of-the-art results in image classification and object detection. While these strategies were optimized for improving validation accuracy, they also led to state-of-the-art results in semi-supervised learnin… ▽ More

    Submitted 13 November, 2019; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: Added ablation experiments

  30. arXiv:1906.11172  [pdf, other

    cs.CV cs.LG

    Learning Data Augmentation Strategies for Object Detection

    Authors: Barret Zoph, Ekin D. Cubuk, Golnaz Ghiasi, Tsung-Yi Lin, Jonathon Shlens, Quoc V. Le

    Abstract: Data augmentation is a critical component of training deep learning models. Although data augmentation has been shown to significantly improve image classification, its potential has not been thoroughly investigated for object detection. Given the additional cost for annotating images for object detection, data augmentation may be of even greater importance for this computer vision task. In this w… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

  31. arXiv:1906.08988  [pdf, other

    cs.LG cs.CV stat.ML

    A Fourier Perspective on Model Robustness in Computer Vision

    Authors: Dong Yin, Raphael Gontijo Lopes, Jonathon Shlens, Ekin D. Cubuk, Justin Gilmer

    Abstract: Achieving robustness to distributional shift is a longstanding and challenging goal of computer vision. Data augmentation is a commonly used approach for improving robustness, however robustness gains are typically not uniform across corruption types. Indeed increasing performance in the presence of random noise is often met with reduced performance on other corruptions such as contrast change. Un… ▽ More

    Submitted 16 September, 2020; v1 submitted 21 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019

  32. arXiv:1906.03367  [pdf, other

    cs.LG stat.ML

    Using learned optimizers to make models robust to input noise

    Authors: Luke Metz, Niru Maheswaranathan, Jonathon Shlens, Jascha Sohl-Dickstein, Ekin D. Cubuk

    Abstract: State-of-the art vision models can achieve superhuman performance on image classification tasks when testing and training data come from the same distribution. However, when models are tested on corrupted images (e.g. due to scale changes, translations, or shifts in brightness or contrast), performance degrades significantly. Here, we explore the possibility of meta-training a learned optimizer th… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  33. arXiv:1906.02611  [pdf, other

    cs.LG cs.CV stat.ML

    Improving Robustness Without Sacrificing Accuracy with Patch Gaussian Augmentation

    Authors: Raphael Gontijo Lopes, Dong Yin, Ben Poole, Justin Gilmer, Ekin D. Cubuk

    Abstract: Deploying machine learning systems in the real world requires both high accuracy on clean data and robustness to naturally occurring corruptions. While architectural advances have led to improved accuracy, building robust models remains challenging. Prior work has argued that there is an inherent trade-off between robustness and accuracy, which is exemplified by standard data augment techniques su… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  34. arXiv:1904.08779  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

    Authors: Daniel S. Park, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin D. Cubuk, Quoc V. Le

    Abstract: We present SpecAugment, a simple data augmentation method for speech recognition. SpecAugment is applied directly to the feature inputs of a neural network (i.e., filter bank coefficients). The augmentation policy consists of war** the features, masking blocks of frequency channels, and masking blocks of time steps. We apply SpecAugment on Listen, Attend and Spell networks for end-to-end speech… ▽ More

    Submitted 3 December, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: 5 pages, 3 figures, 6 tables; v3: references added

    Journal ref: Proc. Interspeech 2019, 2613-2617

  35. arXiv:1805.09501  [pdf, other

    cs.CV cs.LG stat.ML

    AutoAugment: Learning Augmentation Policies from Data

    Authors: Ekin D. Cubuk, Barret Zoph, Dandelion Mane, Vijay Vasudevan, Quoc V. Le

    Abstract: Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations are manually designed. In this paper, we describe a simple procedure called AutoAugment to automatically search for improved data augmentation policies. In our implementation, we have designed a search space where a policy consists of many sub-polic… ▽ More

    Submitted 11 April, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: CVPR 2019

  36. arXiv:1804.09170  [pdf, other

    cs.LG stat.ML

    Realistic Evaluation of Deep Semi-Supervised Learning Algorithms

    Authors: Avital Oliver, Augustus Odena, Colin Raffel, Ekin D. Cubuk, Ian J. Goodfellow

    Abstract: Semi-supervised learning (SSL) provides a powerful framework for leveraging unlabeled data when labels are limited or expensive to obtain. SSL algorithms based on deep neural networks have recently proven successful on standard benchmark tasks. However, we argue that these benchmarks fail to address many issues that these algorithms would face in real-world applications. After creating a unified r… ▽ More

    Submitted 17 June, 2019; v1 submitted 24 April, 2018; originally announced April 2018.

    Journal ref: NeurIPS 2018 Proceedings

  37. arXiv:1711.02846  [pdf, other

    stat.ML cs.LG

    Intriguing Properties of Adversarial Examples

    Authors: Ekin D. Cubuk, Barret Zoph, Samuel S. Schoenholz, Quoc V. Le

    Abstract: It is becoming increasingly clear that many machine learning classifiers are vulnerable to adversarial examples. In attempting to explain the origin of adversarial examples, previous studies have typically focused on the fact that neural networks operate on high dimensional data, they overfit, or they are too linear. Here we argue that the origin of adversarial examples is primarily due to an inhe… ▽ More

    Submitted 8 November, 2017; originally announced November 2017.

    Comments: 17 pages