Skip to main content

Showing 1–50 of 51 results for author: Cubuk, E D

.
  1. arXiv:2404.08197  [pdf, other

    cs.CV

    Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

    Authors: Zichao Li, Cihang Xie, Ekin Dogus Cubuk

    Abstract: This paper investigates the performance of the Contrastive Language-Image Pre-training (CLIP) when scaled down to limited computation budgets. We explore CLIP along three dimensions: data, architecture, and training strategies. With regards to data, we demonstrate the significance of high-quality training data and show that a smaller dataset of high-quality data can outperform a larger dataset wit… ▽ More

    Submitted 15 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  2. arXiv:2312.08472  [pdf, other

    cs.NE cs.LG math.NA

    AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions

    Authors: Esteban Real, Yao Chen, Mirko Rossini, Connal de Souza, Manav Garg, Akhil Verghese, Moritz Firsching, Quoc V. Le, Ekin Dogus Cubuk, David H. Park

    Abstract: Computers calculate transcendental functions by approximating them through the composition of a few limited-precision instructions. For example, an exponential can be calculated with a Taylor series. These approximation methods were developed over the centuries by mathematicians, who emphasized the attainability of arbitrary precision. Computers, however, operate on few limited precision types, su… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    ACM Class: I.2.2; I.2.6; G.1.2

  3. arXiv:2311.17894  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cs.LG

    Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy

    Authors: Max Schwarzer, Jesse Farebrother, Joshua Greaves, Ekin Dogus Cubuk, Rishabh Agarwal, Aaron Courville, Marc G. Bellemare, Sergei Kalinin, Igor Mordatch, Pablo Samuel Castro, Kevin M. Roccapriore

    Abstract: We introduce a machine learning approach to determine the transition dynamics of silicon atoms on a single layer of carbon atoms, when stimulated by the electron beam of a scanning transmission electron microscope (STEM). Our method is data-centric, leveraging data collected on a STEM. The data samples are processed and filtered to produce symbolic representations, which we use to train a neural n… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  4. arXiv:2311.13778  [pdf

    cond-mat.mtrl-sci

    Accurate Prediction of Experimental Band Gaps from Large Language Model-Based Data Extraction

    Authors: Samuel J. Yang, Shutong Li, Subhashini Venugopalan, Vahe Tshitoyan, Muratahan Aykol, Amil Merchant, Ekin Dogus Cubuk, Gowoon Cheon

    Abstract: Machine learning is transforming materials discovery by providing rapid predictions of material properties, which enables large-scale screening for target materials. However, such models require training data. While automated data extraction from scientific literature has potential, current auto-generated datasets often lack sufficient accuracy and critical structural and processing details of mat… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  5. arXiv:2311.09235  [pdf, other

    cs.LG cs.AI

    Scalable Diffusion for Materials Generation

    Authors: Sherry Yang, KwangHwan Cho, Amil Merchant, Pieter Abbeel, Dale Schuurmans, Igor Mordatch, Ekin Dogus Cubuk

    Abstract: Generative models trained on internet-scale data are capable of generating novel and realistic texts, images, and videos. A natural next question is whether these models can advance science, for example by generating novel stable materials. Traditionally, models with explicit structures (e.g., graphs) have been used in modeling structural relationships in scientific data (e.g., atoms and bonds in… ▽ More

    Submitted 3 June, 2024; v1 submitted 18 October, 2023; originally announced November 2023.

    Comments: https://unified-materials.github.io/

  6. arXiv:2310.01117  [pdf

    cond-mat.mtrl-sci cond-mat.dis-nn cs.LG physics.comp-ph

    Predicting emergence of crystals from amorphous matter with deep learning

    Authors: Muratahan Aykol, Amil Merchant, Simon Batzner, Jennifer N. Wei, Ekin Dogus Cubuk

    Abstract: Crystallization of the amorphous phases into metastable crystals plays a fundamental role in the formation of new matter, from geological to biological processes in nature to synthesis and development of new materials in the laboratory. Predicting the outcome of such phase transitions reliably would enable new research directions in these areas, but has remained beyond reach with molecular modelin… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 5 main figures, 4 supplementary figures

  7. arXiv:2305.13520  [pdf, other

    cs.CV cs.AI cs.LG

    Tied-Augment: Controlling Representation Similarity Improves Data Augmentation

    Authors: Emirhan Kurtulus, Zichao Li, Yann Dauphin, Ekin Dogus Cubuk

    Abstract: Data augmentation methods have played an important role in the recent advance of deep learning models, and have become an indispensable component of state-of-the-art models in semi-supervised, self-supervised, and supervised training for vision. Despite incurring no additional latency at test time, data augmentation often requires more epochs of training to be effective. For example, even the simp… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 14 pages, 2 figures, ICML 2023

  8. arXiv:2305.06925  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.chem-ph physics.comp-ph

    Accurate Surface and Finite Temperature Bulk Properties of Lithium Metal at Large Scales using Machine Learning Interaction Potentials

    Authors: Mgcini Keith Phuthi, Archie Mingze Yao, Simon Batzner, Albert Musaelian, Boris Kozinsky, Ekin Dogus Cubuk, Venkatasubramanian Viswanathan

    Abstract: The properties of lithium metal are key parameters in the design of lithium ion and lithium metal batteries. They are difficult to probe experimentally due to the high reactivity and low melting point of lithium as well as the microscopic scales at which lithium exists in batteries where it is found to have enhanced strength, with implications for dendrite suppression strategies. Computationally,… ▽ More

    Submitted 22 May, 2023; v1 submitted 24 April, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures, 3 pages of Supporting Information

  9. arXiv:2210.13488  [pdf, other

    cs.CV

    LidarAugment: Searching for Scalable 3D LiDAR Data Augmentations

    Authors: Zhaoqi Leng, Guowang Li, Chenxi Liu, Ekin Dogus Cubuk, Pei Sun, Tong He, Dragomir Anguelov, Mingxing Tan

    Abstract: Data augmentations are important in training high-performance 3D object detectors for point clouds. Despite recent efforts on designing new data augmentations, perhaps surprisingly, most state-of-the-art 3D detectors only use a few simple data augmentations. In particular, different from 2D image data augmentations, 3D data augmentations need to account for different representations of input data… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  10. arXiv:2210.10879  [pdf, other

    cs.LG cs.CL cs.SD eess.AS

    G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR

    Authors: Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park

    Abstract: Data augmentation is a ubiquitous technique used to provide robustness to automatic speech recognition (ASR) training. However, even as so much of the ASR training process has become automated and more "end-to-end", the data augmentation policy (what augmentation functions to use, and how to apply them) remains hand-crafted. We present Graph-Augment, a technique to define the augmentation space as… ▽ More

    Submitted 24 October, 2022; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 6 pages, accepted at SLT 2022. Updated with copyright

  11. arXiv:2210.05546  [pdf, other

    cs.LG cs.CV

    What does a deep neural network confidently perceive? The effective dimension of high certainty class manifolds and their low confidence boundaries

    Authors: Stanislav Fort, Ekin Dogus Cubuk, Surya Ganguli, Samuel S. Schoenholz

    Abstract: Deep neural network classifiers partition input space into high confidence regions for each class. The geometry of these class manifolds (CMs) is widely studied and intimately related to model performance; for example, the margin depends on CM boundaries. We exploit the notions of Gaussian width and Gordon's escape theorem to tractably estimate the effective dimension of CMs and their boundaries t… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: An extended version of /Slice, Dice, and Optimize: Measuring the Dimension of Neural Network Class Manifolds/

  12. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  13. arXiv:2204.12511  [pdf, other

    cs.CV

    PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions

    Authors: Zhaoqi Leng, Mingxing Tan, Chenxi Liu, Ekin Dogus Cubuk, Xiaojie Shi, Shuyang Cheng, Dragomir Anguelov

    Abstract: Cross-entropy loss and focal loss are the most common choices when training deep neural networks for classification problems. Generally speaking, however, a good loss function can take on much more flexible forms, and should be tailored for different tasks and datasets. Motivated by how functions can be approximated via Taylor expansion, we propose a simple framework, named PolyLoss, to view and d… ▽ More

    Submitted 10 May, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: Add ablation studies on COCO detection using RetinaNet (Section 8)

    Journal ref: International Conference on Learning Representations. 2021

  14. arXiv:2203.04946  [pdf, other

    cs.CV

    Do better ImageNet classifiers assess perceptual similarity better?

    Authors: Manoj Kumar, Neil Houlsby, Nal Kalchbrenner, Ekin D. Cubuk

    Abstract: Perceptual distances between images, as measured in the space of pre-trained deep features, have outperformed prior low-level, pixel-based metrics on assessing perceptual similarity. While the capabilities of older and less accurate models such as AlexNet and VGG to capture perceptual similarity are well known, modern and more accurate models are less studied. In this paper, we present a large-sca… ▽ More

    Submitted 29 October, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: TMLR 2022 (https://openreview.net/forum?id=qrGKGZZvH0)

  15. arXiv:2110.12899  [pdf, other

    cs.LG

    No One Representation to Rule Them All: Overlap** Features of Training Methods

    Authors: Raphael Gontijo-Lopes, Yann Dauphin, Ekin D. Cubuk

    Abstract: Despite being able to capture a range of features of the data, high accuracy models trained with supervision tend to make similar predictions. This seemingly implies that high-performing models share similar biases regardless of training methodology, which would limit ensembling benefits and render low-accuracy models as having little practical use. Against this backdrop, recent work has developed… ▽ More

    Submitted 25 April, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Journal ref: International Conference on Learning Representations (ICLR) 2022

  16. arXiv:2108.11353  [pdf, other

    cs.CV

    Multi-Task Self-Training for Learning General Representations

    Authors: Golnaz Ghiasi, Barret Zoph, Ekin D. Cubuk, Quoc V. Le, Tsung-Yi Lin

    Abstract: Despite the fast progress in training specialized models for various tasks, learning a single general model that works well for many tasks is still challenging for computer vision. Here we introduce multi-task self-training (MuST), which harnesses the knowledge in independent specialized teacher models (e.g., ImageNet model on classification) to train a single general student model. Our approach h… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: ICCV 2021

  17. arXiv:2107.09661  [pdf, other

    cs.LG

    Learn2Hop: Learned Optimization on Rough Landscapes

    Authors: Amil Merchant, Luke Metz, Sam Schoenholz, Ekin Dogus Cubuk

    Abstract: Optimization of non-convex loss surfaces containing many local minima remains a critical problem in a variety of domains, including operations research, informatics, and material design. Yet, current techniques either require extremely high iteration counts or a large number of random restarts for good performance. In this work, we propose adapting recent developments in meta-learning to these man… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: ICML 2021, 16 pages, 8 figures, 6 tables

  18. dPV: An End-to-End Differentiable Solar-Cell Simulator

    Authors: Sean Mann, Eric Fadel, Samuel S. Schoenholz, Ekin D. Cubuk, Steven G. Johnson, Giuseppe Romano

    Abstract: We introduce dPV, an end-to-end differentiable photovoltaic (PV) cell simulator based on the drift-diffusion model and Beer-Lambert law for optical absorption. dPV is programmed in Python using JAX, an automatic differentiation (AD) library for scientific computing. Using AD coupled with the implicit function theorem, dPV computes the power conversion efficiency (PCE) of an input PV design as well… ▽ More

    Submitted 9 December, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

  19. arXiv:2103.07579  [pdf, other

    cs.CV

    Revisiting ResNets: Improved Training and Scaling Strategies

    Authors: Irwan Bello, William Fedus, Xianzhi Du, Ekin D. Cubuk, Aravind Srinivas, Tsung-Yi Lin, Jonathon Shlens, Barret Zoph

    Abstract: Novel computer vision architectures monopolize the spotlight, but the impact of the model architecture is often conflated with simultaneous changes to training methodology and scaling strategies. Our work revisits the canonical ResNet (He et al., 2015) and studies these three aspects in an effort to disentangle them. Perhaps surprisingly, we find that training and scaling strategies may matter mor… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  20. arXiv:2012.07177  [pdf, other

    cs.CV

    Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

    Authors: Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph

    Abstract: Building instance segmentation models that are data-efficient and can handle rare object categories is an important challenge in computer vision. Leveraging data augmentations is a promising direction towards addressing this challenge. Here, we perform a systematic study of the Copy-Paste augmentation ([13, 12]) for instance segmentation where we randomly paste objects onto an image. Prior studies… ▽ More

    Submitted 23 June, 2021; v1 submitted 13 December, 2020; originally announced December 2020.

    Comments: Accepted at CVPR 2021 (Oral)

  21. arXiv:2012.02920  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Dataset of Random Relaxations for Crystal Structure Search of Li-Si System

    Authors: Gowoon Cheon, Lusann Yang, Kevin McCloskey, Evan J. Reed, Ekin D. Cubuk

    Abstract: Crystal structure search is a long-standing challenge in materials design. We present a dataset of more than 100,000 structural relaxations of potential battery anode materials from randomized structures using density functional theory calculations. We illustrate the usage of the dataset by training graph neural networks to predict structural relaxations from randomly generated structures. Our mod… ▽ More

    Submitted 8 March, 2023; v1 submitted 4 December, 2020; originally announced December 2020.

  22. arXiv:2010.15175  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci

    Self-assembling kinetics: Accessing a new design space via differentiable statistical-physics models

    Authors: Carl P. Goodrich, Ella M. King, Samuel S. Schoenholz, Ekin D. Cubuk, Michael Brenner

    Abstract: The inverse problem of designing component interactions to target emergent structure is fundamental to numerous applications in biotechnology, materials science, and statistical physics. Equally important is the inverse problem of designing emergent kinetics, but this has received considerably less attention. Using recent advances in automatic differentiation, we show how kinetic pathways can be p… ▽ More

    Submitted 18 November, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: 5 figures

  23. arXiv:2010.07810  [pdf, other

    cs.CV cs.LG

    Does Data Augmentation Benefit from Split BatchNorms

    Authors: Amil Merchant, Barret Zoph, Ekin Dogus Cubuk

    Abstract: Data augmentation has emerged as a powerful technique for improving the performance of deep neural networks and led to state-of-the-art results in computer vision. However, state-of-the-art data augmentation strongly distorts training images, leading to a disparity between examples seen during training and inference. In this work, we explore a recently proposed training paradigm in order to correc… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: 9 pages (+ 3 for references)

  24. arXiv:2009.08551  [pdf, other

    physics.comp-ph cs.LG

    Kohn-Sham equations as regularizer: building prior knowledge into machine-learned physics

    Authors: Li Li, Stephan Hoyer, Ryan Pederson, Ruoxi Sun, Ekin D. Cubuk, Patrick Riley, Kieron Burke

    Abstract: Including prior knowledge is important for effective machine learning models in physics, and is usually achieved by explicitly adding loss terms or constraints on model architectures. Prior knowledge embedded in the physics computation itself rarely draws attention. We show that solving the Kohn-Sham equations when training neural networks for the exchange-correlation functional provides an implic… ▽ More

    Submitted 17 November, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Journal ref: Phys. Rev. Lett. 126, 036401 (2021)

  25. arXiv:2008.09681  [pdf, other

    cond-mat.soft cond-mat.dis-nn cond-mat.mtrl-sci

    Unifying framework for strong and fragile liquids via machine learning: a study of liquid silica

    Authors: Ekin D. Cubuk, Andrea J. Liu, Efthimios Kaxiras, Samuel S. Schoenholz

    Abstract: The fragility of a glassforming liquid characterizes how rapidly its relaxation dynamics slow down with cooling. The viscosity of strong liquids follows an Arrhenius law with a temperature-independent barrier height to rearrangements responsible for relaxation, whereas fragile liquids experience a much faster increase in their dynamics, suggesting a barrier height that increases with decreasing te… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: 6 pages, 4 figures

  26. arXiv:2008.05298  [pdf, other

    cond-mat.mtrl-sci cond-mat.soft

    Forward and Inverse Design of Kirigami via Supervised Autoencoder

    Authors: Paul Z. Hanakata, Ekin D. Cubuk, David K. Campbell, Harold S. Park

    Abstract: Machine learning (ML) methods have recently been used as forward solvers to predict the mechanical properties of composite materials. Here, we use a supervised-autoencoder (sAE) to perform inverse design of graphene kirigami, where predicting the ultimate stress or strain under tensile loading is known to be difficult due to nonlinear effects arising from the out-of-plane buckling. Unlike the stan… ▽ More

    Submitted 31 July, 2020; originally announced August 2020.

    Journal ref: Phys. Rev. Research 2, 042006 (2020)

  27. arXiv:2006.06882  [pdf, other

    cs.CV cs.LG stat.ML

    Rethinking Pre-training and Self-training

    Authors: Barret Zoph, Golnaz Ghiasi, Tsung-Yi Lin, Yin Cui, Hanxiao Liu, Ekin D. Cubuk, Quoc V. Le

    Abstract: Pre-training is a dominant paradigm in computer vision. For example, supervised ImageNet pre-training is commonly used to initialize the backbones of object detection and segmentation models. He et al., however, show a surprising result that ImageNet pre-training has limited impact on COCO object detection. Here we investigate self-training as another method to utilize additional data on the same… ▽ More

    Submitted 15 November, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted for publication at the Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020)

  28. arXiv:2005.10266  [pdf, other

    cs.CV

    Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation

    Authors: Liang-Chieh Chen, Raphael Gontijo Lopes, Bowen Cheng, Maxwell D. Collins, Ekin D. Cubuk, Barret Zoph, Hartwig Adam, Jonathon Shlens

    Abstract: Supervised learning in large discriminative models is a mainstay for modern computer vision. Such an approach necessitates investing in large-scale human-annotated datasets for achieving state-of-the-art results. In turn, the efficacy of supervised learning may be limited by the size of the human annotated dataset. This limitation is particularly notable for image segmentation tasks, where the exp… ▽ More

    Submitted 19 July, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted to ECCV 2020

  29. arXiv:2004.00831  [pdf, other

    cs.CV

    Improving 3D Object Detection through Progressive Population Based Augmentation

    Authors: Shuyang Cheng, Zhaoqi Leng, Ekin Dogus Cubuk, Barret Zoph, Chunyan Bai, Jiquan Ngiam, Yang Song, Benjamin Caine, Vijay Vasudevan, Congcong Li, Quoc V. Le, Jonathon Shlens, Dragomir Anguelov

    Abstract: Data augmentation has been widely adopted for object detection in 3D point clouds. However, all previous related efforts have focused on manually designing specific data augmentation methods for individual architectures. In this work, we present the first attempt to automate the design of data augmentation policies for 3D object detection. We introduce the Progressive Population Based Augmentation… ▽ More

    Submitted 16 July, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: Accepted at ECCV 2020

  30. arXiv:2002.08973  [pdf, other

    cs.LG cs.CV stat.ML

    Affinity and Diversity: Quantifying Mechanisms of Data Augmentation

    Authors: Raphael Gontijo-Lopes, Sylvia J. Smullin, Ekin D. Cubuk, Ethan Dyer

    Abstract: Though data augmentation has become a standard component of deep neural network training, the underlying mechanism behind the effectiveness of these techniques remains poorly understood. In practice, augmentation policies are often chosen using heuristics of either distribution shift or augmentation diversity. Inspired by these, we seek to quantify how data augmentation improves model generalizati… ▽ More

    Submitted 4 June, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: 10 pages, 7 figures

  31. arXiv:2001.07685  [pdf

    cs.LG cs.CV stat.ML

    FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence

    Authors: Kihyuk Sohn, David Berthelot, Chun-Liang Li, Zizhao Zhang, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Han Zhang, Colin Raffel

    Abstract: Semi-supervised learning (SSL) provides an effective means of leveraging unlabeled data to improve a model's performance. In this paper, we demonstrate the power of a simple combination of two common SSL methods: consistency regularization and pseudo-labeling. Our algorithm, FixMatch, first generates pseudo-labels using the model's predictions on weakly-augmented unlabeled images. For a given imag… ▽ More

    Submitted 25 November, 2020; v1 submitted 21 January, 2020; originally announced January 2020.

    Comments: Published at NeurIPS 2020 as a conference paper

  32. arXiv:1912.04232  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cond-mat.soft stat.ML

    JAX, M.D.: A Framework for Differentiable Physics

    Authors: Samuel S. Schoenholz, Ekin D. Cubuk

    Abstract: We introduce JAX MD, a software package for performing differentiable physics simulations with a focus on molecular dynamics. JAX MD includes a number of physics simulation environments, as well as interaction potentials and neural networks that can be integrated into these environments without writing any additional code. Since the simulations themselves are differentiable functions, entire traje… ▽ More

    Submitted 3 December, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

    Journal ref: Advances in Neural Information Processing Systems 33 (2020)

  33. arXiv:1912.02781  [pdf, other

    stat.ML cs.CV cs.LG

    AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

    Authors: Dan Hendrycks, Norman Mu, Ekin D. Cubuk, Barret Zoph, Justin Gilmer, Balaji Lakshminarayanan

    Abstract: Modern deep neural networks can achieve high accuracy when the training distribution and test distribution are identically distributed, but this assumption is frequently violated in practice. When the train and test distributions are mismatched, accuracy can plummet. Currently there are few techniques that improve robustness to unforeseen data shifts encountered during deployment. In this work, we… ▽ More

    Submitted 17 February, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Code available at https://github.com/google-research/augmix

  34. arXiv:1911.09785  [pdf, other

    cs.LG cs.CV stat.ML

    ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring

    Authors: David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Kihyuk Sohn, Han Zhang, Colin Raffel

    Abstract: We improve the recently-proposed "MixMatch" semi-supervised learning algorithm by introducing two new techniques: distribution alignment and augmentation anchoring. Distribution alignment encourages the marginal distribution of predictions on unlabeled data to be close to the marginal distribution of ground-truth labels. Augmentation anchoring feeds multiple strongly augmented versions of an input… ▽ More

    Submitted 13 February, 2020; v1 submitted 21 November, 2019; originally announced November 2019.

  35. arXiv:1909.13719  [pdf, other

    cs.CV

    RandAugment: Practical automated data augmentation with a reduced search space

    Authors: Ekin D. Cubuk, Barret Zoph, Jonathon Shlens, Quoc V. Le

    Abstract: Recent work has shown that data augmentation has the potential to significantly improve the generalization of deep learning models. Recently, automated augmentation strategies have led to state-of-the-art results in image classification and object detection. While these strategies were optimized for improving validation accuracy, they also led to state-of-the-art results in semi-supervised learnin… ▽ More

    Submitted 13 November, 2019; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: Added ablation experiments

  36. arXiv:1906.11172  [pdf, other

    cs.CV cs.LG

    Learning Data Augmentation Strategies for Object Detection

    Authors: Barret Zoph, Ekin D. Cubuk, Golnaz Ghiasi, Tsung-Yi Lin, Jonathon Shlens, Quoc V. Le

    Abstract: Data augmentation is a critical component of training deep learning models. Although data augmentation has been shown to significantly improve image classification, its potential has not been thoroughly investigated for object detection. Given the additional cost for annotating images for object detection, data augmentation may be of even greater importance for this computer vision task. In this w… ▽ More

    Submitted 26 June, 2019; originally announced June 2019.

  37. arXiv:1906.08988  [pdf, other

    cs.LG cs.CV stat.ML

    A Fourier Perspective on Model Robustness in Computer Vision

    Authors: Dong Yin, Raphael Gontijo Lopes, Jonathon Shlens, Ekin D. Cubuk, Justin Gilmer

    Abstract: Achieving robustness to distributional shift is a longstanding and challenging goal of computer vision. Data augmentation is a commonly used approach for improving robustness, however robustness gains are typically not uniform across corruption types. Indeed increasing performance in the presence of random noise is often met with reduced performance on other corruptions such as contrast change. Un… ▽ More

    Submitted 16 September, 2020; v1 submitted 21 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019

  38. arXiv:1906.03367  [pdf, other

    cs.LG stat.ML

    Using learned optimizers to make models robust to input noise

    Authors: Luke Metz, Niru Maheswaranathan, Jonathon Shlens, Jascha Sohl-Dickstein, Ekin D. Cubuk

    Abstract: State-of-the art vision models can achieve superhuman performance on image classification tasks when testing and training data come from the same distribution. However, when models are tested on corrupted images (e.g. due to scale changes, translations, or shifts in brightness or contrast), performance degrades significantly. Here, we explore the possibility of meta-training a learned optimizer th… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  39. arXiv:1906.02611  [pdf, other

    cs.LG cs.CV stat.ML

    Improving Robustness Without Sacrificing Accuracy with Patch Gaussian Augmentation

    Authors: Raphael Gontijo Lopes, Dong Yin, Ben Poole, Justin Gilmer, Ekin D. Cubuk

    Abstract: Deploying machine learning systems in the real world requires both high accuracy on clean data and robustness to naturally occurring corruptions. While architectural advances have led to improved accuracy, building robust models remains challenging. Prior work has argued that there is an inherent trade-off between robustness and accuracy, which is exemplified by standard data augment techniques su… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  40. arXiv:1904.08779  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

    Authors: Daniel S. Park, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin D. Cubuk, Quoc V. Le

    Abstract: We present SpecAugment, a simple data augmentation method for speech recognition. SpecAugment is applied directly to the feature inputs of a neural network (i.e., filter bank coefficients). The augmentation policy consists of war** the features, masking blocks of frequency channels, and masking blocks of time steps. We apply SpecAugment on Listen, Attend and Spell networks for end-to-end speech… ▽ More

    Submitted 3 December, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: 5 pages, 3 figures, 6 tables; v3: references added

    Journal ref: Proc. Interspeech 2019, 2613-2617

  41. arXiv:1808.06111  [pdf, other

    physics.comp-ph cond-mat.dis-nn

    Accelerated search and design of stretchable graphene kirigami using machine learning

    Authors: Paul Z. Hanakata, Ekin D. Cubuk, David K. Campbell, Harold S. Park

    Abstract: Making kirigami-inspired cuts into a sheet has been shown to be an effective way of designing stretchable materials with metamorphic properties where the 2D shape can transform into complex 3D shapes. However, finding the optimal solutions is not straightforward as the number of possible cutting patterns grows exponentially with system size. Here, we report on how machine learning (ML) can be used… ▽ More

    Submitted 18 August, 2018; originally announced August 2018.

    Journal ref: Phys. Rev. Lett. 121, 255304 (2018)

  42. arXiv:1808.02470  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Machine learning-assisted discovery of many new solid Li-ion conducting materials

    Authors: Austin D. Sendek, Ekin D. Cubuk, Evan R. Antoniuk, Gowoon Cheon, Yi Cui, Evan J. Reed

    Abstract: We discover many new crystalline solid materials with fast single crystal Li ion conductivity at room temperature, discovered through density functional theory simulations guided by machine learning-based methods. The discovery of new solid Li superionic conductors is of critical importance to the development of safe all-solid-state Li-ion batteries. With a predictive universal structure-property… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

    Comments: 32 page main text with 3 tables and 4 figures; 4 page Supporting Information with 1 table appended to end of document

  43. arXiv:1805.09501  [pdf, other

    cs.CV cs.LG stat.ML

    AutoAugment: Learning Augmentation Policies from Data

    Authors: Ekin D. Cubuk, Barret Zoph, Dandelion Mane, Vijay Vasudevan, Quoc V. Le

    Abstract: Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations are manually designed. In this paper, we describe a simple procedure called AutoAugment to automatically search for improved data augmentation policies. In our implementation, we have designed a search space where a policy consists of many sub-polic… ▽ More

    Submitted 11 April, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: CVPR 2019

  44. arXiv:1804.09170  [pdf, other

    cs.LG stat.ML

    Realistic Evaluation of Deep Semi-Supervised Learning Algorithms

    Authors: Avital Oliver, Augustus Odena, Colin Raffel, Ekin D. Cubuk, Ian J. Goodfellow

    Abstract: Semi-supervised learning (SSL) provides a powerful framework for leveraging unlabeled data when labels are limited or expensive to obtain. SSL algorithms based on deep neural networks have recently proven successful on standard benchmark tasks. However, we argue that these benchmarks fail to address many issues that these algorithms would face in real-world applications. After creating a unified r… ▽ More

    Submitted 17 June, 2019; v1 submitted 24 April, 2018; originally announced April 2018.

    Journal ref: NeurIPS 2018 Proceedings

  45. arXiv:1803.01416  [pdf, other

    cond-mat.mtrl-sci

    Machine learning determination of atomic dynamics at grain boundaries

    Authors: Tristan A. Sharp, Spencer L. Thomas, Ekin D. Cubuk, Samuel S. Schoenholz, David J. Srolovitz, Andrea J. Liu

    Abstract: In polycrystalline materials, grain boundaries are sites of enhanced atomic motion, but the complexity of the atomic structures within a grain boundary network makes it difficult to link the structure and atomic dynamics. Here we use a machine learning technique to establish a connection between local structure and dynamics of these materials. Following previous work on bulk glassy materials, we d… ▽ More

    Submitted 11 September, 2018; v1 submitted 4 March, 2018; originally announced March 2018.

  46. arXiv:1711.02846  [pdf, other

    stat.ML cs.LG

    Intriguing Properties of Adversarial Examples

    Authors: Ekin D. Cubuk, Barret Zoph, Samuel S. Schoenholz, Quoc V. Le

    Abstract: It is becoming increasingly clear that many machine learning classifiers are vulnerable to adversarial examples. In attempting to explain the origin of adversarial examples, previous studies have typically focused on the fact that neural networks operate on high dimensional data, they overfit, or they are too linear. Here we argue that the origin of adversarial examples is primarily due to an inhe… ▽ More

    Submitted 8 November, 2017; originally announced November 2017.

    Comments: 17 pages

  47. Disconnecting structure and dynamics in glassy thin films

    Authors: Daniel M. Sussman, Samuel S. Schoenholz, Ekin D. Cubuk, Andrea J. Liu

    Abstract: Nanometrically thin glassy films depart strikingly from the behavior of their bulk counterparts. We investigate whether the dynamical differences between bulk and thin film glasses can be understood by differences in local microscopic structure. We employ machine-learning methods that have previously identified strong correlations between local structure and particle rearrangement dynamics in bulk… ▽ More

    Submitted 11 October, 2016; originally announced October 2016.

    Comments: 8 pages, 7 figures

  48. The Relationship Between Local Structure and Relaxation in Out-of-Equilibrium Glassy Systems

    Authors: Samuel S. Schoenholz, Ekin D. Cubuk, Efthimios Kaxiras, Andrea J. Liu

    Abstract: The dynamical glass transition is typically taken to be the temperature at which a glassy liquid is no longer able to equilibrate on experimental timescales. Consequently, the physical properties of these systems just above or below the dynamical glass transition, such as viscosity, can change by many orders of magnitude over long periods of time following external perturbation. During this progre… ▽ More

    Submitted 23 July, 2016; originally announced July 2016.

  49. High-Temperature Quantum Anomalous Hall Effect in n-p Codoped Topological Insulators

    Authors: Shifei Qi, Zhenhua Qiao, Xinzhou Deng, Ekin D. Cubuk, Hua Chen, Wenguang Zhu, Efthimios Kaxiras, S. B. Zhang, Xiaohong Xu, Zhenyu Zhang

    Abstract: The quantum anomalous Hall effect (QAHE) is a fundamental quantum transport phenomenon that manifests as a quantized transverse conductance in response to a longitudinally applied electric field in the absence of an external magnetic field, and promises to have immense application potentials in future dissipation-less quantum electronics. Here we present a novel kinetic pathway to realize the QAHE… ▽ More

    Submitted 12 July, 2015; originally announced July 2015.

    Comments: 5 pages, 3 figures

    Journal ref: Phys. Rev. Lett. 117, 056804 (2016)

  50. arXiv:1506.07772  [pdf, other

    cond-mat.soft

    A structural approach to relaxation in glassy liquids

    Authors: Samuel S. Schoenholz, Ekin D. Cubuk, Daniel M. Sussman, Efthimios Kaxiras, Andrea J Liu

    Abstract: When a liquid freezes, a change in the local atomic structure marks the transition to the crystal. When a liquid is cooled to form a glass, however, no noticeable structural change marks the glass transition. Indeed, characteristic features of glassy dynamics that appear below an onset temperature, T_0, are qualitatively captured by mean field theory, which assumes uniform local structure at all t… ▽ More

    Submitted 22 November, 2015; v1 submitted 25 June, 2015; originally announced June 2015.