Skip to main content

Showing 1–45 of 45 results for author: Schoenholz, S S

.
  1. arXiv:2406.01522  [pdf, other

    cond-mat.soft physics.data-an

    Inferring interaction potentials from stochastic particle trajectories

    Authors: Ella M. King, Megan C. Engel, Caroline Martin, Alp M. Sunol, Qian-Ze Zhu, Sam S. Schoenholz, Vinothan N. Manoharan, Michael P. Brenner

    Abstract: Accurate interaction potentials between microscopic components such as colloidal particles or cells are crucial to understanding a range of processes, including colloidal crystallization, bacterial colony formation, and cancer metastasis. Even in systems where the precise interaction mechanisms are unknown, effective interactions can be measured to inform simulation and design. However, these meas… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2312.05360  [pdf, other

    cond-mat.soft physics.comp-ph

    Programmable patchy particles for materials design

    Authors: Ella M. King, Chrisy Xiyu Du, Qian-Ze Zhu, Samuel S. Schoenholz, Michael P. Brenner

    Abstract: Direct design of complex functional materials would revolutionize technologies ranging from printable organs to novel clean energy devices. However, even incremental steps towards designing functional materials have proven challenging. If the material is constructed from highly complex components, the design space of materials properties rapidly becomes too computationally expensive to search. On… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  3. arXiv:2210.05546  [pdf, other

    cs.LG cs.CV

    What does a deep neural network confidently perceive? The effective dimension of high certainty class manifolds and their low confidence boundaries

    Authors: Stanislav Fort, Ekin Dogus Cubuk, Surya Ganguli, Samuel S. Schoenholz

    Abstract: Deep neural network classifiers partition input space into high confidence regions for each class. The geometry of these class manifolds (CMs) is widely studied and intimately related to model performance; for example, the margin depends on CM boundaries. We exploit the notions of Gaussian width and Gordon's escape theorem to tractably estimate the effective dimension of CMs and their boundaries t… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: An extended version of /Slice, Dice, and Optimize: Measuring the Dimension of Neural Network Class Manifolds/

  4. arXiv:2207.09432  [pdf, other

    cs.LG

    Deep equilibrium networks are sensitive to initialization statistics

    Authors: Atish Agarwala, Samuel S. Schoenholz

    Abstract: Deep equilibrium networks (DEQs) are a promising way to construct models which trade off memory for compute. However, theoretical understanding of these models is still lacking compared to traditional networks, in part because of the repeated application of a single set of weights. We show that DEQs are sensitive to the higher order statistics of the matrix families from which they are initialized… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  5. arXiv:2206.08720  [pdf, other

    cs.LG cs.AI stat.ML

    Fast Finite Width Neural Tangent Kernel

    Authors: Roman Novak, Jascha Sohl-Dickstein, Samuel S. Schoenholz

    Abstract: The Neural Tangent Kernel (NTK), defined as $Θ_θ^f(x_1, x_2) = \left[\partial f(θ, x_1)\big/\partial θ\right] \left[\partial f(θ, x_2)\big/\partial θ\right]^T$ where $\left[\partial f(θ, \cdot)\big/\partial θ\right]$ is a neural network (NN) Jacobian, has emerged as a central object of study in deep learning. In the infinite width limit, the NTK can sometimes be computed analytically and is useful… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Published as a conference paper at ICML 2022

  6. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  7. arXiv:2111.05803  [pdf, other

    cs.LG stat.ML

    Gradients are Not All You Need

    Authors: Luke Metz, C. Daniel Freeman, Samuel S. Schoenholz, Tal Kachman

    Abstract: Differentiable programming techniques are widely used in the community and are responsible for the machine learning renaissance of the past several decades. While these methods are powerful, they have limits. In this short report, we discuss a common chaos based failure mode which appears in a variety of differentiable circumstances, ranging from recurrent neural networks and numerical physics sim… ▽ More

    Submitted 20 January, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

  8. arXiv:2110.01765  [pdf, other

    cs.LG cs.AI cs.NE

    Rapid training of deep neural networks without skip connections or normalization layers using Deep Kernel Sha**

    Authors: James Martens, Andy Ballard, Guillaume Desjardins, Grzegorz Swirszcz, Valentin Dalibard, Jascha Sohl-Dickstein, Samuel S. Schoenholz

    Abstract: Using an extended and formalized version of the Q/C map analysis of Poole et al. (2016), along with Neural Tangent Kernel theory, we identify the main pathologies present in deep networks that prevent them from training fast and generalizing to unseen data, and show how these can be avoided by carefully controlling the "shape" of the network's initialization-time kernel function. We then develop a… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

  9. dPV: An End-to-End Differentiable Solar-Cell Simulator

    Authors: Sean Mann, Eric Fadel, Samuel S. Schoenholz, Ekin D. Cubuk, Steven G. Johnson, Giuseppe Romano

    Abstract: We introduce dPV, an end-to-end differentiable photovoltaic (PV) cell simulator based on the drift-diffusion model and Beer-Lambert law for optical absorption. dPV is programmed in Python using JAX, an automatic differentiation (AD) library for scientific computing. Using AD coupled with the implicit function theorem, dPV computes the power conversion efficiency (PCE) of an input PV design as well… ▽ More

    Submitted 9 December, 2021; v1 submitted 13 May, 2021; originally announced May 2021.

  10. arXiv:2102.03793  [pdf, other

    cs.LG cond-mat.soft stat.ML

    Tilting the playing field: Dynamical loss functions for machine learning

    Authors: Miguel Ruiz-Garcia, Ge Zhang, Samuel S. Schoenholz, Andrea J. Liu

    Abstract: We show that learning can be improved by using loss functions that evolve cyclically during training to emphasize one class at a time. In underparameterized networks, such dynamical loss functions can lead to successful training for networks that fail to find a deep minima of the standard cross-entropy loss. In overparameterized networks, dynamical loss functions can lead to better generalization.… ▽ More

    Submitted 23 June, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

  11. arXiv:2010.15175  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci

    Self-assembling kinetics: Accessing a new design space via differentiable statistical-physics models

    Authors: Carl P. Goodrich, Ella M. King, Samuel S. Schoenholz, Ekin D. Cubuk, Michael Brenner

    Abstract: The inverse problem of designing component interactions to target emergent structure is fundamental to numerous applications in biotechnology, materials science, and statistical physics. Equally important is the inverse problem of designing emergent kinetics, but this has received considerably less attention. Using recent advances in automatic differentiation, we show how kinetic pathways can be p… ▽ More

    Submitted 18 November, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: 5 figures

  12. arXiv:2008.09681  [pdf, other

    cond-mat.soft cond-mat.dis-nn cond-mat.mtrl-sci

    Unifying framework for strong and fragile liquids via machine learning: a study of liquid silica

    Authors: Ekin D. Cubuk, Andrea J. Liu, Efthimios Kaxiras, Samuel S. Schoenholz

    Abstract: The fragility of a glassforming liquid characterizes how rapidly its relaxation dynamics slow down with cooling. The viscosity of strong liquids follows an Arrhenius law with a temperature-independent barrier height to rearrangements responsible for relaxation, whereas fragile liquids experience a much faster increase in their dynamics, suggesting a barrier height that increases with decreasing te… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: 6 pages, 4 figures

  13. arXiv:2008.07545  [pdf, other

    cs.LG stat.ML

    Whitening and second order optimization both make information in the dataset unusable during training, and can reduce or prevent generalization

    Authors: Neha S. Wadia, Daniel Duckworth, Samuel S. Schoenholz, Ethan Dyer, Jascha Sohl-Dickstein

    Abstract: Machine learning is predicated on the concept of generalization: a model achieving low error on a sufficiently large training set should also perform well on novel samples from the same distribution. We show that both data whitening and second order optimization can harm or entirely prevent generalization. In general, model training harnesses information contained in the sample-sample second momen… ▽ More

    Submitted 19 July, 2021; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: 13+10 pages, 10 figures; minor textual changes and some reorganization, one new figure and a new proof of main theorem added

  14. arXiv:2007.15801  [pdf, other

    cs.LG stat.ML

    Finite Versus Infinite Neural Networks: an Empirical Study

    Authors: Jaehoon Lee, Samuel S. Schoenholz, Jeffrey Pennington, Ben Adlam, Lechao Xiao, Roman Novak, Jascha Sohl-Dickstein

    Abstract: We perform a careful, thorough, and large scale empirical study of the correspondence between wide neural networks and kernel methods. By doing so, we resolve a variety of open questions related to the study of infinitely wide neural networks. Our experimental results include: kernel methods outperform fully-connected finite-width networks, but underperform convolutional finite width networks; neu… ▽ More

    Submitted 8 September, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: 17+11 pages; v2 references added, minor improvements

  15. arXiv:2001.07301  [pdf, other

    cs.LG stat.ML

    On the infinite width limit of neural networks with a standard parameterization

    Authors: Jascha Sohl-Dickstein, Roman Novak, Samuel S. Schoenholz, Jaehoon Lee

    Abstract: There are currently two parameterizations used to derive fixed kernels corresponding to infinite width neural networks, the NTK (Neural Tangent Kernel) parameterization and the naive standard parameterization. However, the extrapolation of both of these parameterizations to infinite width is problematic. The standard parameterization leads to a divergent neural tangent kernel while the NTK paramet… ▽ More

    Submitted 18 April, 2020; v1 submitted 20 January, 2020; originally announced January 2020.

  16. arXiv:1912.13053  [pdf, other

    cs.LG stat.ML

    Disentangling Trainability and Generalization in Deep Neural Networks

    Authors: Lechao Xiao, Jeffrey Pennington, Samuel S. Schoenholz

    Abstract: A longstanding goal in the theory of deep learning is to characterize the conditions under which a given neural network architecture will be trainable, and if so, how well it might generalize to unseen data. In this work, we provide such a characterization in the limit of very wide and very deep networks, for which the analysis simplifies considerably. For wide networks, the trajectory under gradi… ▽ More

    Submitted 13 July, 2020; v1 submitted 30 December, 2019; originally announced December 2019.

    Comments: 22 pages, 3 figures, ICML 2020. Associated Colab notebook at https://colab.research.google.com/github/google/neural-tangents/blob/master/notebooks/Disentangling_Trainability_and_Generalization.ipynb

  17. arXiv:1912.04232  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cond-mat.soft stat.ML

    JAX, M.D.: A Framework for Differentiable Physics

    Authors: Samuel S. Schoenholz, Ekin D. Cubuk

    Abstract: We introduce JAX MD, a software package for performing differentiable physics simulations with a focus on molecular dynamics. JAX MD includes a number of physics simulation environments, as well as interaction potentials and neural networks that can be integrated into these environments without writing any additional code. Since the simulations themselves are differentiable functions, entire traje… ▽ More

    Submitted 3 December, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

    Journal ref: Advances in Neural Information Processing Systems 33 (2020)

  18. arXiv:1912.02803  [pdf, other

    stat.ML cs.LG

    Neural Tangents: Fast and Easy Infinite Neural Networks in Python

    Authors: Roman Novak, Lechao Xiao, Jiri Hron, Jaehoon Lee, Alexander A. Alemi, Jascha Sohl-Dickstein, Samuel S. Schoenholz

    Abstract: Neural Tangents is a library designed to enable research into infinite-width neural networks. It provides a high-level API for specifying complex and hierarchical neural network architectures. These networks can then be trained and evaluated either at finite-width as usual or in their infinite-width limit. Infinite-width networks can be trained analytically using exact Bayesian inference or using… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  19. arXiv:1902.08129  [pdf, other

    cs.NE cond-mat.dis-nn cs.LG math.DS

    A Mean Field Theory of Batch Normalization

    Authors: Greg Yang, Jeffrey Pennington, Vinay Rao, Jascha Sohl-Dickstein, Samuel S. Schoenholz

    Abstract: We develop a mean field theory for batch normalization in fully-connected feedforward neural networks. In so doing, we provide a precise characterization of signal propagation and gradient backpropagation in wide batch-normalized networks at initialization. Our theory shows that gradient signals grow exponentially in depth and that these exploding gradients cannot be eliminated by tuning the initi… ▽ More

    Submitted 5 March, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

    Comments: To appear in ICLR 2019

  20. Wide Neural Networks of Any Depth Evolve as Linear Models Under Gradient Descent

    Authors: Jaehoon Lee, Lechao Xiao, Samuel S. Schoenholz, Yasaman Bahri, Roman Novak, Jascha Sohl-Dickstein, Jeffrey Pennington

    Abstract: A longstanding goal in deep learning research has been to precisely characterize training and generalization. However, the often complex loss landscapes of neural networks have made a theory of learning dynamics elusive. In this work, we show that for wide neural networks the learning dynamics simplify considerably and that, in the infinite width limit, they are governed by a linear model obtained… ▽ More

    Submitted 8 December, 2019; v1 submitted 18 February, 2019; originally announced February 2019.

    Comments: 12+16 pages; open-source code available at https://github.com/google/neural-tangents; accepted to NeurIPS 2019

  21. arXiv:1901.08987  [pdf, other

    cs.LG stat.ML

    Dynamical Isometry and a Mean Field Theory of LSTMs and GRUs

    Authors: Dar Gilboa, Bo Chang, Minmin Chen, Greg Yang, Samuel S. Schoenholz, Ed H. Chi, Jeffrey Pennington

    Abstract: Training recurrent neural networks (RNNs) on long sequence tasks is plagued with difficulties arising from the exponential explosion or vanishing of signals as they propagate forward or backward through the network. Many techniques have been proposed to ameliorate these issues, including various algorithmic and architectural modifications. Two of the most successful RNN architectures, the LSTM and… ▽ More

    Submitted 23 May, 2019; v1 submitted 25 January, 2019; originally announced January 2019.

  22. arXiv:1808.06576  [pdf, other

    q-bio.QM stat.ML

    Peptide-Spectra Matching from Weak Supervision

    Authors: Samuel S. Schoenholz, Sean Hackett, Laura Deming, Eugene Melamud, Navdeep Jaitly, Fiona McAllister, Jonathon O'Brien, George Dahl, Bryson Bennett, Andrew M. Dai, Daphne Koller

    Abstract: As in many other scientific domains, we face a fundamental problem when using machine learning to identify proteins from mass spectrometry data: large ground truth datasets map** inputs to correct outputs are extremely difficult to obtain. Instead, we have access to imperfect hand-coded models crafted by domain experts. In this paper, we apply deep neural networks to an important step of the pro… ▽ More

    Submitted 22 August, 2018; v1 submitted 20 August, 2018; originally announced August 2018.

  23. arXiv:1806.05394  [pdf, other

    stat.ML cs.LG

    Dynamical Isometry and a Mean Field Theory of RNNs: Gating Enables Signal Propagation in Recurrent Neural Networks

    Authors: Minmin Chen, Jeffrey Pennington, Samuel S. Schoenholz

    Abstract: Recurrent neural networks have gained widespread use in modeling sequence data across various domains. While many successful recurrent architectures employ a notion of gating, the exact mechanism that enables such remarkable performance is not well understood. We develop a theory for signal propagation in recurrent networks after random initialization using a combination of mean field theory and r… ▽ More

    Submitted 15 August, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: ICML 2018 Conference Proceedings

  24. arXiv:1806.05393  [pdf, other

    stat.ML cs.LG

    Dynamical Isometry and a Mean Field Theory of CNNs: How to Train 10,000-Layer Vanilla Convolutional Neural Networks

    Authors: Lechao Xiao, Yasaman Bahri, Jascha Sohl-Dickstein, Samuel S. Schoenholz, Jeffrey Pennington

    Abstract: In recent years, state-of-the-art methods in computer vision have utilized increasingly deep convolutional neural network architectures (CNNs), with some of the most successful models employing hundreds or even thousands of layers. A variety of pathologies such as vanishing/exploding gradients make training such deep networks challenging. While residual connections and batch normalization do enabl… ▽ More

    Submitted 10 July, 2018; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: ICML 2018 Conference Proceedings

  25. arXiv:1803.01416  [pdf, other

    cond-mat.mtrl-sci

    Machine learning determination of atomic dynamics at grain boundaries

    Authors: Tristan A. Sharp, Spencer L. Thomas, Ekin D. Cubuk, Samuel S. Schoenholz, David J. Srolovitz, Andrea J. Liu

    Abstract: In polycrystalline materials, grain boundaries are sites of enhanced atomic motion, but the complexity of the atomic structures within a grain boundary network makes it difficult to link the structure and atomic dynamics. Here we use a machine learning technique to establish a connection between local structure and dynamics of these materials. Following previous work on bulk glassy materials, we d… ▽ More

    Submitted 11 September, 2018; v1 submitted 4 March, 2018; originally announced March 2018.

  26. arXiv:1802.09979  [pdf, other

    stat.ML cs.LG

    The Emergence of Spectral Universality in Deep Networks

    Authors: Jeffrey Pennington, Samuel S. Schoenholz, Surya Ganguli

    Abstract: Recent work has shown that tight concentration of the entire spectrum of singular values of a deep network's input-output Jacobian around one at initialization can speed up learning by orders of magnitude. Therefore, to guide important design choices, it is important to build a full theoretical understanding of the spectra of Jacobians at initialization. To this end, we leverage powerful tools fro… ▽ More

    Submitted 27 February, 2018; originally announced February 2018.

    Comments: 17 pages, 4 figures. Appearing at the 21st International Conference on Artificial Intelligence and Statistics (AISTATS) 2018

  27. arXiv:1801.02774  [pdf, other

    cs.CV

    Adversarial Spheres

    Authors: Justin Gilmer, Luke Metz, Fartash Faghri, Samuel S. Schoenholz, Maithra Raghu, Martin Wattenberg, Ian Goodfellow

    Abstract: State of the art computer vision models have been shown to be vulnerable to small adversarial perturbations of the input. In other words, most images in the data distribution are both correctly classified by the model and are very close to a visually similar misclassified image. Despite substantial research interest, the cause of the phenomenon is still poorly understood and remains unsolved. We h… ▽ More

    Submitted 10 September, 2018; v1 submitted 8 January, 2018; originally announced January 2018.

    MSC Class: 68T45 ACM Class: I.2.6

  28. arXiv:1712.08969  [pdf, other

    cs.NE cond-mat.dis-nn cs.LG math.DS nlin.CD

    Mean Field Residual Networks: On the Edge of Chaos

    Authors: Greg Yang, Samuel S. Schoenholz

    Abstract: We study randomly initialized residual networks using mean field theory and the theory of difference equations. Classical feedforward neural networks, such as those with tanh activations, exhibit exponential behavior on the average when propagating inputs forward or gradients backward. The exponential forward dynamics causes rapid collapsing of the input space geometry, while the exponential backw… ▽ More

    Submitted 24 December, 2017; originally announced December 2017.

    Comments: NIPS 2017

  29. arXiv:1711.04735  [pdf, other

    cs.LG stat.ML

    Resurrecting the sigmoid in deep learning through dynamical isometry: theory and practice

    Authors: Jeffrey Pennington, Samuel S. Schoenholz, Surya Ganguli

    Abstract: It is well known that the initialization of weights in deep neural networks can have a dramatic impact on learning speed. For example, ensuring the mean squared singular value of a network's input-output Jacobian is $O(1)$ is essential for avoiding the exponential vanishing or explosion of gradients. The stronger condition that all singular values of the Jacobian concentrate near $1$ is a property… ▽ More

    Submitted 13 November, 2017; originally announced November 2017.

    Comments: 13 pages, 6 figures. Appearing at the 31st Conference on Neural Information Processing Systems (NIPS 2017)

  30. arXiv:1711.02846  [pdf, other

    stat.ML cs.LG

    Intriguing Properties of Adversarial Examples

    Authors: Ekin D. Cubuk, Barret Zoph, Samuel S. Schoenholz, Quoc V. Le

    Abstract: It is becoming increasingly clear that many machine learning classifiers are vulnerable to adversarial examples. In attempting to explain the origin of adversarial examples, previous studies have typically focused on the fact that neural networks operate on high dimensional data, they overfit, or they are too linear. Here we argue that the origin of adversarial examples is primarily due to an inhe… ▽ More

    Submitted 8 November, 2017; originally announced November 2017.

    Comments: 17 pages

  31. arXiv:1711.00165  [pdf, other

    stat.ML cs.LG

    Deep Neural Networks as Gaussian Processes

    Authors: Jaehoon Lee, Yasaman Bahri, Roman Novak, Samuel S. Schoenholz, Jeffrey Pennington, Jascha Sohl-Dickstein

    Abstract: It has long been known that a single-layer fully-connected neural network with an i.i.d. prior over its parameters is equivalent to a Gaussian process (GP), in the limit of infinite network width. This correspondence enables exact Bayesian inference for infinite width neural networks on regression tasks by means of evaluating the corresponding GP. Recently, kernel functions which mimic multi-layer… ▽ More

    Submitted 2 March, 2018; v1 submitted 31 October, 2017; originally announced November 2017.

    Comments: Published version in ICLR 2018. 10 pages + appendix

  32. arXiv:1710.06570  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    A Correspondence Between Random Neural Networks and Statistical Field Theory

    Authors: Samuel S. Schoenholz, Jeffrey Pennington, Jascha Sohl-Dickstein

    Abstract: A number of recent papers have provided evidence that practical design questions about neural networks may be tackled theoretically by studying the behavior of random networks. However, until now the tools available for analyzing random neural networks have been relatively ad-hoc. In this work, we show that the distribution of pre-activations in random neural networks can be exactly mapped onto la… ▽ More

    Submitted 17 October, 2017; originally announced October 2017.

  33. arXiv:1709.08015  [pdf, other

    stat.ML cond-mat.soft cond-mat.stat-mech

    Combining Machine Learning and Physics to Understand Glassy Systems

    Authors: Samuel S. Schoenholz

    Abstract: Our understanding of supercooled liquids and glasses has lagged significantly behind that of simple liquids and crystalline solids. This is in part due to the many possibly relevant degrees of freedom that are present due to the disorder inherent to these systems and in part to non-equilibrium effects which are difficult to treat in the standard context of statistical physics. Together these issue… ▽ More

    Submitted 23 September, 2017; originally announced September 2017.

  34. arXiv:1704.01212  [pdf, other

    cs.LG

    Neural Message Passing for Quantum Chemistry

    Authors: Justin Gilmer, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals, George E. Dahl

    Abstract: Supervised learning on molecules has incredible potential to be useful in chemistry, drug discovery, and materials science. Luckily, several promising and closely related neural network models invariant to molecular symmetries have already been described in the literature. These models learn a message passing algorithm and aggregation procedure to compute a function of their entire input graph. At… ▽ More

    Submitted 12 June, 2017; v1 submitted 4 April, 2017; originally announced April 2017.

    Comments: 14 pages

    ACM Class: I.2.6

  35. Machine learning prediction errors better than DFT accuracy

    Authors: Felix A. Faber, Luke Hutchison, Bing Huang, Justin Gilmer, Samuel S. Schoenholz, George E. Dahl, Oriol Vinyals, Steven Kearnes, Patrick F. Riley, O. Anatole von Lilienfeld

    Abstract: We investigate the impact of choosing regressors and molecular representations for the construction of fast machine learning (ML) models of thirteen electronic ground-state properties of organic molecules. The performance of each regressor/representation/property combination is assessed using learning curves which report out-of-sample errors as a function of training set size with up to $\sim$117k… ▽ More

    Submitted 4 June, 2017; v1 submitted 17 February, 2017; originally announced February 2017.

  36. arXiv:1611.01232  [pdf, other

    stat.ML cs.LG

    Deep Information Propagation

    Authors: Samuel S. Schoenholz, Justin Gilmer, Surya Ganguli, Jascha Sohl-Dickstein

    Abstract: We study the behavior of untrained neural networks whose weights and biases are randomly distributed using mean field theory. We show the existence of depth scales that naturally limit the maximum depth of signal propagation through these random networks. Our main practical result is to show that random networks may be trained precisely when information can travel through them. Thus, the depth sca… ▽ More

    Submitted 4 April, 2017; v1 submitted 3 November, 2016; originally announced November 2016.

  37. Disconnecting structure and dynamics in glassy thin films

    Authors: Daniel M. Sussman, Samuel S. Schoenholz, Ekin D. Cubuk, Andrea J. Liu

    Abstract: Nanometrically thin glassy films depart strikingly from the behavior of their bulk counterparts. We investigate whether the dynamical differences between bulk and thin film glasses can be understood by differences in local microscopic structure. We employ machine-learning methods that have previously identified strong correlations between local structure and particle rearrangement dynamics in bulk… ▽ More

    Submitted 11 October, 2016; originally announced October 2016.

    Comments: 8 pages, 7 figures

  38. The Relationship Between Local Structure and Relaxation in Out-of-Equilibrium Glassy Systems

    Authors: Samuel S. Schoenholz, Ekin D. Cubuk, Efthimios Kaxiras, Andrea J. Liu

    Abstract: The dynamical glass transition is typically taken to be the temperature at which a glassy liquid is no longer able to equilibrate on experimental timescales. Consequently, the physical properties of these systems just above or below the dynamical glass transition, such as viscosity, can change by many orders of magnitude over long periods of time following external perturbation. During this progre… ▽ More

    Submitted 23 July, 2016; originally announced July 2016.

  39. arXiv:1510.02129  [pdf, other

    hep-th cond-mat.stat-mech hep-lat

    Nonlinear Sigma Models with Compact Hyperbolic Target Spaces

    Authors: Steven Gubser, Zain H. Saleem, Samuel S. Schoenholz, Bogdan Stoica, James Stokes

    Abstract: We explore the phase structure of nonlinear sigma models with target spaces corresponding to compact quotients of hyperbolic space, focusing on the case of a hyperbolic genus-2 Riemann surface. The continuum theory of these models can be approximated by a lattice spin system which we simulate using Monte Carlo methods. The target space possesses interesting geometric and topological properties whi… ▽ More

    Submitted 7 October, 2015; originally announced October 2015.

    Comments: 14 pages, 6 Figures

    Report number: CALT-TH 2015-019, PUPT-2487

  40. arXiv:1506.07772  [pdf, other

    cond-mat.soft

    A structural approach to relaxation in glassy liquids

    Authors: Samuel S. Schoenholz, Ekin D. Cubuk, Daniel M. Sussman, Efthimios Kaxiras, Andrea J Liu

    Abstract: When a liquid freezes, a change in the local atomic structure marks the transition to the crystal. When a liquid is cooled to form a glass, however, no noticeable structural change marks the glass transition. Indeed, characteristic features of glassy dynamics that appear below an onset temperature, T_0, are qualitatively captured by mean field theory, which assumes uniform local structure at all t… ▽ More

    Submitted 22 November, 2015; v1 submitted 25 June, 2015; originally announced June 2015.

  41. Strain fluctuations and elastic moduli in disordered solids

    Authors: Daniel M. Sussman, Samuel S. Schoenholz, Ye Xu, Tim Still, A. G. Yodh, Andrea J. Liu

    Abstract: Recently there has been a surge in interest in using video-microscopy techniques to infer the local mechanical properties of disordered solids. One common approach is to minimize the difference between particle vibrational displacements in a local coarse-graining volume and the displacements that would result from a best-fit affine deformation. Effective moduli are then be inferred under the assum… ▽ More

    Submitted 24 August, 2015; v1 submitted 25 March, 2015; originally announced March 2015.

    Comments: 12 pages, 8 figures

    Journal ref: Phys. Rev. E 92, 022307 (2015)

  42. Identifying structural flow defects in disordered solids using machine learning methods

    Authors: Ekin D. Cubuk, Samuel S. Schoenholz, Jennifer M. Rieser, Brad D. Malone, Joerg Rottler, Douglas J. Durian, Efthimios Kaxiras, Andrea J. Liu

    Abstract: We use machine learning methods on local structure to identify flow defects - or regions susceptible to rearrangement - in jammed and glassy systems. We apply this method successfully to two disparate systems: a two dimensional experimental realization of a granular pillar under compression, and a Lennard-Jones glass in both two and three dimensions above and below its glass transition temperature… ▽ More

    Submitted 24 September, 2014; originally announced September 2014.

    Comments: 4 pages, 4 figures

    Journal ref: Phys. Rev. Lett. 114, 108001 (2015)

  43. arXiv:1404.1403  [pdf, other

    cond-mat.soft

    Understanding plastic deformation in thermal glasses from single-soft-spot dynamics

    Authors: Samuel S. Schoenholz, Andrea J. Liu, Robert A. Riggleman, Joerg Rottler

    Abstract: By considering the low-frequency vibrational modes of amorphous solids, Manning and Liu [Phys. Rev. Lett. 107, 108302 (2011)] showed that a population of "soft spots" can be identified that are intimately related to plasticity at zero temperature under quasistatic shear. In this work we track individual soft spots with time in a two-dimensional sheared thermal Lennard Jones glass at temperatures r… ▽ More

    Submitted 4 April, 2014; originally announced April 2014.

    Comments: 9 pages, 6 figures

  44. arXiv:1403.0922  [pdf, other

    cond-mat.soft cond-mat.mtrl-sci

    Predicting plasticity with soft vibrational modes: from dislocations to glasses

    Authors: Joerg Rottler, Samuel S. Schoenholz, Andrea J. Liu

    Abstract: We show that quasi localized low-frequency modes in the vibrational spectrum can be used to construct soft spots, or regions vulnerable to rearrangement, which serve as a universal tool for the identification of flow defects in solids. We show that soft spots not only encode spatial information, via their location, but also directional information, via directors for particles within each soft spot… ▽ More

    Submitted 4 March, 2014; originally announced March 2014.

  45. arXiv:1301.6982  [pdf, other

    cond-mat.soft cond-mat.stat-mech

    Stability of jammed packings II: the transverse length scale

    Authors: Samuel S. Schoenholz, Carl P. Goodrich, Oleg Kogan, Andrea J. Liu, Sidney R. Nagel

    Abstract: As a function of packing fraction at zero temperature and applied stress, an amorphous packing of spheres exhibits a jamming transition where the system is sensitive to boundary conditions even in the thermodynamic limit. Upon further compression, the system should become insensitive to boundary conditions provided it is sufficiently large. Here we explore the linear response to a large class of b… ▽ More

    Submitted 22 November, 2013; v1 submitted 29 January, 2013; originally announced January 2013.

    Comments: 8 pages, 5 figures

    Journal ref: Soft Matter, 2013,9, 11000-11006