Skip to main content

Showing 1–6 of 6 results for author: Canatar, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.12821  [pdf, other

    q-bio.NC cs.AI

    A Spectral Theory of Neural Prediction and Alignment

    Authors: Abdulkadir Canatar, Jenelle Feather, Albert Wakhloo, SueYeon Chung

    Abstract: The representations of neural networks are often compared to those of biological systems by performing regression between the neural network responses and those measured from biological systems. Many different state-of-the-art deep neural networks yield similar neural predictions, but it remains unclear how to differentiate among models that perform equally well at predicting neural responses. To… ▽ More

    Submitted 11 December, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: First two authors contributed equally. NeurIPS 2023

  2. arXiv:2206.06686  [pdf, other

    quant-ph cs.LG

    Bandwidth Enables Generalization in Quantum Kernel Models

    Authors: Abdulkadir Canatar, Evan Peters, Cengiz Pehlevan, Stefan M. Wild, Ruslan Shaydulin

    Abstract: Quantum computers are known to provide speedups over classical state-of-the-art machine learning methods in some specialized settings. For example, quantum kernel methods have been shown to provide an exponential speedup on a learning version of the discrete logarithm problem. Understanding the generalization of quantum models is essential to realizing similar speedups on problems of practical int… ▽ More

    Submitted 18 June, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Accepted version

  3. arXiv:2106.02261  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Out-of-Distribution Generalization in Kernel Regression

    Authors: Abdulkadir Canatar, Blake Bordelon, Cengiz Pehlevan

    Abstract: In real word applications, data generating process for training a machine learning model often differs from what the model encounters in the test stage. Understanding how and whether machine learning models generalize under such distributional shifts have been a theoretical challenge. Here, we study generalization in kernel regression when the training and test distributions are different using me… ▽ More

    Submitted 4 February, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Eq. (SI.1.59) corrected

    Journal ref: Neural Information Processing Systems (NeurIPS), 2021

  4. arXiv:2106.00651  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Asymptotics of representation learning in finite Bayesian neural networks

    Authors: Jacob A. Zavatone-Veth, Abdulkadir Canatar, Benjamin S. Ruben, Cengiz Pehlevan

    Abstract: Recent works have suggested that finite Bayesian neural networks may sometimes outperform their infinite cousins because finite networks can flexibly adapt their internal representations. However, our theoretical understanding of how the learned hidden layer representations of finite networks differ from the fixed representations of infinite networks remains incomplete. Perturbative finite-width c… ▽ More

    Submitted 8 February, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: 13+28 pages, 4 figures; v3: extensive revision with improved exposition and new section on CNNs, accepted to NeurIPS 2021; v4: minor updates to supplement; v5: post-NeurIPS update, minor typos fixed

    Journal ref: Advances in Neural Information Processing Systems 34 (2021); JSTAT 114008 (2022)

  5. arXiv:2006.13198  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Spectral Bias and Task-Model Alignment Explain Generalization in Kernel Regression and Infinitely Wide Neural Networks

    Authors: Abdulkadir Canatar, Blake Bordelon, Cengiz Pehlevan

    Abstract: Generalization beyond a training dataset is a main goal of machine learning, but theoretical understanding of generalization remains an open problem for many models. The need for a new theory is exacerbated by recent observations in deep neural networks where overparameterization leads to better performance, contradicting the conventional wisdom from classical statistics. In this paper, we investi… ▽ More

    Submitted 4 February, 2022; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: Accepted for publication in Nature Communications. SI Eq.71 is corrected

  6. arXiv:2002.02561  [pdf, other

    cs.LG stat.ML

    Spectrum Dependent Learning Curves in Kernel Regression and Wide Neural Networks

    Authors: Blake Bordelon, Abdulkadir Canatar, Cengiz Pehlevan

    Abstract: We derive analytical expressions for the generalization performance of kernel regression as a function of the number of training samples using theoretical methods from Gaussian processes and statistical physics. Our expressions apply to wide neural networks due to an equivalence between training them and kernel regression with the Neural Tangent Kernel (NTK). By computing the decomposition of the… ▽ More

    Submitted 25 February, 2021; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: ICML 2020 Update: Updated section on asymptotics generalization error for power law spectra, finding agreement with Spigler, Geiger, Wyart 2019 arXiv:1905.10843. Added a section on Discrete measures and an MNIST Experiment. Eigenvalue problem can be approximated by Kernel PCA. Typo fixed on 2/25/2021

    Journal ref: Proceedings of the 37th International Conference on Machine Learning, PMLR 119:1024-1034, 2020