Skip to main content

Showing 1–2 of 2 results for author: Alexander, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07875  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States

    Authors: Noam Razin, Yotam Alexander, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen

    Abstract: In modern machine learning, models can often fit training data in numerous ways, some of which perform well on unseen (test) data, while others do not. Remarkably, in such cases gradient descent frequently exhibits an implicit bias that leads to excellent performance on unseen data. This implicit bias was extensively studied in supervised learning, but is far less understood in optimal control (re… ▽ More

    Submitted 1 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024

  2. arXiv:2303.11249  [pdf, other

    cs.LG cs.AI quant-ph

    What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement

    Authors: Yotam Alexander, Nimrod De La Vega, Noam Razin, Nadav Cohen

    Abstract: The question of what makes a data distribution suitable for deep learning is a fundamental open problem. Focusing on locally connected neural networks (a prevalent family of architectures that includes convolutional and recurrent neural networks as well as local self-attention models), we address this problem by adopting theoretical tools from quantum physics. Our main theoretical result states th… ▽ More

    Submitted 21 January, 2024; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Accepted to NeurIPS 2023