Skip to main content

Showing 1–2 of 2 results for author: Zhu, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11741  [pdf, other

    cs.LG cs.AI

    Transcendence: Generative Models Can Outperform The Experts That Train Them

    Authors: Edwin Zhang, Vincent Zhu, Naomi Saphra, Anat Kleiman, Benjamin L. Edelman, Milind Tambe, Sham M. Kakade, Eran Malach

    Abstract: Generative models are trained with the simple objective of imitating the conditional probability distribution induced by the data they are trained on. Therefore, when trained on data generated by humans, we may not expect the artificial model to outperform the humans on their original objectives. In this work, we study the phenomenon of transcendence: when a generative model achieves capabilities… ▽ More

    Submitted 28 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Code, models, and data at https://transcendence.eddie.win

  2. arXiv:2307.06732  [pdf, other

    q-bio.NC cs.NE

    Learning fixed points of recurrent neural networks by reparameterizing the network model

    Authors: Vicky Zhu, Robert Rosenbaum

    Abstract: In computational neuroscience, fixed points of recurrent neural networks are commonly used to model neural responses to static or slowly changing stimuli. These applications raise the question of how to train the weights in a recurrent neural network to minimize a loss function evaluated on fixed points. A natural approach is to use gradient descent on the Euclidean space of synaptic weights. We s… ▽ More

    Submitted 27 July, 2023; v1 submitted 13 July, 2023; originally announced July 2023.