Skip to main content

Showing 1–2 of 2 results for author: Li, M Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.03707  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    What Should Embeddings Embed? Autoregressive Models Represent Latent Generating Distributions

    Authors: Liyi Zhang, Michael Y. Li, Thomas L. Griffiths

    Abstract: Autoregressive language models have demonstrated a remarkable ability to extract latent structure from text. The embeddings from large language models have been shown to capture aspects of the syntax and semantics of language. But what {\em should} embeddings represent? We connect the autoregressive prediction objective to the idea of constructing predictive sufficient statistics to summarize the… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 15 pages, 8 figures

    ACM Class: I.2; I.5

  2. arXiv:2208.06028  [pdf, other

    cs.LG stat.ML

    Gaussian Process Surrogate Models for Neural Networks

    Authors: Michael Y. Li, Erin Grant, Thomas L. Griffiths

    Abstract: Not being able to understand and predict the behavior of deep learning systems makes it hard to decide what architecture and algorithm to use for a given problem. In science and engineering, modeling is a methodology used to understand complex systems whose internal processes are opaque. Modeling replaces a complex system with a simpler, more interpretable surrogate. Drawing inspiration from this,… ▽ More

    Submitted 14 September, 2023; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: Proceedings of UAI 2023