Skip to main content

Showing 1–1 of 1 results for author: Hopkins, A K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.13382  [pdf, other

    cs.LG cs.AI cs.CL

    Emergent World Representations: Exploring a Sequence Model Trained on a Synthetic Task

    Authors: Kenneth Li, Aspen K. Hopkins, David Bau, Fernanda ViƩgas, Hanspeter Pfister, Martin Wattenberg

    Abstract: Language models show a surprising range of capabilities, but the source of their apparent competence is unclear. Do these networks just memorize a collection of surface statistics, or do they rely on internal representations of the process that generates the sequences they see? We investigate this question by applying a variant of the GPT model to the task of predicting legal moves in a simple boa… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 oral (notable-top-5%): https://openreview.net/forum?id=DeG07_TcZvT ; code: https://github.com/likenneth/othello_world