Skip to main content

Showing 1–5 of 5 results for author: Moskvichev, A

.
  1. arXiv:2311.09247  [pdf, other

    cs.AI cs.LG

    Comparing Humans, GPT-4, and GPT-4V On Abstraction and Reasoning Tasks

    Authors: Melanie Mitchell, Alessandro B. Palmarini, Arseny Moskvichev

    Abstract: We explore the abstract reasoning abilities of text-only and multimodal versions of GPT-4, using the ConceptARC benchmark [10], which is designed to evaluate robust understanding and reasoning with core-knowledge concepts. We extend the work of Moskvichev et al. [10] by evaluating GPT-4 on more detailed, one-shot prompting (rather than simple, zero-shot prompts) with text versions of ConceptARC ta… ▽ More

    Submitted 11 December, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Corrected Figure 3 (extra spaces were replaced by commas, which were lost in original formatting)

    Journal ref: Proceedings of the LLM-CP Workshop, AAAI 2024

  2. arXiv:2305.13877  [pdf, other

    cs.CL cs.AI

    NarrativeXL: A Large-scale Dataset For Long-Term Memory Models

    Authors: Arseny Moskvichev, Ky-Vinh Mai

    Abstract: We propose a new large-scale (nearly a million questions) ultra-long-context (more than 50,000 words average document length) reading comprehension dataset. Using GPT 3.5, we summarized each scene in 1,500 hand-curated fiction books from Project Gutenberg, which resulted in approximately 150 scene-level summaries per book. After that, we created a number of reading comprehension questions based on… ▽ More

    Submitted 7 December, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    ACM Class: I.2.7; I.2.6

  3. arXiv:2305.07141  [pdf, other

    cs.LG cs.AI

    The ConceptARC Benchmark: Evaluating Understanding and Generalization in the ARC Domain

    Authors: Arseny Moskvichev, Victor Vikram Odouard, Melanie Mitchell

    Abstract: The abilities to form and abstract concepts is key to human intelligence, but such abilities remain lacking in state-of-the-art AI systems. There has been substantial research on conceptual abstraction in AI, particularly using idealized domains such as Raven's Progressive Matrices and Bongard problems, but even when AI systems succeed on such problems, the systems are rarely evaluated in depth to… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Journal ref: Transactions on Machine Learning Research, 8/2023

  4. arXiv:2104.05500  [pdf, other

    cs.CL cs.AI cs.LG

    Updater-Extractor Architecture for Inductive World State Representations

    Authors: Arseny Moskvichev, James A. Liu

    Abstract: Develo** NLP models traditionally involves two stages - training and application. Retention of information acquired after training (at application time) is architecturally limited by the size of the model's context window (in the case of transformers), or by the practical difficulties associated with long sequences (in the case of RNNs). In this paper, we propose a novel transformer-based Update… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 15 pages (12 main content, 3 references and appendix), 4 figures

  5. arXiv:2007.09820  [pdf, other

    cs.AI cs.CL cs.LG cs.MA

    Reinforcement Communication Learning in Different Social Network Structures

    Authors: Marina Dubova, Arseny Moskvichev, Robert Goldstone

    Abstract: Social network structure is one of the key determinants of human language evolution. Previous work has shown that the network of social interactions shapes decentralized learning in human groups, leading to the emergence of different kinds of communicative conventions. We examined the effects of social network organization on the properties of communication systems emerging in decentralized, multi… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Journal ref: 1st Workshop on Language in Reinforcement Learning, ICML 2020