Skip to main content

Showing 1–20 of 20 results for author: Momennejad, I

.
  1. arXiv:2407.05377  [pdf, other

    cs.AI

    Collective Innovation in Groups of Large Language Models

    Authors: Eleni Nisioti, Sebastian Risi, Ida Momennejad, Pierre-Yves Oudeyer, Clément Moulin-Frier

    Abstract: Human culture relies on collective innovation: our ability to continuously explore how existing elements in our environment can be combined to create new ones. Language is hypothesized to play a key role in human culture, driving individual cognitive capacities and sha** communication. Yet the majority of models of collective innovation assign no cognitive capacities or language abilities to age… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  2. arXiv:2402.03575  [pdf, other

    cs.AI cs.HC

    Toward Human-AI Alignment in Large-Scale Multi-Player Games

    Authors: Sugandha Sharma, Guy Davidson, Khimya Khetarpal, Anssi Kanervisto, Udit Arora, Katja Hofmann, Ida Momennejad

    Abstract: Achieving human-AI alignment in complex multi-agent games is crucial for creating trustworthy AI agents that enhance gameplay. We propose a method to evaluate this alignment using an interpretable task-sets framework, focusing on high-level behavioral tasks instead of low-level policies. Our approach has three components. First, we analyze extensive human gameplay data from Xbox's Bleeding Edge (1… ▽ More

    Submitted 18 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2401.09491  [pdf

    cs.AI

    Memory, Space, and Planning: Multiscale Predictive Representations

    Authors: Ida Momennejad

    Abstract: Memory is inherently entangled with prediction and planning. Flexible behavior in biological and artificial agents depends on the interplay of learning from the past and predicting the future in ever-changing environments. This chapter reviews computational, behavioral, and neural evidence suggesting these processes rely on learning the relational structure of experiences, known as cognitive maps,… ▽ More

    Submitted 19 February, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: To be published as a chapter in an edited volume by Oxford University Press (Editors: Sara Aronowitz and Lynn Nadel)

  4. arXiv:2310.00313  [pdf, other

    cs.CL

    Decoding In-Context Learning: Neuroscience-inspired Analysis of Representations in Large Language Models

    Authors: Safoora Yousefi, Leo Betthauser, Hosein Hasanbeig, Raphaël Millière, Ida Momennejad

    Abstract: Large language models (LLMs) exhibit remarkable performance improvement through in-context learning (ICL) by leveraging task-specific examples in the input. However, the mechanisms behind this improvement remain elusive. In this work, we investigate how LLM embeddings and attention representations change following in-context-learning, and how these changes mediate improvement in behavior. We emplo… ▽ More

    Submitted 21 February, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

  5. arXiv:2310.00194  [pdf, other

    cs.AI cs.NE

    A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models

    Authors: Taylor Webb, Shanka Subhra Mondal, Chi Wang, Brian Krabach, Ida Momennejad

    Abstract: Large language models (LLMs) demonstrate impressive performance on a wide variety of tasks, but they often struggle with tasks that require multi-step reasoning or goal-directed planning. To address this, we take inspiration from the human brain, in which planning is accomplished via the recurrent interaction of specialized modules in the prefrontal cortex (PFC). These modules perform functions su… ▽ More

    Submitted 5 March, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

  6. arXiv:2309.15129  [pdf, other

    cs.AI cs.CL cs.LG

    Evaluating Cognitive Maps and Planning in Large Language Models with CogEval

    Authors: Ida Momennejad, Hosein Hasanbeig, Felipe Vieira, Hiteshi Sharma, Robert Osazuwa Ness, Nebojsa Jojic, Hamid Palangi, Jonathan Larson

    Abstract: Recently an influx of studies claim emergent cognitive abilities in large language models (LLMs). Yet, most rely on anecdotes, overlook contamination of training sets, or lack systematic Evaluation involving multiple tasks, control conditions, multiple iterations, and statistical robustness tests. Here we make two major contributions. First, we propose CogEval, a cognitive science-inspired protoco… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  7. arXiv:2309.13701  [pdf, other

    cs.CL cs.AI cs.HC

    ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning

    Authors: Hosein Hasanbeig, Hiteshi Sharma, Leo Betthauser, Felipe Vieira Frujeri, Ida Momennejad

    Abstract: From grading papers to summarizing medical documents, large language models (LLMs) are evermore used for evaluation of text generated by humans and AI alike. However, despite their extensive utility, LLMs exhibit distinct failure modes, necessitating a thorough audit and improvement of their text evaluation capabilities. Here we introduce ALLURE, a systematic approach to Auditing Large Language Mo… ▽ More

    Submitted 26 September, 2023; v1 submitted 24 September, 2023; originally announced September 2023.

  8. arXiv:2303.08690  [pdf, other

    cs.LG cs.AI

    Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning

    Authors: Ali Rahimi-Kalahroudi, Janarthanan Rajendran, Ida Momennejad, Harm van Seijen, Sarath Chandar

    Abstract: One of the key behavioral characteristics used in neuroscience to determine whether the subject of study -- be it a rodent or a human -- exhibits model-based learning is effective adaptation to local changes in the environment, a particular form of adaptivity that is the focus of this work. In reinforcement learning, however, recent work has shown that modern deep model-based reinforcement-learnin… ▽ More

    Submitted 27 September, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  9. arXiv:2303.02160  [pdf, other

    cs.HC cs.LG cs.RO

    Navigates Like Me: Understanding How People Evaluate Human-Like AI in Video Games

    Authors: Stephanie Milani, Arthur Juliani, Ida Momennejad, Raluca Georgescu, Jaroslaw Rzpecki, Alison Shaw, Gavin Costello, Fei Fang, Sam Devlin, Katja Hofmann

    Abstract: We aim to understand how people assess human likeness in navigation produced by people and artificially intelligent (AI) agents in a video game. To this end, we propose a novel AI agent with the goal of generating more human-like behavior. We collect hundreds of crowd-sourced assessments comparing the human-likeness of navigation behavior generated by our agent and baseline AI agents with human-ge… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 18 pages; accepted at CHI 2023

  10. arXiv:2301.10677  [pdf, other

    cs.AI cs.LG stat.ML

    Imitating Human Behaviour with Diffusion Models

    Authors: Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin

    Abstract: Diffusion models have emerged as powerful generative models in the text-to-image domain. This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments. Human behaviour is stochastic and multimodal, with structured correlations between action dimensions. Meanwhile, standard modelling choices in behaviour cloning are limited in their ex… ▽ More

    Submitted 3 March, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Published in ICLR 2023

    Journal ref: ICLR 2023

  11. arXiv:2212.04401  [pdf

    cs.AI

    A Rubric for Human-like Agents and NeuroAI

    Authors: Ida Momennejad

    Abstract: Researchers across cognitive, neuro-, and computer sciences increasingly reference human-like artificial intelligence and neuroAI. However, the scope and use of the terms are often inconsistent. Contributed research ranges widely from mimicking behaviour, to testing machine learning methods as neurally plausible hypotheses at the cellular or functional levels, or solving engineering problems. Howe… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  12. arXiv:2210.14077  [pdf, other

    cs.LG

    Eigen Memory Trees

    Authors: Mark Rucker, Jordan T. Ash, John Langford, Paul Mineiro, Ida Momennejad

    Abstract: This work introduces the Eigen Memory Tree (EMT), a novel online memory model for sequential learning scenarios. EMTs store data at the leaves of a binary tree and route new samples through the structure using the principal components of previous experiences, facilitating efficient (logarithmic) access to relevant memories. We demonstrate that EMT outperforms existing online memory approaches, and… ▽ More

    Submitted 31 October, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: corrected an author name; corrected title plurality

  13. arXiv:2206.08364  [pdf, other

    cs.LG cs.AI cs.HC stat.ML

    Interaction-Grounded Learning with Action-inclusive Feedback

    Authors: Tengyang Xie, Akanksha Saran, Dylan J. Foster, Lekan Molu, Ida Momennejad, Nan Jiang, Paul Mineiro, John Langford

    Abstract: Consider the problem setting of Interaction-Grounded Learning (IGL), in which a learner's goal is to optimally interact with the environment with no explicit reward to ground its policies. The agent observes a context vector, takes an action, and receives a feedback vector, using this information to effectively optimize a policy with respect to a latent reward function. Prior analyzed approaches f… ▽ More

    Submitted 12 October, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Published in NeurIPS 2022

  14. arXiv:2206.05060  [pdf, other

    cs.AI cs.MA cs.SI

    Social Network Structure Shapes Innovation: Experience-sharing in RL with SAPIENS

    Authors: Eleni Nisioti, Mateo Mahaut, Pierre-Yves Oudeyer, Ida Momennejad, Clément Moulin-Frier

    Abstract: Human culture relies on innovation: our ability to continuously explore how existing elements can be combined to create new ones. Innovation is not solitary, it relies on collective search and accumulation. Reinforcement learning (RL) approaches commonly assume that fully-connected groups are best suited for innovation. However, human laboratory and field studies have shown that hierarchical innov… ▽ More

    Submitted 18 November, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

  15. arXiv:2206.03312  [pdf, other

    cs.NE cs.AI cs.LG

    Neuro-Nav: A Library for Neurally-Plausible Reinforcement Learning

    Authors: Arthur Juliani, Samuel Barnett, Brandon Davis, Margaret Sereno, Ida Momennejad

    Abstract: In this work we propose Neuro-Nav, an open-source library for neurally plausible reinforcement learning (RL). RL is among the most common modeling frameworks for studying decision making, learning, and navigation in biological organisms. In utilizing RL, cognitive scientists often handcraft environments and agents to meet the needs of their particular studies. On the other hand, artificial intelli… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  16. arXiv:2204.11464  [pdf, other

    cs.LG cs.AI

    Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods

    Authors: Yi Wan, Ali Rahimi-Kalahroudi, Janarthanan Rajendran, Ida Momennejad, Sarath Chandar, Harm van Seijen

    Abstract: In recent years, a growing number of deep model-based reinforcement learning (RL) methods have been introduced. The interest in deep model-based RL is not surprising, given its many potential benefits, such as higher sample efficiency and the potential for fast adaption to changes in the environment. However, we demonstrate, using an improved version of the recently introduced Local Change Adaptat… ▽ More

    Submitted 25 June, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

  17. arXiv:2203.04806  [pdf, other

    cs.CL

    One-Shot Learning from a Demonstration with Hierarchical Latent Language

    Authors: Nathaniel Weir, Xingdi Yuan, Marc-Alexandre Côté, Matthew Hausknecht, Romain Laroche, Ida Momennejad, Harm Van Seijen, Benjamin Van Durme

    Abstract: Humans have the capability, aided by the expressive compositionality of their language, to learn quickly by demonstration. They are able to describe unseen task-performing procedures and generalize their execution to other contexts. In this work, we introduce DescribeWorld, an environment designed to test this sort of generalization skill in grounded agents, where tasks are linguistically and proc… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  18. arXiv:2106.04887  [pdf, other

    cs.LG cs.AI stat.ML

    Interaction-Grounded Learning

    Authors: Tengyang Xie, John Langford, Paul Mineiro, Ida Momennejad

    Abstract: Consider a prosthetic arm, learning to adapt to its user's control signals. We propose Interaction-Grounded Learning for this novel setting, in which a learner's goal is to interact with the environment with no grounding or explicit reward to optimize its policies. Such a problem evades common RL solutions which require an explicit reward. The learning agent observes a multidimensional context vec… ▽ More

    Submitted 13 July, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Published in ICML 2021

  19. arXiv:2105.09637  [pdf, other

    cs.AI cs.LG

    Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation

    Authors: Sam Devlin, Raluca Georgescu, Ida Momennejad, Jaroslaw Rzepecki, Evelyn Zuniga, Gavin Costello, Guy Leroy, Ali Shaw, Katja Hofmann

    Abstract: A key challenge on the path to develo** agents that learn complex human-like behavior is the need to quickly and accurately quantify human-likeness. While human assessments of such behavior can be highly accurate, speed and scalability are limited. We address these limitations through a novel automated Navigation Turing Test (ANTT) that learns to predict human judgments of human-likeness. We dem… ▽ More

    Submitted 28 July, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: All data collected throughout this study, plus the code to reproduce our analysis and ANTT are available at https://github.com/microsoft/NTT

    Journal ref: Proceedings of the 38th International Conference on Machine Learning (ICML), 139:2644-2653, 2021

  20. arXiv:1705.07185  [pdf

    cs.SI

    The Ties that Bind Networks: Weak Ties Facilitate the Emergence of Collective Memories

    Authors: Ida Momennejad, Ajua Duker, Alin Coman

    Abstract: From families to nations, what binds individuals in social groups is the degree to which they share beliefs, norms, and memories. While local clusters of communicating individuals can sustain shared memories and norms, communities characterized by isolated cliques are susceptible to information fragmentation and polarization dynamics. We employ experimental manipulations in lab-created communities… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.