Skip to main content

Showing 1–7 of 7 results for author: Nottingham, K

.
  1. arXiv:2402.03244  [pdf, other

    cs.LG cs.CL

    Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills

    Authors: Kolby Nottingham, Bodhisattwa Prasad Majumder, Bhavana Dalvi Mishra, Sameer Singh, Peter Clark, Roy Fox

    Abstract: Large language models (LLMs) have recently been used for sequential decision making in interactive environments. However, leveraging environment reward signals for continual LLM actor improvement is not straightforward. We propose Skill Set Optimization (SSO) for improving LLM actor performance through constructing and refining sets of transferable skills. SSO constructs skills by extracting commo… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  2. arXiv:2307.11922  [pdf, other

    cs.LG cs.AI cs.CL

    Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors

    Authors: Kolby Nottingham, Yasaman Razeghi, Kyungmin Kim, JB Lanier, Pierre Baldi, Roy Fox, Sameer Singh

    Abstract: Large language models (LLMs) are being applied as actors for sequential decision making tasks in domains such as robotics and games, utilizing their general world knowledge and planning abilities. However, previous work does little to explore what environment state information is provided to LLM actors via language. Exhaustively describing high-dimensional states can impair performance and raise i… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  3. arXiv:2301.12050  [pdf, other

    cs.LG cs.CL

    Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling

    Authors: Kolby Nottingham, Prithviraj Ammanabrolu, Alane Suhr, Ye** Choi, Hannaneh Hajishirzi, Sameer Singh, Roy Fox

    Abstract: Reinforcement learning (RL) agents typically learn tabula rasa, without prior knowledge of the world. However, if initialized with knowledge of high-level subgoals and transitions between subgoals, RL agents could utilize this Abstract World Model (AWM) for planning and exploration. We propose using few-shot large language models (LLMs) to hypothesize an AWM, that will be verified through world ex… ▽ More

    Submitted 27 April, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: in proceedings of ICML 23

  4. arXiv:2205.13079  [pdf, other

    cs.LG

    Learning to Query Internet Text for Informing Reinforcement Learning Agents

    Authors: Kolby Nottingham, Alekhya Pyla, Sameer Singh, Roy Fox

    Abstract: Generalization to out of distribution tasks in reinforcement learning is a challenging problem. One successful approach improves generalization by conditioning policies on task or environment descriptions that provide information about the current transition or reward functions. Previously, these descriptions were often expressed as generated or crowd sourced text. In this work, we begin to tackle… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  5. arXiv:2109.02631  [pdf, other

    cs.LG

    Guiding Global Placement With Reinforcement Learning

    Authors: Robert Kirby, Kolby Nottingham, Rajarshi Roy, Saad Godil, Bryan Catanzaro

    Abstract: Recent advances in GPU accelerated global and detail placement have reduced the time to solution by an order of magnitude. This advancement allows us to leverage data driven optimization (such as Reinforcement Learning) in an effort to improve the final quality of placement results. In this work we augment state-of-the-art, force-based global placement solvers with a reinforcement learning agent t… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    ACM Class: B.7.2

  6. arXiv:2109.02161  [pdf, other

    cs.AI

    Modular Framework for Visuomotor Language Grounding

    Authors: Kolby Nottingham, Litian Liang, Daeyun Shin, Charless C. Fowlkes, Roy Fox, Sameer Singh

    Abstract: Natural language instruction following tasks serve as a valuable test-bed for grounded language and robotics research. However, data collection for these tasks is expensive and end-to-end approaches suffer from data inefficiency. We propose the structuring of language, acting, and visual tasks into separate modules that can be trained independently. Using a Language, Action, and Vision (LAV) frame… ▽ More

    Submitted 5 September, 2021; originally announced September 2021.

  7. arXiv:1910.01723  [pdf, other

    cs.LG cs.AI stat.ML

    Using Logical Specifications of Objectives in Multi-Objective Reinforcement Learning

    Authors: Kolby Nottingham, Anand Balakrishnan, Jyotirmoy Deshmukh, David Wingate

    Abstract: It is notoriously difficult to control the behavior of reinforcement learning agents. Agents often learn to exploit the environment or reward signal and need to be retrained multiple times. The multi-objective reinforcement learning (MORL) framework separates a reward function into several objectives. An ideal MORL agent learns to generalize to novel combinations of objectives allowing for better… ▽ More

    Submitted 5 September, 2021; v1 submitted 3 October, 2019; originally announced October 2019.