Skip to main content

Showing 1–14 of 14 results for author: Sumers, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04302  [pdf, other

    cs.LG

    Representational Alignment Supports Effective Machine Teaching

    Authors: Ilia Sucholutsky, Katherine M. Collins, Maya Malaviya, Nori Jacoby, Weiyang Liu, Theodore R. Sumers, Michalis Korakakis, Umang Bhatt, Mark Ho, Joshua B. Tenenbaum, Brad Love, Zachary A. Pardos, Adrian Weller, Thomas L. Griffiths

    Abstract: A good teacher should not only be knowledgeable; but should be able to communicate in a way that the student understands -- to share the student's representation of the world. In this work, we integrate insights from machine teaching and pragmatic communication with the burgeoning literature on representational alignment to characterize a utility curve defining a relationship between representatio… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Preprint

  2. arXiv:2402.18759  [pdf, other

    cs.RO cs.AI cs.LG

    Learning with Language-Guided State Abstractions

    Authors: Andi Peng, Ilia Sucholutsky, Belinda Z. Li, Theodore R. Sumers, Thomas L. Griffiths, Jacob Andreas, Julie A. Shah

    Abstract: We describe a framework for using natural language to design state abstractions for imitation learning. Generalizable policy learning in high-dimensional observation spaces is facilitated by well-designed state representations, which can surface important features of an environment and hide irrelevant ones. These state representations are typically manually specified, or derived from other labor-i… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  3. arXiv:2402.07282  [pdf, other

    cs.CL cs.AI cs.LG

    How do Large Language Models Navigate Conflicts between Honesty and Helpfulness?

    Authors: Ryan Liu, Theodore R. Sumers, Ishita Dasgupta, Thomas L. Griffiths

    Abstract: In day-to-day communication, people often approximate the truth - for example, rounding the time or omitting details - in order to be maximally helpful to the listener. How do large language models (LLMs) handle such nuanced trade-offs? To address this question, we use psychological models and experiments designed to characterize human behavior to analyze LLMs. We test a range of LLMs and explore… ▽ More

    Submitted 13 February, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  4. arXiv:2402.03081  [pdf, other

    cs.RO cs.AI cs.LG

    Preference-Conditioned Language-Guided Abstraction

    Authors: Andi Peng, Andreea Bobu, Belinda Z. Li, Theodore R. Sumers, Ilia Sucholutsky, Nishanth Kumar, Thomas L. Griffiths, Julie A. Shah

    Abstract: Learning from demonstrations is a common way for users to teach robots, but it is prone to spurious feature correlations. Recent work constructs state abstractions, i.e. visual representations containing task-relevant features, from language as a way to perform more generalizable learning. However, these abstractions also depend on a user's preference for what matters in a task, which may be hard… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: HRI 2024

  5. arXiv:2312.14226  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Deep de Finetti: Recovering Topic Distributions from Large Language Models

    Authors: Liyi Zhang, R. Thomas McCoy, Theodore R. Sumers, Jian-Qiao Zhu, Thomas L. Griffiths

    Abstract: Large language models (LLMs) can produce long, coherent passages of text, suggesting that LLMs, although trained on next-word prediction, must represent the latent structure that characterizes a document. Prior work has found that internal representations of LLMs encode one aspect of latent structure, namely syntax; here we investigate a complementary aspect, namely the document's topic structure.… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 13 pages, 4 figures

    ACM Class: I.2.6; I.2.7

  6. arXiv:2309.02427  [pdf, other

    cs.AI cs.CL cs.LG cs.SC

    Cognitive Architectures for Language Agents

    Authors: Theodore R. Sumers, Shunyu Yao, Karthik Narasimhan, Thomas L. Griffiths

    Abstract: Recent efforts have augmented large language models (LLMs) with external resources (e.g., the Internet) or internal control flows (e.g., prompt chaining) for tasks requiring grounding or reasoning, leading to a new class of language agents. While these agents have achieved substantial empirical success, we lack a systematic framework to organize existing agents and plan future developments. In thi… ▽ More

    Submitted 15 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: v3 is TMLR camera ready version. 19 pages of main content, 5 figures. The first two authors contributed equally, order decided by coin flip. A CoALA-based repo of recent work on language agents: https://github.com/ysymyth/awesome-language-agents

  7. arXiv:2301.12507  [pdf, other

    cs.AI

    Distilling Internet-Scale Vision-Language Models into Embodied Agents

    Authors: Theodore Sumers, Kenneth Marino, Arun Ahuja, Rob Fergus, Ishita Dasgupta

    Abstract: Instruction-following agents must ground language into their observation and action spaces. Learning to ground language is challenging, typically requiring domain-specific engineering or large quantities of human interaction data. To address this challenge, we propose using pretrained vision-language models (VLMs) to supervise embodied agents. We combine ideas from model distillation and hindsight… ▽ More

    Submitted 14 June, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: 9 pages, 7 figures. Presented at ICML 2023

  8. arXiv:2206.07870  [pdf, other

    cs.AI

    How to talk so AI will learn: Instructions, descriptions, and autonomy

    Authors: Theodore R Sumers, Robert D Hawkins, Mark K Ho, Thomas L Griffiths, Dylan Hadfield-Menell

    Abstract: From the earliest years of our lives, humans use language to express our beliefs and desires. Being able to talk to artificial agents about our preferences would thus fulfill a central goal of value alignment. Yet today, we lack computational models explaining such language use. To address this challenge, we formalize learning from language in a contextual bandit setting and ask how a human might… ▽ More

    Submitted 10 October, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: 10 pages, 5 figures. Published as a conference paper at NeurIPS 2022

  9. arXiv:2206.04105  [pdf, other

    cs.CL cs.LG stat.ML

    Words are all you need? Language as an approximation for human similarity judgments

    Authors: Raja Marjieh, Pol van Rijn, Ilia Sucholutsky, Theodore R. Sumers, Harin Lee, Thomas L. Griffiths, Nori Jacoby

    Abstract: Human similarity judgments are a powerful supervision signal for machine learning applications based on techniques such as contrastive learning, information retrieval, and model alignment, but classical methods for collecting human similarity judgments are too expensive to be used at scale. Recent methods propose using pre-trained deep neural networks (DNNs) to approximate human similarity, but pr… ▽ More

    Submitted 23 February, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted to ICLR 2023, final revision. https://openreview.net/forum?id=O-G91-4cMdv

  10. arXiv:2204.05091  [pdf, other

    cs.AI cs.CL

    Linguistic communication as (inverse) reward design

    Authors: Theodore R. Sumers, Robert D. Hawkins, Mark K. Ho, Thomas L. Griffiths, Dylan Hadfield-Menell

    Abstract: Natural language is an intuitive and expressive way to communicate reward information to autonomous agents. It encompasses everything from concrete instructions to abstract descriptions of the world. Despite this, natural language is often challenging to learn from: it is difficult for machine learning methods to make appropriate inferences from such a wide range of input. This paper proposes a ge… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 6 pages, 3 figures. Accepted at Learning from Natural Language Supervision workshop (ACL 2022)

  11. arXiv:2202.04728  [pdf, other

    cs.LG cs.CL

    Predicting Human Similarity Judgments Using Large Language Models

    Authors: Raja Marjieh, Ilia Sucholutsky, Theodore R. Sumers, Nori Jacoby, Thomas L. Griffiths

    Abstract: Similarity judgments provide a well-established method for accessing mental representations, with applications in psychology, neuroscience and machine learning. However, collecting similarity judgments can be prohibitively expensive for naturalistic datasets as the number of comparisons grows quadratically in the number of stimuli. One way to tackle this problem is to construct approximation proce… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

    Comments: 7 pages, 6 figures

  12. arXiv:2105.11950  [pdf, other

    cs.CL

    Extending rational models of communication from beliefs to actions

    Authors: Theodore R. Sumers, Robert D. Hawkins, Mark K. Ho, Thomas L. Griffiths

    Abstract: Speakers communicate to influence their partner's beliefs and shape their actions. Belief- and action-based objectives have been explored independently in recent computational models, but it has been challenging to explicitly compare or integrate them. Indeed, we find that they are conflated in standard referential communication tasks. To distinguish these accounts, we introduce a new paradigm cal… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: 7 pages, 4 figures. Proceedings for the 43rd Annual Meeting of the Cognitive Science Society

  13. arXiv:2012.09035  [pdf, other

    cs.CL

    Show or Tell? Demonstration is More Robust to Changes in Shared Perception than Explanation

    Authors: Theodore R. Sumers, Mark K. Ho, Thomas L. Griffiths

    Abstract: Successful teaching entails a complex interaction between a teacher and a learner. The teacher must select and convey information based on what they think the learner perceives and believes. Teaching always involves misaligned beliefs, but studies of pedagogy often focus on situations where teachers and learners share perceptions. Nonetheless, a teacher and learner may not always experience or att… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

    Comments: 7 pages, 4 figures. Proceedings for the 42nd Annual Meeting of the Cognitive Science Society

  14. arXiv:2009.14715  [pdf, other

    cs.AI

    Learning Rewards from Linguistic Feedback

    Authors: Theodore R. Sumers, Mark K. Ho, Robert D. Hawkins, Karthik Narasimhan, Thomas L. Griffiths

    Abstract: We explore unconstrained natural language feedback as a learning signal for artificial agents. Humans use rich and varied language to teach, yet most prior work on interactive learning from language assumes a particular form of input (e.g., commands). We propose a general framework which does not make this assumption, using aspect-based sentiment analysis to decompose feedback into sentiment about… ▽ More

    Submitted 3 July, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

    Comments: 9 pages, 4 figures. AAAI '21