Skip to main content

Showing 1–23 of 23 results for author: Wingate, D

.
  1. arXiv:2306.02177  [pdf, other

    cs.AI

    Towards Coding Social Science Datasets with Language Models

    Authors: Christopher Michael Rytting, Taylor Sorensen, Lisa Argyle, Ethan Busby, Nancy Fulda, Joshua Gubler, David Wingate

    Abstract: Researchers often rely on humans to code (label, annotate, etc.) large sets of texts. This kind of human coding forms an important part of social science research, yet the coding process is both resource intensive and highly variable from application to application. In some cases, efforts to automate this process have achieved human-level accuracies, but to achieve this, these attempts frequently… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  2. arXiv:2302.07268  [pdf, other

    cs.HC cs.AI cs.CL

    AI Chat Assistants can Improve Conversations about Divisive Topics

    Authors: Lisa P. Argyle, Ethan Busby, Joshua Gubler, Chris Bail, Thomas Howe, Christopher Rytting, David Wingate

    Abstract: A rapidly increasing amount of human conversation occurs online. But divisiveness and conflict can fester in text-based interactions on social media platforms, in messaging apps, and on other digital forums. Such toxicity increases polarization and, importantly, corrodes the capacity of diverse societies to develop efficient solutions to complex social problems that impact everyone. Scholars and c… ▽ More

    Submitted 20 October, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  3. arXiv:2210.12353  [pdf, other

    cs.CL cs.LG

    Leveraging Large Language Models for Multiple Choice Question Answering

    Authors: Joshua Robinson, Christopher Michael Rytting, David Wingate

    Abstract: While large language models (LLMs) like GPT-3 have achieved impressive results on multiple choice question answering (MCQA) tasks in the zero, one, and few-shot settings, they generally lag behind the MCQA state of the art (SOTA). MCQA tasks have traditionally been presented to LLMs like cloze tasks. An LLM is conditioned on a question (without the associated answer options) and its chosen option… ▽ More

    Submitted 16 March, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

    Comments: Accepted for ICLR 2023

  4. arXiv:2210.03162  [pdf, other

    cs.CL cs.AI cs.LG

    Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models

    Authors: David Wingate, Mohammad Shoeybi, Taylor Sorensen

    Abstract: We explore the idea of compressing the prompts used to condition language models, and show that compressed prompts can retain a substantive amount of information about the original prompt. For severely compressed prompts, while fine-grained information is lost, abstract information and general sentiments can be retained with surprisingly few parameters, which can be useful in the context of decode… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Empirical Methods in Natural Language Processing, 2022 (Main-Long Paper)

  5. Out of One, Many: Using Language Models to Simulate Human Samples

    Authors: Lisa P. Argyle, Ethan C. Busby, Nancy Fulda, Joshua Gubler, Christopher Rytting, David Wingate

    Abstract: We propose and explore the possibility that language models can be studied as effective proxies for specific human sub-populations in social science research. Practical and research applications of artificial intelligence tools have sometimes been limited by problematic biases (such as racism or sexism), which are often treated as uniform properties of the models. We show that the "algorithmic bia… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  6. An Information-theoretic Approach to Prompt Engineering Without Ground Truth Labels

    Authors: Taylor Sorensen, Joshua Robinson, Christopher Michael Rytting, Alexander Glenn Shaw, Kyle Jeffrey Rogers, Alexia Pauline Delorey, Mahmoud Khalil, Nancy Fulda, David Wingate

    Abstract: Pre-trained language models derive substantial linguistic and factual knowledge from the massive corpora on which they are trained, and prompt engineering seeks to align these models to specific tasks. Unfortunately, existing prompt engineering methods require significant amounts of labeled data, access to model parameters, or both. We introduce a new method for selecting prompt templates \textit{… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  7. arXiv:2110.02370  [pdf, other

    cs.CL cs.AI

    Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning

    Authors: Christopher Michael Rytting, David Wingate

    Abstract: Large natural language models (such as GPT-3 or T5) demonstrate impressive abilities across a range of general NLP tasks. Here, we show that the knowledge embedded in such models provides a useful inductive bias, not just on traditional NLP tasks, but also in the nontraditional task of training a symbolic reasoning engine. We observe that these engines learn quickly and generalize in a natural way… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  8. arXiv:2012.05983  [pdf, other

    cs.CL cs.AI

    Towards Neural Programming Interfaces

    Authors: Zachary C. Brown, Nathaniel Robinson, David Wingate, Nancy Fulda

    Abstract: It is notoriously difficult to control the behavior of artificial neural networks such as generative neural language models. We recast the problem of controlling natural language generation as that of learning to interface with a pretrained language model, just as Application Programming Interfaces (APIs) control the behavior of programs by altering hyperparameters. In this new paradigm, a special… ▽ More

    Submitted 17 February, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: 24 pages total (13 for main paper and references, 11 for Appendix 1), accepted for publication in Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

    Journal ref: Neural Information Processing Systems 33 (2020) 17416-17428

  9. arXiv:2001.00991  [pdf, other

    cs.RO cs.AI cs.HC cs.LG eess.SY

    Human-robot co-manipulation of extended objects: Data-driven models and control from analysis of human-human dyads

    Authors: Erich Mielke, Eric Townsend, David Wingate, Marc D. Killpack

    Abstract: Human teams are able to easily perform collaborative manipulation tasks. However, for a robot and human to simultaneously manipulate an extended object is a difficult task using existing methods from the literature. Our approach in this paper is to use data from human-human dyad experiments to determine motion intent which we use for a physical human-robot co-manipulation task. We first present an… ▽ More

    Submitted 3 January, 2020; originally announced January 2020.

    Comments: Paper has been in submission to IJRR since November 2018

  10. arXiv:1910.01723  [pdf, other

    cs.LG cs.AI stat.ML

    Using Logical Specifications of Objectives in Multi-Objective Reinforcement Learning

    Authors: Kolby Nottingham, Anand Balakrishnan, Jyotirmoy Deshmukh, David Wingate

    Abstract: It is notoriously difficult to control the behavior of reinforcement learning agents. Agents often learn to exploit the environment or reward signal and need to be retrained multiple times. The multi-objective reinforcement learning (MORL) framework separates a reward function into several objectives. An ideal MORL agent learns to generalize to novel combinations of objectives allowing for better… ▽ More

    Submitted 5 September, 2021; v1 submitted 3 October, 2019; originally announced October 2019.

  11. arXiv:1910.00668  [pdf, other

    cs.LG stat.ML

    Wasserstein Neural Processes

    Authors: Andrew Carr, Jared Nielsen, David Wingate

    Abstract: Neural Processes (NPs) are a class of models that learn a map** from a context set of input-output pairs to a distribution over functions. They are traditionally trained using maximum likelihood with a KL divergence regularization term. We show that there are desirable classes of problems where NPs, with this loss, fail to learn any reasonable distribution. We also show that this drawback is sol… ▽ More

    Submitted 9 January, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

  12. arXiv:1903.00133  [pdf, other

    cs.CV

    Video Extrapolation with an Invertible Linear Embedding

    Authors: Robert Pottorff, Jared Nielsen, David Wingate

    Abstract: We predict future video frames from complex dynamic scenes, using an invertible neural network as the encoder of a nonlinear dynamic system with latent linear state evolution. Our invertible linear embedding (ILE) demonstrates successful learning, prediction and latent state inference. In contrast to other approaches, ILE does not use any explicit reconstruction loss or simplistic pixel-space assu… ▽ More

    Submitted 28 February, 2019; originally announced March 2019.

  13. arXiv:1902.10042  [pdf, other

    cs.LG stat.ML

    Graph Neural Processes: Towards Bayesian Graph Neural Networks

    Authors: Andrew Carr, David Wingate

    Abstract: We introduce Graph Neural Processes (GNP), inspired by the recent work in conditional and latent neural processes. A Graph Neural Process is defined as a Conditional Neural Process that operates on arbitrary graph data. It takes features of sparsely observed context points as input, and outputs a distribution over target points. We demonstrate graph neural processes in edge imputation and discuss… ▽ More

    Submitted 1 October, 2019; v1 submitted 26 February, 2019; originally announced February 2019.

  14. arXiv:1812.01569  [pdf, other

    cs.AI

    Nested Reasoning About Autonomous Agents Using Probabilistic Programs

    Authors: Iris Rubi Seaman, Jan-Willem van de Meent, David Wingate

    Abstract: As autonomous agents become more ubiquitous, they will eventually have to reason about the plans of other agents, which is known as theory of mind reasoning. We develop a planning-as-inference framework in which agents perform nested simulation to reason about the behavior of other agents in an online manner. As a concrete application of this framework, we use probabilistic programs to model a hig… ▽ More

    Submitted 4 March, 2020; v1 submitted 4 December, 2018; originally announced December 2018.

  15. arXiv:1809.09203  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Machine-learned multi-system surrogate models for materials prediction

    Authors: Chandramouli Nyshadham, Matthias Rupp, Brayden Bekker, Alexander V. Shapeev, Tim Mueller, Conrad W. Rosenbrock, Gábor Csányi, David W. Wingate, Gus L. W. Hart

    Abstract: Surrogate machine-learning models are transforming computational materials science by predicting properties of materials with the accuracy of ab initio methods at a fraction of the computational cost. We demonstrate surrogate models that simultaneously interpolate energies of different materials on a dataset of 10 binary alloys (AgCu, AlFe, AlMg, AlNi, AlTi, CoNi, CuFe, CuNi, FeV, NbNi) with 10 di… ▽ More

    Submitted 20 May, 2019; v1 submitted 24 September, 2018; originally announced September 2018.

    Comments: 12 pages, 7 figures

    Journal ref: npj Computational Materials 5.1 (2019): 51

  16. arXiv:1808.04891  [pdf, other

    cs.CL

    Embedding Grammars

    Authors: David Wingate, William Myers, Nancy Fulda, Tyler Etchart

    Abstract: Classic grammars and regular expressions can be used for a variety of purposes, including parsing, intent detection, and matching. However, the comparisons are performed at a structural level, with constituent elements (words or characters) matched exactly. Recent advances in word embeddings show that semantically related words share common features in a vector-space representation, suggesting the… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

  17. arXiv:1705.10851  [pdf, other

    cs.RO

    Estimating Human Intent for Physical Human-Robot Co-Manipulation

    Authors: Eric C. Townsend, Erich A Mielke, David Wingate, Marc D. Killpack

    Abstract: Human teams can be exceptionally efficient at adapting and collaborating during manipulation tasks using shared mental models. However, the same shared mental models that can be used by humans to perform robust low-level force and motion control during collaborative manipulation tasks are non-existent for robots. For robots to perform collaborative tasks with people naturally and efficiently, unde… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

  18. arXiv:1704.04977  [pdf, other

    cs.AI

    Probabilistic programs for inferring the goals of autonomous agents

    Authors: Marco F. Cusumano-Towner, Alexey Radul, David Wingate, Vikash K. Mansinghka

    Abstract: Intelligent systems sometimes need to infer the probable goals of people, cars, and robots, based on partial observations of their motion. This paper introduces a class of probabilistic programs for formulating and solving these problems. The formulation uses randomized path planning algorithms as the basis for probabilistic models of the process by which autonomous agents plan to achieve their go… ▽ More

    Submitted 18 April, 2017; v1 submitted 17 April, 2017; originally announced April 2017.

  19. arXiv:1703.03429  [pdf, other

    cs.AI cs.CL

    What can you do with a rock? Affordance extraction via word embeddings

    Authors: Nancy Fulda, Daniel Ricks, Ben Murdoch, David Wingate

    Abstract: Autonomous agents must often detect affordances: the set of behaviors enabled by a situation. Affordance detection is particularly helpful in domains with large action spaces, allowing the agent to prune its search space by avoiding futile behaviors. This paper presents a method for affordance extraction via word embeddings trained on a Wikipedia corpus. The resulting word vectors are treated as a… ▽ More

    Submitted 9 March, 2017; originally announced March 2017.

    Comments: 7 pages, 7 figures, 2 algorithms, data runs were performed using the Autoplay learning environment for interactive fiction

    Journal ref: Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI), Pages 1039-1045, 2017

  20. arXiv:1301.1299  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Automated Variational Inference in Probabilistic Programming

    Authors: David Wingate, Theophane Weber

    Abstract: We present a new algorithm for approximate inference in probabilistic programs, based on a stochastic gradient for variational programs. This method is efficient without restrictions on the probabilistic program; it is particularly practical for distributions which are not analytically tractable, including highly structured distributions that arise in probabilistic programs. We show how to automat… ▽ More

    Submitted 7 January, 2013; originally announced January 2013.

  21. arXiv:1207.1416  [pdf

    cs.AI

    Predictive Linear-Gaussian Models of Stochastic Dynamical Systems

    Authors: Matthew Rudary, Satinder Singh, David Wingate

    Abstract: Models of dynamical systems based on predictive state representations (PSRs) are defined strictly in terms of observable quantities, in contrast with traditional models (such as Hidden Markov Models) that use latent variables or statespace representations. In addition, PSRs have an effectively infinite memory, allowing them to model some systems that finite memory-based models cannot. Thus far, PS… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-501-508

  22. arXiv:1205.2664  [pdf

    cs.LG

    A Bayesian Sampling Approach to Exploration in Reinforcement Learning

    Authors: John Asmuth, Lihong Li, Michael L. Littman, Ali Nouri, David Wingate

    Abstract: We present a modular approach to reinforcement learning that uses a Bayesian representation of the uncertainty over models. The approach, BOSS (Best of Sampled Set), drives exploration by sampling multiple models from the posterior and selecting actions optimistically. It extends previous work by providing a rule for deciding when to resample and how to combine the models. We show that our algorit… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-19-26

  23. arXiv:1205.2604  [pdf

    stat.ML cs.LG

    The Infinite Latent Events Model

    Authors: David Wingate, Noah Goodman, Daniel Roy, Joshua Tenenbaum

    Abstract: We present the Infinite Latent Events Model, a nonparametric hierarchical Bayesian distribution over infinite dimensional Dynamic Bayesian Networks with binary state representations and noisy-OR-like transitions. The distribution can be used to learn structure in discrete timeseries data by simultaneously inferring a set of latent events, which events fired at each timestep, and how those events a… ▽ More

    Submitted 9 May, 2012; originally announced May 2012.

    Comments: Appears in Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI2009)

    Report number: UAI-P-2009-PG-607-614