Skip to main content

Showing 1–14 of 14 results for author: Trott, A

.
  1. arXiv:2312.17482  [pdf, other

    cs.CL cs.LG

    MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining

    Authors: Jacob Portes, Alex Trott, Sam Havens, Daniel King, Abhinav Venigalla, Moin Nadeem, Nikhil Sardana, Daya Khudia, Jonathan Frankle

    Abstract: Although BERT-style encoder models are heavily used in NLP research, many researchers do not pretrain their own BERTs from scratch due to the high cost of training. In the past half-decade since BERT first rose to prominence, many advances have been made with other transformer architectures and training configurations that have yet to be systematically incorporated into BERT. Here, we introduce Mo… ▽ More

    Submitted 16 January, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 10 pages, 4 figures in main text. 25 pages total

    Journal ref: NeurIPS 2023

  2. arXiv:2311.13133  [pdf, other

    cs.LG cs.AI cs.CL

    LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms

    Authors: Aditi Jha, Sam Havens, Jeremy Dohmann, Alex Trott, Jacob Portes

    Abstract: Large Language Models are traditionally finetuned on large instruction datasets. However recent studies suggest that small, high-quality datasets can suffice for general purpose instruction following. This lack of consensus surrounding finetuning best practices is in part due to rapidly diverging approaches to LLM evaluation. In this study, we ask whether a small amount of diverse finetuning sampl… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 36 pages, 12 figures, NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following

  3. arXiv:2203.13395  [pdf, other

    cs.MA cs.GT

    Platform Behavior under Market Shocks: A Simulation Framework and Reinforcement-Learning Based Study

    Authors: Xintong Wang, Gary Qiurui Ma, Alon Eden, Clara Li, Alexander Trott, Stephan Zheng, David C. Parkes

    Abstract: We study the behavior of an economic platform (e.g., Amazon, Uber Eats, Instacart) under shocks, such as COVID-19 lockdowns, and the effect of different regulation considerations imposed on a platform. To this end, we develop a multi-agent Gym environment of a platform economy in a dynamic, multi-period setting, with the possible occurrence of economic shocks. Buyers and sellers are modeled as eco… ▽ More

    Submitted 4 January, 2023; v1 submitted 24 March, 2022; originally announced March 2022.

  4. arXiv:2202.01691  [pdf, other

    cs.MA cs.AI

    Solving Dynamic Principal-Agent Problems with a Rationally Inattentive Principal

    Authors: Tong Mu, Stephan Zheng, Alexander Trott

    Abstract: Principal-Agent (PA) problems describe a broad class of economic relationships characterized by misaligned incentives and asymmetric information. The Principal's problem is to find optimal incentives given the available information, e.g., a manager setting optimal wages for its employees. Whereas the Principal is often assumed rational, comparatively little is known about solutions when the Princi… ▽ More

    Submitted 17 February, 2022; v1 submitted 18 January, 2022; originally announced February 2022.

    Comments: 22 pages, 8 figures, including appendix

  5. arXiv:2201.01163  [pdf, other

    cs.GT cs.AI cs.LG econ.GN

    Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning

    Authors: Michael Curry, Alexander Trott, Soham Phade, Yu Bai, Stephan Zheng

    Abstract: Real economies can be modeled as a sequential imperfect-information game with many heterogeneous agents, such as consumers, firms, and governments. Dynamic general equilibrium (DGE) models are often used for macroeconomic analysis in this setting. However, finding general equilibria is challenging using existing theoretical or computational methods, especially when using microfoundations to model… ▽ More

    Submitted 23 February, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

  6. arXiv:2108.02904  [pdf, other

    cs.LG cs.AI cs.MA econ.EM econ.GN

    Building a Foundation for Data-Driven, Interpretable, and Robust Policy Design using the AI Economist

    Authors: Alexander Trott, Sunil Srinivasa, Douwe van der Wal, Sebastien Haneuse, Stephan Zheng

    Abstract: Optimizing economic and public policy is critical to address socioeconomic issues and trade-offs, e.g., improving equality, productivity, or wellness, and poses a complex mechanism design problem. A policy designer needs to consider multiple objectives, policy levers, and behavioral responses from strategic actors who optimize for their individual objectives. Moreover, real-world policies should b… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: 41 pages, 14 figures. AT, SS, and SZ contributed equally

  7. arXiv:2108.02755  [pdf, other

    cs.LG econ.GN

    The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning

    Authors: Stephan Zheng, Alexander Trott, Sunil Srinivasa, David C. Parkes, Richard Socher

    Abstract: AI and reinforcement learning (RL) have improved many areas, but are not yet widely adopted in economic policy design, mechanism design, or economics at large. At the same time, current economic methodology is limited by a lack of counterfactual data, simplistic behavioral models, and limited opportunities to experiment with policies and evaluate behavioral responses. Here we show that machine-lea… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: Substantial Extension of arXiv:2004.13332. SZ and AT contributed equally

  8. arXiv:2106.05492  [pdf, other

    cs.LG

    Learning to Play General-Sum Games Against Multiple Boundedly Rational Agents

    Authors: Eric Zhao, Alexander R. Trott, Caiming Xiong, Stephan Zheng

    Abstract: We study the problem of training a principal in a multi-agent general-sum game using reinforcement learning (RL). Learning a robust principal policy requires anticipating the worst possible strategic responses of other agents, which is generally NP-hard. However, we show that no-regret dynamics can identify these worst-case responses in poly-time in smooth games. We propose a framework that uses t… ▽ More

    Submitted 19 December, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 15 pages, 6 figures. Appearing at the Thirty-seventh AAAI Conference on Artificial Intelligence (AAAI 2023)

  9. arXiv:2004.13332  [pdf, other

    econ.GN cs.LG stat.ML

    The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies

    Authors: Stephan Zheng, Alexander Trott, Sunil Srinivasa, Nikhil Naik, Melvin Gruesbeck, David C. Parkes, Richard Socher

    Abstract: Tackling real-world socio-economic challenges requires designing and testing economic policies. However, this is hard in practice, due to a lack of appropriate (micro-level) economic data and limited opportunity to experiment. In this work, we train social planners that discover tax policies in dynamic economies that can effectively trade-off economic equality and productivity. We propose a two-le… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: 46 pages, 21 figures

  10. arXiv:2002.03647  [pdf, other

    cs.LG cs.AI stat.ML

    Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills

    Authors: Víctor Campos, Alexander Trott, Caiming Xiong, Richard Socher, Xavier Giro-i-Nieto, Jordi Torres

    Abstract: Acquiring abilities in the absence of a task-oriented reward function is at the frontier of reinforcement learning research. This problem has been studied through the lens of empowerment, which draws a connection between option discovery and information theory. Information-theoretic skill discovery methods have garnered much interest from the community, but little research has been conducted in un… ▽ More

    Submitted 3 August, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: 17 pages, 11 figures. Code is publicly available at https://github.com/victorcampos7/edl

  11. arXiv:1911.01417  [pdf, other

    cs.AI

    Kee** Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards

    Authors: Alexander Trott, Stephan Zheng, Caiming Xiong, Richard Socher

    Abstract: While using shaped rewards can be beneficial when solving sparse reward tasks, their successful application often requires careful engineering and is problem specific. For instance, in tasks where the agent must achieve some goal state, simple distance-to-goal reward sha** often fails, as it renders learning vulnerable to local optima. We introduce a simple and effective model-free method to lea… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.

    Comments: NeurIPS 2019

  12. arXiv:1907.00664  [pdf, other

    cs.LG stat.ML

    Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

    Authors: Wenling Shang, Alex Trott, Stephan Zheng, Caiming Xiong, Richard Socher

    Abstract: In many real-world scenarios, an autonomous agent often encounters various tasks within a single complex environment. We propose to build a graph abstraction over the environment structure to accelerate the learning of these tasks. Here, nodes are important points of interest (pivotal states) and edges represent feasible traversals between them. Our approach has two stages. First, we jointly train… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

  13. arXiv:1902.00528  [pdf, other

    cs.LG stat.ML

    Competitive Experience Replay

    Authors: Hao Liu, Alexander Trott, Richard Socher, Caiming Xiong

    Abstract: Deep learning has achieved remarkable successes in solving challenging reinforcement learning (RL) problems when dense reward function is provided. However, in sparse reward environment it still often suffers from the need to carefully shape reward function to guide policy optimization. This limits the applicability of RL in the real world since both reinforcement learning and domain-specific know… ▽ More

    Submitted 16 February, 2019; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: Published as a conference paper at Seventh International Conference on Learning Representations(ICLR 2019)

  14. arXiv:1712.08697  [pdf, other

    cs.AI cs.CL cs.CV

    Interpretable Counting for Visual Question Answering

    Authors: Alexander Trott, Caiming Xiong, Richard Socher

    Abstract: Questions that require counting a variety of objects in images remain a major challenge in visual question answering (VQA). The most common approaches to VQA involve either classifying answers based on fixed length representations of both the image and question or summing fractional counts estimated from each section of the image. In contrast, we treat counting as a sequential decision process and… ▽ More

    Submitted 1 March, 2018; v1 submitted 22 December, 2017; originally announced December 2017.

    Comments: ICLR 2018