Skip to main content

Showing 1–15 of 15 results for author: Moskovitz, T

.
  1. arXiv:2404.07129  [pdf, other

    cs.LG

    What needs to go right for an induction head? A mechanistic study of in-context learning circuits and their formation

    Authors: Aaditya K. Singh, Ted Moskovitz, Felix Hill, Stephanie C. Y. Chan, Andrew M. Saxe

    Abstract: In-context learning is a powerful emergent ability in transformer models. Prior work in mechanistic interpretability has identified a circuit element that may be critical for in-context learning -- the induction head (IH), which performs a match-and-copy operation. During training of large transformers on natural language data, IHs emerge around the same time as a notable phase change in the loss.… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 26 pages, 18 figures

  2. arXiv:2311.08360  [pdf, other

    cs.LG cs.AI cs.CL

    The Transient Nature of Emergent In-Context Learning in Transformers

    Authors: Aaditya K. Singh, Stephanie C. Y. Chan, Ted Moskovitz, Erin Grant, Andrew M. Saxe, Felix Hill

    Abstract: Transformer neural networks can exhibit a surprising capacity for in-context learning (ICL) despite not being explicitly trained for it. Prior work has provided a deeper understanding of how ICL emerges in transformers, e.g. through the lens of mechanistic interpretability, Bayesian inference, or by examining the distributional properties of training data. However, in each of these cases, ICL is t… ▽ More

    Submitted 11 December, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 19 pages, 16 figures

  3. arXiv:2310.04373  [pdf, other

    cs.LG cs.AI

    Confronting Reward Model Overoptimization with Constrained RLHF

    Authors: Ted Moskovitz, Aaditya K. Singh, DJ Strouse, Tuomas Sandholm, Ruslan Salakhutdinov, Anca D. Dragan, Stephen McAleer

    Abstract: Large language models are typically aligned with human preferences by optimizing $\textit{reward models}$ (RMs) fitted to human feedback. However, human preferences are multi-faceted, and it is increasingly common to derive reward from a composition of simpler reward models which each capture a different aspect of language quality. This itself presents a challenge, as it is difficult to appropriat… ▽ More

    Submitted 10 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  4. arXiv:2309.03710  [pdf, other

    cs.LG

    A State Representation for Diminishing Rewards

    Authors: Ted Moskovitz, Samo Hromadka, Ahmed Touati, Diana Borsa, Maneesh Sahani

    Abstract: A common setting in multitask reinforcement learning (RL) demands that an agent rapidly adapt to various stationary reward functions randomly sampled from a fixed distribution. In such situations, the successor representation (SR) is a popular framework which supports rapid policy evaluation by decoupling a policy's expected discounted, cumulative state occupancies from a specific reward function.… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  5. arXiv:2302.01275  [pdf, other

    cs.LG

    ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs

    Authors: Ted Moskovitz, Brendan O'Donoghue, Vivek Veeriah, Sebastian Flennerhag, Satinder Singh, Tom Zahavy

    Abstract: In recent years, Reinforcement Learning (RL) has been applied to real-world problems with increasing success. Such applications often require to put constraints on the agent's behavior. Existing algorithms for constrained RL (CRL) rely on gradient descent-ascent, but this approach comes with a caveat. While these algorithms are guaranteed to converge on average, they do not guarantee last-iterate… ▽ More

    Submitted 5 March, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  6. arXiv:2211.14469  [pdf, other

    cs.LG cs.AI

    Transfer RL via the Undo Maps Formalism

    Authors: Abhi Gupta, Ted Moskovitz, David Alvarez-Melis, Aldo Pacchiano

    Abstract: Transferring knowledge across domains is one of the most fundamental problems in machine learning, but doing so effectively in the context of reinforcement learning remains largely an open problem. Current methods make strong assumptions on the specifics of the task, often lack principled objectives, and -- crucially -- modify individual policies, which might be sub-optimal when the domains differ… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 8 main pages, 3 appendix

  7. arXiv:2211.07036  [pdf, other

    q-bio.NC

    A Unified Theory of Dual-Process Control

    Authors: Ted Moskovitz, Kevin Miller, Maneesh Sahani, Matthew M. Botvinick

    Abstract: Dual-process theories play a central role in both psychology and neuroscience, figuring prominently in fields ranging from executive control to reward-based learning to judgment and decision making. In each of these domains, two mechanisms appear to operate concurrently, one relatively high in computational complexity, the other relatively simple. Why is neural information processing organized in… ▽ More

    Submitted 10 October, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

  8. arXiv:2207.08258  [pdf, other

    cs.LG

    Minimum Description Length Control

    Authors: Ted Moskovitz, Ta-Chu Kao, Maneesh Sahani, Matthew M. Botvinick

    Abstract: We propose a novel framework for multitask reinforcement learning based on the minimum description length (MDL) principle. In this approach, which we term MDL-control (MDL-C), the agent learns the common structure among the tasks with which it is faced and then distills it into a simpler representation which facilitates faster convergence and generalization to new tasks. In doing so, MDL-C natural… ▽ More

    Submitted 24 July, 2022; v1 submitted 17 July, 2022; originally announced July 2022.

  9. arXiv:2111.02994  [pdf, other

    cs.LG

    Towards an Understanding of Default Policies in Multitask Policy Optimization

    Authors: Ted Moskovitz, Michael Arbel, Jack Parker-Holder, Aldo Pacchiano

    Abstract: Much of the recent success of deep reinforcement learning has been driven by regularized policy optimization (RPO) algorithms with strong performance across multiple domains. In this family of methods, agents are trained to maximize cumulative reward while penalizing deviation in behavior from some reference, or default policy. In addition to empirical success, there is a strong theoretical founda… ▽ More

    Submitted 23 March, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

  10. arXiv:2109.13863  [pdf, other

    cs.LG cs.AI

    A First-Occupancy Representation for Reinforcement Learning

    Authors: Ted Moskovitz, Spencer R. Wilson, Maneesh Sahani

    Abstract: Both animals and artificial agents benefit from state representations that support rapid transfer of learning across tasks and which enable them to efficiently traverse their environments to reach rewarding states. The successor representation (SR), which measures the expected cumulative, discounted state occupancy under a fixed policy, enables efficient transfer to different reward structures in… ▽ More

    Submitted 6 November, 2021; v1 submitted 28 September, 2021; originally announced September 2021.

  11. arXiv:2102.03765  [pdf, other

    cs.LG

    Tactical Optimism and Pessimism for Deep Reinforcement Learning

    Authors: Ted Moskovitz, Jack Parker-Holder, Aldo Pacchiano, Michael Arbel, Michael I. Jordan

    Abstract: In recent years, deep off-policy actor-critic algorithms have become a dominant approach to reinforcement learning for continuous control. One of the primary drivers of this improved performance is the use of pessimistic value updates to address function approximation errors, which previously led to disappointing performance. However, a direct consequence of pessimism is reduced exploration, runni… ▽ More

    Submitted 6 April, 2022; v1 submitted 7 February, 2021; originally announced February 2021.

  12. arXiv:2010.05380  [pdf, other

    cs.LG

    Efficient Wasserstein Natural Gradients for Reinforcement Learning

    Authors: Ted Moskovitz, Michael Arbel, Ferenc Huszar, Arthur Gretton

    Abstract: A novel optimization approach is proposed for application to policy gradient methods and evolution strategies for reinforcement learning (RL). The procedure uses a computationally efficient Wasserstein natural gradient (WNG) descent that takes advantage of the geometry induced by a Wasserstein penalty to speed optimization. This method follows the recent theme in RL of including a divergence penal… ▽ More

    Submitted 18 March, 2021; v1 submitted 11 October, 2020; originally announced October 2020.

  13. arXiv:2002.09737  [pdf, other

    stat.ML cs.LG

    Amortised Learning by Wake-Sleep

    Authors: Li K. Wenliang, Theodore Moskovitz, Heishiro Kanagawa, Maneesh Sahani

    Abstract: Models that employ latent variables to capture structure in observed data lie at the heart of many current unsupervised learning algorithms, but exact maximum-likelihood learning for powerful and flexible latent-variable models is almost always intractable. Thus, state-of-the-art approaches either abandon the maximum-likelihood framework entirely, or else rely on a variety of variational approxima… ▽ More

    Submitted 15 August, 2020; v1 submitted 22 February, 2020; originally announced February 2020.

  14. arXiv:1910.08461  [pdf, other

    cs.LG stat.ML

    First-Order Preconditioning via Hypergradient Descent

    Authors: Ted Moskovitz, Rui Wang, Janice Lan, Sanyam Kapoor, Thomas Miconi, Jason Yosinski, Aditya Rawal

    Abstract: Standard gradient descent methods are susceptible to a range of issues that can impede training, such as high correlations and different scaling in parameter space.These difficulties can be addressed by second-order approaches that apply a pre-conditioning matrix to the gradient to improve convergence. Unfortunately, such algorithms typically struggle to scale to high-dimensional problems, in part… ▽ More

    Submitted 27 April, 2020; v1 submitted 18 October, 2019; originally announced October 2019.

  15. arXiv:1812.06488  [pdf, other

    cs.NE cs.LG stat.ML

    Feedback alignment in deep convolutional networks

    Authors: Theodore H. Moskovitz, Ashok Litwin-Kumar, L. F. Abbott

    Abstract: Ongoing studies have identified similarities between neural representations in biological networks and in deep artificial neural networks. This has led to renewed interest in develo** analogies between the backpropagation learning algorithm used to train artificial networks and the synaptic plasticity rules operative in the brain. These efforts are challenged by biologically implausible features… ▽ More

    Submitted 10 June, 2019; v1 submitted 12 December, 2018; originally announced December 2018.