Skip to main content

Showing 1–7 of 7 results for author: Nikishin, E

.
  1. arXiv:2405.04342  [pdf, other

    cs.LG

    The Curse of Diversity in Ensemble-Based Exploration

    Authors: Zhixuan Lin, Pierluca D'Oro, Evgenii Nikishin, Aaron Courville

    Abstract: We uncover a surprising phenomenon in deep reinforcement learning: training a diverse ensemble of data-sharing agents -- a well-established exploration strategy -- can significantly impair the performance of the individual ensemble members when compared to standard single-agent training. Through careful analysis, we attribute the degradation in performance to the low proportion of self-generated d… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Published as a conference paper at ICLR 2024

  2. arXiv:2403.07688  [pdf, other

    cs.LG cs.AI

    Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons

    Authors: Simon Dufort-Labbé, Pierluca D'Oro, Evgenii Nikishin, Razvan Pascanu, Pierre-Luc Bacon, Aristide Baratin

    Abstract: When training deep neural networks, the phenomenon of $\textit{dying neurons}$ $\unicode{x2013}$units that become inactive or saturated, output zero during training$\unicode{x2013}$ has traditionally been viewed as undesirable, linked with optimization challenges, and contributing to plasticity loss in continual learning scenarios. In this paper, we reassess this phenomenon, focusing on sparsity a… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  3. arXiv:2305.15555  [pdf, other

    cs.LG cs.AI

    Deep Reinforcement Learning with Plasticity Injection

    Authors: Evgenii Nikishin, Junhyuk Oh, Georg Ostrovski, Clare Lyle, Razvan Pascanu, Will Dabney, André Barreto

    Abstract: A growing body of evidence suggests that neural networks employed in deep reinforcement learning (RL) gradually lose their plasticity, the ability to learn from new data; however, the analysis and mitigation of this phenomenon is hampered by the complex relationship between plasticity, exploration, and performance in RL. This paper introduces plasticity injection, a minimalistic intervention that… ▽ More

    Submitted 3 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 camera-ready

  4. arXiv:2303.01486  [pdf, other

    cs.LG

    Understanding plasticity in neural networks

    Authors: Clare Lyle, Zeyu Zheng, Evgenii Nikishin, Bernardo Avila Pires, Razvan Pascanu, Will Dabney

    Abstract: Plasticity, the ability of a neural network to quickly change its predictions in response to new information, is essential for the adaptability and robustness of deep reinforcement learning systems. Deep neural networks are known to lose plasticity over the course of training even in relatively simple learning problems, but the mechanisms driving this phenomenon are still poorly understood. This p… ▽ More

    Submitted 27 November, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted to ICML 2023 (oral presentation)

  5. arXiv:2205.07802  [pdf, other

    cs.LG cs.AI stat.ML

    The Primacy Bias in Deep Reinforcement Learning

    Authors: Evgenii Nikishin, Max Schwarzer, Pierluca D'Oro, Pierre-Luc Bacon, Aaron Courville

    Abstract: This work identifies a common flaw of deep reinforcement learning (RL) algorithms: a tendency to rely on early interactions and ignore useful evidence encountered later. Because of training on progressively growing datasets, deep RL agents incur a risk of overfitting to earlier experiences, negatively affecting the rest of the learning process. Inspired by cognitive science, we refer to this effec… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: ICML 2022; code at https://github.com/evgenii-nikishin/rl_with_resets

  6. arXiv:2112.06276  [pdf, other

    cs.LG cs.CR

    Quantifying and Understanding Adversarial Examples in Discrete Input Spaces

    Authors: Volodymyr Kuleshov, Evgenii Nikishin, Shantanu Thakoor, Tingfung Lau, Stefano Ermon

    Abstract: Modern classification algorithms are susceptible to adversarial examples--perturbations to inputs that cause the algorithm to produce undesirable behavior. In this work, we seek to understand and extend adversarial examples across domains in which inputs are discrete, particularly across new domains, such as computational biology. As a step towards this goal, we formalize a notion of synonymous ad… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

  7. arXiv:2106.03273  [pdf, other

    cs.LG cs.AI stat.ML

    Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation

    Authors: Evgenii Nikishin, Romina Abachi, Rishabh Agarwal, Pierre-Luc Bacon

    Abstract: The shortcomings of maximum likelihood estimation in the context of model-based reinforcement learning have been highlighted by an increasing number of papers. When the model class is misspecified or has a limited representational capacity, model parameters with high likelihood might not necessarily result in high performance of the agent on a downstream control task. To alleviate this problem, we… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

    Comments: Code at https://github.com/evgenii-nikishin/omd