Skip to main content

Showing 1–6 of 6 results for author: Kozakowski, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05713  [pdf, other

    cs.LG

    tsGT: Stochastic Time Series Modeling With Transformer

    Authors: Łukasz Kuciński, Witold Drzewakowski, Mateusz Olko, Piotr Kozakowski, Łukasz Maziarka, Marta Emilia Nowakowska, Łukasz Kaiser, Piotr Miłoś

    Abstract: Time series methods are of fundamental importance in virtually any field of science that deals with temporally structured data. Recently, there has been a surge of deterministic transformer models with time series-specific architectural biases. In this paper, we go in a different direction by introducing tsGT, a stochastic time series model built on a general-purpose transformer architecture. We f… ▽ More

    Submitted 3 April, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2310.07313  [pdf, other

    cs.LG stat.ML

    Molecule-Edit Templates for Efficient and Accurate Retrosynthesis Prediction

    Authors: Mikołaj Sacha, Michał Sadowski, Piotr Kozakowski, Ruard van Workum, Stanisław Jastrzębski

    Abstract: Retrosynthesis involves determining a sequence of reactions to synthesize complex molecules from simpler precursors. As this poses a challenge in organic chemistry, machine learning has offered solutions, particularly for predicting possible reaction substrates for a given target molecule. These solutions mainly fall into template-based and template-free categories. The former is efficient but rel… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    ACM Class: I.2.1; I.5.1

  3. Planning and Learning Using Adaptive Entropy Tree Search

    Authors: Piotr Kozakowski, Mikołaj Pacek, Piotr Miłoś

    Abstract: Recent breakthroughs in Artificial Intelligence have shown that the combination of tree-based planning with deep learning can lead to superior performance. We present Adaptive Entropy Tree Search (ANTS) - a novel algorithm combining planning and learning in the maximum entropy paradigm. Through a comprehensive suite of experiments on the Atari benchmark we show that ANTS significantly outperforms… ▽ More

    Submitted 14 March, 2023; v1 submitted 12 February, 2021; originally announced February 2021.

  4. arXiv:2102.06782  [pdf, other

    cs.LG

    Q-Value Weighted Regression: Reinforcement Learning with Limited Data

    Authors: Piotr Kozakowski, Łukasz Kaiser, Henryk Michalewski, Afroz Mohiuddin, Katarzyna Kańska

    Abstract: Sample efficiency and performance in the offline setting have emerged as significant challenges of deep reinforcement learning. We introduce Q-Value Weighted Regression (QWR), a simple RL algorithm that excels in these aspects. QWR is an extension of Advantage Weighted Regression (AWR), an off-policy actor-critic algorithm that performs very well on continuous control tasks, also in the offline se… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  5. arXiv:1912.09996  [pdf, other

    cs.LG cs.AI stat.ML

    Uncertainty-sensitive Learning and Planning with Ensembles

    Authors: Piotr Miłoś, Łukasz Kuciński, Konrad Czechowski, Piotr Kozakowski, Maciek Klimek

    Abstract: We propose a reinforcement learning framework for discrete environments in which an agent makes both strategic and tactical decisions. The former manifests itself through the use of value function, while the latter is powered by a tree search planner. These tools complement each other. The planning module performs a local \textit{what-if} analysis, which allows to avoid tactical pitfalls and boost… ▽ More

    Submitted 4 March, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

  6. arXiv:1903.00374  [pdf, other

    cs.LG stat.ML

    Model-Based Reinforcement Learning for Atari

    Authors: Lukasz Kaiser, Mohammad Babaeizadeh, Piotr Milos, Blazej Osinski, Roy H Campbell, Konrad Czechowski, Dumitru Erhan, Chelsea Finn, Piotr Kozakowski, Sergey Levine, Afroz Mohiuddin, Ryan Sepassi, George Tucker, Henryk Michalewski

    Abstract: Model-free reinforcement learning (RL) can be used to learn effective policies for complex tasks, such as Atari games, even from image observations. However, this typically requires very large amounts of interaction -- substantially more, in fact, than a human would need to learn the same games. How can people learn so quickly? Part of the answer may be that people can learn how the game works and… ▽ More

    Submitted 3 April, 2024; v1 submitted 1 March, 2019; originally announced March 2019.