Skip to main content

Showing 1–5 of 5 results for author: Czechowski, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.00702  [pdf, other

    cs.AI cs.LG

    Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search

    Authors: Michał Zawalski, Michał Tyrolski, Konrad Czechowski, Tomasz Odrzygóźdź, Damian Stachura, Piotr Piękos, Yuhuai Wu, Łukasz Kuciński, Piotr Miłoś

    Abstract: Complex reasoning problems contain states that vary in the computational cost required to determine a good action plan. Taking advantage of this property, we propose Adaptive Subgoal Search (AdaSubS), a search method that adaptively adjusts the planning horizon. To this end, AdaSubS generates diverse sets of subgoals at different distances. A verification mechanism is employed to filter out unreac… ▽ More

    Submitted 25 May, 2024; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: ICLR 2023 (notable-top-5%) website: https://sites.google.com/view/adaptivesubgoalsearch/

    ACM Class: I.2.8; I.2.6

  2. arXiv:2205.10893  [pdf, other

    cs.AI

    Thor: Wielding Hammers to Integrate Language Models and Automated Theorem Provers

    Authors: Albert Q. Jiang, Wenda Li, Szymon Tworkowski, Konrad Czechowski, Tomasz Odrzygóźdź, Piotr Miłoś, Yuhuai Wu, Mateja Jamnik

    Abstract: In theorem proving, the task of selecting useful premises from a large library to unlock the proof of a given conjecture is crucially important. This presents a challenge for all theorem provers, especially the ones based on language models, due to their relative inability to reason over huge volumes of premises in text form. This paper introduces Thor, a framework integrating language models and… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

  3. arXiv:2108.11204  [pdf, other

    cs.AI cs.LG

    Subgoal Search For Complex Reasoning Tasks

    Authors: Konrad Czechowski, Tomasz Odrzygóźdź, Marek Zbysiński, Michał Zawalski, Krzysztof Olejnik, Yuhuai Wu, Łukasz Kuciński, Piotr Miłoś

    Abstract: Humans excel in solving complex reasoning tasks through a mental process of moving from one idea to a related one. Inspired by this, we propose Subgoal Search (kSubS) method. Its key component is a learned subgoal generator that produces a diversity of subgoals that are both achievable and closer to the solution. Using subgoals reduces the search space and induces a high-level search graph suitabl… ▽ More

    Submitted 3 April, 2024; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: NeurIPS 2021

  4. arXiv:1912.09996  [pdf, other

    cs.LG cs.AI stat.ML

    Uncertainty-sensitive Learning and Planning with Ensembles

    Authors: Piotr Miłoś, Łukasz Kuciński, Konrad Czechowski, Piotr Kozakowski, Maciek Klimek

    Abstract: We propose a reinforcement learning framework for discrete environments in which an agent makes both strategic and tactical decisions. The former manifests itself through the use of value function, while the latter is powered by a tree search planner. These tools complement each other. The planning module performs a local \textit{what-if} analysis, which allows to avoid tactical pitfalls and boost… ▽ More

    Submitted 4 March, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

  5. arXiv:1903.00374  [pdf, other

    cs.LG stat.ML

    Model-Based Reinforcement Learning for Atari

    Authors: Lukasz Kaiser, Mohammad Babaeizadeh, Piotr Milos, Blazej Osinski, Roy H Campbell, Konrad Czechowski, Dumitru Erhan, Chelsea Finn, Piotr Kozakowski, Sergey Levine, Afroz Mohiuddin, Ryan Sepassi, George Tucker, Henryk Michalewski

    Abstract: Model-free reinforcement learning (RL) can be used to learn effective policies for complex tasks, such as Atari games, even from image observations. However, this typically requires very large amounts of interaction -- substantially more, in fact, than a human would need to learn the same games. How can people learn so quickly? Part of the answer may be that people can learn how the game works and… ▽ More

    Submitted 3 April, 2024; v1 submitted 1 March, 2019; originally announced March 2019.