Skip to main content

Showing 1–15 of 15 results for author: Kuciński, Ł

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03361  [pdf, other

    cs.LG

    What Matters in Hierarchical Search for Combinatorial Reasoning Problems?

    Authors: Michał Zawalski, Gracjan Góral, Michał Tyrolski, Emilia Wiśnios, Franciszek Budrowski, Łukasz Kuciński, Piotr Miłoś

    Abstract: Efficiently tackling combinatorial reasoning problems, particularly the notorious NP-hard tasks, remains a significant challenge for AI research. Recent efforts have sought to enhance planning by incorporating hierarchical high-level search strategies, known as subgoal methods. While promising, their performance against traditional low-level planners is inconsistent, raising questions about their… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted for Generative Models for Decision Making Workshop at ICLR 2024

  2. arXiv:2403.05713  [pdf, other

    cs.LG

    tsGT: Stochastic Time Series Modeling With Transformer

    Authors: Łukasz Kuciński, Witold Drzewakowski, Mateusz Olko, Piotr Kozakowski, Łukasz Maziarka, Marta Emilia Nowakowska, Łukasz Kaiser, Piotr Miłoś

    Abstract: Time series methods are of fundamental importance in virtually any field of science that deals with temporally structured data. Recently, there has been a surge of deterministic transformer models with time series-specific architectural biases. In this paper, we go in a different direction by introducing tsGT, a stochastic time series model built on a general-purpose transformer architecture. We f… ▽ More

    Submitted 3 April, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2403.03938  [pdf, other

    cs.LG

    GUIDE: Guidance-based Incremental Learning with Diffusion Models

    Authors: Bartosz Cywiński, Kamil Deja, Tomasz Trzciński, Bartłomiej Twardowski, Łukasz Kuciński

    Abstract: We introduce GUIDE, a novel continual learning approach that directs diffusion models to rehearse samples at risk of being forgotten. Existing generative strategies combat catastrophic forgetting by randomly sampling rehearsal examples from a generative model. Such an approach contradicts buffer-based approaches where sampling strategy plays an important role. We propose to bridge this gap by inco… ▽ More

    Submitted 31 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  4. arXiv:2402.02868  [pdf, other

    cs.LG

    Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem

    Authors: Maciej Wołczyk, Bartłomiej Cupiał, Mateusz Ostaszewski, Michał Bortkiewicz, Michał Zając, Razvan Pascanu, Łukasz Kuciński, Piotr Miłoś

    Abstract: Fine-tuning is a widespread technique that allows practitioners to transfer pre-trained capabilities, as recently showcased by the successful applications of foundation models. However, fine-tuning reinforcement learning (RL) models remains a challenge. This work conceptualizes one specific cause of poor transfer, accentuated in the RL setting by the interplay between actions and observations: for… ▽ More

    Submitted 12 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  5. arXiv:2312.17296  [pdf, other

    cs.CL

    Structured Packing in LLM Training Improves Long Context Utilization

    Authors: Konrad Staniszewski, Szymon Tworkowski, Sebastian Jaszczur, Yu Zhao, Henryk Michalewski, Łukasz Kuciński, Piotr Miłoś

    Abstract: Recent advancements in long-context large language models have attracted significant attention, yet their practical applications often suffer from suboptimal context utilization. This study investigates structuring training data to enhance semantic interdependence, demonstrating that this approach effectively improves context utilization. To this end, we introduce the Structured Packing for Long C… ▽ More

    Submitted 24 June, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: new experiments with a 13B model

  6. arXiv:2303.04488  [pdf, other

    cs.LG cs.AI cs.LO

    Magnushammer: A Transformer-Based Approach to Premise Selection

    Authors: Maciej Mikuła, Szymon Tworkowski, Szymon Antoniak, Bartosz Piotrowski, Albert Qiaochu Jiang, ** Peng Zhou, Christian Szegedy, Łukasz Kuciński, Piotr Miłoś, Yuhuai Wu

    Abstract: This paper presents a novel approach to premise selection, a crucial reasoning task in automated theorem proving. Traditionally, symbolic methods that rely on extensive domain knowledge and engineering effort are applied to this task. In contrast, this work demonstrates that contrastive training with the transformer architecture can achieve higher-quality retrieval of relevant premises, without th… ▽ More

    Submitted 18 March, 2024; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: ICLR 2024

  7. arXiv:2211.13715  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Trust Your $\nabla$: Gradient-based Intervention Targeting for Causal Discovery

    Authors: Mateusz Olko, Michał Zając, Aleksandra Nowak, Nino Scherrer, Yashas Annadani, Stefan Bauer, Łukasz Kuciński, Piotr Miłoś

    Abstract: Inferring causal structure from data is a challenging task of fundamental importance in science. Observational data are often insufficient to identify a system's causal structure uniquely. While conducting interventions (i.e., experiments) can improve the identifiability, such samples are usually challenging and expensive to obtain. Hence, experimental design approaches for causal discovery aim to… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted to 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  8. arXiv:2209.13900  [pdf, other

    cs.LG

    Disentangling Transfer in Continual Reinforcement Learning

    Authors: Maciej Wołczyk, Michał Zając, Razvan Pascanu, Łukasz Kuciński, Piotr Miłoś

    Abstract: The ability of continual learning systems to transfer knowledge from previously seen tasks in order to maximize performance on new tasks is a significant challenge for the field, limiting the applicability of continual learning solutions to realistic scenarios. Consequently, this study aims to broaden our understanding of transfer and its driving forces in the specific case of continual reinforcem… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: Accepted at NeurIPS 2022

  9. arXiv:2206.00702  [pdf, other

    cs.AI cs.LG

    Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search

    Authors: Michał Zawalski, Michał Tyrolski, Konrad Czechowski, Tomasz Odrzygóźdź, Damian Stachura, Piotr Piękos, Yuhuai Wu, Łukasz Kuciński, Piotr Miłoś

    Abstract: Complex reasoning problems contain states that vary in the computational cost required to determine a good action plan. Taking advantage of this property, we propose Adaptive Subgoal Search (AdaSubS), a search method that adaptively adjusts the planning horizon. To this end, AdaSubS generates diverse sets of subgoals at different distances. A verification mechanism is employed to filter out unreac… ▽ More

    Submitted 25 May, 2024; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: ICLR 2023 (notable-top-5%) website: https://sites.google.com/view/adaptivesubgoalsearch/

    ACM Class: I.2.8; I.2.6

  10. arXiv:2111.15382  [pdf, other

    cs.LG cs.AI

    Continuous Control With Ensemble Deep Deterministic Policy Gradients

    Authors: Piotr Januszewski, Mateusz Olko, Michał Królikowski, Jakub Świątkowski, Marcin Andrychowicz, Łukasz Kuciński, Piotr Miłoś

    Abstract: The growth of deep reinforcement learning (RL) has brought multiple exciting tools and methods to the field. This rapid expansion makes it important to understand the interplay between individual elements of the RL toolbox. We approach this task from an empirical perspective by conducting a study in the continuous control setting. We present multiple insights of fundamental nature, including: an a… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

  11. arXiv:2111.06464  [pdf, other

    cs.LG cs.AI cs.CL

    Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication

    Authors: Łukasz Kuciński, Tomasz Korbak, Paweł Kołodziej, Piotr Miłoś

    Abstract: Communication is compositional if complex signals can be represented as a combination of simpler subparts. In this paper, we theoretically show that inductive biases on both the training framework and the data are needed to develop a compositional communication. Moreover, we prove that compositionality spontaneously arises in the signaling games, where agents communicate over a noisy channel. We e… ▽ More

    Submitted 3 April, 2024; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021

  12. arXiv:2108.11204  [pdf, other

    cs.AI cs.LG

    Subgoal Search For Complex Reasoning Tasks

    Authors: Konrad Czechowski, Tomasz Odrzygóźdź, Marek Zbysiński, Michał Zawalski, Krzysztof Olejnik, Yuhuai Wu, Łukasz Kuciński, Piotr Miłoś

    Abstract: Humans excel in solving complex reasoning tasks through a mental process of moving from one idea to a related one. Inspired by this, we propose Subgoal Search (kSubS) method. Its key component is a learned subgoal generator that produces a diversity of subgoals that are both achievable and closer to the solution. Using subgoals reduces the search space and induces a high-level search graph suitabl… ▽ More

    Submitted 3 April, 2024; v1 submitted 25 August, 2021; originally announced August 2021.

    Comments: NeurIPS 2021

  13. arXiv:2105.10919  [pdf, other

    cs.LG cs.AI cs.RO

    Continual World: A Robotic Benchmark For Continual Reinforcement Learning

    Authors: Maciej Wołczyk, Michał Zając, Razvan Pascanu, Łukasz Kuciński, Piotr Miłoś

    Abstract: Continual learning (CL) -- the ability to continuously learn, building on previously acquired knowledge -- is a natural requirement for long-lived autonomous reinforcement learning (RL) agents. While building such agents, one needs to balance opposing desiderata, such as constraints on capacity and compute, the ability to not catastrophically forget, and to exhibit positive transfer on new tasks.… ▽ More

    Submitted 28 October, 2021; v1 submitted 23 May, 2021; originally announced May 2021.

    Comments: NeurIPS 2021

  14. arXiv:1912.09996  [pdf, other

    cs.LG cs.AI stat.ML

    Uncertainty-sensitive Learning and Planning with Ensembles

    Authors: Piotr Miłoś, Łukasz Kuciński, Konrad Czechowski, Piotr Kozakowski, Maciek Klimek

    Abstract: We propose a reinforcement learning framework for discrete environments in which an agent makes both strategic and tactical decisions. The former manifests itself through the use of value function, while the latter is powered by a tree search planner. These tools complement each other. The planning module performs a local \textit{what-if} analysis, which allows to avoid tactical pitfalls and boost… ▽ More

    Submitted 4 March, 2020; v1 submitted 19 December, 2019; originally announced December 2019.

  15. arXiv:1910.06079  [pdf, other

    cs.LG cs.AI cs.MA

    Developmentally motivated emergence of compositional communication via template transfer

    Authors: Tomasz Korbak, Julian Zubek, Łukasz Kuciński, Piotr Miłoś, Joanna Rączaszek-Leonardi

    Abstract: This paper explores a novel approach to achieving emergent compositional communication in multi-agent systems. We propose a training regime implementing template transfer, the idea of carrying over learned biases across contexts. In our method, a sender-receiver pair is first trained with disentangled loss functions and then the receiver is transferred to train a new sender with a standard loss. U… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: Accepted for NeurIPS 2019 workshop Emergent Communication: Towards Natural Language