Skip to main content

Showing 1–17 of 17 results for author: Ostaszewski, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16158  [pdf, other

    cs.LG

    Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control

    Authors: Michal Nauman, Mateusz Ostaszewski, Krzysztof Jankowski, Piotr Miłoś, Marek Cygan

    Abstract: Sample efficiency in Reinforcement Learning (RL) has traditionally been driven by algorithmic enhancements. In this work, we demonstrate that scaling can also lead to substantial improvements. We conduct a thorough investigation into the interplay of scaling model capacity and domain-specific RL enhancements. These empirical findings inform the design choices underlying our proposed BRO (Bigger, R… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Preprint

  2. arXiv:2403.01014  [pdf, other

    cs.LG

    A Case for Validation Buffer in Pessimistic Actor-Critic

    Authors: Michal Nauman, Mateusz Ostaszewski, Marek Cygan

    Abstract: In this paper, we investigate the issue of error accumulation in critic networks updated via pessimistic temporal difference objectives. We show that the critic approximation error can be approximated via a recursive fixed-point model similar to that of the Bellman value. We use such recursive definition to retrieve the conditions under which the pessimistic critic is unbiased. Building on these i… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: Preprint

  3. arXiv:2403.00514  [pdf, other

    cs.LG

    Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

    Authors: Michal Nauman, Michał Bortkiewicz, Piotr Miłoś, Tomasz Trzciński, Mateusz Ostaszewski, Marek Cygan

    Abstract: Recent advancements in off-policy Reinforcement Learning (RL) have significantly improved sample efficiency, primarily due to the incorporation of various forms of regularization that enable more gradient update steps than traditional agents. However, many of these techniques have been tested in limited settings, often on tasks from single simulation benchmarks and against well-known algorithms ra… ▽ More

    Submitted 19 June, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  4. arXiv:2402.03500  [pdf, other

    quant-ph cs.AI cs.LG

    Curriculum reinforcement learning for quantum architecture search under hardware errors

    Authors: Yash J. Patel, Akash Kundu, Mateusz Ostaszewski, Xavier Bonet-Monroig, Vedran Dunjko, Onur Danaci

    Abstract: The key challenge in the noisy intermediate-scale quantum era is finding useful circuits compatible with current device limitations. Variational quantum algorithms (VQAs) offer a potential solution by fixing the circuit architecture and optimizing individual gate parameters in an external loop. However, parameter optimization can become intractable, and the overall performance of the algorithm dep… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 32 pages, 11 figures, 6 tables. Accepted at ICLR 2024

  5. arXiv:2402.02868  [pdf, other

    cs.LG

    Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem

    Authors: Maciej Wołczyk, Bartłomiej Cupiał, Mateusz Ostaszewski, Michał Bortkiewicz, Michał Zając, Razvan Pascanu, Łukasz Kuciński, Piotr Miłoś

    Abstract: Fine-tuning is a widespread technique that allows practitioners to transfer pre-trained capabilities, as recently showcased by the successful applications of foundation models. However, fine-tuning reinforcement learning (RL) models remains a challenge. This work conceptualizes one specific cause of poor transfer, accentuated in the RL setting by the interplay between actions and observations: for… ▽ More

    Submitted 12 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  6. arXiv:2310.19537  [pdf, other

    cs.LG

    On consequences of finetuning on data with highly discriminative features

    Authors: Wojciech Masarczyk, Tomasz Trzciński, Mateusz Ostaszewski

    Abstract: In the era of transfer learning, training neural networks from scratch is becoming obsolete. Transfer learning leverages prior knowledge for new tasks, conserving computational resources. While its advantages are well-documented, we uncover a notable drawback: networks tend to prioritize basic data patterns, forsaking valuable pre-learned features. We term this behavior "feature erosion" and analy… ▽ More

    Submitted 15 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 -- UniReps Workshop

  7. arXiv:2306.11086  [pdf, other

    quant-ph cs.AI cs.LG

    Enhancing variational quantum state diagonalization using reinforcement learning techniques

    Authors: Akash Kundu, Przemysław Bedełek, Mateusz Ostaszewski, Onur Danaci, Yash J. Patel, Vedran Dunjko, Jarosław A. Miszczak

    Abstract: The variational quantum algorithms are crucial for the application of NISQ computers. Such algorithms require short quantum circuits, which are more amenable to implementation on near-term hardware, and many such methods have been developed. One of particular interest is the so-called variational quantum state diagonalization method, which constitutes an important algorithmic subroutine and can be… ▽ More

    Submitted 11 January, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 24 pages with 13 figures, accepted in the New Journal of Physics, code available at https://github.com/iitis/RL_for_VQSD_ansatz_optimization

    Journal ref: New Journal of Physics, 26, 013034 (2024)

  8. arXiv:2305.19753  [pdf, other

    cs.LG cs.CV

    The Tunnel Effect: Building Data Representations in Deep Neural Networks

    Authors: Wojciech Masarczyk, Mateusz Ostaszewski, Ehsan Imani, Razvan Pascanu, Piotr Miłoś, Tomasz Trzciński

    Abstract: Deep neural networks are widely known for their remarkable effectiveness across various tasks, with the consensus that deeper networks implicitly learn more complex data representations. This paper shows that sufficiently deep networks trained for supervised image classification split into two distinct parts that contribute to the resulting data representations differently. The initial layers crea… ▽ More

    Submitted 30 October, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  9. arXiv:2211.15944  [pdf, other

    cs.LG cs.AI

    The Effectiveness of World Models for Continual Reinforcement Learning

    Authors: Samuel Kessler, Mateusz Ostaszewski, Michał Bortkiewicz, Mateusz Żarski, Maciej Wołczyk, Jack Parker-Holder, Stephen J. Roberts, Piotr Miłoś

    Abstract: World models power some of the most efficient reinforcement learning algorithms. In this work, we showcase that they can be harnessed for continual learning - a situation when the agent faces changing environments. World models typically employ a replay buffer for training, which can be naturally extended to continual learning. We systematically study how different selective experience replay meth… ▽ More

    Submitted 12 July, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted at CoLLAs 2023, 21 pages, 15 figures

  10. arXiv:2211.06351  [pdf, other

    cs.LG cs.AI cs.MA

    Emergency action termination for immediate reaction in hierarchical reinforcement learning

    Authors: Michał Bortkiewicz, Jakub Łyskawa, Paweł Wawrzyński, Mateusz Ostaszewski, Artur Grudkowski, Tomasz Trzciński

    Abstract: Hierarchical decomposition of control is unavoidable in large dynamical systems. In reinforcement learning (RL), it is usually solved with subgoals defined at higher policy levels and achieved at lower policy levels. Reaching these goals can take a substantial amount of time, during which it is not verified whether they are still worth pursuing. However, due to the randomness of the environment, t… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  11. arXiv:2208.00156  [pdf, other

    cs.LG

    Reinforcement learning with experience replay and adaptation of action dispersion

    Authors: Paweł Wawrzyński, Wojciech Masarczyk, Mateusz Ostaszewski

    Abstract: Effective reinforcement learning requires a proper balance of exploration and exploitation defined by the dispersion of action distribution. However, this balance depends on the task, the current stage of the learning process, and the current environment state. Existing methods that designate the action distribution dispersion require problem-dependent hyperparameters. In this paper, we propose to… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    ACM Class: I.2.6

  12. arXiv:2103.16089  [pdf, other

    quant-ph cs.AI

    Reinforcement learning for optimization of variational quantum circuit architectures

    Authors: Mateusz Ostaszewski, Lea M. Trenkwalder, Wojciech Masarczyk, Eleanor Scerri, Vedran Dunjko

    Abstract: The study of Variational Quantum Eigensolvers (VQEs) has been in the spotlight in recent times as they may lead to real-world applications of near-term quantum devices. However, their performance depends on the structure of the used variational ansatz, which requires balancing the depth and expressivity of the corresponding circuit. In recent years, various methods for VQE structure optimization h… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

  13. arXiv:1909.05507  [pdf, other

    cs.NE cs.CV eess.IV

    Effective training of deep convolutional neural networks for hyperspectral image classification through artificial labeling

    Authors: Wojciech Masarczyk, Przemysław Głomb, Bartosz Grabowski, Mateusz Ostaszewski

    Abstract: Hyperspectral imaging is a rich source of data, allowing for multitude of effective applications. However, such imaging remains challenging because of large data dimension and, typically, small pool of available training examples. While deep learning approaches have been shown to be successful in providing effective classification solutions, especially for high dimensional problems, unfortunately… ▽ More

    Submitted 22 October, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Journal ref: Remote Sens. 2020, 12, 2653

  14. Approximation of quantum control correction scheme using deep neural networks

    Authors: M. Ostaszewski, J. A. Miszczak, P. Sadowski, L. Banchi

    Abstract: We study the functional relationship between quantum control pulses in the idealized case and the pulses in the presence of an unwanted drift. We show that a class of artificial neural networks called LSTM is able to model this functional relationship with high efficiency, and hence the correction scheme required to counterbalance the effect of the drift. Our solution allows studying the map** f… ▽ More

    Submitted 28 March, 2019; v1 submitted 14 March, 2018; originally announced March 2018.

    Comments: 6 pages, 3 figures, Python code available upon request. arXiv admin note: text overlap with arXiv:1803.05169

    Journal ref: Quantum Inf Process (2019), 18:126

  15. Geometrical versus time-series representation of data in quantum control learning

    Authors: M. Ostaszewski, J. A. Miszczak, P. Sadowski

    Abstract: Recently machine learning techniques have become popular for analysing physical systems and solving problems occurring in quantum computing. In this paper we focus on using such techniques for finding the sequence of physical operations implementing the given quantum logical operation. In this context we analyse the flexibility of the data representation and compare the applicability of two machin… ▽ More

    Submitted 29 April, 2020; v1 submitted 14 March, 2018; originally announced March 2018.

    Comments: 12 pages, 14 figures, Python code available upon the request

    Journal ref: Journal of Physics A: Mathematical and Theoretical, Volume 53, Number 19 (2020)

  16. Lively quantum walks on cycles

    Authors: Przemysław Sadowski, Jarosław Adam Miszczak, Mateusz Ostaszewski

    Abstract: We introduce a family of quantum walks on cycles parametrized by their liveliness, defined by the ability to execute a long-range move. We investigate the behaviour of the probability distribution and time-averaged probability distribution. We show that the liveliness parameter, controlling the magnitude of the additional long-range move, has a direct impact on the periodicity of the limiting dist… ▽ More

    Submitted 8 February, 2017; v1 submitted 9 December, 2015; originally announced December 2015.

    Comments: 13 pages

    MSC Class: 81P45; 94A05; 05C81 ACM Class: C.2.1; I.6.5

    Journal ref: J. Phys. A: Math. Theor. 49 375302 (2016)

  17. arXiv:1504.00580  [pdf, ps, other

    quant-ph cs.CV cs.LG

    Quantum image classification using principal component analysis

    Authors: Mateusz Ostaszewski, Przemysław Sadowski, Piotr Gawron

    Abstract: We present a novel quantum algorithm for classification of images. The algorithm is constructed using principal component analysis and von Neuman quantum measurements. In order to apply the algorithm we present a new quantum representation of grayscale images.

    Submitted 2 April, 2015; originally announced April 2015.

    Comments: 9 pages

    Journal ref: Theoretical and Applied Informatics, Vol. 27, No. 1, pp. 1-12 (2015)