Skip to main content

Showing 1–6 of 6 results for author: Pásztor, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16745  [pdf, other

    cs.LG cs.AI cs.GT stat.ML

    Bandits with Preference Feedback: A Stackelberg Game Perspective

    Authors: Barna Pásztor, Parnian Kassraie, Andreas Krause

    Abstract: Bandits with preference feedback present a powerful tool for optimizing unknown target functions when only pairwise comparisons are allowed instead of direct value queries. This model allows for incorporating human feedback into online inference and optimization and has been employed in systems for fine-tuning large language models. The problem is well understood in simplified settings with linear… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 30 pages, 8 figures

  2. arXiv:2406.01575  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Stochastic Bilevel Optimization with Lower-Level Contextual Markov Decision Processes

    Authors: Vinzenz Thoma, Barna Pasztor, Andreas Krause, Giorgia Ramponi, Yifan Hu

    Abstract: In various applications, the optimal policy in a strategic decision-making problem depends both on the environmental configuration and exogenous events. For these settings, we introduce Bilevel Optimization with Contextual Markov Decision Processes (BO-CMDP), a stochastic bilevel decision-making model, where the lower level consists of solving a contextual Markov Decision Process (CMDP). BO-CMDP c… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 54 pages, 18 Figures

  3. arXiv:2306.17052  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Safe Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Matej Jusup, Barna Pásztor, Tadeusz Janik, Kenan Zhang, Francesco Corman, Andreas Krause, Ilija Bogunovic

    Abstract: Many applications, e.g., in shared mobility, require coordinating a large number of agents. Mean-field reinforcement learning addresses the resulting scalability challenge by optimizing the policy of a representative agent interacting with the infinite population of identical agents instead of considering individual pairwise interactions. In this paper, we address an important generalization where… ▽ More

    Submitted 27 December, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: 23 pages, 26 figures, 6 tables

  4. arXiv:2107.04050  [pdf, other

    stat.ML cs.LG cs.MA

    Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning

    Authors: Barna Pásztor, Ilija Bogunovic, Andreas Krause

    Abstract: Learning in multi-agent systems is highly challenging due to several factors including the non-stationarity introduced by agents' interactions and the combinatorial nature of their state and action spaces. In particular, we consider the Mean-Field Control (MFC) problem which assumes an asymptotically infinite population of identical agents that aim to collaboratively maximize the collective reward… ▽ More

    Submitted 9 May, 2023; v1 submitted 8 July, 2021; originally announced July 2021.

    Journal ref: Pásztor, B., Krause, A., & Bogunovic, I. (2023). Efficient Model-Based Multi-Agent Mean-Field Reinforcement Learning. Transactions on Machine Learning Research

  5. arXiv:2010.12002  [pdf, other

    q-fin.ST cs.LG physics.soc-ph q-fin.TR

    On the impact of publicly available news and information transfer to financial markets

    Authors: Metod Jazbec, Barna Pásztor, Felix Faltings, Nino Antulov-Fantulin, Petter N. Kolm

    Abstract: We quantify the propagation and absorption of large-scale publicly available news articles from the World Wide Web to financial markets. To extract publicly available information, we use the news archives from the Common Crawl, a nonprofit organization that crawls a large part of the web. We develop a processing pipeline to identify news articles associated with the constituent companies in the S\… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  6. arXiv:2008.10376  [pdf, other

    cs.CG cs.SI stat.ML

    Stochastic Gradient Descent Works Really Well for Stress Minimization

    Authors: Katharina Börsig, Ulrik Brandes, Barna Pasztor

    Abstract: Stress minimization is among the best studied force-directed graph layout methods because it reliably yields high-quality layouts. It thus comes as a surprise that a novel approach based on stochastic gradient descent (Zheng, Pawar and Goodman, TVCG 2019) is claimed to improve on state-of-the-art approaches based on majorization. We present experimental evidence that the new approach does not actu… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: Appears in the Proceedings of the 28th International Symposium on Graph Drawing and Network Visualization (GD 2020)