Skip to main content

Showing 1–7 of 7 results for author: Blüml, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16748  [pdf, other

    cs.LG cs.CL

    OCALM: Object-Centric Assessment with Language Models

    Authors: Timo Kaufmann, Jannis Blüml, Antonia Wüst, Quentin Delfosse, Kristian Kersting, Eyke Hüllermeier

    Abstract: Properly defining a reward signal to efficiently train a reinforcement learning (RL) agent is a challenging task. Designing balanced objective functions from which a desired behavior can emerge requires expert knowledge, especially for complex environments. Learning rewards from human feedback or using large language models (LLMs) to directly provide rewards are promising alternatives, allowing no… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted at the RLBRew Workshop at RLC 2024

  2. arXiv:2406.03997  [pdf, other

    cs.AI cs.LG

    HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning

    Authors: Quentin Delfosse, Jannis Blüml, Bjarne Gregori, Kristian Kersting

    Abstract: Artificial agents' adaptability to novelty and alignment with intended behavior is crucial for their effective deployment. Reinforcement learning (RL) leverages novelty as a means of exploration, yet agents often struggle to handle novel situations, hindering generalization. To address these issues, we propose HackAtari, a framework introducing controlled novelty to the most common RL benchmark, t… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 9 main pages, 4 pages references, 19 pages of appendix

  3. arXiv:2402.08511  [pdf, other

    cs.AI

    Amplifying Exploration in Monte-Carlo Tree Search by Focusing on the Unknown

    Authors: Cedric Derstroff, Jannis Brugger, Jannis Blüml, Mira Mezini, Stefan Kramer, Kristian Kersting

    Abstract: Monte-Carlo tree search (MCTS) is an effective anytime algorithm with a vast amount of applications. It strategically allocates computational resources to focus on promising segments of the search tree, making it a very attractive search algorithm in large search spaces. However, it often expends its limited resources on reevaluating previously explored regions when they remain the most promising… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 10 pages, 7 figures

  4. arXiv:2401.16852  [pdf, other

    cs.LG

    Checkmating One, by Using Many: Combining Mixture of Experts with MCTS to Improve in Chess

    Authors: Felix Helfenstein, Jannis Blüml, Johannes Czech, Kristian Kersting

    Abstract: This paper presents a new approach that integrates deep learning with computational chess, using both the Mixture of Experts (MoE) method and Monte-Carlo Tree Search (MCTS). Our methodology employs a suite of specialized models, each designed to respond to specific changes in the game's input data. This results in a framework with sparsely activated models, which provides significant computational… ▽ More

    Submitted 10 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Code available under https://github.com/HelpstoneX/CrazyAra

  5. arXiv:2311.13414  [pdf, other

    cs.LG cs.AI cs.GT

    From Images to Connections: Can DQN with GNNs learn the Strategic Game of Hex?

    Authors: Yannik Keller, Jannis Blüml, Gopika Sudhakaran, Kristian Kersting

    Abstract: The gameplay of strategic board games such as chess, Go and Hex is often characterized by combinatorial, relational structures -- capturing distinct interactions and non-local patterns -- and not just images. Nonetheless, most common self-play reinforcement learning (RL) approaches simply approximate policy and value functions using convolutional neural networks (CNN). A key feature of CNNs is the… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  6. arXiv:2306.08649  [pdf, other

    cs.LG cs.AI cs.CV

    OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments

    Authors: Quentin Delfosse, Jannis Blüml, Bjarne Gregori, Sebastian Sztwiertnia, Kristian Kersting

    Abstract: Cognitive science and psychology suggest that object-centric representations of complex scenes are a promising step towards enabling efficient abstract reasoning from low-level perceptual features. Yet, most deep reinforcement learning approaches only rely on pixel-based representations that do not capture the compositional properties of natural scenes. For this, we need environments and datasets… ▽ More

    Submitted 27 February, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 26 pages, 8 main paper pages, 36 appendix pages. In main paper: 4 figures, 3 tables

  7. arXiv:2304.14918  [pdf, other

    cs.AI

    Representation Matters: The Game of Chess Poses a Challenge to Vision Transformers

    Authors: Johannes Czech, Jannis Blüml, Kristian Kersting

    Abstract: While transformers have gained the reputation as the "Swiss army knife of AI", no one has challenged them to master the game of chess, one of the classical AI benchmarks. Simply using vision transformers (ViTs) within AlphaZero does not master the game of chess, mainly because ViTs are too slow. Even making them more efficient using a combination of MobileNet and NextViT does not beat what actuall… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: 11 pages, 5 figures, 8 tables