Skip to main content

Showing 1–6 of 6 results for author: Lanzendörfer, L A

.
  1. arXiv:2407.00401  [pdf, other

    cs.LG cs.AI

    PUZZLES: A Benchmark for Neural Algorithmic Reasoning

    Authors: Benjamin Estermann, Luca A. Lanzendörfer, Yannick Niedermayr, Roger Wattenhofer

    Abstract: Algorithmic reasoning is a fundamental cognitive ability that plays a pivotal role in problem-solving and decision-making processes. Reinforcement Learning (RL) has demonstrated remarkable proficiency in tasks such as motor control, handling perceptual input, and managing stochastic environments. These advancements have been enabled in part by the availability of benchmarks. In this work we introd… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  2. arXiv:2406.01631  [pdf, other

    cs.IR cs.LG

    An LLM-based Recommender System Environment

    Authors: Nathan Corecco, Giorgio Piatti, Luca A. Lanzendörfer, Flint Xiaofeng Fan, Roger Wattenhofer

    Abstract: Reinforcement learning (RL) has gained popularity in the realm of recommender systems due to its ability to optimize long-term rewards and guide users in discovering relevant content. However, the successful implementation of RL in recommender systems is challenging because of several factors, including the limited availability of online data for training on-policy methods. This scarcity requires… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  3. arXiv:2403.20156  [pdf, other

    cs.LG cs.AI

    CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

    Authors: Hei Yi Mak, Flint Xiaofeng Fan, Luca A. Lanzendörfer, Cheston Tan, Wei Tsang Ooi, Roger Wattenhofer

    Abstract: In this study, we delve into Federated Reinforcement Learning (FedRL) in the context of value-based agents operating across diverse Markov Decision Processes (MDPs). Existing FedRL methods typically aggregate agents' learning by averaging the value functions across them to improve their performance. However, this aggregation strategy is suboptimal in heterogeneous environments where agents converg… ▽ More

    Submitted 16 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  4. arXiv:2306.13512  [pdf, other

    cs.SD cs.LG eess.AS

    DISCO-10M: A Large-Scale Music Dataset

    Authors: Luca A. Lanzendörfer, Florian Grötschla, Emil Funke, Roger Wattenhofer

    Abstract: Music datasets play a crucial role in advancing research in machine learning for music. However, existing music datasets suffer from limited size, accessibility, and lack of audio resources. To address these shortcomings, we present DISCO-10M, a novel and extensive music dataset that surpasses the largest previously available music dataset by an order of magnitude. To ensure high-quality data, we… ▽ More

    Submitted 5 October, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 Track on Datasets and Benchmarks

  5. arXiv:2306.12957  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Siamese SIREN: Audio Compression with Implicit Neural Representations

    Authors: Luca A. Lanzendörfer, Roger Wattenhofer

    Abstract: Implicit Neural Representations (INRs) have emerged as a promising method for representing diverse data modalities, including 3D shapes, images, and audio. While recent research has demonstrated successful applications of INRs in image and 3D shape compression, their potential for audio compression remains largely unexplored. Motivated by this, we present a preliminary investigation into the use o… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: Published as a workshop paper at ICML 2023 neural compression workshop

  6. arXiv:2306.01009  [pdf, other

    cs.CL cs.AI cs.LG

    Examining the Emergence of Deductive Reasoning in Generative Language Models

    Authors: Peter Belcak, Luca A. Lanzendörfer, Roger Wattenhofer

    Abstract: We conduct a preliminary inquiry into the ability of generative transformer models to deductively reason from premises provided. We observe notable differences in the performance of models coming from different training setups and find that the deductive reasoning ability increases with scale. Further, we discover that the performance generally does not decrease with the length of the deductive ch… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted to the 1st Natural Language Reasoning and Structured Explanations Workshop (NLRSE@ACL'23). 8 pages, 4 figures, 3 tables