Skip to main content

Showing 1–50 of 113 results for author: Neumann, G

.
  1. arXiv:2406.15131  [pdf, other

    cs.LG stat.ML

    KalMamba: Towards Efficient Probabilistic State Space Models for RL under Uncertainty

    Authors: Philipp Becker, Niklas Freymuth, Gerhard Neumann

    Abstract: Probabilistic State Space Models (SSMs) are essential for Reinforcement Learning (RL) from high-dimensional, partial information as they provide concise representations for control. Yet, they lack the computational efficiency of their recent deterministic counterparts such as S4 or Mamba. We propose KalMamba, an efficient architecture to learn representations for RL that combines the strengths of… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2406.14161  [pdf, other

    cs.LG

    Iterative Sizing Field Prediction for Adaptive Mesh Generation From Expert Demonstrations

    Authors: Niklas Freymuth, Philipp Dahlinger, Tobias Würth, Philipp Becker, Aleksandar Taranovic, Onno Grönheim, Luise Kärger, Gerhard Neumann

    Abstract: Many engineering systems require accurate simulations of complex physical systems. Yet, analytical solutions are only available for simple problems, necessitating numerical approximations such as the Finite Element Method (FEM). The cost and accuracy of the FEM scale with the resolution of the underlying computational mesh. To balance computational speed and accuracy meshes with adaptive resolutio… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted as a workshop paper in AI4Science@ICML 2024

  3. arXiv:2406.12538  [pdf, other

    cs.LG cs.AI cs.RO

    Variational Distillation of Diffusion Policies into Mixture of Experts

    Authors: Hongyi Zhou, Denis Blessing, Ge Li, Onur Celik, Xiaogang Jia, Gerhard Neumann, Rudolf Lioutikov

    Abstract: This work introduces Variational Diffusion Distillation (VDD), a novel method that distills denoising diffusion policies into Mixtures of Experts (MoE) through variational inference. Diffusion Models are the current state-of-the-art in generative modeling due to their exceptional ability to accurately learn and represent complex, multi-modal distributions. This ability allows Diffusion Models to r… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2406.08440  [pdf, other

    cs.LG cs.MA

    Adaptive Swarm Mesh Refinement using Deep Reinforcement Learning with Local Rewards

    Authors: Niklas Freymuth, Philipp Dahlinger, Tobias Würth, Simon Reisch, Luise Kärger, Gerhard Neumann

    Abstract: Simulating physical systems is essential in engineering, but analytical solutions are limited to straightforward problems. Consequently, numerical methods like the Finite Element Method (FEM) are widely used. However, the FEM becomes computationally expensive as problem complexity and accuracy demands increase. Adaptive Mesh Refinement (AMR) improves the FEM by dynamically allocating mesh elements… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Submitted to Journal of Machine Learning Research (JMLR)

  5. arXiv:2406.08234  [pdf, other

    cs.LG cs.RO

    MaIL: Improving Imitation Learning with Mamba

    Authors: Xiaogang Jia, Qian Wang, Atalay Donat, Bowen Xing, Ge Li, Hongyi Zhou, Onur Celik, Denis Blessing, Rudolf Lioutikov, Gerhard Neumann

    Abstract: This work introduces Mamba Imitation Learning (MaIL), a novel imitation learning (IL) architecture that offers a computationally efficient alternative to state-of-the-art (SoTA) Transformer policies. Transformer-based policies have achieved remarkable results due to their ability in handling human-recorded data with inherently non-Markovian behavior. However, their high performance comes with the… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2406.07423  [pdf, other

    cs.LG cs.AI stat.ML

    Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling

    Authors: Denis Blessing, Xiaogang Jia, Johannes Esslinger, Francisco Vargas, Gerhard Neumann

    Abstract: Monte Carlo methods, Variational Inference, and their combinations play a pivotal role in sampling from intractable probability distributions. However, current studies lack a unified evaluation framework, relying on disparate performance measures and limited method comparisons across diverse tasks, complicating the assessment of progress and hindering the decision-making of practitioners. In respo… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  7. arXiv:2403.06966  [pdf, other

    cs.LG cs.RO

    Acquiring Diverse Skills using Curriculum Reinforcement Learning with Mixture of Experts

    Authors: Onur Celik, Aleksandar Taranovic, Gerhard Neumann

    Abstract: Reinforcement learning (RL) is a powerful approach for acquiring a good-performing policy. However, learning diverse skills is challenging in RL due to the commonly used Gaussian policy parameterization. We propose \textbf{Di}verse \textbf{Skil}l \textbf{L}earning (Di-SkilL\footnote{Videos and code are available on the project webpage: \url{https://alrhub.github.io/di-skill-website/}}), an RL meth… ▽ More

    Submitted 10 June, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: International conference on machine learning (ICML)

  8. arXiv:2403.04453  [pdf, other

    cs.LG

    Vlearn: Off-Policy Learning with Efficient State-Value Function Estimation

    Authors: Fabian Otto, Philipp Becker, Ngo Anh Vien, Gerhard Neumann

    Abstract: Existing off-policy reinforcement learning algorithms often rely on an explicit state-action-value function representation, which can be problematic in high-dimensional action spaces due to the curse of dimensionality. This reliance results in data inefficiency as maintaining a state-action-value function in such spaces is challenging. We present an efficient approach that utilizes only a state-va… ▽ More

    Submitted 20 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  9. arXiv:2402.14606  [pdf, other

    cs.RO

    Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations

    Authors: Xiaogang Jia, Denis Blessing, Xinkai Jiang, Moritz Reuss, Atalay Donat, Rudolf Lioutikov, Gerhard Neumann

    Abstract: Imitation learning with human data has demonstrated remarkable success in teaching robots in a wide range of skills. However, the inherent diversity in human behavior leads to the emergence of multi-modal data distributions, thereby presenting a formidable challenge for existing imitation learning algorithms. Quantifying a model's capacity to capture and replicate this diversity effectively is sti… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  10. arXiv:2402.10681  [pdf, other

    cs.LG cs.AI cs.CE

    Physics-informed MeshGraphNets (PI-MGNs): Neural finite element solvers for non-stationary and nonlinear simulations on arbitrary meshes

    Authors: Tobias Würth, Niklas Freymuth, Clemens Zimmerling, Gerhard Neumann, Luise Kärger

    Abstract: Engineering components must meet increasing technological demands in ever shorter development cycles. To face these challenges, a holistic approach is essential that allows for the concurrent development of part design, material system and manufacturing process. Current approaches employ numerical simulations, which however quickly becomes computation-intensive, especially for iterative optimizati… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Submitted to CMAME

  11. arXiv:2401.11437  [pdf, other

    cs.LG cs.RO

    Open the Black Box: Step-based Policy Updates for Temporally-Correlated Episodic Reinforcement Learning

    Authors: Ge Li, Hongyi Zhou, Dominik Roth, Serge Thilges, Fabian Otto, Rudolf Lioutikov, Gerhard Neumann

    Abstract: Current advancements in reinforcement learning (RL) have predominantly focused on learning step-based policies that generate actions for each perceived state. While these methods efficiently leverage step information from environmental interaction, they often ignore the temporal correlation between actions, resulting in inefficient exploration and unsmooth trajectories that are challenging to impl… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: Codebase, see: https://github.com/BruceGeLi/TCE_RL

  12. arXiv:2401.09352  [pdf, other

    cs.RO cs.AI cs.LG

    Neural Contractive Dynamical Systems

    Authors: Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Nadia Figueroa, Gerhard Neumann, Leonel Rozo

    Abstract: Stability guarantees are crucial when ensuring a fully autonomous robot does not take undesirable or potentially harmful actions. Unfortunately, global stability guarantees are hard to provide in dynamical systems learned from data, especially when the learned dynamics are governed by neural networks. We propose a novel methodology to learn neural contractive dynamical systems, where our neural ar… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  13. arXiv:2312.13905  [pdf, ps, other

    cs.RO cs.AI cs.CL cs.HC cs.LG

    Domain-Specific Fine-Tuning of Large Language Models for Interactive Robot Programming

    Authors: Benjamin Alt, Urs Keßner, Aleksandar Taranovic, Darko Katic, Andreas Hermann, Rainer Jäkel, Gerhard Neumann

    Abstract: Industrial robots are applied in a widening range of industries, but robot programming mostly remains a task limited to programming experts. We propose a natural language-based assistant for programming of advanced, industrial robotic applications and investigate strategies for domain-specific fine-tuning of foundation models with limited data and compute.

    Submitted 21 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 5 pages, 1 figure, presented at the 2024 European Robotics Forum in Rimini, Italy

    MSC Class: 68T40 ACM Class: I.2.9; I.2.5; I.2.6; I.2.7

  14. arXiv:2312.10008  [pdf, ps, other

    cs.RO cs.AI cs.LG

    Movement Primitive Diffusion: Learning Gentle Robotic Manipulation of Deformable Objects

    Authors: Paul Maria Scheikl, Nicolas Schreiber, Christoph Haas, Niklas Freymuth, Gerhard Neumann, Rudolf Lioutikov, Franziska Mathis-Ullrich

    Abstract: Policy learning in robot-assisted surgery (RAS) lacks data efficient and versatile methods that exhibit the desired motion quality for delicate surgical interventions. To this end, we introduce Movement Primitive Diffusion (MPD), a novel method for imitation learning (IL) in RAS that focuses on gentle manipulation of deformable objects. The approach combines the versatility of diffusion-based imit… ▽ More

    Submitted 10 June, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Journal ref: IEEE Robotics and Automation Letters 9 (2024) 5338-5345

  15. arXiv:2311.08240  [pdf, other

    cs.CL cs.AI

    Investigating the Encoding of Words in BERT's Neurons using Feature Textualization

    Authors: Tanja Baeumel, Soniya Vijayakumar, Josef van Genabith, Guenter Neumann, Simon Ostermann

    Abstract: Pretrained language models (PLMs) form the basis of most state-of-the-art NLP technologies. Nevertheless, they are essentially black boxes: Humans do not have a clear understanding of what knowledge is encoded in different parts of the models, especially in individual neurons. The situation is different in computer vision, where feature visualization provides a decompositional interpretability tec… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: To be published in 'BlackboxNLP 2023: The 6th Workshop on Analysing and Interpreting Neural Networks for NLP'. Camera-ready version

  16. arXiv:2311.07357  [pdf, other

    cs.CV

    Registered and Segmented Deformable Object Reconstruction from a Single View Point Cloud

    Authors: Pit Henrich, Balázs Gyenes, Paul Maria Scheikl, Gerhard Neumann, Franziska Mathis-Ullrich

    Abstract: In deformable object manipulation, we often want to interact with specific segments of an object that are only defined in non-deformed models of the object. We thus require a system that can recognize and locate these segments in sensor data of deformed real world objects. This is normally done using deformable object registration, which is problem specific and complex to tune. Recent methods util… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted at WACV 2024

  17. arXiv:2311.05256  [pdf, other

    cs.LG

    Latent Task-Specific Graph Network Simulators

    Authors: Philipp Dahlinger, Niklas Freymuth, Michael Volpp, Tai Hoang, Gerhard Neumann

    Abstract: Simulating dynamic physical interactions is a critical challenge across multiple scientific domains, with applications ranging from robotics to material science. For mesh-based simulations, Graph Network Simulators (GNSs) pose an efficient alternative to traditional physics-based simulators. Their inherent differentiability and speed make them particularly well-suited for inverse design problems.… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  18. arXiv:2310.20574  [pdf, other

    cs.LG

    Information-Theoretic Trust Regions for Stochastic Gradient-Based Optimization

    Authors: Philipp Dahlinger, Philipp Becker, Maximilian Hüttenrauch, Gerhard Neumann

    Abstract: Stochastic gradient-based optimization is crucial to optimize neural networks. While popular approaches heuristically adapt the step size and direction by rescaling gradients, a more principled approach to improve optimizers requires second-order information. Such methods precondition the gradient using the objective's Hessian. Yet, computing the Hessian is usually expensive and effectively using… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  19. arXiv:2310.18534  [pdf, other

    cs.LG cs.AI

    Multi Time Scale World Models

    Authors: Vaisakh Shaj, Saleh Gholam Zadeh, Ozan Demir, Luiz Ricardo Douat, Gerhard Neumann

    Abstract: Intelligent agents use internal world models to reason and make predictions about different courses of their actions at many scales. Devising learning paradigms and architectures that allow machines to learn world models that operate at multiple levels of temporal abstractions while dealing with complex uncertainty predictions is a major technical hurdle. In this work, we propose a probabilistic f… ▽ More

    Submitted 4 December, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted as spotlight at NeurIPS 2023

  20. ALMA Lensing Cluster Survey: average dust, gas, and star formation properties of cluster and field galaxies from stacking analysis

    Authors: Andrea Guerrero, Neil Nagar, Kotaro Kohno, Seiji Fujimoto, Vasily Kokorev, Gabriel Brammer, Jean-Baptiste Jolly, Kirsten Knudsen, Fengwu Sun, Franz E. Bauer, Gabriel B. Caminha, Karina Caputi, Gerald Neumann, Gustavo Orellana-González, Pierluigi Cerulo, Jorge González-López, Nicolas Laporte, Anton M. Koekemoer, Yi** Ao, Daniel Espada, Alejandra M. Muñoz Arancibia

    Abstract: We develop new tools for continuum and spectral stacking of ALMA data, and apply these to the ALMA Lensing Cluster Survey (ALCS). We derive average dust masses, gas masses and star formation rates (SFR) from the stacked observed 260~GHz continuum of 3402 individually undetected star-forming galaxies, of which 1450 are cluster galaxies and 1952 field galaxies, over three redshift and stellar mass b… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 13 pages, 8 figures, 1 table (+4 pages, 4 figures in Appendix). Accepted for publication in MNRAS

  21. arXiv:2308.16528  [pdf, other

    cs.CV cs.LG cs.RO

    SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects

    Authors: Ning Gao, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann

    Abstract: To enable meaningful robotic manipulation of objects in the real-world, 6D pose estimation is one of the critical aspects. Most existing approaches have difficulties to extend predictions to scenarios where novel object instances are continuously introduced, especially with heavy occlusions. In this work, we propose a few-shot pose estimation (FSPE) approach called SA6D, which uses a self-adaptive… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Journal ref: Conference on Robot Learning (CoRL), 2023

  22. arXiv:2308.11369  [pdf, other

    cs.CV

    Enhancing Interpretable Object Abstraction via Clustering-based Slot Initialization

    Authors: Ning Gao, Bernard Hohmann, Gerhard Neumann

    Abstract: Object-centric representations using slots have shown the advances towards efficient, flexible and interpretable abstraction from low-level perceptual features in a compositional scene. Current approaches randomize the initial state of slots followed by an iterative refinement. As we show in this paper, the random slot initialization significantly affects the accuracy of the final slot prediction.… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Journal ref: The 34th British Machine Vision Conference (BMVC), 2023

  23. arXiv:2308.00456  [pdf, other

    cs.RO cs.AI

    DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes

    Authors: Philipp Blättner, Johannes Brand, Gerhard Neumann, Ngo Anh Vien

    Abstract: Robotic gras** is a fundamental skill required for object manipulation in robotics. Multi-fingered robotic hands, which mimic the structure of the human hand, can potentially perform complex object manipulation. Nevertheless, current techniques for multi-fingered robotic gras** frequently predict only a single grasp for each inference time, limiting computational efficiency and their versatili… ▽ More

    Submitted 16 August, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: Submitted IROS 2023 workshop "Policy Learning in Geometric Spaces"

  24. arXiv:2307.00306  [pdf, other

    cs.CV cs.AI cs.LG

    SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation

    Authors: Fabian Duffhauss, Sebastian Koch, Hanna Ziesche, Ngo Anh Vien, Gerhard Neumann

    Abstract: Detecting objects and estimating their 6D poses is essential for automated systems to interact safely with the environment. Most 6D pose estimators, however, rely on a single camera frame and suffer from occlusions and ambiguities due to object symmetries. We overcome this issue by presenting a novel symmetry-aware multi-view 6D pose estimator called SyMFM6D. Our approach efficiently fuses the RGB… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: Accepted at the IEEE Robotics and Automation Letters (RA-L) 2023

  25. arXiv:2306.12729  [pdf, other

    cs.LG cs.AI cs.RO

    MP3: Movement Primitive-Based (Re-)Planning Policy

    Authors: Fabian Otto, Hongyi Zhou, Onur Celik, Ge Li, Rudolf Lioutikov, Gerhard Neumann

    Abstract: We introduce a novel deep reinforcement learning (RL) approach called Movement Primitive-based Planning Policy (MP3). By integrating movement primitives (MPs) into the deep RL framework, MP3 enables the generation of smooth trajectories throughout the whole learning process while effectively learning from sparse and non-Markovian rewards. Additionally, MP3 maintains the capability to adapt to chan… ▽ More

    Submitted 2 July, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: The video demonstration can be accessed at https://intuitive-robots.github.io/mp3_website/. arXiv admin note: text overlap with arXiv:2210.09622

  26. arXiv:2306.12306  [pdf, ps, other

    cs.LG

    Beyond Deep Ensembles: A Large-Scale Evaluation of Bayesian Deep Learning under Distribution Shift

    Authors: Florian Seligmann, Philipp Becker, Michael Volpp, Gerhard Neumann

    Abstract: Bayesian deep learning (BDL) is a promising approach to achieve well-calibrated predictions on distribution-shifted data. Nevertheless, there exists no large-scale survey that evaluates recent SOTA methods on diverse, realistic, and challenging benchmark tasks in a systematic manner. To provide a clear picture of the current state of BDL research, we evaluate modern BDL algorithms on real-world da… ▽ More

    Submitted 24 October, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: Code at https://github.com/Feuermagier/Beyond_Deep_Ensembles

  27. Curriculum-Based Imitation of Versatile Skills

    Authors: Maximilian Xiling Li, Onur Celik, Philipp Becker, Denis Blessing, Rudolf Lioutikov, Gerhard Neumann

    Abstract: Learning skills by imitation is a promising concept for the intuitive teaching of robots. A common way to learn such skills is to learn a parametric model by maximizing the likelihood given the demonstrations. Yet, human demonstrations are often multi-modal, i.e., the same task is solved in multiple ways which is a major challenge for most imitation learning methods that are based on such a maximu… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Journal ref: 2023 IEEE International Conference on Robotics and Automation (ICRA)

  28. arXiv:2304.00818  [pdf, other

    cs.MA cs.LG math.NA

    Swarm Reinforcement Learning For Adaptive Mesh Refinement

    Authors: Niklas Freymuth, Philipp Dahlinger, Tobias Würth, Simon Reisch, Luise Kärger, Gerhard Neumann

    Abstract: Adaptive Mesh Refinement (AMR) enhances the Finite Element Method, an important technique for simulating complex problems in engineering, by dynamically refining mesh regions, enabling a favorable trade-off between computational speed and simulation accuracy. Classical methods for AMR depend on heuristics or expensive error estimators, hindering their use for complex simulations. Recent learning-b… ▽ More

    Submitted 9 October, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS) 2023. Version 1 of this paper is a preliminary version that was accepted as a workshop paper in the International Conference on Learning Representations (ICLR) 2023 Workshop on Physics for Machine Learning

  29. arXiv:2303.15349  [pdf, other

    cs.LG

    Information Maximizing Curriculum: A Curriculum-Based Approach for Imitating Diverse Skills

    Authors: Denis Blessing, Onur Celik, Xiaogang Jia, Moritz Reuss, Maximilian Xiling Li, Rudolf Lioutikov, Gerhard Neumann

    Abstract: Imitation learning uses data for training policies to solve complex tasks. However, when the training data is collected from human demonstrators, it often leads to multimodal distributions because of the variability in human actions. Most imitation learning methods rely on a maximum likelihood (ML) objective to learn a parameterized policy, but this can result in suboptimal or unsafe behavior due… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

  30. arXiv:2302.11864  [pdf, other

    cs.LG cs.RO

    Grounding Graph Network Simulators using Physical Sensor Observations

    Authors: Jonas Linkerhägner, Niklas Freymuth, Paul Maria Scheikl, Franziska Mathis-Ullrich, Gerhard Neumann

    Abstract: Physical simulations that accurately model reality are crucial for many engineering disciplines such as mechanical engineering and robotic motion planning. In recent years, learned Graph Network Simulators produced accurate mesh-based simulations while requiring only a fraction of the computational cost of traditional simulators. Yet, the resulting predictors are confined to learning from data gen… ▽ More

    Submitted 7 March, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted as a poster at the 11th International Conference on Learning Representations (ICLR), 2023

  31. arXiv:2302.09606  [pdf, other

    cs.RO

    LapGym -- An Open Source Framework for Reinforcement Learning in Robot-Assisted Laparoscopic Surgery

    Authors: Paul Maria Scheikl, Balázs Gyenes, Rayan Younis, Christoph Haas, Gerhard Neumann, Martin Wagner, Franziska Mathis-Ullrich

    Abstract: Recent advances in reinforcement learning (RL) have increased the promise of introducing cognitive assistance and automation to robot-assisted laparoscopic surgery (RALS). However, progress in algorithms and methods depends on the availability of standardized learning environments that represent skills relevant to RALS. We present LapGym, a framework for building RL environments for RALS that mode… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

  32. arXiv:2302.05342  [pdf, other

    cs.LG

    Combining Reconstruction and Contrastive Methods for Multimodal Representations in RL

    Authors: Philipp Becker, Sebastian Mossburger, Fabian Otto, Gerhard Neumann

    Abstract: Learning self-supervised representations using reconstruction or contrastive losses improves performance and sample complexity of image-based and multimodal reinforcement learning (RL). Here, different self-supervised loss functions have distinct advantages and limitations depending on the information density of the underlying sensor modality. Reconstruction provides strong learning signals but is… ▽ More

    Submitted 26 June, 2024; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: Published in "Reinforcement Learning Conference (RLC)", August 2024

  33. arXiv:2211.02114  [pdf, ps, other

    math.NT

    $r$-primitive $k$-normal elements in arithmetic progressions over finite fields

    Authors: Josimar J. R. Aguirre, Abílio Lemos, Victor G. L. Neumann, Sávio Ribas

    Abstract: Let $\mathbb{F}_{q^n}$ be a finite field with $q^n$ elements. For a positive divisor $r$ of $q^n-1$, the element $α\in \mathbb{F}_{q^n}^*$ is called \textit{$r$-primitive} if its multiplicative order is $(q^n-1)/r$. Also, for a non-negative integer $k$, the element $α\in \mathbb{F}_{q^n}$ is \textit{$k$-normal} over $\mathbb{F}_q$ if… ▽ More

    Submitted 31 July, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: To appear in Communications in Algebra. arXiv admin note: substantial text overlap with arXiv:2210.11504

    MSC Class: 12E20; 11T23

  34. arXiv:2210.11504  [pdf, ps, other

    math.NT

    Pairs of $r$-primitive and $k$-normal elements in finite fields

    Authors: Josimar J. R. Aguirre, Victor G. L. Neumann

    Abstract: Let $\mathbb{F}_{q^n}$ be a finite field with $q^n$ elements and $r$ be a positive divisor of $q^n-1$. An element $α\in \mathbb{F}_{q^n}^*$ is called $r$-primitive if its multiplicative order is $(q^n-1)/r$. Also, $α\in \mathbb{F}_{q^n}$ is $k$-normal over $\mathbb{F}_q$ if the greatest common divisor of the polynomials $g_α(x) = αx^{n-1}+ α^q x^{n-2} + \ldots + α^{q^{n-2}}x + α^{q^{n-1}}$ and… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    MSC Class: 12E20; 11T23

  35. arXiv:2210.09622  [pdf, other

    cs.LG cs.AI cs.RO

    Deep Black-Box Reinforcement Learning with Movement Primitives

    Authors: Fabian Otto, Onur Celik, Hongyi Zhou, Hanna Ziesche, Ngo Anh Vien, Gerhard Neumann

    Abstract: \Episode-based reinforcement learning (ERL) algorithms treat reinforcement learning (RL) as a black-box optimization problem where we learn to select a parameter vector of a controller, often represented as a movement primitive, for a given task descriptor called a context. ERL offers several distinct benefits in comparison to step-based RL. It generates smooth control trajectories, can handle non… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: Accepted at CoRL 2022

  36. arXiv:2210.09256  [pdf, other

    cs.LG

    On Uncertainty in Deep State Space Models for Model-Based Reinforcement Learning

    Authors: Philipp Becker, Gerhard Neumann

    Abstract: Improved state space models, such as Recurrent State Space Models (RSSMs), are a key factor behind recent advances in model-based reinforcement learning (RL). Yet, despite their empirical success, many of the underlying design choices are not well understood. We show that RSSMs use a suboptimal inference scheme and that models trained using this inference overestimate the aleatoric uncertainty of… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Published in TMLR, October 2022

  37. arXiv:2210.08121  [pdf, other

    cs.RO cs.LG

    Inferring Versatile Behavior from Demonstrations by Matching Geometric Descriptors

    Authors: Niklas Freymuth, Nicolas Schreiber, Philipp Becker, Aleksandar Taranovic, Gerhard Neumann

    Abstract: Humans intuitively solve tasks in versatile ways, varying their behavior in terms of trajectory-based planning and for individual steps. Thus, they can easily generalize and adapt to new and changing environments. Current Imitation Learning algorithms often only consider unimodal expert demonstrations and act in a state-action-based setting, making it difficult for them to imitate human behavior i… ▽ More

    Submitted 9 November, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted as a poster at the 6th Conference on Robot Learning (CoRL), 2022

  38. arXiv:2210.01531  [pdf, other

    cs.RO cs.LG

    ProDMPs: A Unified Perspective on Dynamic and Probabilistic Movement Primitives

    Authors: Ge Li, Zeqi **, Michael Volpp, Fabian Otto, Rudolf Lioutikov, Gerhard Neumann

    Abstract: Movement Primitives (MPs) are a well-known concept to represent and generate modular trajectories. MPs can be broadly categorized into two types: (a) dynamics-based approaches that generate smooth trajectories from any initial state, e. g., Dynamic Movement Primitives (DMPs), and (b) probabilistic approaches that capture higher-order statistics of the motion, e. g., Probabilistic Movement Primitiv… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: 12 pages, 13 figures

  39. arXiv:2209.11533  [pdf, other

    cs.LG cs.RO stat.ML

    A Unified Perspective on Natural Gradient Variational Inference with Gaussian Mixture Models

    Authors: Oleg Arenz, Philipp Dahlinger, Zihan Ye, Michael Volpp, Gerhard Neumann

    Abstract: Variational inference with Gaussian mixture models (GMMs) enables learning of highly tractable yet multi-modal approximations of intractable target distributions with up to a few hundred dimensions. The two currently most effective methods for GMM-based variational inference, VIPS and iBayes-GMM, both employ independent natural gradient updates for the individual components and their weights. We s… ▽ More

    Submitted 17 July, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: This version corresponds to the camera ready version published at Transactions of Machine Learning Research (TMLR). https://openreview.net/forum?id=tLBjsX4tjs

    Journal ref: Transactions on Machine Learning Research (2023) ISSN: 2835-8856

  40. arXiv:2209.11277  [pdf, other

    cs.CV cs.AI cs.LG

    FusionVAE: A Deep Hierarchical Variational Autoencoder for RGB Image Fusion

    Authors: Fabian Duffhauss, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann

    Abstract: Sensor fusion can significantly improve the performance of many computer vision tasks. However, traditional fusion approaches are either not data-driven and cannot exploit prior knowledge nor find regularities in a given dataset or they are restricted to a single application. We overcome this shortcoming by presenting a novel deep hierarchical variational autoencoder called FusionVAE that can serv… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: Accepted at ECCV 2022

  41. arXiv:2208.01172  [pdf, other

    cs.CV cs.AI cs.LG

    MV6D: Multi-View 6D Pose Estimation on RGB-D Frames Using a Deep Point-wise Voting Network

    Authors: Fabian Duffhauss, Tobias Demmler, Gerhard Neumann

    Abstract: Estimating 6D poses of objects is an essential computer vision task. However, most conventional approaches rely on camera data from a single perspective and therefore suffer from occlusions. We overcome this issue with our novel multi-view 6D pose estimation method called MV6D which accurately predicts the 6D poses of all objects in a cluttered scene based on RGB-D images from multiple perspective… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted at IROS 2022

  42. arXiv:2208.00478  [pdf, other

    cs.LG cs.RO

    Robot Policy Learning from Demonstration Using Advantage Weighting and Early Termination

    Authors: Abdalkarim Mohtasib, Gerhard Neumann, Heriberto Cuayahuitl

    Abstract: Learning robotic tasks in the real world is still highly challenging and effective practical solutions remain to be found. Traditional methods used in this area are imitation learning and reinforcement learning, but they both have limitations when applied to real robots. Combining reinforcement learning with pre-collected demonstrations is a promising approach that can help in learning control pol… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

  43. arXiv:2206.14697  [pdf, other

    cs.LG eess.SY

    Hidden Parameter Recurrent State Space Models For Changing Dynamics Scenarios

    Authors: Vaisakh Shaj, Dieter Buchler, Rohit Sonker, Philipp Becker, Gerhard Neumann

    Abstract: Recurrent State-space models (RSSMs) are highly expressive models for learning patterns in time series data and system identification. However, these models assume that the dynamics are fixed and unchanging, which is rarely the case in real-world scenarios. Many control applications often exhibit tasks with similar but not identical dynamics which can be modeled as a latent variable. We introduce… ▽ More

    Submitted 12 October, 2023; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: Published at the International Conference on Learning Representations, ICLR 2022

  44. arXiv:2206.07162  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Category-Agnostic 6D Pose Estimation with Conditional Neural Processes

    Authors: Yumeng Li, Ning Gao, Hanna Ziesche, Gerhard Neumann

    Abstract: We present a novel meta-learning approach for 6D pose estimation on unknown objects. In contrast to ``instance-level" and ``category-level" pose estimation methods, our algorithm learns object representation in a category-agnostic way, which endows it with strong generalization capabilities across object categories. Specifically, we employ a neural process-based meta-learning approach to train an… ▽ More

    Submitted 19 October, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Accepted at CVPR2022 workshop: Women in Computer Vision (WiCV)

    Journal ref: CVPR2022 workshop: Women in Computer Vision (WiCV)

  45. arXiv:2206.06090  [pdf, other

    stat.ML cs.LG cs.NE

    Regret-Aware Black-Box Optimization with Natural Gradients, Trust-Regions and Entropy Control

    Authors: Maximilian Hüttenrauch, Gerhard Neumann

    Abstract: Most successful stochastic black-box optimizers, such as CMA-ES, use rankings of the individual samples to obtain a new search distribution. Yet, the use of rankings also introduces several issues such as the underlying optimization objective is often unclear, i.e., we do not optimize the expected fitness. Further, while these algorithms typically produce a high-quality mean estimate of the search… ▽ More

    Submitted 24 May, 2022; originally announced June 2022.

    Comments: 26 pages, 15 figures

  46. arXiv:2206.02852  [pdf, other

    cs.CR

    CompartOS: CHERI Compartmentalization for Embedded Systems

    Authors: Hesham Almatary, Michael Dodson, Jessica Clarke, Peter Rugg, Ivan Gomes, Michal Podhradsky, Peter G. Neumann, Simon W. Moore, Robert N. M. Watson

    Abstract: Existing high-end embedded systems face frequent security attacks. Software compartmentalization is one technique to limit the attacks' effects to the compromised compartment and not the entire system. Unfortunately, the existing state-of-the-art embedded hardware-software solutions do not work well to enforce software compartmentalization for high-end embedded systems. MPUs are not fine-grained a… ▽ More

    Submitted 11 June, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

  47. arXiv:2205.13804  [pdf, other

    cs.RO cs.LG

    End-to-End Learning of Hybrid Inverse Dynamics Models for Precise and Compliant Impedance Control

    Authors: Moritz Reuss, Niels van Duijkeren, Robert Krug, Philipp Becker, Vaisakh Shaj, Gerhard Neumann

    Abstract: It is well-known that inverse dynamics models can improve tracking performance in robot control. These models need to precisely capture the robot dynamics, which consist of well-understood components, e.g., rigid body dynamics, and effects that remain challenging to capture, e.g., stick-slip friction and mechanical flexibilities. Such effects exhibit hysteresis and partial observability, rendering… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at Robotics: Science and System XVIII (RSS), year 2022. Paper length is 13 pages (i.e. 9 pages of technical content, 1 page of the Bibliography/References and 3 pages of Appendix)

  48. arXiv:2205.11110  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Meta-Learning Regras** Strategies for Physical-Agnostic Objects

    Authors: Ning Gao, **gyu Zhang, Ruijie Chen, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann

    Abstract: Gras** inhomogeneous objects in real-world applications remains a challenging task due to the unknown physical properties such as mass distribution and coefficient of friction. In this study, we propose a meta-learning algorithm called ConDex, which incorporates Conditional Neural Processes (CNP) with DexNet-2.0 to autonomously discern the underlying physical properties of objects using depth im… ▽ More

    Submitted 14 September, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted as spotlight in ICRA 2022 Workshop: Scaling Robot Learning

  49. arXiv:2204.04783  [pdf, other

    cs.LG cs.AI

    Temporal Knowledge Graph Reasoning with Low-rank and Model-agnostic Representations

    Authors: Ioannis Dikeoulias, Saadullah Amin, Günter Neumann

    Abstract: Temporal knowledge graph completion (TKGC) has become a popular approach for reasoning over the event and temporal knowledge graphs, targeting the completion of knowledge with accurate but missing information. In this context, tensor decomposition has successfully modeled interactions between entities and relations. Their effectiveness in static knowledge graph completion motivates us to introduce… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: Accepted by RepL4NLP'22

  50. arXiv:2204.04779  [pdf, other

    cs.CL cs.LG

    MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction

    Authors: Saadullah Amin, Pasquale Minervini, David Chang, Pontus Stenetorp, Günter Neumann

    Abstract: Relation extraction in the biomedical domain is challenging due to the lack of labeled data and high annotation costs, needing domain experts. Distant supervision is commonly used to tackle the scarcity of annotated data by automatically pairing knowledge graph relationships with raw texts. Such a pipeline is prone to noise and has added challenges to scale for covering a large number of biomedica… ▽ More

    Submitted 13 September, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: Accepted by COLING 2022 (Oral presentation, Main Conference: Long Papers)