Skip to main content

Showing 1–8 of 8 results for author: Hübotter, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.15898  [pdf, other

    cs.LG cs.AI

    Transductive Active Learning: Theory and Applications

    Authors: Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

    Abstract: We generalize active learning to address real-world settings with concrete prediction targets where sampling is restricted to an accessible region of the domain, while prediction targets may lie outside this region. We analyze a family of decision rules that sample adaptively to minimize uncertainty about prediction targets. We are the first to show, under general regularity assumptions, that such… ▽ More

    Submitted 22 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.15441

  2. arXiv:2402.15441  [pdf, other

    cs.LG cs.AI

    Active Few-Shot Fine-Tuning

    Authors: Jonas Hübotter, Bhavya Sukhija, Lenart Treven, Yarden As, Andreas Krause

    Abstract: We study the question: How can we select the right data for fine-tuning to a specific task? We call this data selection problem active fine-tuning and show that it is an instance of transductive active learning, a novel generalization of classical active learning. We propose ITL, short for information-based transductive learning, an approach which samples adaptively to maximize information gained… ▽ More

    Submitted 21 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  3. arXiv:2310.19848  [pdf, other

    cs.LG cs.RO math.OC

    Efficient Exploration in Continuous-time Model-based Reinforcement Learning

    Authors: Lenart Treven, Jonas Hübotter, Bhavya Sukhija, Florian Dörfler, Andreas Krause

    Abstract: Reinforcement learning algorithms typically consider discrete-time dynamics, even though the underlying systems are often continuous in time. In this paper, we introduce a model-based reinforcement learning algorithm that represents continuous-time dynamics using nonlinear ordinary differential equations (ODEs). We capture epistemic uncertainty using well-calibrated probabilistic models, and use t… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  4. arXiv:2306.07092  [pdf, other

    cs.RO cs.AI

    Tuning Legged Locomotion Controllers via Safe Bayesian Optimization

    Authors: Daniel Widmer, Dongho Kang, Bhavya Sukhija, Jonas Hübotter, Andreas Krause, Stelian Coros

    Abstract: This paper presents a data-driven strategy to streamline the deployment of model-based controllers in legged robotic hardware platforms. Our approach leverages a model-free safe learning algorithm to automate the tuning of control gains, addressing the mismatch between the simplified model used in the control formulation and the real system. This method substantially mitigates the risk of hazardou… ▽ More

    Submitted 25 October, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted to the 2023 Conference on Robot Learning (CoRL 2023.) The first two authors contributed equally. The supplementary video is available at https://youtu.be/zDBouUgegrU and the code implementation is available at https://github.com/lasgroup/gosafeopt

  5. arXiv:2211.11726  [pdf, ps, other

    cs.DS cs.DC math.CO

    A Cut-Matching Game for Constant-Hop Expanders

    Authors: Bernhard Haeupler, Jonas Huebotter, Mohsen Ghaffari

    Abstract: This paper provides a cut-strategy that produces constant-hop expanders in the well-known cut-matching game framework. Constant-hop expanders strengthen expanders with constant conductance by guaranteeing that any demand can be (obliviously) routed along constant-hop paths - in contrast to the $Ω(\log n)$-hop routes in expanders. Cut-matching games for expanders are key tools for obtaining clo… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  6. arXiv:2209.08033  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Learning Policies for Continuous Control via Transition Models

    Authors: Justus Huebotter, Serge Thill, Marcel van Gerven, Pablo Lanillos

    Abstract: It is doubtful that animals have perfect inverse models of their limbs (e.g., what muscle contraction must be applied to every joint to reach a particular location in space). However, in robot control, moving an arm's end-effector to a target position or along a target trajectory requires accurate forward and inverse models. Here we show that by learning the transition (forward) model from interac… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

  7. arXiv:2109.11045  [pdf, other

    cs.NE cs.LG

    Training Deep Spiking Auto-encoders without Bursting or Dying Neurons through Regularization

    Authors: Justus F. Hübotter, Pablo Lanillos, Jakub M. Tomczak

    Abstract: Spiking neural networks are a promising approach towards next-generation models of the brain in computational neuroscience. Moreover, compared to classic artificial neural networks, they could serve as an energy-efficient deployment of AI by enabling fast computation in specialized neuromorphic hardware. However, training deep spiking neural networks, especially in an unsupervised manner, is chall… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: Under review

  8. arXiv:2108.09489  [pdf, other

    cs.DS

    Implementation of Algorithms for Right-Sizing Data Centers

    Authors: Jonas Hübotter

    Abstract: The energy consumption of data centers assumes a significant fraction of the world's overall energy consumption. Most data centers are statically provisioned, leading to a very low average utilization of servers. In this work, we survey uni-dimensional and high-dimensional approaches for dynamically powering up and powering down servers to reduce the energy footprint of data centers while ensuring… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.