Skip to main content

Showing 1–16 of 16 results for author: Lang, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15753  [pdf, other

    cs.LG cs.AI stat.ML

    The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

    Authors: Lukas Fluri, Leon Lang, Alessandro Abate, Patrick Forré, David Krueger, Joar Skalse

    Abstract: In reinforcement learning, specifying reward functions that capture the intended task can be very challenging. Reward learning aims to address this issue by learning the reward function. However, a learned reward model may have a low error on the training distribution, and yet subsequently produce a policy with large regret. We say that such a reward model has an error-regret mismatch. The main so… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 58 pages, 1 figure

  2. arXiv:2402.17747  [pdf, other

    cs.LG cs.AI stat.ML

    When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback

    Authors: Leon Lang, Davis Foote, Stuart Russell, Anca Dragan, Erik Jenner, Scott Emmons

    Abstract: Past analyses of reinforcement learning from human feedback (RLHF) assume that the human evaluators fully observe the environment. What happens when human feedback is based only on partial observations? We formally define two failure cases: deceptive inflation and overjustification. Modeling the human as Boltzmann-rational w.r.t. a belief over trajectories, we prove conditions under which RLHF is… ▽ More

    Submitted 8 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  3. arXiv:2307.00787  [pdf, ps, other

    cs.CL cs.AI

    Evaluating Shutdown Avoidance of Language Models in Textual Scenarios

    Authors: Teun van der Weij, Simon Lermen, Leon lang

    Abstract: Recently, there has been an increase in interest in evaluating large language models for emergent and dangerous capabilities. Importantly, agents could reason that in some scenarios their goal is better achieved if they are not turned off, which can lead to undesirable behaviors. In this paper, we investigate the potential of using toy textual scenarios to evaluate instrumental reasoning and shutd… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  4. arXiv:2210.10725  [pdf, other

    cs.IR cs.LG

    SML:Enhance the Network Smoothness with Skip Meta Logit for CTR Prediction

    Authors: Wenlong Deng, Lang Lang, Zhen Liu, Bin Liu

    Abstract: In light of the smoothness property brought by skip connections in ResNet, this paper proposed the Skip Logit to introduce the skip connection mechanism that fits arbitrary DNN dimensions and embraces similar properties to ResNet. Meta Tanh Normalization (MTN) is designed to learn variance information and stabilize the training process. With these delicate designs, our Skip Meta Logit (SML) brough… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  5. arXiv:2203.16087  [pdf

    physics.optics cs.LG

    Polarized deep diffractive neural network for classification, generation, multiplexing and de-multiplexing of orbital angular momentum modes

    Authors: Jiaqi Zhang, Zhiyuan Ye, Jianhua Yin, Liying Lang, Shuming Jiao

    Abstract: The multiplexing and de-multiplexing of orbital angular momentum (OAM) beams are critical issues in optical communication. Optical diffractive neural networks have been introduced to perform classification, generation, multiplexing and de-multiplexing of OAM beams. However, conventional diffractive neural networks cannot handle OAM modes with a varying spatial distribution of polarization directio… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  6. arXiv:2202.09393  [pdf, other

    cs.IT

    Information Decomposition Diagrams Applied beyond Shannon Entropy: A Generalization of Hu's Theorem

    Authors: Leon Lang, Pierre Baudot, Rick Quax, Patrick Forré

    Abstract: In information theory, one major goal is to find useful functions that summarize the amount of information contained in the interaction of several random variables. Specifically, one can ask how the classical Shannon entropy, mutual information, and higher interaction information relate to each other. This is answered by Hu's theorem, which is widely known in the form of information diagrams: it r… ▽ More

    Submitted 1 March, 2024; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: 58 pages, 5 figures

  7. Towards Sensor Data Abstraction of Autonomous Vehicle Perception Systems

    Authors: Hannes Reichert, Lukas Lang, Kevin Rösch, Daniel Bogdoll, Konrad Doll, Bernhard Sick, Hans-Christian Reuss, Christoph Stiller, J. Marius Zöllner

    Abstract: Full-stack autonomous driving perception modules usually consist of data-driven models based on multiple sensor modalities. However, these models might be biased to the sensor setup used for data acquisition. This bias can seriously impair the perception models' transferability to new sensor setups, which continuously occur due to the market's competitive nature. We envision sensor data abstractio… ▽ More

    Submitted 28 September, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: Hannes Reichert, Lukas Lang, Kevin Rösch and Daniel Bogdoll contributed equally. Accepted for publication at ISC2 2021

  8. arXiv:2103.10842  [pdf, ps, other

    cs.LG eess.IV stat.ML

    Prediction of progressive lens performance from neural network simulations

    Authors: Alexander Leube, Lukas Lang, Gerhard Kelch, Siegfried Wahl

    Abstract: Purpose: The purpose of this study is to present a framework to predict visual acuity (VA) based on a convolutional neural network (CNN) and to further to compare PAL designs. Method: A simple two hidden layer CNN was trained to classify the gap orientations of Landolt Cs by combining the feature extraction abilities of a CNN with psychophysical staircase methods. The simulation was validated re… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: 9 pages, 4 figures

  9. arXiv:2010.10952  [pdf, ps, other

    cs.LG cs.CV

    A Wigner-Eckart Theorem for Group Equivariant Convolution Kernels

    Authors: Leon Lang, Maurice Weiler

    Abstract: Group equivariant convolutional networks (GCNNs) endow classical convolutional networks with additional symmetry priors, which can lead to a considerably improved performance. Recent advances in the theoretical description of GCNNs revealed that such models can generally be understood as performing convolutions with G-steerable kernels, that is, kernels that satisfy an equivariance constraint them… ▽ More

    Submitted 21 January, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: 100 pages

  10. arXiv:1912.05525  [pdf, other

    cs.AI cs.CL cs.LG

    Learning to Request Guidance in Emergent Communication

    Authors: Benjamin Kolb, Leon Lang, Henning Bartsch, Arwin Gansekoele, Raymond Koopmanschap, Leonardo Romor, David Speck, Mathijs Mul, Elia Bruni

    Abstract: Previous research into agent communication has shown that a pre-trained guide can speed up the learning process of an imitation learning agent. The guide achieves this by providing the agent with discrete messages in an emerged language about how to solve the task. We extend this one-directional communication by a one-bit communication channel from the learner back to the guide: It is able to ask… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

  11. arXiv:1805.01006  [pdf, other

    math.OC cs.CV

    A Numerical Framework for Efficient Motion Estimation on Evolving Sphere-Like Surfaces based on Brightness and Mass Conservation Laws

    Authors: Lukas F. Lang

    Abstract: In this work we consider brightness and mass conservation laws for motion estimation on evolving Riemannian 2-manifolds that allow for a radial parametrisation from the 2-sphere. While conservation of brightness constitutes the foundation for optical flow methods and has been generalised to said scenario, we formulate in this article the principle of mass conservation for time-varying surfaces whi… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

    MSC Class: 35A15; 68U10; 92C55; 33C55; 92C37; 53A05; 65N30; 35L65

  12. arXiv:1703.09161  [pdf, other

    math.OC cs.CV

    A Dynamic Programming Solution to Bounded Dejittering Problems

    Authors: Lukas F. Lang

    Abstract: We propose a dynamic programming solution to image dejittering problems with bounded displacements and obtain efficient algorithms for the removal of line jitter, line pixel jitter, and pixel jitter.

    Submitted 27 March, 2017; originally announced March 2017.

    Comments: The final publication is available at link.springer.com

  13. arXiv:1506.03358  [pdf, other

    math.OC cs.CV

    Optical Flow on Evolving Sphere-Like Surfaces

    Authors: Lukas F. Lang, Otmar Scherzer

    Abstract: In this work we consider optical flow on evolving Riemannian 2-manifolds which can be parametrised from the 2-sphere. Our main motivation is to estimate cell motion in time-lapse volumetric microscopy images depicting fluorescently labelled cells of a live zebrafish embryo. We exploit the fact that the recorded cells float on the surface of the embryo and allow for the extraction of an image seque… ▽ More

    Submitted 10 June, 2015; originally announced June 2015.

  14. Decomposition of Optical Flow on the Sphere

    Authors: Clemens Kirisits, Lukas F. Lang, Otmar Scherzer

    Abstract: We propose a number of variational regularisation methods for the estimation and decomposition of motion fields on the $2$-sphere. While motion estimation is based on the optical flow equation, the presented decomposition models are motivated by recent trends in image analysis. In particular we treat $u+v$ decomposition as well as hierarchical decomposition. Helmholtz decomposition of motion field… ▽ More

    Submitted 4 March, 2014; v1 submitted 16 December, 2013; originally announced December 2013.

    Comments: The final publication is available at link.springer.com

    MSC Class: 92C55; 92C37; 92C17; 35A15; 68U10; 33C55

  15. Optical Flow on Evolving Surfaces with Space and Time Regularisation

    Authors: Clemens Kirisits, Lukas F. Lang, Otmar Scherzer

    Abstract: We extend the concept of optical flow with spatiotemporal regularisation to a dynamic non-Euclidean setting. Optical flow is traditionally computed from a sequence of flat images. The purpose of this paper is to introduce variational motion estimation for images that are defined on an evolving surface. Volumetric microscopy images depicting a live zebrafish embryo serve as both biological motivati… ▽ More

    Submitted 25 June, 2014; v1 submitted 1 October, 2013; originally announced October 2013.

    Comments: The final publication is available at Springer via http://dx.doi.org/10.1007/s10851-014-0513-4. This is an extended version of arXiv:1301.1576

  16. Optical Flow on Evolving Surfaces with an Application to the Analysis of 4D Microscopy Data

    Authors: Clemens Kirisits, Lukas F. Lang, Otmar Scherzer

    Abstract: We extend the concept of optical flow to a dynamic non-Euclidean setting. Optical flow is traditionally computed from a sequence of flat images. It is the purpose of this paper to introduce variational motion estimation for images that are defined on an evolving surface. Volumetric microscopy images depicting a live zebrafish embryo serve as both biological motivation and test data.

    Submitted 21 May, 2013; v1 submitted 8 January, 2013; originally announced January 2013.

    Comments: The final publication is available at link.springer.com