Skip to main content

Showing 51–100 of 113 results for author: Neumann, G

.
  1. arXiv:2204.04775  [pdf, other

    cs.CL cs.CR cs.LG

    Few-Shot Cross-lingual Transfer for Coarse-grained De-identification of Code-Mixed Clinical Texts

    Authors: Saadullah Amin, Noon Pokaratsiri Goldstein, Morgan Kelly Wixted, Alejandro García-Rudolph, Catalina Martínez-Costa, Günter Neumann

    Abstract: Despite the advances in digital healthcare systems offering curated structured knowledge, much of the critical information still lies in large volumes of unlabeled and unstructured clinical texts. These texts, which often contain protected health information (PHI), are exposed to information extraction tools for downstream applications, risking patient identification. Existing works in de-identifi… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

    Comments: Accepted by BioNLP'22

  2. arXiv:2203.07761  [pdf, other

    cs.RO cs.AI cs.LG

    Reactive Motion Generation on Learned Riemannian Manifolds

    Authors: Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Gerhard Neumann, Leonel Rozo

    Abstract: In recent decades, advancements in motion learning have enabled robots to acquire new skills and adapt to unseen conditions in both structured and unstructured environments. In practice, motion learning methods capture relevant patterns and adjust them to new conditions such as dynamic obstacle avoidance or variable targets. In this paper, we investigate the robot motion learning paradigm from a R… ▽ More

    Submitted 17 August, 2023; v1 submitted 15 March, 2022; originally announced March 2022.

  3. arXiv:2203.04905  [pdf, other

    cs.CV cs.AI cs.LG

    What Matters For Meta-Learning Vision Regression Tasks?

    Authors: Ning Gao, Hanna Ziesche, Ngo Anh Vien, Michael Volpp, Gerhard Neumann

    Abstract: Meta-learning is widely used in few-shot classification and function regression due to its ability to quickly adapt to unseen tasks. However, it has not yet been well explored on regression tasks with high dimensional inputs such as images. This paper makes two main contributions that help understand this barely explored area. \emph{First}, we design two new types of cross-category level vision re… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR 2022

  4. arXiv:2202.13680  [pdf, other

    cs.RO

    Hierarchical Policy Learning for Mechanical Search

    Authors: Oussama Zenkri, Ngo Anh Vien, Gerhard Neumann

    Abstract: Retrieving objects from clutters is a complex task, which requires multiple interactions with the environment until the target object can be extracted. These interactions involve executing action primitives like gras** or pushing as well as setting priorities for the objects to manipulate and the actions to execute. Mechanical Search (MS) is a framework for object retrieval, which uses a heurist… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: ICRA 2022

  5. arXiv:2202.09866  [pdf, ps, other

    math.CO

    Number of $k$-normal elements over a finite field

    Authors: Josimar J. R. Aguirre, Victor G. L. Neumann

    Abstract: An element $α\in \mathbb{F}_{q^n}$ is a normal element over $\mathbb{F}_q$ if the conjugates $α^{q^i}$, $0 \leq i \leq n-1$, are linearly independent over $\mathbb{F}_q$. Hence a normal basis for $\mathbb{F}_{q^n}$ over $\mathbb{F}_q$ is of the form $\{α,α^q, \ldots, α^{q^{n-1}}\}$, where $α\in \mathbb{F}_{q^n}$ is normal over $\mathbb{F}_q$. In 2013, Huczynska, Mullen, Panario and Thomson introdu… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

    MSC Class: 12E20; 11T30

  6. arXiv:2112.13151  [pdf, ps, other

    math.NT

    About $r$- primitive and $k$-normal elements in finite fields

    Authors: Cícero Carvalho, Josimar J. R. Aguirre, Victor G. L. Neumann

    Abstract: In 2013, Huczynska, Mullen, Panario and Thomson introduced the concept of $k$-normal elements: an element $α\in \mathbb{F}_{q^n}$ is $k$-normal over $\mathbb{F}_q$ if the greatest common divisor of the polynomials $g_α(x)= αx^{n-1}+α^qx^{n-2}+\ldots +α^{q^{n-2}}x+α^{q^{n-1}}$ and $x^n-1$ in $\mathbb{F}_{q^n}[x]$ has degree $k$, generalizing the concept of normal elements (normal in the usual sense… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

    MSC Class: 12E20; 11T23

  7. arXiv:2112.04216  [pdf, other

    cs.LG cs.RO

    Specializing Versatile Skill Libraries using Local Mixture of Experts

    Authors: Onur Celik, Dongzhuoran Zhou, Ge Li, Philipp Becker, Gerhard Neumann

    Abstract: A long-cherished vision in robotics is to equip robots with skills that match the versatility and precision of humans. For example, when playing table tennis, a robot should be capable of returning the ball in various ways while precisely placing it at the desired location. A common approach to model such versatile behavior is to use a Mixture of Experts (MoE) model, where each expert is a context… ▽ More

    Submitted 10 January, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: published at CoRL 2021 London

  8. arXiv:2111.14681  [pdf, other

    cs.HC

    Human-machine Symbiosis: A Multivariate Perspective for Physically Coupled Human-machine Systems

    Authors: Jairo Inga, Miriam Ruess, Jan Heinrich Robens, Thomas Nelius, Sean Kille, Philipp Dahlinger, Roland Thomaschke, Gerhard Neumann, Sven Matthiesen, Sören Hohmann, Andrea Kiesel

    Abstract: The notion of symbiosis has been increasingly mentioned in research on physically coupled human-machine systems. Yet, a uniform specification on which aspects constitute human-machine symbiosis is missing. By combining the expertise of different disciplines, we elaborate on a multivariate perspective of symbiosis as the highest form of physically coupled human-machine systems. Four dimensions are… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 28 pages, 7 figures

  9. arXiv:2111.08291  [pdf, other

    cs.LG cs.AI eess.SP

    Switching Recurrent Kalman Networks

    Authors: Giao Nguyen-Quynh, Philipp Becker, Chen Qiu, Maja Rudolph, Gerhard Neumann

    Abstract: Forecasting driving behavior or other sensor measurements is an essential component of autonomous driving systems. Often real-world multivariate time series data is hard to model because the underlying dynamics are nonlinear and the observations are noisy. In addition, driving data can often be multimodal in distribution, meaning that there are distinct predictions that are likely, but averaging c… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Autonomous Driving Workshop at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia

  10. arXiv:2111.07667  [pdf, other

    cs.LG

    Versatile Inverse Reinforcement Learning via Cumulative Rewards

    Authors: Niklas Freymuth, Philipp Becker, Gerhard Neumann

    Abstract: Inverse Reinforcement Learning infers a reward function from expert demonstrations, aiming to encode the behavior and intentions of the expert. Current approaches usually do this with generative and uni-modal models, meaning that they encode a single behavior. In the common setting, where there are various solutions to a problem and the experts show versatile behavior this severely limits the gene… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: Accepted as a workshop paper in 4th Robot Learning Workshop: Self-Supervised and Lifelong Learning @NeurIPS 2021

    ACM Class: I.2.6

  11. Bugs in our Pockets: The Risks of Client-Side Scanning

    Authors: Hal Abelson, Ross Anderson, Steven M. Bellovin, Josh Benaloh, Matt Blaze, Jon Callas, Whitfield Diffie, Susan Landau, Peter G. Neumann, Ronald L. Rivest, Jeffrey I. Schiller, Bruce Schneier, Vanessa Teague, Carmela Troncoso

    Abstract: Our increasing reliance on digital technology for personal, economic, and government affairs has made it essential to secure the communications and devices of private citizens, businesses, and governments. This has led to pervasive use of cryptography across society. Despite its evident advantages, law enforcement and national security agencies have argued that the spread of cryptography has hinde… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: 46 pages, 3 figures

    Journal ref: Journal of Cybersecurity, 10(1), 2024

  12. Cooperative Assistance in Robotic Surgery through Multi-Agent Reinforcement Learning

    Authors: Paul Maria Scheikl, Balázs Gyenes, Tornike Davitashvili, Rayan Younis, André Schulze, Beat P. Müller-Stich, Gerhard Neumann, Martin Wagner, Franziska Mathis-Ullrich

    Abstract: Cognitive cooperative assistance in robot-assisted surgery holds the potential to increase quality of care in minimally invasive interventions. Automation of surgical tasks promises to reduce the mental exertion and fatigue of surgeons. In this work, multi-agent reinforcement learning is demonstrated to be robust to the distribution shift introduced by pairing a learned policy with a human team me… ▽ More

    Submitted 10 October, 2021; originally announced October 2021.

    Comments: Accepted at the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021)

  13. arXiv:2108.03222  [pdf, other

    cs.RO cs.LG

    A Study on Dense and Sparse (Visual) Rewards in Robot Policy Learning

    Authors: Abdalkarim Mohtasib, Gerhard Neumann, Heriberto Cuayahuitl

    Abstract: Deep Reinforcement Learning (DRL) is a promising approach for teaching robots new behaviour. However, one of its main limitations is the need for carefully hand-coded reward signals by an expert. We argue that it is crucial to automate the reward learning process so that new skills can be taught to robots by their users. To address such automation, we consider task success classifiers using visual… ▽ More

    Submitted 6 August, 2021; originally announced August 2021.

  14. arXiv:2107.13487  [pdf, ps, other

    cs.IT math.AG

    A family of codes with variable locality and availability

    Authors: Cícero Carvalho, Victor G. L. Neumann

    Abstract: In this work we present a class of locally recoverable codes, i.e. codes where an erasure at a position $P$ of a codeword may be recovered from the knowledge of the entries in the positions of a recovery set $R_P$. The codes in the class that we define have availability, meaning that for each position $P$ there are several distinct recovery sets. Also, the entry at position $P$ may be recovered ev… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    MSC Class: 11T71 (Primary) 94B27; 14G50 (Secondary)

  15. Navigate-and-Seek: a Robotics Framework for People Localization in Agricultural Environments

    Authors: Riccardo Polvara, Francesco Del Duchetto, Gerhard Neumann, Marc Hanheide

    Abstract: The agricultural domain offers a working environment where many human laborers are nowadays employed to maintain or harvest crops, with huge potential for productivity gains through the introduction of robotic automation. Detecting and localizing humans reliably and accurately in such an environment, however, is a prerequisite to many services offered by fleets of mobile robots collaborating with… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  16. arXiv:2106.05535  [pdf, other

    cs.RO cs.LG

    Differentiable Robust LQR Layers

    Authors: Ngo Anh Vien, Gerhard Neumann

    Abstract: This paper proposes a differentiable robust LQR layer for reinforcement learning and imitation learning under model uncertainty and stochastic dynamics. The robust LQR layer can exploit the advantages of robust optimal control and model-free learning. It provides a new type of inductive bias for stochasticity and uncertainty modeling in control systems. In particular, we propose an efficient way t… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: 10 pages

  17. arXiv:2106.04315  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Riemannian Manifolds for Geodesic Motion Skills

    Authors: Hadi Beik-Mohammadi, Søren Hauberg, Georgios Arvanitidis, Gerhard Neumann, Leonel Rozo

    Abstract: For robots to work alongside humans and perform in unstructured environments, they must learn new motion skills and adapt them to unseen situations on the fly. This demands learning models that capture relevant motion patterns, while offering enough flexibility to adapt the encoded skills to new requirements, such as dynamic obstacle avoidance. We introduce a Riemannian manifold perspective on thi… ▽ More

    Submitted 1 July, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

  18. arXiv:2106.04306  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Residual Feedback Learning for Contact-Rich Manipulation Tasks with Uncertainty

    Authors: Alireza Ranjbar, Ngo Anh Vien, Hanna Ziesche, Joschka Boedecker, Gerhard Neumann

    Abstract: While classic control theory offers state of the art solutions in many problem scenarios, it is often desired to improve beyond the structure of such solutions and surpass their limitations. To this end, residual policy learning (RPL) offers a formulation to improve existing controllers with reinforcement learning (RL) by learning an additive "residual" to the output of a given controller. However… ▽ More

    Submitted 6 August, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

  19. arXiv:2101.09207  [pdf, other

    cs.LG cs.IT

    Differentiable Trust Region Layers for Deep Reinforcement Learning

    Authors: Fabian Otto, Philipp Becker, Ngo Anh Vien, Hanna Carolin Ziesche, Gerhard Neumann

    Abstract: Trust region methods are a popular tool in reinforcement learning as they yield robust policy updates in continuous and discrete action spaces. However, enforcing such trust regions in deep reinforcement learning is difficult. Hence, many approaches, such as Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO), are based on approximations. Due to those approximations, the… ▽ More

    Submitted 9 March, 2021; v1 submitted 22 January, 2021; originally announced January 2021.

    Comments: Accepted at ICLR 2021, camera ready version

  20. arXiv:2101.07629  [pdf, ps, other

    cs.IT math.AG

    A family of codes with locality containing optimal codes

    Authors: Bruno Andrade, Cícero Carvalho, Victor G. L. Neumann, Antônio C. P. Veiga

    Abstract: Locally recoverable codes were introduced by Gopalan et al. in 2012, and in the same year Prakash et al. introduced the concept of codes with locality, which are a type of locally recoverable codes. In this work we introduce a new family of codes with locality, which are subcodes of a certain family of evaluation codes. We determine the dimension of these codes, and also bounds for the minimum dis… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    MSC Class: 11T71; 94B27; 14G50

  21. arXiv:2010.10201  [pdf, other

    cs.RO cs.LG

    Action-Conditional Recurrent Kalman Networks For Forward and Inverse Dynamics Learning

    Authors: Vaisakh Shaj, Philipp Becker, Dieter Buchler, Harit Pandya, Niels van Duijkeren, C. James Taylor, Marc Hanheide, Gerhard Neumann

    Abstract: Estimating accurate forward and inverse dynamics models is a crucial component of model-based control for sophisticated robots such as robots driven by hydraulics, artificial muscles, or robots dealing with different contact situations. Analytic models to such processes are often unavailable or inaccurate due to complex hysteresis effects, unmodelled friction and stiction phenomena,and unknown eff… ▽ More

    Submitted 5 November, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: Accepted to Conference On Robot Learning(CoRL), 2020

  22. arXiv:2009.02073  [pdf, other

    cs.CL

    Linguistically inspired morphological inflection with a sequence to sequence model

    Authors: Eleni Metheniti, Guenter Neumann, Josef van Genabith

    Abstract: Inflection is an essential part of every human language's morphology, yet little effort has been made to unify linguistic theory and computational methods in recent years. Methods of string manipulation are used to infer inflectional changes; our research question is whether a neural network would be capable of learning inflectional morphemes for inflection production in a similar way to a human i… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

    Comments: 13 pages, 6 figures

  23. arXiv:2008.10858  [pdf, other

    cs.LG stat.ML

    LowFER: Low-rank Bilinear Pooling for Link Prediction

    Authors: Saadullah Amin, Stalin Varanasi, Katherine Ann Dunfield, Günter Neumann

    Abstract: Knowledge graphs are incomplete by nature, with only a limited number of observed facts from the world knowledge being represented as structured relations between entities. To partly address this issue, an important task in statistical relational learning is that of link prediction or knowledge graph completion. Both linear and non-linear models have been proposed to solve the problem. Bilinear mo… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: Accepted by ICML'20

  24. arXiv:2008.04007  [pdf, other

    cs.RO cs.AI cs.LG

    Imitation Learning for Autonomous Trajectory Learning of Robot Arms in Space

    Authors: RB Ashith Shyam, Zhou Hao, Umberto Montanaro, Gerhard Neumann

    Abstract: This work adds on to the on-going efforts to provide more autonomy to space robots. Here the concept of programming by demonstration or imitation learning is used for trajectory planning of manipulators mounted on small spacecraft. For greater autonomy in future space missions and minimal human intervention through ground control, a robot arm having 7-Degrees of Freedom (DoF) is envisaged for carr… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

  25. arXiv:2008.03525  [pdf, other

    cs.LG cs.IT cs.RO stat.ML

    Non-Adversarial Imitation Learning and its Connections to Adversarial Methods

    Authors: Oleg Arenz, Gerhard Neumann

    Abstract: Many modern methods for imitation learning and inverse reinforcement learning, such as GAIL or AIRL, are based on an adversarial formulation. These methods apply GANs to match the expert's distribution over states and actions with the implicit state-action distribution induced by the agent's policy. However, by framing imitation learning as a saddle point problem, adversarial methods can suffer fr… ▽ More

    Submitted 8 August, 2020; originally announced August 2020.

  26. arXiv:2007.11169  [pdf, ps, other

    math.NT

    Existence of primitive $2$-normal elements in finite fields

    Authors: Victor G. L. Neumann, Josimar J. R. Aguirre

    Abstract: An element $α\in \mathbb{F}_{q^n}$ is normal over $\mathbb{F}_q$ if $\mathcal{B}=\{α, α^q, α^{q^2}, \cdots, α^{q^{n-1}}\}$ forms a basis of $\mathbb{F}_{q^n}$ as a vector space over $\mathbb{F}_q$. It is well known that $α\in \mathbb{F}_{q^n}$ is normal over $\mathbb{F}_q$ if and only if $g_α(x)=αx^{n-1}+α^q x^{n-2}+ \cdots + α^{q^{n-2}}x+α^{q^{n-1}}$ and $x^n-1$ are relatively prime over… ▽ More

    Submitted 22 December, 2020; v1 submitted 21 July, 2020; originally announced July 2020.

    MSC Class: 12E20; 11T23

  27. arXiv:2007.09787  [pdf, ps, other

    math.NT

    On the existence of pairs of primitive and normal elements over finite fields

    Authors: Cícero Carvalho, João Paulo Guardieiro, Victor G. L. Neumann, Guilherme Tizziotti

    Abstract: Let $\mathbb{F}_{q^n}$ be a finite field with $q^n$ elements, and let $m_1$ and $m_2$ be positive integers. Given polynomials $f_1(x), f_2(x) \in \mathbb{F}_q[x]$ with $\textrm{deg}(f_i(x)) \leq m_i$, for $i = 1, 2$, and such that the rational function $f_1(x)/f_2(x)$ belongs to a certain set which we define, we present a sufficient condition for the existence of a primitive element… ▽ More

    Submitted 14 March, 2021; v1 submitted 19 July, 2020; originally announced July 2020.

    MSC Class: 12E20; 11T23

  28. Extended gravitational clock compass: new exact solutions and simulations

    Authors: Gerald Neumann, Dirk Puetzfeld, Guillermo F. Rubilar

    Abstract: By extending the framework of the gravitational clock compass we show how a suitably prepared set of clocks can be used to extract information about the gravitational field in the context of General Relativity. Conceptual differences between the extended and the standard clock compass are highlighted. Particular attention is paid to the influence of kinematic quantities on the measurement process… ▽ More

    Submitted 20 August, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: 20 pages, 18 figures

    Journal ref: Phys. Rev. D 102, 044027 (2020)

  29. arXiv:2005.12565  [pdf, other

    cs.CL cs.LG

    A Data-driven Approach for Noise Reduction in Distantly Supervised Biomedical Relation Extraction

    Authors: Saadullah Amin, Katherine Ann Dunfield, Anna Vechkaeva, Günter Neumann

    Abstract: Fact triples are a common form of structured knowledge used within the biomedical domain. As the amount of unstructured scientific texts continues to grow, manual annotation of these texts for the task of relation extraction becomes increasingly expensive. Distant supervision offers a viable approach to combat this by quickly producing large amounts of labeled, but considerably noisy, data. We aim… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.

  30. arXiv:2002.11495  [pdf, other

    cs.RO

    Probabilistic approach to physical object disentangling

    Authors: Joni Pajarinen, Oleg Arenz, Jan Peters, Gerhard Neumann

    Abstract: Physically disentangling entangled objects from each other is a problem encountered in waste segregation or in any task that requires disassembly of structures. Often there are no object models, and, especially with cluttered irregularly shaped objects, the robot can not create a model of the scene due to occlusion. One of our key insights is that based on previous sensory input we are only intere… ▽ More

    Submitted 12 April, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

  31. arXiv:2002.01867  [pdf, ps, other

    math.NT

    On existence of some special pair of primitive elements over finite fields

    Authors: C. Carvalho, J. P. G. Sousa, V. G. L. Neumann, G. Tizziotti

    Abstract: In this paper we generalize the results of Sharma, Awasthi and Gupta (see \cite{SAG}). We work over a field of any characteristic with $q = p^k$ elements and we give a sufficient condition for the existence of a primitive element $α\in \mathbb{F}_{p^k}$ such that $f(α)$ is also primitive in $\mathbb{F}_{p^k}$, where $f(x) \in \mathbb{F}_{p^k}(x)$ is a quotient of polynomials with some restrictions… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

  32. arXiv:2001.08682  [pdf, other

    cs.LG stat.ML

    Expected Information Maximization: Using the I-Projection for Mixture Density Estimation

    Authors: Philipp Becker, Oleg Arenz, Gerhard Neumann

    Abstract: Modelling highly multi-modal data is a challenging problem in machine learning. Most algorithms are based on maximizing the likelihood, which corresponds to the M(oment)-projection of the data distribution to the model distribution. The M-projection forces the model to average over modes it cannot represent. In contrast, the I(information)-projection ignores such modes in the data and concentrates… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

  33. arXiv:1907.04710  [pdf, other

    cs.LG stat.ML

    Trust-Region Variational Inference with Gaussian Mixture Models

    Authors: Oleg Arenz, Mingjun Zhong, Gerhard Neumann

    Abstract: Many methods for machine learning rely on approximate inference from intractable probability distributions. Variational inference approximates such distributions by tractable models that can be subsequently used for approximate inference. Learning sufficiently accurate approximations requires a rich model family and careful exploration of the relevant modes of the target distribution. We propose a… ▽ More

    Submitted 4 August, 2020; v1 submitted 10 July, 2019; originally announced July 2019.

    Journal ref: Journal of Machine Learning Research. 21(163):1-60, 2020

  34. arXiv:1905.07357  [pdf, other

    cs.LG stat.ML

    Recurrent Kalman Networks: Factorized Inference in High-Dimensional Deep Feature Spaces

    Authors: Philipp Becker, Harit Pandya, Gregor Gebhardt, Cheng Zhao, James Taylor, Gerhard Neumann

    Abstract: In order to integrate uncertainty estimates into deep time-series modelling, Kalman Filters (KFs) (Kalman et al., 1960) have been integrated with deep learning models, however, such approaches typically rely on approximate inference techniques such as variational inference which makes learning more complex and often less scalable due to approximation errors. We propose a new deep approach to Kalma… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: accepted at ICML 2019

  35. arXiv:1903.09458  [pdf, ps, other

    math.AG

    An extension of Delsarte, Goethals and Mac Williams theorem on minimal weight codewords to a class of Reed-Muller type codes

    Authors: Cicero Carvalho, Victor G. L. Neumann

    Abstract: In 1970 Delsarte, Goethals and Mac Williams published a seminal paper on generalized Reed-Muller codes where, among many important results, they proved that the minimal weight codewords of these codes are obtained through the evaluation of certain polynomials which are a specific product of linear factors, which they describe. In the present paper we extend this result to a class of Reed-Muller ty… ▽ More

    Submitted 22 March, 2019; originally announced March 2019.

    Comments: 28 pages

    MSC Class: 14G50; 94B27

  36. arXiv:1902.02823  [pdf, other

    cs.LG stat.ML

    Compatible Natural Gradient Policy Search

    Authors: Joni Pajarinen, Hong Linh Thai, Riad Akrour, Jan Peters, Gerhard Neumann

    Abstract: Trust-region methods have yielded state-of-the-art results in policy search. A common approach is to use KL-divergence to bound the region of trust resulting in a natural gradient policy update. We show that the natural gradient and trust region optimization are equivalent if we use the natural parameterization of a standard exponential policy distribution in combination with compatible value func… ▽ More

    Submitted 7 February, 2019; originally announced February 2019.

  37. An Algorithmic Perspective on Imitation Learning

    Authors: Takayuki Osa, Joni Pajarinen, Gerhard Neumann, J. Andrew Bagnell, Pieter Abbeel, Jan Peters

    Abstract: As robots and other intelligent agents move from simple environments and problems to more complex, unstructured settings, manually programming their behavior has become increasingly challenging and expensive. Often, it is easier for a teacher to demonstrate a desired behavior rather than attempt to manually engineer it. This process of learning from demonstrations, and the study of algorithms to d… ▽ More

    Submitted 16 November, 2018; originally announced November 2018.

    Comments: 187 pages. Published in Foundations and Trends in Robotics

  38. arXiv:1808.10648  [pdf, other

    cs.LG cs.RO stat.ML

    Adaptation and Robust Learning of Probabilistic Movement Primitives

    Authors: Sebastian Gomez-Gonzalez, Gerhard Neumann, Bernhard Schölkopf, Jan Peters

    Abstract: Probabilistic representations of movement primitives open important new possibilities for machine learning in robotics. These representations are able to capture the variability of the demonstrations from a teacher as a probability distribution over trajectories, providing a sensible region of exploration and the ability to adapt to changes in the robot environment. However, to be able to capture… ▽ More

    Submitted 19 February, 2020; v1 submitted 31 August, 2018; originally announced August 2018.

  39. arXiv:1808.06453  [pdf, other

    cs.NI cs.LG

    Towards Fine Grained Network Flow Prediction

    Authors: Patrick Jahnke, Emmanuel Stapf, Jonas Mieseler, Gerhard Neumann, Patrick Eugster

    Abstract: One main challenge for the design of networks is that traffic load is not generally known in advance. This makes it hard to adequately devote resources such as to best prevent or mitigate bottlenecks. While several authors have shown how to predict traffic in a coarse grained manner by aggregating flows, fine grained prediction of traffic at the level of individual flows, including bursty traffic,… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

  40. arXiv:1807.06613  [pdf, other

    cs.MA cs.AI cs.LG eess.SY stat.ML

    Deep Reinforcement Learning for Swarm Systems

    Authors: Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann

    Abstract: Recently, deep reinforcement learning (RL) methods have been applied successfully to multi-agent scenarios. Typically, these methods rely on a concatenation of agent states to represent the information content required for decentralized decision making. However, concatenation scales poorly to swarm systems with a large number of homogeneous agents as it does not exploit the fundamental properties… ▽ More

    Submitted 6 June, 2019; v1 submitted 17 July, 2018; originally announced July 2018.

    Comments: 31 pages, 12 figures, version 3 (published in JMLR Volume 20)

    Journal ref: Journal of Machine Learning Research 20(54):1-31, 2019

  41. arXiv:1806.06762  [pdf

    cs.RO

    Agricultural Robotics: The Future of Robotic Agriculture

    Authors: Tom Duckett, Simon Pearson, Simon Blackmore, Bruce Grieve, Wen-Hua Chen, Grzegorz Cielniak, Jason Cleaversmith, Jian Dai, Steve Davis, Charles Fox, Pål From, Ioannis Georgilas, Richie Gill, Iain Gould, Marc Hanheide, Alan Hunter, Fumiya Iida, Lyudmila Mihalyova, Samia Nefti-Meziani, Gerhard Neumann, Paolo Paoletti, Tony Pridmore, Dave Ross, Melvyn Smith, Martin Stoelen , et al. (5 additional authors not shown)

    Abstract: Agri-Food is the largest manufacturing sector in the UK. It supports a food chain that generates over £108bn p.a., with 3.9m employees in a truly international industry and exports £20bn of UK manufactured goods. However, the global food chain is under pressure from population growth, climate change, political pressures affecting migration, population drift from rural to urban regions and the demo… ▽ More

    Submitted 2 August, 2018; v1 submitted 18 June, 2018; originally announced June 2018.

    Comments: UK-RAS Network White Papers, ISSN 2398-4414

  42. arXiv:1804.08426  [pdf, ps, other

    cs.CL cs.AI

    LightRel SemEval-2018 Task 7: Lightweight and Fast Relation Classification

    Authors: Tyler Renslow, Günter Neumann

    Abstract: We present LightRel, a lightweight and fast relation classifier. Our goal is to develop a high baseline for different relation extraction tasks. By defining only very few data-internal, word-level features and external knowledge sources in the form of word clusters and word embeddings, we train a fast and simple linear classifier.

    Submitted 19 April, 2018; originally announced April 2018.

    Comments: SemEval-2018 task 7 Semantic Relation Extraction and Classification in Scientific Papers

  43. arXiv:1709.07224  [pdf, other

    cs.MA cs.AI cs.LG eess.SY stat.ML

    Local Communication Protocols for Learning Complex Swarm Behaviors with Deep Reinforcement Learning

    Authors: Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann

    Abstract: Swarm systems constitute a challenging problem for reinforcement learning (RL) as the algorithm needs to learn decentralized control policies that can cope with limited local sensing and communication abilities of the agents. While it is often difficult to directly define the behavior of the agents, simple communication protocols can be defined more easily using prior knowledge about the given tas… ▽ More

    Submitted 18 July, 2018; v1 submitted 21 September, 2017; originally announced September 2017.

    Comments: 13 pages, 4 figures, version 2, accepted at ANTS 2018

  44. arXiv:1709.06011  [pdf, other

    cs.MA cs.AI cs.LG eess.SY stat.ML

    Guided Deep Reinforcement Learning for Swarm Systems

    Authors: Maximilian Hüttenrauch, Adrian Šošić, Gerhard Neumann

    Abstract: In this paper, we investigate how to learn to control a group of cooperative agents with limited sensing capabilities such as robot swarms. The agents have only very basic sensor capabilities, yet in a group they can accomplish sophisticated tasks, such as distributed assembly or search and rescue tasks. Learning a policy for a group of agents is difficult due to distributed partial observability… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

    Comments: 15 pages, 8 figures, accepted at the AAMAS 2017 Autonomous Robots and Multirobot Systems (ARMS) Workshop

  45. arXiv:1704.04441  [pdf, other

    cs.CL

    How Robust Are Character-Based Word Embeddings in Tagging and MT Against Wrod Scramlbing or Randdm Nouse?

    Authors: Georg Heigold, Günter Neumann, Josef van Genabith

    Abstract: This paper investigates the robustness of NLP against perturbed word forms. While neural approaches can achieve (almost) human-like accuracy for certain tasks and conditions, they often are sensitive to small changes in the input such as non-canonical input (e.g., typos). Yet both stability and robustness are desired properties in applications involving user-generated content, and the more as huma… ▽ More

    Submitted 14 April, 2017; originally announced April 2017.

    Comments: 9 pages

  46. arXiv:1702.04396  [pdf, other

    cs.RO

    Hybrid control trajectory optimization under uncertainty

    Authors: Joni Pajarinen, Ville Kyrki, Michael Koval, Siddhartha Srinivasa, Jan Peters, Gerhard Neumann

    Abstract: Trajectory optimization is a fundamental problem in robotics. While optimization of continuous control trajectories is well developed, many applications require both discrete and continuous, i.e., hybrid, controls. Finding an optimal sequence of hybrid controls is challenging due to the exponential explosion of discrete control combinations. Our method, based on Differential Dynamic Programming (D… ▽ More

    Submitted 2 March, 2017; v1 submitted 14 February, 2017; originally announced February 2017.

  47. arXiv:1701.01663  [pdf, ps, other

    cs.IT math.AG

    On the next-to-minimal weight of projective Reed-Muller codes

    Authors: Cícero Carvalho, Victor G. L. Neumann

    Abstract: In this paper we present several values for the next-to-minimal weights of projective Reed-Muller codes. We work over $\mathbb{F}_q$ with $q \geq 3$ since in IEEE-IT 62(11) p. 6300-6303 (2016) we have determined the complete values for the next-to-minimal weights of binary projective Reed-Muller codes. As in loc. cit. here we also find examples of codewords with next-to-minimal weight whose set of… ▽ More

    Submitted 6 January, 2017; originally announced January 2017.

    Comments: 9 pages. arXiv admin note: text overlap with arXiv:1701.01658

    MSC Class: 94B27; 94B60

  48. The next-to-minimal weights of binary projective Reed-Muller codes

    Authors: Cícero Carvalho, Victor G. L. Neumann

    Abstract: Projective Reed-Muller codes were introduced by Lachaud, in 1988 and their dimension and minimum distance were determined by Serre and Sørensen in 1991. In coding theory one is also interested in the higher Hamming weights, to study the code performance. Yet, not many values of the higher Hamming weights are known for these codes, not even the second lowest weight (also known as next-to-minimal we… ▽ More

    Submitted 6 January, 2017; originally announced January 2017.

    Comments: 10 pages, Published in IEEE Transactions on Information Theory, vol. 62, issue 11, Nov. 2016

    MSC Class: 94B27; 94B60

  49. arXiv:1611.03231  [pdf, ps, other

    stat.ML cs.LG

    Policy Search with High-Dimensional Context Variables

    Authors: Voot Tangkaratt, Herke van Hoof, Simone Parisi, Gerhard Neumann, Jan Peters, Masashi Sugiyama

    Abstract: Direct contextual policy search methods learn to improve policy parameters and simultaneously generalize these parameters to different context or task variables. However, learning from high-dimensional context variables, such as camera images, is still a prominent problem in many real-world tasks. A naive application of unsupervised dimensionality reduction methods to the context variables, such a… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

  50. arXiv:1606.09197  [pdf, other

    cs.LG cs.RO

    Model-Free Trajectory-based Policy Optimization with Monotonic Improvement

    Authors: Riad Akrour, Abbas Abdolmaleki, Hany Abdulsamad, Jan Peters, Gerhard Neumann

    Abstract: Many of the recent trajectory optimization algorithms alternate between linear approximation of the system dynamics around the mean trajectory and conservative policy update. One way of constraining the policy change is by bounding the Kullback-Leibler (KL) divergence between successive policies. These approaches already demonstrated great experimental success in challenging problems such as end-t… ▽ More

    Submitted 2 July, 2018; v1 submitted 29 June, 2016; originally announced June 2016.