Skip to main content

Showing 1–11 of 11 results for author: Peer, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.03848  [pdf, other

    cs.CL cs.AI

    ANLS* -- A Universal Document Processing Metric for Generative Large Language Models

    Authors: David Peer, Philemon Schöpf, Volckmar Nebendahl, Alexander Rietzler, Sebastian Stabinger

    Abstract: Traditionally, discriminative models have been the predominant choice for tasks like document classification and information extraction. These models make predictions that fall into a limited number of predefined classes, facilitating a binary true or false evaluation and enabling the direct calculation of metrics such as the F1 score. However, recent advancements in generative large language mode… ▽ More

    Submitted 28 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  2. arXiv:2211.05200  [pdf, other

    cs.CV cs.RO

    Affordance detection with Dynamic-Tree Capsule Networks

    Authors: Antonio Rodríguez-Sánchez, Simon Haller-Seeber, David Peer, Chris Engelhardt, Jakob Mittelberger, Matteo Saveriano

    Abstract: Affordance detection from visual input is a fundamental step in autonomous robotic manipulation. Existing solutions to the problem of affordance detection rely on convolutional neural networks. However, these networks do not consider the spatial arrangement of the input data and miss parts-to-whole relationships. Therefore, they fall short when confronted with novel, previously unseen object insta… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: IEEE-RAS International Conference on Humanoid Robots (Humanoids 2022)

  3. arXiv:2208.01134  [pdf, other

    cs.LG

    Improving the Trainability of Deep Neural Networks through Layerwise Batch-Entropy Regularization

    Authors: David Peer, Bart Keulen, Sebastian Stabinger, Justus Piater, Antonio Rodríguez-Sánchez

    Abstract: Training deep neural networks is a very demanding task, especially challenging is how to adapt architectures to improve the performance of trained models. We can find that sometimes, shallow networks generalize better than deep networks, and the addition of more layers results in higher training and test errors. The deep residual learning framework addresses this degradation problem by adding skip… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted at TMLR (07/2022): https://openreview.net/forum?id=LJohl5DnZf

  4. arXiv:2201.11091  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Momentum Capsule Networks

    Authors: Josef Gugglberger, David Peer, Antonio Rodríguez-Sánchez

    Abstract: Capsule networks are a class of neural networks that achieved promising results on many computer vision tasks. However, baseline capsule networks have failed to reach state-of-the-art results on more complex datasets due to the high computation and memory requirements. We tackle this problem by proposing a new network architecture, called Momentum Capsule Network (MoCapsNet). MoCapsNets are inspir… ▽ More

    Submitted 25 August, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

  5. Greedy-layer Pruning: Speeding up Transformer Models for Natural Language Processing

    Authors: David Peer, Sebastian Stabinger, Stefan Engl, Antonio Rodriguez-Sanchez

    Abstract: Fine-tuning transformer models after unsupervised pre-training reaches a very high performance on many different natural language processing tasks. Unfortunately, transformers suffer from long inference times which greatly increases costs in production. One possible solution is to use knowledge distillation, which solves this problem by transferring information from large teacher models to smaller… ▽ More

    Submitted 29 March, 2022; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: Accepted at Pattern Recognition Letters

  6. arXiv:2104.07393  [pdf, ps, other

    cs.CV

    Training Deep Capsule Networks with Residual Connections

    Authors: Josef Gugglberger, David Peer, Antonio Rodriguez-Sanchez

    Abstract: Capsule networks are a type of neural network that have recently gained increased popularity. They consist of groups of neurons, called capsules, which encode properties of objects or object parts. The connections between capsules encrypt part-whole relationships between objects through routing algorithms which route the output of capsules from lower level layers to upper level layers. Capsule net… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

    Comments: 12 pages

  7. arXiv:2103.04331  [pdf, other

    cs.LG cs.CV

    Auto-tuning of Deep Neural Networks by Conflicting Layer Removal

    Authors: David Peer, Sebastian Stabinger, Antonio Rodriguez-Sanchez

    Abstract: Designing neural network architectures is a challenging task and knowing which specific layers of a model must be adapted to improve the performance is almost a mystery. In this paper, we introduce a novel methodology to identify layers that decrease the test accuracy of trained models. Conflicting layers are detected as early as the beginning of training. In the worst-case scenario, we prove that… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2011.02956

  8. arXiv:2102.11944  [pdf, other

    cs.CV cs.AI cs.LG

    Arguments for the Unsuitability of Convolutional Neural Networks for Non--Local Tasks

    Authors: Sebastian Stabinger, David Peer, Antonio Rodríguez-Sánchez

    Abstract: Convolutional neural networks have established themselves over the past years as the state of the art method for image classification, and for many datasets, they even surpass humans in categorizing images. Unfortunately, the same architectures perform much worse when they have to compare parts of an image to each other to correctly classify this image. Until now, no well-formed theoretical argu… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: Under review at Neural Networks Journal

  9. arXiv:2011.02956  [pdf, other

    cs.LG cs.CV

    Conflicting Bundles: Adapting Architectures Towards the Improved Training of Deep Neural Networks

    Authors: David Peer, Sebastian Stabinger, Antonio Rodriguez-Sanchez

    Abstract: Designing neural network architectures is a challenging task and knowing which specific layers of a model must be adapted to improve the performance is almost a mystery. In this paper, we introduce a novel theory and metric to identify layers that decrease the test accuracy of the trained models, this identification is done as early as at the beginning of training. In the worst-case, such a layer… ▽ More

    Submitted 5 November, 2020; originally announced November 2020.

    Comments: Accepted at WACV2021

  10. arXiv:1905.08744  [pdf, other

    cs.LG stat.ML

    Limitation of capsule networks

    Authors: David Peer, Sebastian Stabinger, Antonio Rodriguez-Sanchez

    Abstract: A recently proposed method in deep learning groups multiple neurons to capsules such that each capsule represents an object or part of an object. Routing algorithms route the output of capsules from lower-level layers to upper-level layers. In this paper, we prove that state-of-the-art routing procedures decrease the expressivity of capsule networks. More precisely, it is shown that EM-routing and… ▽ More

    Submitted 19 January, 2021; v1 submitted 21 May, 2019; originally announced May 2019.

  11. arXiv:1812.09707  [pdf, other

    cs.LG cs.AI stat.ML

    Increasing the adversarial robustness and explainability of capsule networks with $γ$-capsules

    Authors: David Peer, Sebastian Stabinger, Antonio Rodriguez-Sanchez

    Abstract: In this paper we introduce a new inductive bias for capsule networks and call networks that use this prior $γ$-capsule networks. Our inductive bias that is inspired by TE neurons of the inferior temporal cortex increases the adversarial robustness and the explainability of capsule networks. A theoretical framework with formal definitions of $γ$-capsule networks and metrics for evaluation are also… ▽ More

    Submitted 5 December, 2019; v1 submitted 23 December, 2018; originally announced December 2018.