Skip to main content

Showing 1–22 of 22 results for author: Frintrop, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05601  [pdf, ps, other

    cs.LG

    Select High-Level Features: Efficient Experts from a Hierarchical Classification Network

    Authors: André Kelm, Niels Hannemann, Bruno Heberle, Lucas Schmidt, Tim Rolff, Christian Wilms, Ehsan Yaghoubi, Simone Frintrop

    Abstract: This study introduces a novel expert generation method that dynamically reduces task and computational complexity without compromising predictive performance. It is based on a new hierarchical classification network topology that combines sequential processing of generic low-level features with parallelism and nesting of high-level features. This structure allows for the innovative extraction tech… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: This two-page paper was accepted for a poster presentation at the 5th ICLR 2024 Workshop on Practical ML for Limited/Low Resource Settings (PML4LRS)

  2. arXiv:2311.05029  [pdf, other

    cs.CV

    S$^3$AD: Semi-supervised Small Apple Detection in Orchard Environments

    Authors: Robert Johanson, Christian Wilms, Ole Johannsen, Simone Frintrop

    Abstract: Crop detection is integral for precision agriculture applications such as automated yield estimation or fruit picking. However, crop detection, e.g., apple detection in orchard environments remains challenging due to a lack of large-scale datasets and the small relative size of the crops in the image. In this work, we address these challenges by reformulating the apple detection task in a semi-sup… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: Accepted at WACV 2024. The dataset MAD is available at http://www.inf.uni-hamburg.de/mad

  3. arXiv:2308.05128  [pdf, other

    cs.CV

    High-Level Parallelism and Nested Features for Dynamic Inference Cost and Top-Down Attention

    Authors: André Peter Kelm, Niels Hannemann, Bruno Heberle, Lucas Schmidt, Tim Rolff, Christian Wilms, Ehsan Yaghoubi, Simone Frintrop

    Abstract: This paper introduces a novel network topology that seamlessly integrates dynamic inference cost with a top-down attention mechanism, addressing two significant gaps in traditional deep learning models. Drawing inspiration from human perception, we combine sequential processing of generic low-level features with parallelism and nesting of high-level features. This design not only reflects a findin… ▽ More

    Submitted 7 March, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: This arXiv paper's findings on high-level parallelism and nested features directly contributes to 'Selecting High-Level Features: Efficient Experts from a Hierarchical Classification Network,' accepted at ICLR 2024's Practical ML for Low Resource Settings (PML4LRS) workshop (non-archival)

  4. arXiv:2307.15191  [pdf, other

    cs.CV

    Small, but important: Traffic light proposals for detecting small traffic lights and beyond

    Authors: Tom Sanitz, Christian Wilms, Simone Frintrop

    Abstract: Traffic light detection is a challenging problem in the context of self-driving cars and driver assistance systems. While most existing systems produce good results on large traffic lights, detecting small and tiny ones is often overlooked. A key problem here is the inherent downsampling in CNNs, leading to low-resolution features for detection. To mitigate this problem, we propose a new traffic l… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted at ICVS 2023

  5. arXiv:2306.01432  [pdf, other

    eess.AS cs.LG

    Audio-Visual Speech Enhancement with Score-Based Generative Models

    Authors: Julius Richter, Simone Frintrop, Timo Gerkmann

    Abstract: This paper introduces an audio-visual speech enhancement system that leverages score-based generative models, also known as diffusion models, conditioned on visual information. In particular, we exploit audio-visual embeddings obtained from a self-super\-vised learning model that has been fine-tuned on lipreading. The layer-wise features of its transformer-based encoder are aggregated, time-aligne… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Submitted to ITG Conference on Speech Communication

  6. arXiv:2304.07593  [pdf, other

    cs.CV cs.AI

    Teacher Network Calibration Improves Cross-Quality Knowledge Distillation

    Authors: Pia Čuk, Robin Senge, Mikko Lauri, Simone Frintrop

    Abstract: We investigate cross-quality knowledge distillation (CQKD), a knowledge distillation method where knowledge from a teacher network trained with full-resolution images is transferred to a student network that takes as input low-resolution images. As image size is a deciding factor for the computational load of computer vision applications, CQKD notably reduces the requirements by only using the stu… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: The implementation is available at: https://github.com/PiaCuk/distillistic

  7. arXiv:2211.13494  [pdf, other

    cs.CV cs.GR cs.HC cs.LG

    Immersive Neural Graphics Primitives

    Authors: Ke Li, Tim Rolff, Susanne Schmidt, Reinhard Bacher, Simone Frintrop, Wim Leemans, Frank Steinicke

    Abstract: Neural radiance field (NeRF), in particular its extension by instant neural graphics primitives, is a novel rendering method for view synthesis that uses real-world images to build photo-realistic immersive virtual scenes. Despite its potential, research on the combination of NeRF and virtual reality (VR) remains sparse. Currently, there is no integration into typical VR systems available, and the… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Submitted to IEEE VR, currently under review

  8. arXiv:2203.11358  [pdf, other

    cs.CV

    Segmenting Medical Instruments in Minimally Invasive Surgeries using AttentionMask

    Authors: Christian Wilms, Alexander Michael Gerlach, Rüdiger Schmitz, Simone Frintrop

    Abstract: Precisely locating and segmenting medical instruments in images of minimally invasive surgeries, medical instrument segmentation, is an essential first step for several tasks in medical image processing. However, image degradations, small instruments, and the generalization between different surgery types make medical instrument segmentation challenging. To cope with these challenges, we adapt the… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  9. arXiv:2202.11372  [pdf, other

    cs.CV

    Localizing Small Apples in Complex Apple Orchard Environments

    Authors: Christian Wilms, Robert Johanson, Simone Frintrop

    Abstract: The localization of fruits is an essential first step in automated agricultural pipelines for yield estimation or fruit picking. One example of this is the localization of apples in images of entire apple trees. Since the apples are very small objects in such scenarios, we tackle this problem by adapting the object proposal generation system AttentionMask that focuses on small objects. We adapt At… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  10. arXiv:2108.03503  [pdf, other

    cs.CV

    DeepFH Segmentations for Superpixel-based Object Proposal Refinement

    Authors: Christian Wilms, Simone Frintrop

    Abstract: Class-agnostic object proposal generation is an important first step in many object detection pipelines. However, object proposals of modern systems are rather inaccurate in terms of segmentation and only roughly adhere to object boundaries. Since typical refinement steps are usually not applicable to thousands of proposals, we propose a superpixel-based refinement system for object proposal gener… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

    Comments: Accepted by IVC

  11. arXiv:2103.01977  [pdf, other

    cs.CV

    CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds

    Authors: Ge Gao, Mikko Lauri, Xiaolin Hu, Jianwei Zhang, Simone Frintrop

    Abstract: It is often desired to train 6D pose estimation systems on synthetic data because manual annotation is expensive. However, due to the large domain gap between the synthetic and real images, synthesizing color images is expensive. In contrast, this domain gap is considerably smaller and easier to fill for depth information. In this work, we present a system that regresses 6D object pose from depth… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted to ICRA 2021

  12. The MSR-Video to Text Dataset with Clean Annotations

    Authors: Haoran Chen, Jianmin Li, Simone Frintrop, Xiaolin Hu

    Abstract: Video captioning automatically generates short descriptions of the video content, usually in form of a single sentence. Many methods have been proposed for solving this task. A large dataset called MSR Video to Text (MSR-VTT) is often used as the benchmark dataset for testing the performance of the methods. However, we found that the human annotations, i.e., the descriptions of video contents in t… ▽ More

    Submitted 25 February, 2024; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: The paper is under consideration at Computer Vision and Image Understanding

    MSC Class: 68T45; 68T50 ACM Class: I.2.10; I.2.7

    Journal ref: Computer Vision and Image Understanding, 225, p.103581 (2022)

  13. arXiv:2101.04574  [pdf, other

    cs.CV

    Superpixel-based Refinement for Object Proposal Generation

    Authors: Christian Wilms, Simone Frintrop

    Abstract: Precise segmentation of objects is an important problem in tasks like class-agnostic object proposal generation or instance segmentation. Deep learning-based systems usually generate segmentations of objects based on coarse feature maps, due to the inherent downsampling in CNNs. This leads to segmentation boundaries not adhering well to the object boundaries in the image. To tackle this problem, w… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: Accepted at ICPR 2020. Code is available at https://github.com/chwilms/superpixelRefinement

  14. Multi-Sensor Next-Best-View Planning as Matroid-Constrained Submodular Maximization

    Authors: Mikko Lauri, Joni Pajarinen, Jan Peters, Simone Frintrop

    Abstract: 3D scene models are useful in robotics for tasks such as path planning, object manipulation, and structural inspection. We consider the problem of creating a 3D model using depth images captured by a team of multiple robots. Each robot selects a viewpoint and captures a depth image from it, and the images are fused to update the scene model. The process is repeated until a scene model of desired q… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Comments: 8 pages, 7 figures. Accepted for publication in IEEE Robotics and Automation Letters

  15. arXiv:2001.08942  [pdf, other

    cs.CV

    6D Object Pose Regression via Supervised Learning on Point Clouds

    Authors: Ge Gao, Mikko Lauri, Yulong Wang, Xiaolin Hu, Jianwei Zhang, Simone Frintrop

    Abstract: This paper addresses the task of estimating the 6 degrees of freedom pose of a known 3D object from depth information represented by a point cloud. Deep features learned by convolutional neural networks from color information have been the dominant features to be used for inferring object poses, while depth information receives much less attention. However, depth information contains rich geometri… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

  16. arXiv:1811.08728  [pdf, other

    cs.CV

    AttentionMask: Attentive, Efficient Object Proposal Generation Focusing on Small Objects

    Authors: Christian Wilms, Simone Frintrop

    Abstract: We propose a novel approach for class-agnostic object proposal generation, which is efficient and especially well-suited to detect small objects. Efficiency is achieved by scale-specific objectness attention maps which focus the processing on promising parts of the image and reduce the amount of sampled windows strongly. This leads to a system, which is $33\%$ faster than the state-of-the-art and… ▽ More

    Submitted 21 November, 2018; originally announced November 2018.

    Comments: Accepted at ACCV 2018. Code is available at https://github.com/chwilms/AttentionMask

  17. arXiv:1811.04309  [pdf, ps, other

    cs.CV

    Multi-label Object Attribute Classification using a Convolutional Neural Network

    Authors: Soubarna Banik, Mikko Lauri, Simone Frintrop

    Abstract: Objects of different classes can be described using a limited number of attributes such as color, shape, pattern, and texture. Learning to detect object attributes instead of only detecting objects can be helpful in dealing with a priori unknown objects. With this inspiration, a deep convolutional neural network for low-level object attribute classification, called the Deep Attribute Network (DAN)… ▽ More

    Submitted 10 November, 2018; originally announced November 2018.

    Comments: 14 pages, 7 figures

  18. arXiv:1809.04226  [pdf, other

    cs.RO

    Attention based visual analysis for fast grasp planning with multi-fingered robotic hand

    Authors: Zhen Deng, Ge Gao, Simone Frintrop, Jianwei Zhang

    Abstract: We present an attention based visual analysis framework to compute grasp-relevant information in order to guide grasp planning using a multi-fingered robotic hand. Our approach uses a computational visual attention model to locate regions of interest in a scene, and uses a deep convolutional neural network to detect grasp type and point for a sub-region of the object presented in a region of inter… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

  19. arXiv:1808.05498  [pdf, other

    cs.CV

    Occlusion Resistant Object Rotation Regression from Point Cloud Segments

    Authors: Ge Gao, Mikko Lauri, Jianwei Zhang, Simone Frintrop

    Abstract: Rotation estimation of known rigid objects is important for robotic applications such as dexterous manipulation. Most existing methods for rotation estimation use intermediate representations such as templates, global or local feature descriptors, or object coordinates, which require multiple steps in order to infer the object pose. We propose to directly regress a pose vector from raw point cloud… ▽ More

    Submitted 2 December, 2018; v1 submitted 16 August, 2018; originally announced August 2018.

    Comments: Proceeding of the ECCV18 workshop on Recovering 6D Object Pose

  20. arXiv:1704.04054  [pdf, other

    cs.CV

    Saliency-guided Adaptive Seeding for Supervoxel Segmentation

    Authors: Ge Gao, Mikko Lauri, Jianwei Zhang, Simone Frintrop

    Abstract: We propose a new saliency-guided method for generating supervoxels in 3D space. Rather than using an evenly distributed spatial seeding procedure, our method uses visual saliency to guide the process of supervoxel generation. This results in densely distributed, small, and precise supervoxels in salient regions which often contain objects, and larger supervoxels in less salient regions that often… ▽ More

    Submitted 19 October, 2017; v1 submitted 13 April, 2017; originally announced April 2017.

    Comments: 6 pages, accepted to IROS2017

  21. arXiv:1704.03706  [pdf, other

    cs.CV

    Object proposal generation applying the distance dependent Chinese restaurant process

    Authors: Mikko Lauri, Simone Frintrop

    Abstract: In application domains such as robotics, it is useful to represent the uncertainty related to the robot's belief about the state of its environment. Algorithms that only yield a single "best guess" as a result are not sufficient. In this paper, we propose object proposal generation based on non-parametric Bayesian inference that allows quantification of the likelihood of the proposals. We apply Ma… ▽ More

    Submitted 12 April, 2017; originally announced April 2017.

    Comments: To appear at Scandinavian Conference on Image Analysis (SCIA) 2017

  22. arXiv:1703.02610  [pdf, other

    cs.RO cs.AI

    Multi-Robot Active Information Gathering with Periodic Communication

    Authors: Mikko Lauri, Eero Heinänen, Simone Frintrop

    Abstract: A team of robots sharing a common goal can benefit from coordination of the activities of team members, hel** the team to reach the goal more reliably or quickly. We address the problem of coordinating the actions of a team of robots with periodic communication capability executing an information gathering task. We cast the problem as a multi-agent optimal decision-making problem with an informa… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

    Comments: IEEE International Conference on Robotics and Automation (ICRA), 2017