Search | arXiv e-print repository

GP-net: Flexible Viewpoint Grasp Proposal

Authors: Anna Konrad, John McDonald, Rudi Villing

Abstract: We present the Grasp Proposal Network (GP-net), a Convolutional Neural Network model which can generate 6-DoF grasps from flexible viewpoints, e.g. as experienced by mobile manipulators. To train GP-net, we synthetically generate a dataset containing depth-images and ground-truth grasp information. In real-world experiments, we use the EGAD evaluation benchmark to evaluate GP-net against two commo… ▽ More We present the Grasp Proposal Network (GP-net), a Convolutional Neural Network model which can generate 6-DoF grasps from flexible viewpoints, e.g. as experienced by mobile manipulators. To train GP-net, we synthetically generate a dataset containing depth-images and ground-truth grasp information. In real-world experiments, we use the EGAD evaluation benchmark to evaluate GP-net against two commonly used algorithms, the Volumetric Gras** Network (VGN) and the Grasp Pose Detection package (GPD), on a PAL TIAGo mobile manipulator. In contrast to the state-of-the-art methods in robotic gras**, GP-net can be used for gras** objects from flexible, unknown viewpoints without the need to define the workspace and achieves a grasp success of 54.4% compared to 51.6% for VGN and 44.2% for GPD. We provide a ROS package along with our code and pre-trained models at https://aucoroboticsmu.github.io/GP-net/. △ Less

Submitted 12 October, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

Comments: Accepted to ICAR 2023

arXiv:2203.04874 [pdf, other]

doi 10.1109/IJCNN55064.2022.9892763

VGQ-CNN: Moving Beyond Fixed Cameras and Top-Grasps for Grasp Quality Prediction

Authors: A. Konrad, J. McDonald, R. Villing

Abstract: We present the Versatile Grasp Quality Convolutional Neural Network (VGQ-CNN), a grasp quality prediction network for 6-DOF grasps. VGQ-CNN can be used when evaluating grasps for objects seen from a wide range of camera poses or mobile robots without the need to retrain the network. By defining the grasp orientation explicitly as an input to the network, VGQ-CNN can evaluate 6-DOF grasp poses, mov… ▽ More We present the Versatile Grasp Quality Convolutional Neural Network (VGQ-CNN), a grasp quality prediction network for 6-DOF grasps. VGQ-CNN can be used when evaluating grasps for objects seen from a wide range of camera poses or mobile robots without the need to retrain the network. By defining the grasp orientation explicitly as an input to the network, VGQ-CNN can evaluate 6-DOF grasp poses, moving beyond the 4-DOF grasps used in most image-based grasp evaluation methods like GQ-CNN. To train VGQ-CNN, we generate the new Versatile Grasp dataset (VG-dset) containing 6-DOF grasps observed from a wide range of camera poses. VGQ-CNN achieves a balanced accuracy of 82.1% on our test-split while generalising to a variety of camera poses. Meanwhile, it achieves competitive performance for overhead cameras and top-grasps with a balanced accuracy of 74.2% compared to GQ-CNN's 76.6%. We also propose a modified network architecture, FAST-VGQ-CNN, that speeds up inference using a shared encoder architecture and can make 128 grasp quality predictions in 12ms on a CPU. Code and data are available at https://aucoroboticsmu.github.io/vgq-cnn/. △ Less

Submitted 23 June, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

Comments: Accepted for International Joint Conference on Neural Networks (IJCNN) 2022

arXiv:2202.05199 [pdf, other]

doi 10.1109/TBME.2021.3130548

A Human-Centered Machine-Learning Approach for Muscle-Tendon Junction Tracking in Ultrasound Images

Authors: Christoph Leitner, Robert Jarolim, Bernhard Englmair, Annika Kruse, Karen Andrea Lara Hernandez, Andreas Konrad, Eric Su, Jörg Schröttner, Luke A. Kelly, Glen A. Lichtwark, Markus Tilp, Christian Baumgartner

Abstract: Biomechanical and clinical gait research observes muscles and tendons in limbs to study their functions and behaviour. Therefore, movements of distinct anatomical landmarks, such as muscle-tendon junctions, are frequently measured. We propose a reliable and time efficient machine-learning approach to track these junctions in ultrasound videos and support clinical biomechanists in gait analysis. In… ▽ More Biomechanical and clinical gait research observes muscles and tendons in limbs to study their functions and behaviour. Therefore, movements of distinct anatomical landmarks, such as muscle-tendon junctions, are frequently measured. We propose a reliable and time efficient machine-learning approach to track these junctions in ultrasound videos and support clinical biomechanists in gait analysis. In order to facilitate this process, a method based on deep-learning was introduced. We gathered an extensive dataset, covering 3 functional movements, 2 muscles, collected on 123 healthy and 38 impaired subjects with 3 different ultrasound systems, and providing a total of 66864 annotated ultrasound images in our network training. Furthermore, we used data collected across independent laboratories and curated by researchers with varying levels of experience. For the evaluation of our method a diverse test-set was selected that is independently verified by four specialists. We show that our model achieves similar performance scores to the four human specialists in identifying the muscle-tendon junction position. Our method provides time-efficient tracking of muscle-tendon junctions, with prediction times of up to 0.078 seconds per frame (approx. 100 times faster than manual labeling). All our codes, trained models and test-set were made publicly available and our model is provided as a free-to-use online service on https://deepmtj.org/. △ Less

Submitted 10 February, 2022; originally announced February 2022.

Comments: in IEEE Transactions on Biomedical Engineering

ACM Class: I.2.1

arXiv:2103.16476 [pdf, other]

doi 10.1371/journal.pone.0273708

Operations Research and Analytics to Combat Human Trafficking: A Systematic Review of Academic Literature

Authors: Geri L. Dimas, Renata A. Konrad, Kayse Lee Maass, Andrew C. Trapp

Abstract: Human trafficking is a widespread and compound social, economic, and human rights issue occurring in every region of the world. While there have been an increasing number of anti-human trafficking works from the Operations Research and Analytics domains in recent years, no systematic review of this literature currently exists. We fill this gap by providing a systematic literature review that ident… ▽ More Human trafficking is a widespread and compound social, economic, and human rights issue occurring in every region of the world. While there have been an increasing number of anti-human trafficking works from the Operations Research and Analytics domains in recent years, no systematic review of this literature currently exists. We fill this gap by providing a systematic literature review that identifies and classifies the body of Operations Research and Analytics research related to the anti-human trafficking domain, thereby illustrating the collective impact of the field to date. We classify 142 studies to identify current trends in methodologies, theoretical approaches, data sources, trafficking contexts, target regions, victim-survivor demographics, and focus within the well-established 4Ps principles. Using these findings, we discuss the extent to which the current literature aligns with the global demographics of human trafficking and identify existing research gaps to propose an agenda for Operations Research and Analytics researchers. △ Less

Submitted 11 May, 2022; v1 submitted 30 March, 2021; originally announced March 2021.

Comments: 34 pages, 10 Figures, 2 Tables

arXiv:2103.00191 [pdf, other]

FisheyeSuperPoint: Keypoint Detection and Description Network for Fisheye Images

Authors: Anna Konrad, Ciarán Eising, Ganesh Sistu, John McDonald, Rudi Villing, Senthil Yogamani

Abstract: Keypoint detection and description is a commonly used building block in computer vision systems particularly for robotics and autonomous driving. However, the majority of techniques to date have focused on standard cameras with little consideration given to fisheye cameras which are commonly used in urban driving and automated parking. In this paper, we propose a novel training and evaluation pipe… ▽ More Keypoint detection and description is a commonly used building block in computer vision systems particularly for robotics and autonomous driving. However, the majority of techniques to date have focused on standard cameras with little consideration given to fisheye cameras which are commonly used in urban driving and automated parking. In this paper, we propose a novel training and evaluation pipeline for fisheye images. We make use of SuperPoint as our baseline which is a self-supervised keypoint detector and descriptor that has achieved state-of-the-art results on homography estimation. We introduce a fisheye adaptation pipeline to enable training on undistorted fisheye images. We evaluate the performance on the HPatches benchmark, and, by introducing a fisheye based evaluation method for detection repeatability and descriptor matching correctness, on the Oxford RobotCar dataset. △ Less

Submitted 29 November, 2021; v1 submitted 27 February, 2021; originally announced March 2021.

Comments: Accepted for Presentation at International Conference on Computer Vision Theory and Applications (VISAPP 2022)

arXiv:2005.02071 [pdf, other]

doi 10.1109/EMBC44109.2020.9176145

Automatic Tracking of the Muscle Tendon Junction in Healthy and Impaired Subjects using Deep Learning

Authors: Christoph Leitner, Robert Jarolim, Andreas Konrad, Annika Kruse, Markus Tilp, Jörg Schröttner, Christian Baumgartner

Abstract: Recording muscle tendon junction displacements during movement, allows separate investigation of the muscle and tendon behaviour, respectively. In order to provide a fully-automatic tracking method, we employ a novel deep learning approach to detect the position of the muscle tendon junction in ultrasound images. We utilize the attention mechanism to enable the network to focus on relevant regions… ▽ More Recording muscle tendon junction displacements during movement, allows separate investigation of the muscle and tendon behaviour, respectively. In order to provide a fully-automatic tracking method, we employ a novel deep learning approach to detect the position of the muscle tendon junction in ultrasound images. We utilize the attention mechanism to enable the network to focus on relevant regions and to obtain a better interpretation of the results. Our data set consists of a large cohort of 79 healthy subjects and 28 subjects with movement limitations performing passive full range of motion and maximum contraction movements. Our trained network shows robust detection of the muscle tendon junction on a diverse data set of varying quality with a mean absolute error of 2.55$\pm$1 mm. We show that our approach can be applied for various subjects and can be operated in real-time. The complete software package is available for open-source use via: https://github.com/luuleitner/deepMTJ △ Less

Submitted 5 May, 2020; originally announced May 2020.

Comments: Accepted version to be published in 2020, 42nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Montreal, Canada

arXiv:1809.07600 [pdf, other]

MIDI-VAE: Modeling Dynamics and Instrumentation of Music with Applications to Style Transfer

Authors: Gino Brunner, Andres Konrad, Yuyi Wang, Roger Wattenhofer

Abstract: We introduce MIDI-VAE, a neural network model based on Variational Autoencoders that is capable of handling polyphonic music with multiple instrument tracks, as well as modeling the dynamics of music by incorporating note durations and velocities. We show that MIDI-VAE can perform style transfer on symbolic music by automatically changing pitches, dynamics and instruments of a music piece from, e.… ▽ More We introduce MIDI-VAE, a neural network model based on Variational Autoencoders that is capable of handling polyphonic music with multiple instrument tracks, as well as modeling the dynamics of music by incorporating note durations and velocities. We show that MIDI-VAE can perform style transfer on symbolic music by automatically changing pitches, dynamics and instruments of a music piece from, e.g., a Classical to a Jazz style. We evaluate the efficacy of the style transfer by training separate style validation classifiers. Our model can also interpolate between short pieces of music, produce medleys and create mixtures of entire songs. The interpolations smoothly change pitches, dynamics and instrumentation to create a harmonic bridge between two music pieces. To the best of our knowledge, this work represents the first successful attempt at applying neural style transfer to complete musical compositions. △ Less

Submitted 20 September, 2018; originally announced September 2018.

Comments: Paper accepted at the 19th International Society for Music Information Retrieval Conference, ISMIR 2018, Paris, France

ACM Class: I.2.1; I.2.4; I.2.6; H.5.5

Showing 1–7 of 7 results for author: Konrad, A