Skip to main content

Showing 1–7 of 7 results for author: DiPietro, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2109.14563  [pdf, other

    cs.CV cs.LG

    Robust Temporal Ensembling for Learning with Noisy Labels

    Authors: Abel Brown, Benedikt Schifferer, Robert DiPietro

    Abstract: Successful training of deep neural networks with noisy labels is an essential capability as most real-world datasets contain some amount of mislabeled data. Left unmitigated, label noise can sharply degrade typical supervised learning approaches. In this paper, we present robust temporal ensembling (RTE), which combines robust loss with semi-supervised regularization methods to achieve noise-robus… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: includes additional baselines and hyperparam references

  2. arXiv:1907.08825  [pdf, other

    cs.CV

    Automated Surgical Activity Recognition with One Labeled Sequence

    Authors: Robert DiPietro, Gregory D. Hager

    Abstract: Prior work has demonstrated the feasibility of automated activity recognition in robot-assisted surgery from motion data. However, these efforts have assumed the availability of a large number of densely-annotated sequences, which must be provided manually by experts. This process is tedious, expensive, and error-prone. In this paper, we present the first analysis under the assumption of scarce an… ▽ More

    Submitted 20 July, 2019; originally announced July 2019.

    Comments: Accepted for publication at MICCAI 2019

  3. arXiv:1806.03318  [pdf, other

    cs.CV

    Unsupervised Learning for Surgical Motion by Learning to Predict the Future

    Authors: Robert DiPietro, Gregory D. Hager

    Abstract: We show that it is possible to learn meaningful representations of surgical motion, without supervision, by learning to predict the future. An architecture that combines an RNN encoder-decoder and mixture density networks (MDNs) is developed to model the conditional distribution over future motion given past motion. We show that the learned encodings naturally cluster according to high-level activ… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: Accepted to MICCAI 2018

  4. arXiv:1708.01885  [pdf, other

    cs.CV

    Long Short-Term Memory Kalman Filters:Recurrent Neural Estimators for Pose Regularization

    Authors: Huseyin Coskun, Felix Achilles, Robert DiPietro, Nassir Navab, Federico Tombari

    Abstract: One-shot pose estimation for tasks such as body joint localization, camera pose estimation, and object tracking are generally noisy, and temporal filters have been extensively used for regularization. One of the most widely-used methods is the Kalman filter, which is both extremely simple and general. However, Kalman filters require a motion model and measurement model to be specified a priori, wh… ▽ More

    Submitted 6 August, 2017; originally announced August 2017.

    Comments: Accepted ICCV 2017

  5. arXiv:1702.07805  [pdf, other

    cs.NE

    Analyzing and Exploiting NARX Recurrent Neural Networks for Long-Term Dependencies

    Authors: Robert DiPietro, Christian Rupprecht, Nassir Navab, Gregory D. Hager

    Abstract: Recurrent neural networks (RNNs) have achieved state-of-the-art performance on many diverse tasks, from machine translation to surgical activity recognition, yet training RNNs to capture long-term dependencies remains difficult. To date, the vast majority of successful RNN architectures alleviate this problem using nearly-additive connections between states, as introduced by long short-term memory… ▽ More

    Submitted 20 April, 2018; v1 submitted 24 February, 2017; originally announced February 2017.

  6. arXiv:1612.00197  [pdf, other

    cs.CV

    Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses

    Authors: Christian Rupprecht, Iro Laina, Robert DiPietro, Maximilian Baust, Federico Tombari, Nassir Navab, Gregory D. Hager

    Abstract: Many prediction tasks contain uncertainty. In some cases, uncertainty is inherent in the task itself. In future prediction, for example, many distinct outcomes are equally valid. In other cases, uncertainty arises from the way data is labeled. For example, in object detection, many objects of interest often go unlabeled, and in human pose estimation, occluded joints are often labeled with ambiguou… ▽ More

    Submitted 22 August, 2017; v1 submitted 1 December, 2016; originally announced December 2016.

    Comments: ICCV 2017

  7. arXiv:1606.06329  [pdf, other

    cs.CV

    Recognizing Surgical Activities with Recurrent Neural Networks

    Authors: Robert DiPietro, Colin Lea, Anand Malpani, Narges Ahmidi, S. Swaroop Vedula, Gyusung I. Lee, Mija R. Lee, Gregory D. Hager

    Abstract: We apply recurrent neural networks to the task of recognizing surgical activities from robot kinematics. Prior work in this area focuses on recognizing short, low-level activities, or gestures, and has been based on variants of hidden Markov models and conditional random fields. In contrast, we work on recognizing both gestures and longer, higher-level activites, or maneuvers, and we model the map… ▽ More

    Submitted 22 June, 2016; v1 submitted 20 June, 2016; originally announced June 2016.

    Comments: Conditionally accepted at MICCAI 2016