Skip to main content

Showing 1–14 of 14 results for author: Milbich, T

.
  1. arXiv:2401.04079  [pdf, ps, other

    eess.IV cs.CV cs.LG

    RudolfV: A Foundation Model by Pathologists for Pathologists

    Authors: Jonas Dippel, Barbara Feulner, Tobias Winterhoff, Timo Milbich, Stephan Tietz, Simon Schallenberg, Gabriel Dernbach, Andreas Kunft, Simon Heinke, Marie-Lisa Eich, Julika Ribbat-Idel, Rosemarie Krupar, Philipp Anders, Niklas Prenißl, Philipp Jurmeister, David Horst, Lukas Ruff, Klaus-Robert Müller, Frederick Klauschen, Maximilian Alber

    Abstract: Artificial intelligence has started to transform histopathology impacting clinical diagnostics and biomedical research. However, while many computational pathology approaches have been proposed, most current AI models are limited with respect to generalization, application variety, and handling rare diseases. Recent efforts introduced self-supervised foundation models to address these challenges,… ▽ More

    Submitted 11 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  2. arXiv:2107.09562  [pdf, other

    cs.LG cs.CV

    Characterizing Generalization under Out-Of-Distribution Shifts in Deep Metric Learning

    Authors: Timo Milbich, Karsten Roth, Samarth Sinha, Ludwig Schmidt, Marzyeh Ghassemi, Björn Ommer

    Abstract: Deep Metric Learning (DML) aims to find representations suitable for zero-shot transfer to a priori unknown test distributions. However, common evaluation protocols only test a single, fixed data split in which train and test classes are assigned randomly. More realistic evaluations should consider a broad spectrum of distribution shifts with potentially varying degree and difficulty. In this work… ▽ More

    Submitted 29 November, 2021; v1 submitted 20 July, 2021; originally announced July 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  3. arXiv:2107.02790  [pdf, other

    cs.CV

    iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis

    Authors: Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer

    Abstract: How would a static scene react to a local poke? What are the effects on other parts of an object if you could locally push it? There will be distinctive movement, despite evident variations caused by the stochastic nature of our world. These outcomes are governed by the characteristic kinematics of objects that dictate their overall motion caused by a local interaction. Conversely, the movement of… ▽ More

    Submitted 6 October, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

    Comments: ICCV 2021, Project page is available at https://bit.ly/3dJN4Lf

  4. arXiv:2106.11303  [pdf, other

    cs.CV

    Understanding Object Dynamics for Interactive Image-to-Video Synthesis

    Authors: Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer

    Abstract: What would be the effect of locally poking a static scene? We present an approach that learns naturally-looking global articulations caused by a local manipulation at a pixel level. Training requires only videos of moving objects but no information of the underlying manipulation of the physical scene. Our generative model learns to infer natural object dynamics as a response to user interaction an… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: CVPR 2021, project page available at https://bit.ly/3cxfA2L

  5. arXiv:2105.04551  [pdf, other

    cs.CV

    Stochastic Image-to-Video Synthesis using cINNs

    Authors: Michael Dorkenwald, Timo Milbich, Andreas Blattmann, Robin Rombach, Konstantinos G. Derpanis, Björn Ommer

    Abstract: Video understanding calls for a model to learn the characteristic interplay between static scene content and its dynamics: Given an image, the model must be able to predict a future progression of the portrayed scene and, conversely, a video should be explained in terms of its static image content and all the remaining characteristics not present in the initial frame. This naturally suggests a bij… ▽ More

    Submitted 17 June, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: Accepted to CVPR 2021

  6. arXiv:2103.04677  [pdf, other

    cs.CV

    Behavior-Driven Synthesis of Human Dynamics

    Authors: Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer

    Abstract: Generating and representing human behavior are of major importance for various computer vision applications. Commonly, human video synthesis represents behavior as sequences of postures while directly predicting their likely progressions or merely changing the appearance of the depicted persons, thus not being able to exercise control over their actual behavior during the synthesis process. In con… ▽ More

    Submitted 22 April, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR 2021 as Poster

  7. arXiv:2009.08348  [pdf, other

    cs.CV

    S2SD: Simultaneous Similarity-based Self-Distillation for Deep Metric Learning

    Authors: Karsten Roth, Timo Milbich, Björn Ommer, Joseph Paul Cohen, Marzyeh Ghassemi

    Abstract: Deep Metric Learning (DML) provides a crucial tool for visual similarity and zero-shot applications by learning generalizing embedding spaces, although recent work in DML has shown strong performance saturation across training objectives. However, generalization capacity is known to scale with the embedding space dimensionality. Unfortunately, high dimensional embeddings also create higher retriev… ▽ More

    Submitted 4 June, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted to ICML2021

  8. arXiv:2004.13458  [pdf, other

    cs.CV

    DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning

    Authors: Timo Milbich, Karsten Roth, Homanga Bharadhwaj, Samarth Sinha, Yoshua Bengio, Björn Ommer, Joseph Paul Cohen

    Abstract: Visual Similarity plays an important role in many computer vision applications. Deep metric learning (DML) is a powerful framework for learning such similarities which not only generalize from training data to identically distributed test distributions, but in particular also translate to unknown test classes. However, its prevailing learning paradigm is class-discriminative supervised training, w… ▽ More

    Submitted 10 September, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: published at ECCV 2020

  9. Sharing Matters for Generalization in Deep Metric Learning

    Authors: Timo Milbich, Karsten Roth, Biagio Brattoli, Björn Ommer

    Abstract: Learning the similarity between images constitutes the foundation for numerous vision tasks. The common paradigm is discriminative metric learning, which seeks an embedding that separates different training classes. However, the main challenge is to learn a metric that not only generalizes from training to novel, but related, test samples. It should also transfer to different object classes. So wh… ▽ More

    Submitted 9 September, 2021; v1 submitted 12 April, 2020; originally announced April 2020.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence

  10. arXiv:2003.11113  [pdf, other

    cs.CV

    PADS: Policy-Adapted Sampling for Visual Similarity Learning

    Authors: Karsten Roth, Timo Milbich, Björn Ommer

    Abstract: Learning visual similarity requires to learn relations, typically between triplets of images. Albeit triplet approaches being powerful, their computational complexity mostly limits training to only a subset of all possible training triplets. Thus, sampling strategies that decide when to use which training sample during learning are crucial. Currently, the prominent paradigm are fixed or curriculum… ▽ More

    Submitted 28 March, 2020; v1 submitted 24 March, 2020; originally announced March 2020.

    Comments: Accepted to CVPR2020

  11. arXiv:2002.08473  [pdf, other

    cs.CV

    Revisiting Training Strategies and Generalization Performance in Deep Metric Learning

    Authors: Karsten Roth, Timo Milbich, Samarth Sinha, Prateek Gupta, Björn Ommer, Joseph Paul Cohen

    Abstract: Deep Metric Learning (DML) is arguably one of the most influential lines of research for learning visual similarities with many proposed approaches every year. Although the field benefits from the rapid progress, the divergence in training protocols, architectures, and parameter choices make an unbiased comparison difficult. To provide a consistent reference point, we revisit the most widely used… ▽ More

    Submitted 1 August, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: ICML 2020. Main paper 8.25 pages, 26 pages total

  12. arXiv:1911.07808  [pdf, other

    cs.CV

    Unsupervised Representation Learning by Discovering Reliable Image Relations

    Authors: Timo Milbich, Omair Ghori, Ferran Diego, Björn Ommer

    Abstract: Learning robust representations that allow to reliably establish relations between images is of paramount importance for virtually all of computer vision. Annotating the quadratic number of pairwise relations between training images is simply not feasible, while unsupervised inference is prone to noise, thus leaving the vast majority of these relations to be unreliable. To nevertheless find those… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

    Comments: Accepted for Publication in 'Pattern Recognition Journal'

  13. arXiv:1903.06946  [pdf, other

    cs.CV

    Unsupervised Part-Based Disentangling of Object Shape and Appearance

    Authors: Dominik Lorenz, Leonard Bereska, Timo Milbich, Björn Ommer

    Abstract: Large intra-class variation is the result of changes in multiple object characteristics. Images, however, only show the superposition of different variable factors such as appearance or shape. Therefore, learning to disentangle and represent these different characteristics poses a great challenge, especially in the unsupervised case. Moreover, large object articulation calls for a flexible part-ba… ▽ More

    Submitted 17 June, 2019; v1 submitted 16 March, 2019; originally announced March 2019.

    Comments: CVPR 2019 Oral

  14. arXiv:1708.01191  [pdf, other

    cs.CV

    Unsupervised Video Understanding by Reconciliation of Posture Similarities

    Authors: Timo Milbich, Miguel Bautista, Ekaterina Sutter, Bjorn Ommer

    Abstract: Understanding human activity and being able to explain it in detail surpasses mere action classification by far in both complexity and value. The challenge is thus to describe an activity on the basis of its most fundamental constituents, the individual postures and their distinctive transitions. Supervised learning of such a fine-grained representation based on elementary poses is very tedious an… ▽ More

    Submitted 3 August, 2017; originally announced August 2017.

    Comments: Accepted by ICCV 2017