Skip to main content

Showing 1–50 of 51 results for author: Pock, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18087  [pdf, other

    cs.CV

    FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms

    Authors: Lea Bogensperger, Dominik Narnhofer, Alexander Falk, Konrad Schindler, Thomas Pock

    Abstract: Medical image segmentation is a crucial task that relies on the ability to accurately identify and isolate regions of interest in medical images. Thereby, generative approaches allow to capture the statistical properties of segmentation masks that are dependent on the respective structures. In this work we propose FlowSDF, an image-guided conditional flow matching framework to represent the signed… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2403.12710  [pdf, other

    cs.CV cs.LG

    Selective, Interpretable, and Motion Consistent Privacy Attribute Obfuscation for Action Recognition

    Authors: Filip Ilic, He Zhao, Thomas Pock, Richard P. Wildes

    Abstract: Concerns for the privacy of individuals captured in public imagery have led to privacy-preserving action recognition. Existing approaches often suffer from issues arising through obfuscation being applied globally and a lack of interpretability. Global obfuscation hides privacy sensitive regions, but also contextual regions important for action recognition. Lack of interpretability erodes trust in… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  3. arXiv:2311.08199  [pdf, other

    eess.IV cs.CV cs.LG

    Diffusion-based generation of Histopathological Whole Slide Images at a Gigapixel scale

    Authors: Robert Harb, Thomas Pock, Heimo Müller

    Abstract: We present a novel diffusion-based approach to generate synthetic histopathological Whole Slide Images (WSIs) at an unprecedented gigapixel scale. Synthetic WSIs have many potential applications: They can augment training datasets to enhance the performance of many computational pathology applications. They allow the creation of synthesized copies of datasets that can be shared without violating p… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    ACM Class: I.4.9; I.5.4; I.2.10

  4. arXiv:2306.16854  [pdf, other

    cs.LG

    On the Relationship Between RNN Hidden State Vectors and Semantic Ground Truth

    Authors: Edi Muškardin, Martin Tappler, Ingo Pill, Bernhard K. Aichernig, Thomas Pock

    Abstract: We examine the assumption that the hidden-state vectors of recurrent neural networks (RNNs) tend to form clusters of semantically similar vectors, which we dub the clustering hypothesis. While this hypothesis has been assumed in the analysis of RNNs in recent years, its validity has not been studied thoroughly on modern neural network architectures. We examine the clustering hypothesis in the cont… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  5. arXiv:2305.15988  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Non-Log-Concave and Nonsmooth Sampling via Langevin Monte Carlo Algorithms

    Authors: Tim Tsz-Kit Lau, Han Liu, Thomas Pock

    Abstract: We study the problem of approximate sampling from non-log-concave distributions, e.g., Gaussian mixtures, which is often challenging even in low dimensions due to their multimodality. We focus on performing this task via Markov chain Monte Carlo (MCMC) methods derived from discretizations of the overdamped Langevin diffusions, which are commonly known as Langevin Monte Carlo algorithms. Furthermor… ▽ More

    Submitted 29 May, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  6. arXiv:2303.05966  [pdf, other

    cs.CV

    Score-Based Generative Models for Medical Image Segmentation using Signed Distance Functions

    Authors: Lea Bogensperger, Dominik Narnhofer, Filip Ilic, Thomas Pock

    Abstract: Medical image segmentation is a crucial task that relies on the ability to accurately identify and isolate regions of interest in medical images. Thereby, generative approaches allow to capture the statistical properties of segmentation masks that are dependent on the respective structures. In this work we propose a conditional score-based generative modeling framework to represent the signed dist… ▽ More

    Submitted 21 July, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  7. arXiv:2302.10502  [pdf, other

    cs.LG cs.CV

    Learning Gradually Non-convex Image Priors Using Score Matching

    Authors: Erich Kobler, Thomas Pock

    Abstract: In this paper, we propose a unified framework of denoising score-based models in the context of graduated non-convex energy minimization. We show that for sufficiently large noise variance, the associated negative log density -- the energy -- becomes convex. Consequently, denoising score-based models essentially follow a graduated non-convexity heuristic. We apply this framework to learning genera… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: 13 pages, 3 figures

    ACM Class: I.2.6; I.4.10

  8. Explicit Diffusion of Gaussian Mixture Model Based Image Priors

    Authors: Martin Zach, Thomas Pock, Erich Kobler, Antonin Chambolle

    Abstract: In this work we tackle the problem of estimating the density $f_X$ of a random variable $X$ by successive smoothing, such that the smoothed random variable $Y$ fulfills $(\partial_t - Δ_1)f_Y(\,\cdot\,, t) = 0$, $f_Y(\,\cdot\,, 0) = f_X$. With a focus on image processing, we propose a product/fields of experts model with Gaussian mixture experts that admits an analytic expression for… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Journal ref: Scale Space and Variational Methods in Computer Vision (2023). Lecture Notes in Computer Science, vol 14009. 3-15

  9. arXiv:2212.12499  [pdf, other

    cs.CV math.PR

    Posterior-Variance-Based Error Quantification for Inverse Problems in Imaging

    Authors: Dominik Narnhofer, Andreas Habring, Martin Holler, Thomas Pock

    Abstract: In this work, a method for obtaining pixel-wise error bounds in Bayesian regularization of inverse imaging problems is introduced. The proposed method employs estimates of the posterior variance together with techniques from conformal prediction in order to obtain coverage guarantees for the error bounds, without making any assumption on the underlying data distribution. It is generally applicable… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    MSC Class: 68U10; 62F15; 65C40; 65C60; 65J22

  10. arXiv:2210.13834  [pdf, other

    eess.IV cs.CV cs.LG

    Stable Deep MRI Reconstruction using Generative Priors

    Authors: Martin Zach, Florian Knoll, Thomas Pock

    Abstract: Data-driven approaches recently achieved remarkable success in magnetic resonance imaging (MRI) reconstruction, but integration into clinical routine remains challenging due to a lack of generalizability and interpretability. In this paper, we address these challenges in a unified framework based on generative image priors. We propose a novel deep neural network based regularizer which is trained… ▽ More

    Submitted 15 June, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

  11. arXiv:2207.06261  [pdf, other

    cs.CV cs.LG

    Is Appearance Free Action Recognition Possible?

    Authors: Filip Ilic, Thomas Pock, Richard P. Wildes

    Abstract: Intuition might suggest that motion and dynamic information are key to video-based action recognition. In contrast, there is evidence that state-of-the-art deep-learning video understanding architectures are biased toward static information available in single frames. Presently, a methodology and corresponding dataset to isolate the effects of dynamic information in video are missing. Their absenc… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  12. arXiv:2203.12658  [pdf, other

    eess.IV cs.CV cs.LG

    Computed Tomography Reconstruction using Generative Energy-Based Priors

    Authors: Martin Zach, Erich Kobler, Thomas Pock

    Abstract: In the past decades, Computed Tomography (CT) has established itself as one of the most important imaging techniques in medicine. Today, the applicability of CT is only limited by the deposited radiation dose, reduction of which manifests in noisy or incomplete measurements. Thus, the need for robust reconstruction algorithms arises. In this work, we learn a parametric regularizer with a global re… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

  13. arXiv:2102.10863  [pdf, other

    cs.LG cs.AI

    Learning atrial fiber orientations and conductivity tensors from intracardiac maps using physics-informed neural networks

    Authors: Thomas Grandits, Simone Pezzuto, Francisco Sahli Costabal, Paris Perdikaris, Thomas Pock, Gernot Plank, Rolf Krause

    Abstract: Electroanatomical maps are a key tool in the diagnosis and treatment of atrial fibrillation. Current approaches focus on the activation times recorded. However, more information can be extracted from the available data. The fibers in cardiac tissue conduct the electrical wave faster, and their direction could be inferred from activation times. In this work, we employ a recently developed approach,… ▽ More

    Submitted 6 May, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: 10 pages, 3 figures

  14. arXiv:2102.06665  [pdf, other

    eess.IV cs.CV cs.LG

    Bayesian Uncertainty Estimation of Learned Variational MRI Reconstruction

    Authors: Dominik Narnhofer, Alexander Effland, Erich Kobler, Kerstin Hammernik, Florian Knoll, Thomas Pock

    Abstract: Recent deep learning approaches focus on improving quantitative scores of dedicated benchmarks, and therefore only reduce the observation-related (aleatoric) uncertainty. However, the model-immanent (epistemic) uncertainty is less frequently systematically analyzed. In this work, we introduce a Bayesian variational framework to quantify the epistemic uncertainty. To this end, we solve the linear i… ▽ More

    Submitted 22 October, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: 19 pages, 11 figures

    MSC Class: 68T45; 65K10; 65D19

  15. arXiv:2011.06539  [pdf, other

    cs.CV cs.LG eess.IV math.NA math.OC

    Shared Prior Learning of Energy-Based Models for Image Reconstruction

    Authors: Thomas Pinetz, Erich Kobler, Thomas Pock, Alexander Effland

    Abstract: We propose a novel learning-based framework for image reconstruction particularly designed for training without ground truth data, which has three major building blocks: energy-based learning, a patch-based Wasserstein loss functional, and shared prior learning. In energy-based learning, the parameters of an energy functional composed of a learned data fidelity term and a data-driven regularizer a… ▽ More

    Submitted 13 November, 2020; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 37 pages, 19 figures

    MSC Class: 49J15; 65C30; 65K10; 65L09; 68U10

  16. arXiv:2010.12436  [pdf, other

    cs.CV

    BP-MVSNet: Belief-Propagation-Layers for Multi-View-Stereo

    Authors: Christian Sormann, Patrick Knöbelreiter, Andreas Kuhn, Mattia Rossi, Thomas Pock, Friedrich Fraundorfer

    Abstract: In this work, we propose BP-MVSNet, a convolutional neural network (CNN)-based Multi-View-Stereo (MVS) method that uses a differentiable Conditional Random Field (CRF) layer for regularization. To this end, we propose to extend the BP layer and add what is necessary to successfully use it in the MVS setting. We therefore show how we can calculate a normalization based on the expected 3D error, whi… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: accepted at 3DV 2020

  17. arXiv:2006.08789  [pdf, other

    cs.CV math.NA math.OC

    Total Deep Variation: A Stable Regularizer for Inverse Problems

    Authors: Erich Kobler, Alexander Effland, Karl Kunisch, Thomas Pock

    Abstract: Various problems in computer vision and medical imaging can be cast as inverse problems. A frequent method for solving inverse problems is the variational approach, which amounts to minimizing an energy composed of a data fidelity term and a regularizer. Classically, handcrafted regularizers are used, which are commonly outperformed by state-of-the-art deep learning approaches. In this work, we co… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 30 pages, 12 figures. arXiv admin note: text overlap with arXiv:2001.05005

  18. arXiv:2003.06258  [pdf, other

    cs.CV cs.LG

    Belief Propagation Reloaded: Learning BP-Layers for Labeling Problems

    Authors: Patrick Knöbelreiter, Christian Sormann, Alexander Shekhovtsov, Friedrich Fraundorfer, Thomas Pock

    Abstract: It has been proposed by many researchers that combining deep neural networks with graphical models can create more efficient and better regularized composite models. The main difficulties in implementing this in practice are associated with a discrepancy in suitable learning objectives as well as with the necessity of approximations for the inference. In this work we take one of the simplest infer… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: CVPR 2020

  19. arXiv:2001.05005  [pdf, other

    math.OC cs.CV cs.LG

    Total Deep Variation for Linear Inverse Problems

    Authors: Erich Kobler, Alexander Effland, Karl Kunisch, Thomas Pock

    Abstract: Diverse inverse problems in imaging can be cast as variational problems composed of a task-specific data fidelity term and a regularization term. In this paper, we propose a novel learnable general-purpose regularizer exploiting recent architectural design patterns from deep learning. We cast the learning problem as a discrete sampled optimal control problem, for which we derive the adjoint state… ▽ More

    Submitted 17 February, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: 21 pages, 10 figures

    MSC Class: 68T45; 93A30; 34H05; 49K15; 65L05

  20. arXiv:1912.10739  [pdf, other

    cs.CV

    Improving Optical Flow on a Pyramid Level

    Authors: Markus Hofinger, Samuel Rota Bulò, Lorenzo Porzi, Arno Knapitsch, Thomas Pock, Peter Kontschieder

    Abstract: In this work we review the coarse-to-fine spatial feature pyramid concept, which is used in state-of-the-art optical flow estimation networks to make exploration of the pixel flow search space computationally tractable and efficient. Within an individual pyramid level, we improve the cost volume construction process by departing from a war**- to a sampling-based strategy, which avoids ghosting a… ▽ More

    Submitted 18 July, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

  21. arXiv:1910.00888  [pdf, other

    cs.LG stat.ML

    On the estimation of the Wasserstein distance in generative models

    Authors: Thomas Pinetz, Daniel Soukup, Thomas Pock

    Abstract: Generative Adversarial Networks (GANs) have been used to model the underlying probability distribution of sample based datasets. GANs are notoriuos for training difficulties and their dependence on arbitrary hyperparameters. One recent improvement in GAN literature is to use the Wasserstein distance as loss function leading to Wasserstein Generative Adversarial Networks (WGANs). Using this as a ba… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: Accepted and presented at GCPR 2019 (http://gcpr2019.tu-dortmund.de/)

  22. arXiv:1907.13391  [pdf, other

    cs.CV

    Learned Collaborative Stereo Refinement

    Authors: Patrick Knöbelreiter, Thomas Pock

    Abstract: In this work, we propose a learning-based method to denoise and refine disparity maps of a given stereo method. The proposed variational network arises naturally from unrolling the iterates of a proximal gradient method applied to a variational energy defined in a joint disparity, color, and confidence image space. Our method allows to learn a robust collaborative regularizer leveraging the joint… ▽ More

    Submitted 31 July, 2019; originally announced July 2019.

    Comments: @German Conference on Pattern Recognition 2019

  23. arXiv:1907.12446  [pdf, other

    cs.CV

    Self-Supervised Learning for Stereo Reconstruction on Aerial Images

    Authors: Patrick Knöbelreiter, Christoph Vogel, Thomas Pock

    Abstract: Recent developments established deep learning as an inevitable tool to boost the performance of dense matching and stereo estimation. On the downside, learning these networks requires a substantial amount of training data to be successful. Consequently, the application of these models outside of the laboratory is far from straight forward. In this work we propose a self-supervised training procedu… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: Symposium Prize Paper Award @IGARSS 2018

  24. arXiv:1907.08488  [pdf, other

    math.OC cs.LG eess.IV

    An Optimal Control Approach to Early Stop** Variational Methods for Image Restoration

    Authors: Alexander Effland, Erich Kobler, Karl Kunisch, Thomas Pock

    Abstract: We investigate a well-known phenomenon of variational approaches in image processing, where typically the best image quality is achieved when the gradient flow process is stopped before converging to a stationary point. This paradox originates from a tradeoff between optimization and modelling errors of the underlying variational model and holds true even if deep learning methods are used to learn… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

    Comments: 14 figures

    MSC Class: 68T45; 93A30; 34H05; 49K15; 65L05

  25. arXiv:1905.11327  [pdf, ps, other

    cs.LG stat.ML

    Fast Decomposable Submodular Function Minimization using Constrained Total Variation

    Authors: K S Sesh Kumar, Francis Bach, Thomas Pock

    Abstract: We consider the problem of minimizing the sum of submodular set functions assuming minimization oracles of each summand function. Most existing approaches reformulate the problem as the convex minimization of the sum of the corresponding Lovász extensions and the squared Euclidean norm, leading to algorithms requiring total variation oracles of the summand functions; without further assumptions, t… ▽ More

    Submitted 27 May, 2019; originally announced May 2019.

  26. arXiv:1904.03537  [pdf, other

    math.OC cs.CV cs.LG math.NA

    Convex-Concave Backtracking for Inertial Bregman Proximal Gradient Algorithms in Non-Convex Optimization

    Authors: Mahesh Chandra Mukkamala, Peter Ochs, Thomas Pock, Shoham Sabach

    Abstract: Backtracking line-search is an old yet powerful strategy for finding a better step sizes to be used in proximal gradient algorithms. The main principle is to locally find a simple convex upper bound of the objective function, which in turn controls the step size that is used. In case of inertial proximal gradient algorithms, the situation becomes much more difficult and usually leads to very restr… ▽ More

    Submitted 5 November, 2019; v1 submitted 6 April, 2019; originally announced April 2019.

    Comments: 29 pages

    MSC Class: 90C25; 26B25; 49M27; 52A41; 65K05

  27. arXiv:1904.01112  [pdf, other

    eess.SP cs.CV cs.LG eess.IV

    Deep Learning Methods for Parallel Magnetic Resonance Image Reconstruction

    Authors: Florian Knoll, Kerstin Hammernik, Chi Zhang, Steen Moeller, Thomas Pock, Daniel K. Sodickson, Mehmet Akcakaya

    Abstract: Following the success of deep learning in a wide range of applications, neural network-based machine learning techniques have received interest as a means of accelerating magnetic resonance imaging (MRI). A number of ideas inspired by deep learning techniques from computer vision and image processing have been successfully applied to non-linear image reconstruction in the spirit of compressed sens… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

    Comments: 14 pages, 7 figures

  28. arXiv:1811.03721  [pdf, other

    cs.CV

    Learning Energy Based Inpainting for Optical Flow

    Authors: Christoph Vogel, Patrick Knöbelreiter, Thomas Pock

    Abstract: Modern optical flow methods are often composed of a cascade of many independent steps or formulated as a black box neural network that is hard to interpret and analyze. In this work we seek for a plain, interpretable, but learnable solution. We propose a novel inpainting based algorithm that approaches the problem in three steps: feature selection and matching, selection of supporting points and e… ▽ More

    Submitted 8 November, 2018; originally announced November 2018.

    Journal ref: Proc. Asian Conf. on Computer Vision (ACCV), 2018

  29. arXiv:1804.03037  [pdf, other

    cs.CV physics.flu-dyn

    3D Fluid Flow Estimation with Integrated Particle Reconstruction

    Authors: Katrin Lasinger, Christoph Vogel, Thomas Pock, Konrad Schindler

    Abstract: The standard approach to densely reconstruct the motion in a volume of fluid is to inject high-contrast tracer particles and record their motion with multiple high-speed cameras. Almost all existing work processes the acquired multi-view video in two separate steps, utilizing either a pure Eulerian or pure Lagrangian approach. Eulerian methods perform a voxel-based reconstruction of particles per… ▽ More

    Submitted 21 November, 2019; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: To appear in International Journal of Computer Vision (IJCV)

  30. arXiv:1804.02872  [pdf, other

    cs.CV physics.flu-dyn

    Variational 3D-PIV with Sparse Descriptors

    Authors: Katrin Lasinger, Christoph Vogel, Thomas Pock, Konrad Schindler

    Abstract: 3D Particle Imaging Velocimetry (3D-PIV) aim to recover the flow field in a volume of fluid, which has been seeded with tracer particles and observed from multiple camera viewpoints. The first step of 3D-PIV is to reconstruct the 3D locations of the tracer particles from synchronous views of the volume. We propose a new method for iterative particle reconstruction (IPR), in which the locations and… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

    Comments: to be published in Measurement Science and Technology

  31. arXiv:1802.04546  [pdf, other

    cs.CV

    Robust Deformation Estimation in Wood-Composite Materials using Variational Optical Flow

    Authors: Markus Hofinger, Thomas Pock, Thomas Moosbrugger

    Abstract: Wood-composite materials are widely used today as they homogenize humidity related directional deformations. Quantification of these deformations as coefficients is important for construction and engineering and topic of current research but still a manual process. This work introduces a novel computer vision approach that automatically extracts these properties directly from scans of the wooden… ▽ More

    Submitted 13 February, 2018; originally announced February 2018.

    Comments: 8 pages, 8 figures, originally published in 23 rd Computer Vision Winter Workshop proceedings 2018 http://cmp.felk.cvut.cz/cvww2018/papers/28.pdf

    Journal ref: 23rd Computer Vision Winter Workshop proceedings February 2018 page 97-104

  32. arXiv:1710.01749  [pdf, other

    cs.CV

    Semantic 3D Reconstruction with Finite Element Bases

    Authors: Audrey Richard, Christoph Vogel, Maros Blaha, Thomas Pock, Konrad Schindler

    Abstract: We propose a novel framework for the discretisation of multi-label problems on arbitrary, continuous domains. Our work bridges the gap between general FEM discretisations, and labeling problems that arise in a variety of computer vision tasks, including for instance those derived from the generalised Potts model. Starting from the popular formulation of labeling as a convex relaxation by functiona… ▽ More

    Submitted 4 October, 2017; originally announced October 2017.

    Journal ref: BMVC 2017, 28th British Machine Vision Conference

  33. arXiv:1707.06427  [pdf, other

    cs.CV

    Scalable Full Flow with Learned Binary Descriptors

    Authors: Gottfried Munda, Alexander Shekhovtsov, Patrick Knöbelreiter, Thomas Pock

    Abstract: We propose a method for large displacement optical flow in which local matching costs are learned by a convolutional neural network (CNN) and a smoothness prior is imposed by a conditional random field (CRF). We tackle the computation- and memory-intensive operations on the 4D cost volume by a min-projection which reduces memory complexity from quadratic to linear and binary descriptors for effici… ▽ More

    Submitted 20 July, 2017; originally announced July 2017.

    Comments: GCPR 2017

  34. arXiv:1704.00447  [pdf, other

    cs.CV

    Learning a Variational Network for Reconstruction of Accelerated MRI Data

    Authors: Kerstin Hammernik, Teresa Klatzer, Erich Kobler, Michael P Recht, Daniel K Sodickson, Thomas Pock, Florian Knoll

    Abstract: Purpose: To allow fast and high-quality reconstruction of clinical accelerated multi-coil MR data by learning a variational network that combines the mathematical structure of variational models with deep learning. Theory and Methods: Generalized compressed sensing reconstruction formulated as a variational model is embedded in an unrolled gradient descent scheme. All parameters of this formulat… ▽ More

    Submitted 3 April, 2017; originally announced April 2017.

    Comments: Submitted to Magnetic Resonance in Medicine

  35. arXiv:1703.05161  [pdf, other

    cs.CV

    Real-Time Panoramic Tracking for Event Cameras

    Authors: Christian Reinbacher, Gottfried Munda, Thomas Pock

    Abstract: Event cameras are a paradigm shift in camera technology. Instead of full frames, the sensor captures a sparse set of events caused by intensity changes. Since only the changes are transferred, those cameras are able to capture quick movements of objects in the scene or of the camera itself. In this work we propose a novel method to perform camera tracking of event cameras in a panoramic setting wi… ▽ More

    Submitted 21 March, 2017; v1 submitted 15 March, 2017; originally announced March 2017.

    Comments: Accepted to International Conference on Computational Photography 2017

  36. arXiv:1611.10229  [pdf, other

    cs.CV

    End-to-End Training of Hybrid CNN-CRF Models for Stereo

    Authors: Patrick Knöbelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock

    Abstract: We propose a novel and principled hybrid CNN+CRF model for stereo estimation. Our model allows to exploit the advantages of both, convolutional neural networks (CNNs) and conditional random fields (CRFs) in an unified approach. The CNNs compute expressive features for matching and distinctive color edges, which in turn are used to compute the unary and binary costs of the CRF. For inference, we ap… ▽ More

    Submitted 3 May, 2017; v1 submitted 30 November, 2016; originally announced November 2016.

    Comments: To appear at CVPR 2017

  37. arXiv:1607.06283  [pdf, other

    cs.CV

    Real-Time Intensity-Image Reconstruction for Event Cameras Using Manifold Regularisation

    Authors: Christian Reinbacher, Gottfried Graber, Thomas Pock

    Abstract: Event cameras or neuromorphic cameras mimic the human perception system as they measure the per-pixel intensity change rather than the actual intensity level. In contrast to traditional cameras, such cameras capture new information about the scene at MHz frequency in the form of sparse events. The high temporal resolution comes at the cost of losing the familiar per-pixel intensity information. In… ▽ More

    Submitted 4 August, 2016; v1 submitted 21 July, 2016; originally announced July 2016.

    Comments: Accepted to BMVC 2016 as oral presentation, 12 pages

  38. arXiv:1601.06274  [pdf, other

    cs.CV

    Solving Dense Image Matching in Real-Time using Discrete-Continuous Optimization

    Authors: Alexander Shekhovtsov, Christian Reinbacher, Gottfried Graber, Thomas Pock

    Abstract: Dense image matching is a fundamental low-level problem in Computer Vision, which has received tremendous attention from both discrete and continuous optimization communities. The goal of this paper is to combine the advantages of discrete and continuous optimization in a coherent framework. We devise a model based on energy minimization, to be optimized by both discrete and continuous algorithms… ▽ More

    Submitted 23 January, 2016; originally announced January 2016.

    Comments: 21 st Computer Vision Winter Workshop

  39. Acceleration of the PDHGM on strongly convex subspaces

    Authors: Tuomo Valkonen, Thomas Pock

    Abstract: We propose several variants of the primal-dual method due to Chambolle and Pock. Without requiring full strong convexity of the objective functions, our methods are accelerated on subspaces with strong convexity. This yields mixed rates, $O(1/N^2)$ with respect to initialisation and $O(1/N)$ with respect to the dual sequence, and the residual part of the primal sequence. We demonstrate the efficac… ▽ More

    Submitted 10 February, 2016; v1 submitted 20 November, 2015; originally announced November 2015.

    MSC Class: 90C25; 49M29; 94A08

  40. Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration

    Authors: Yun** Chen, Thomas Pock

    Abstract: Image restoration is a long-standing problem in low-level computer vision with many interesting applications. We describe a flexible learning framework based on the concept of nonlinear reaction diffusion models for various image restoration problems. By embodying recent improvements in nonlinear diffusion models, we propose a dynamic nonlinear reaction diffusion model with time-dependent paramete… ▽ More

    Submitted 20 August, 2016; v1 submitted 12 August, 2015; originally announced August 2015.

    Comments: 14 pages, 13 figures, to appear in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  41. arXiv:1503.05768  [pdf, other

    cs.CV

    On learning optimized reaction diffusion processes for effective image restoration

    Authors: Yun** Chen, Wei Yu, Thomas Pock

    Abstract: For several decades, image restoration remains an active research topic in low-level computer vision and hence new approaches are constantly emerging. However, many recently proposed algorithms achieve state-of-the-art performance only at the expense of very high computation time, which clearly limits their practical relevance. In this work, we propose a simple but effective approach with both hig… ▽ More

    Submitted 25 March, 2015; v1 submitted 19 March, 2015; originally announced March 2015.

    Comments: 9 pages, 3 figures, 3 tables. CVPR2015 oral presentation together with the supplemental material of 13 pages, 8 pages (Notes on diffusion networks)

  42. arXiv:1502.07770  [pdf, other

    cs.CV

    Total variation on a tree

    Authors: Vladimir Kolmogorov, Thomas Pock, Michal Rolinek

    Abstract: We consider the problem of minimizing the continuous valued total variation subject to different unary terms on trees and propose fast direct algorithms based on dynamic programming to solve these problems. We treat both the convex and the non-convex case and derive worst case complexities that are equal or better than existing methods. We show applications to total variation based 2D image proces… ▽ More

    Submitted 25 April, 2016; v1 submitted 26 February, 2015; originally announced February 2015.

    Comments: accepted to SIAM Journal on Imaging Sciences (SIIMS)

  43. A higher-order MRF based variational model for multiplicative noise reduction

    Authors: Yun** Chen, Wensen Feng, René Ranftl, Hong Qiao, Thomas Pock

    Abstract: The Fields of Experts (FoE) image prior model, a filter-based higher-order Markov Random Fields (MRF) model, has been shown to be effective for many image restoration problems. Motivated by the successes of FoE-based approaches, in this letter, we propose a novel variational model for multiplicative noise reduction based on the FoE image prior model. The resulted model corresponds to a non-convex… ▽ More

    Submitted 7 July, 2014; v1 submitted 21 April, 2014; originally announced April 2014.

    Comments: 5 pages, 5 figures, to appear in IEEE Signal Processing Letters

  44. arXiv:1404.4805  [pdf, other

    cs.CV math.OC

    iPiano: Inertial Proximal Algorithm for Non-Convex Optimization

    Authors: Peter Ochs, Yun** Chen, Thomas Brox, Thomas Pock

    Abstract: In this paper we study an algorithm for solving a minimization problem composed of a differentiable (possibly non-convex) and a convex (possibly non-differentiable) function. The algorithm iPiano combines forward-backward splitting with an inertial force. It can be seen as a non-smooth split version of the Heavy-ball method from Polyak. A rigorous analysis of the algorithm for the proposed class o… ▽ More

    Submitted 18 April, 2014; originally announced April 2014.

    Comments: 32pages, 7 figures, to appear in SIAM Journal on Imaging Sciences

  45. arXiv:1403.3522  [pdf, other

    cs.CV math.NA math.OC

    An inertial forward-backward algorithm for monotone inclusions

    Authors: Dirk A. Lorenz, Thomas Pock

    Abstract: In this paper, we propose an inertial forward backward splitting algorithm to compute a zero of the sum of two monotone operators, with one of the two operators being co-coercive. The algorithm is inspired by the accelerated gradient method of Nesterov, but can be applied to a much larger class of problems including convex-concave saddle point problems and general monotone inclusions. We prove con… ▽ More

    Submitted 12 September, 2014; v1 submitted 14 March, 2014; originally announced March 2014.

    Comments: The final publication is available at http://link.springer.com

  46. arXiv:1401.4112  [pdf, other

    cs.CV

    A bi-level view of inpainting - based image compression

    Authors: Yun** Chen, René Ranftl, Thomas Pock

    Abstract: Inpainting based image compression approaches, especially linear and non-linear diffusion models, are an active research topic for lossy image compression. The major challenge in these compression models is to find a small set of descriptive supporting points, which allow for an accurate reconstruction of the original image. It turns out in practice that this is a challenging problem even for the… ▽ More

    Submitted 9 May, 2014; v1 submitted 16 January, 2014; originally announced January 2014.

    Comments: 8 pages, 4 figures, best paper award of CVWW 2014, Computer Vision Winter Workshop, Křtiny, Czech Republic, 3-5th February 2014

  47. Revisiting loss-specific training of filter-based MRFs for image restoration

    Authors: Yun** Chen, Thomas Pock, René Ranftl, Horst Bischof

    Abstract: It is now well known that Markov random fields (MRFs) are particularly effective for modeling image priors in low-level vision. Recent years have seen the emergence of two main approaches for learning the parameters in MRFs: (1) probabilistic learning using sampling-based algorithms and (2) loss-specific training based on MAP estimate. After investigating existing training approaches, it turns out… ▽ More

    Submitted 16 January, 2014; originally announced January 2014.

    Comments: 10 pages, 2 figures, appear at 35th German Conference, GCPR 2013, Saarbrücken, Germany, September 3-6, 2013. Proceedings

  48. arXiv:1401.4105  [pdf, other

    cs.CV

    Learning $\ell_1$-based analysis and synthesis sparsity priors using bi-level optimization

    Authors: Yun** Chen, Thomas Pock, Horst Bischof

    Abstract: We consider the analysis operator and synthesis dictionary learning problems based on the the $\ell_1$ regularized sparse representation model. We reveal the internal relations between the $\ell_1$-based analysis model and synthesis model. We then introduce an approach to learn both analysis operator and synthesis dictionary simultaneously by using a unified framework of bi-level optimization. Our… ▽ More

    Submitted 16 January, 2014; originally announced January 2014.

    Comments: 5 pages, 1 figure, appear at the Workshop on Analysis Operator Learning vs. Dictionary Learning, NIPS 2012

  49. Insights into analysis operator learning: From patch-based sparse models to higher-order MRFs

    Authors: Yun** Chen, René Ranftl, Thomas Pock

    Abstract: This paper addresses a new learning algorithm for the recently introduced co-sparse analysis model. First, we give new insights into the co-sparse analysis model by establishing connections to filter-based MRF models, such as the Field of Experts (FoE) model of Roth and Black. For training, we introduce a technique called bi-level optimization to learn the analysis operators. Compared to existing… ▽ More

    Submitted 13 January, 2014; originally announced January 2014.

    Comments: 13 pages, 10 figures, accepted to IEEE Image Processing

  50. arXiv:1304.7153  [pdf, other

    cs.CV

    A Convex Approach for Image Hallucination

    Authors: Peter Innerhofer, Thomas Pock

    Abstract: In this paper we propose a global convex approach for image hallucination. Altering the idea of classical multi image super resolution (SU) systems to single image SU, we incorporate aligned images to hallucinate the output. Our work is based on the paper of Tappen et al. where they use a non-convex model for image hallucination. In comparison we formulate a convex primal optimization problem and… ▽ More

    Submitted 26 April, 2013; originally announced April 2013.

    Comments: submitted to ÖAGM-AAPR 2013, 8 pages, 3 figures

    Report number: OAGM-AAPR/2013/18