Skip to main content

Showing 1–27 of 27 results for author: Shekhovtsov, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.04187  [pdf, other

    cs.CC cs.IT

    Enumerating Complexity Revisited

    Authors: Alexander Shekhovtsov, Georgii Zakharov

    Abstract: Consider a subset of positive integers $S$. In this paper, we reduce the upper bound on the length of a minimum program that enumerates $S$ in terms of the probability of $S$ being enumerated by a random program. So far, the best-known upper bound was given by Solovay. Solovay proved that the minimum length of a program enumerating $S$ is bounded by $3$ times minus binary logarithm of the probab… ▽ More

    Submitted 14 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

  2. arXiv:2307.09883  [pdf, other

    cs.LG

    Symmetric Equilibrium Learning of VAEs

    Authors: Boris Flach, Dmitrij Schlesinger, Alexander Shekhovtsov

    Abstract: We view variational autoencoders (VAE) as decoder-encoder pairs, which map distributions in the data space to distributions in the latent space and vice versa. The standard learning approach for VAEs is the maximisation of the evidence lower bound (ELBO). It is asymmetric in that it aims at learning a latent variable model while using the encoder as an auxiliary means only. Moreover, it requires a… ▽ More

    Submitted 12 March, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 13 pages, 6 figures, accepted for AISTATS 2024

    ACM Class: I.2.6

  3. arXiv:2212.13185  [pdf, other

    cs.CV

    Generalized Differentiable RANSAC

    Authors: Tong Wei, Yash Patel, Alexander Shekhovtsov, Jiri Matas, Daniel Barath

    Abstract: We propose $\nabla$-RANSAC, a generalized differentiable RANSAC that allows learning the entire randomized robust estimation pipeline. The proposed approach enables the use of relaxation techniques for estimating the gradients in the sampling distribution, which are then propagated through a differentiable solver. The trainable quality function marginalizes over the scores from all the models esti… ▽ More

    Submitted 8 September, 2023; v1 submitted 26 December, 2022; originally announced December 2022.

  4. arXiv:2110.03549  [pdf, other

    cs.LG cs.NE

    Bias-Variance Tradeoffs in Single-Sample Binary Gradient Estimators

    Authors: Alexander Shekhovtsov

    Abstract: Discrete and especially binary random variables occur in many machine learning models, notably in variational autoencoders with binary latent states and in stochastic binary networks. When learning such models, a key tool is an estimator of the gradient of the expected loss with respect to the probabilities of binary variables. The straight-through (ST) estimator gained popularity due to its simpl… ▽ More

    Submitted 15 October, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 22 pages, GCPR 2021

  5. arXiv:2102.09310  [pdf, other

    cs.LG cs.CV stat.ML

    VAE Approximation Error: ELBO and Exponential Families

    Authors: Alexander Shekhovtsov, Dmitrij Schlesinger, Boris Flach

    Abstract: The importance of Variational Autoencoders reaches far beyond standalone generative models -- the approach is also used for learning latent representations and can be generalized to semi-supervised learning. This requires a thorough analysis of their commonly known shortcomings: posterior collapse and approximation errors. This paper analyzes VAE approximation errors caused by the combination of t… ▽ More

    Submitted 11 April, 2022; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: ICLR 2022 spotlight

  6. arXiv:2006.06880  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Reintroducing Straight-Through Estimators as Principled Methods for Stochastic Binary Networks

    Authors: Alexander Shekhovtsov, Viktor Yanush

    Abstract: Training neural networks with binary weights and activations is a challenging problem due to the lack of gradients and difficulty of optimization over discrete weights. Many successful experimental results have been achieved with empirical straight-through (ST) approaches, proposing a variety of ad-hoc rules for propagating gradients through non-differentiable activations and updating discrete wei… ▽ More

    Submitted 19 October, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 33 pages, DAGM 2021 version (presented, to be published)

  7. arXiv:2006.03143  [pdf, other

    stat.ML cs.LG

    Path Sample-Analytic Gradient Estimators for Stochastic Binary Networks

    Authors: Alexander Shekhovtsov, Viktor Yanush, Boris Flach

    Abstract: In neural networks with binary activations and or binary weights the training by gradient descent is complicated as the model has piecewise constant response. We consider stochastic binary networks, obtained by adding noises in front of activations. The expected model response becomes a smooth function of parameters, its gradient is well defined but it is challenging to estimate it accurately. We… ▽ More

    Submitted 4 November, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  8. arXiv:2004.08227  [pdf, other

    cs.LG cs.CV stat.ML

    MPLP++: Fast, Parallel Dual Block-Coordinate Ascent for Dense Graphical Models

    Authors: Siddharth Tourani, Alexander Shekhovtsov, Carsten Rother, Bogdan Savchynskyy

    Abstract: Dense, discrete Graphical Models with pairwise potentials are a powerful class of models which are employed in state-of-the-art computer vision and bio-imaging applications. This work introduces a new MAP-solver, based on the popular Dual Block-Coordinate Ascent principle. Surprisingly, by making a small change to the low-performing solver, the Max Product Linear Programming (MPLP) algorithm, we d… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted in ECCV-2018

  9. arXiv:2004.07715  [pdf, other

    cs.LG stat.ML

    Taxonomy of Dual Block-Coordinate Ascent Methods for Discrete Energy Minimization

    Authors: Siddharth Tourani, Alexander Shekhovtsov, Carsten Rother, Bogdan Savchynskyy

    Abstract: We consider the maximum-a-posteriori inference problem in discrete graphical models and study solvers based on the dual block-coordinate ascent rule. We map all existing solvers in a single framework, allowing for a better understanding of their design principles. We theoretically show that some block-optimizing updates are sub-optimal and how to strictly improve them. On a wide range of problem i… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted in AISTATS 2020

  10. arXiv:2003.06258  [pdf, other

    cs.CV cs.LG

    Belief Propagation Reloaded: Learning BP-Layers for Labeling Problems

    Authors: Patrick Knöbelreiter, Christian Sormann, Alexander Shekhovtsov, Friedrich Fraundorfer, Thomas Pock

    Abstract: It has been proposed by many researchers that combining deep neural networks with graphical models can create more efficient and better regularized composite models. The main difficulties in implementing this in practice are associated with a discrepancy in suitable learning objectives as well as with the necessity of approximations for the inference. In this work we take one of the simplest infer… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: CVPR 2020

  11. arXiv:1907.00845  [pdf, other

    cs.DS math.PR

    Graph-based Nearest Neighbor Search: From Practice to Theory

    Authors: Liudmila Prokhorenkova, Aleksandr Shekhovtsov

    Abstract: Graph-based approaches are empirically shown to be very successful for the nearest neighbor search (NNS). However, there has been very little research on their theoretical guarantees. We fill this gap and rigorously analyze the performance of graph-based NNS algorithms, specifically focusing on the low-dimensional (d << \log n) regime. In addition to the basic greedy algorithm on nearest neighbor… ▽ More

    Submitted 20 August, 2020; v1 submitted 1 July, 2019; originally announced July 2019.

  12. arXiv:1811.00639  [pdf, other

    cs.LG cs.NE stat.ML

    Stochastic Normalizations as Bayesian Learning

    Authors: Alexander Shekhovtsov, Boris Flach

    Abstract: In this work we investigate the reasons why Batch Normalization (BN) improves the generalization performance of deep networks. We argue that one major reason, distinguishing it from data-independent normalization methods, is randomness of batch statistics. This randomness appears in the parameters rather than in activations and admits an interpretation as a practical Bayesian learning. We apply th… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: Accepted to ACCV 2018

  13. arXiv:1803.10590  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    Feed-forward Uncertainty Propagation in Belief and Neural Networks

    Authors: Alexander Shekhovtsov, Boris Flach, Michal Busta

    Abstract: We propose a feed-forward inference method applicable to belief and neural networks. In a belief network, the method estimates an approximate factorized posterior of all hidden units given the input. In neural networks the method propagates uncertainty of the input through all the layers. In neural networks with injected noise, the method analytically takes into account uncertainties resulting fro… ▽ More

    Submitted 1 November, 2018; v1 submitted 28 March, 2018; originally announced March 2018.

    Comments: error corrections

  14. arXiv:1803.10560  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Normalization of Neural Networks using Analytic Variance Propagation

    Authors: Alexander Shekhovtsov, Boris Flach

    Abstract: We address the problem of estimating statistics of hidden units in a neural network using a method of analytic moment propagation. These statistics are useful for approximate whitening of the inputs in front of saturating non-linearities such as a sigmoid function. This is important for initialization of training and for reducing the accumulated scale and bias dependencies (compensating covariate… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

    Journal ref: In Proceedings of Computer Vision Winter Workshop 2018

  15. arXiv:1709.08524  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Generative learning for deep networks

    Authors: Boris Flach, Alexander Shekhovtsov, Ondrej Fikar

    Abstract: Learning, taking into account full distribution of the data, referred to as generative, is not feasible with deep neural networks (DNNs) because they model only the conditional distribution of the outputs given the inputs. Current solutions are either based on joint probability models facing difficult estimation problems or learn two separate networks, map** inputs to outputs (recognition) and v… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Comments: submitted to AAAI

  16. arXiv:1707.06427  [pdf, other

    cs.CV

    Scalable Full Flow with Learned Binary Descriptors

    Authors: Gottfried Munda, Alexander Shekhovtsov, Patrick Knöbelreiter, Thomas Pock

    Abstract: We propose a method for large displacement optical flow in which local matching costs are learned by a convolutional neural network (CNN) and a smoothness prior is imposed by a conditional random field (CRF). We tackle the computation- and memory-intensive operations on the 4D cost volume by a min-projection which reduces memory complexity from quadratic to linear and binary descriptors for effici… ▽ More

    Submitted 20 July, 2017; originally announced July 2017.

    Comments: GCPR 2017

  17. arXiv:1611.10229  [pdf, other

    cs.CV

    End-to-End Training of Hybrid CNN-CRF Models for Stereo

    Authors: Patrick Knöbelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock

    Abstract: We propose a novel and principled hybrid CNN+CRF model for stereo estimation. Our model allows to exploit the advantages of both, convolutional neural networks (CNNs) and conditional random fields (CRFs) in an unified approach. The CNNs compute expressive features for matching and distinctive color edges, which in turn are used to compute the unary and binary costs of the CRF. For inference, we ap… ▽ More

    Submitted 3 May, 2017; v1 submitted 30 November, 2016; originally announced November 2016.

    Comments: To appear at CVPR 2017

  18. arXiv:1607.08905  [pdf, other

    cs.CV cs.DS

    Complexity of Discrete Energy Minimization Problems

    Authors: Mengtian Li, Alexander Shekhovtsov, Daniel Huber

    Abstract: Discrete energy minimization is widely-used in computer vision and machine learning for problems such as MAP inference in graphical models. The problem, in general, is notoriously intractable, and finding the global optimal solution is known to be NP-hard. However, is it possible to approximate this problem with a reasonable ratio bound on the solution quality in polynomial time? We show in this p… ▽ More

    Submitted 29 July, 2016; originally announced July 2016.

    Comments: ECCV'16 accepted

  19. arXiv:1606.07015  [pdf, other

    cs.CV

    Joint M-Best-Diverse Labelings as a Parametric Submodular Minimization

    Authors: Alexander Kirillov, Alexander Shekhovtsov, Carsten Rother, Bogdan Savchynskyy

    Abstract: We consider the problem of jointly inferring the M-best diverse labelings for a binary (high-order) submodular energy of a graphical model. Recently, it was shown that this problem can be solved to a global optimum, for many practically interesting diversity measures. It was noted that the labelings are, so-called, nested. This nestedness property also holds for labelings of a class of parametric… ▽ More

    Submitted 23 June, 2016; v1 submitted 22 June, 2016; originally announced June 2016.

  20. arXiv:1601.06274  [pdf, other

    cs.CV

    Solving Dense Image Matching in Real-Time using Discrete-Continuous Optimization

    Authors: Alexander Shekhovtsov, Christian Reinbacher, Gottfried Graber, Thomas Pock

    Abstract: Dense image matching is a fundamental low-level problem in Computer Vision, which has received tremendous attention from both discrete and continuous optimization communities. The goal of this paper is to combine the advantages of discrete and continuous optimization in a coherent framework. We devise a model based on energy minimization, to be optimized by both discrete and continuous algorithms… ▽ More

    Submitted 23 January, 2016; originally announced January 2016.

    Comments: 21 st Computer Vision Winter Workshop

  21. arXiv:1508.07902  [pdf, other

    cs.CV cs.DS

    Maximum Persistency via Iterative Relaxed Inference with Graphical Models

    Authors: Alexander Shekhovtsov, Paul Swoboda, Bogdan Savchynskyy

    Abstract: We consider the NP-hard problem of MAP-inference for undirected discrete graphical models. We propose a polynomial time and practically efficient algorithm for finding a part of its optimal solution. Specifically, our algorithm marks some labels of the considered graphical model either as (i) optimal, meaning that they belong to all optimal solutions of the inference problem; (ii) non-optimal if t… ▽ More

    Submitted 3 February, 2017; v1 submitted 31 August, 2015; originally announced August 2015.

    Comments: Reworked version, submitted to PAMI

  22. arXiv:1505.00571  [pdf, other

    cs.CV cs.DM math.CO

    Higher Order Maximum Persistency and Comparison Theorems

    Authors: Alexander Shekhovtsov

    Abstract: We address combinatorial problems that can be formulated as minimization of a partially separable function of discrete variables (energy minimization in graphical models, weighted constraint satisfaction, pseudo-Boolean optimization, 0-1 polynomial programming). For polyhedral relaxations of such problems it is generally not true that variables integer in the relaxed solution will retain the same… ▽ More

    Submitted 4 May, 2015; originally announced May 2015.

    Comments: Submitted to CVIU Special Issuie on Inference in Graphical Models

  23. arXiv:1410.6641  [pdf, other

    cs.AI

    Partial Optimality by Pruning for MAP-Inference with General Graphical Models

    Authors: Paul Swoboda, Alexander Shekhovtsov, Jörg Hendrik Kappes, Christoph Schnörr, Bogdan Savchynskyy

    Abstract: We consider the energy minimization problem for undirected graphical models, also known as MAP-inference problem for Markov random fields which is NP-hard in general. We propose a novel polynomial time algorithm to obtain a part of its optimal non-relaxed integral solution. Our algorithm is initialized with variables taking integral values in the solution of a convex relaxation of the MAP-inferenc… ▽ More

    Submitted 18 August, 2015; v1 submitted 24 October, 2014; originally announced October 2014.

    Comments: 16 pages, 4 tables and 4 figures

  24. arXiv:1404.3653  [pdf, other

    cs.DM

    Maximum Persistency in Energy Minimization

    Authors: Alexander Shekhovtsov

    Abstract: We consider discrete pairwise energy minimization problem (weighted constraint satisfaction, max-sum labeling) and methods that identify a globally optimal partial assignment of variables. When finding a complete optimal assignment is intractable, determining optimal values for a part of variables is an interesting possibility. Existing methods are based on different sufficient conditions. We prop… ▽ More

    Submitted 16 June, 2014; v1 submitted 14 April, 2014; originally announced April 2014.

    Comments: Extended technical report for the CVPR 2014 paper. Update: correction to the proof of characterization theorem

  25. arXiv:1109.1480  [pdf, other

    cs.CV

    Curvature Prior for MRF-based Segmentation and Shape Inpainting

    Authors: Alexander Shekhovtsov, Pushmeet Kohli, Carsten Rother

    Abstract: Most image labeling problems such as segmentation and image reconstruction are fundamentally ill-posed and suffer from ambiguities and noise. Higher order image priors encode high level structural dependencies between pixels and are key to overcoming these problems. However, these priors in general lead to computationally intractable models. This paper addresses the problem of discovering compact… ▽ More

    Submitted 7 September, 2011; originally announced September 2011.

    Comments: 17 pages, 16 figures

    Report number: CTU--CMP--2011--11

  26. arXiv:1109.1149  [pdf, ps, other

    cs.DM cs.CV math.OC

    On Partial Opimality by Auxiliary Submodular Problems

    Authors: Alexander Shekhovtsov, Vaclav Hlavac

    Abstract: In this work, we prove several relations between three different energy minimization techniques. A recently proposed methods for determining a provably optimal partial assignment of variables by Ivan Kovtun (IK), the linear programming relaxation approach (LP) and the popular expansion move algorithm by Yuri Boykov. We propose a novel sufficient condition of optimal partial assignment, which is ba… ▽ More

    Submitted 6 September, 2011; originally announced September 2011.

    Comments: 9 pages, 0 figures; Control Systems and Computers #2/2011, Special issue: "Optimal Labeling Problem in Structural Pattern Recognition", pp. 71-78, issn 0130-5395

  27. arXiv:1109.1146  [pdf, other

    cs.DC cs.DM cs.DS

    A Distributed Mincut/Maxflow Algorithm Combining Path Augmentation and Push-Relabel

    Authors: Alexander Shekhovtsov, Vaclav Hlavac

    Abstract: We develop a novel distributed algorithm for the minimum cut problem. We primarily aim at solving large sparse problems. Assuming vertices of the graph are partitioned into several regions, the algorithm performs path augmentations inside the regions and updates of the push-relabel style between the regions. The interaction between regions is considered expensive (regions are loaded into the memor… ▽ More

    Submitted 6 September, 2011; originally announced September 2011.

    Comments: 40 pages, 15 figures

    Report number: K333-43/11, CTU-CMP-2011-03