Search | arXiv e-print repository

arXiv:2003.03759 [pdf, other]

3D Object Detection from a Single Fisheye Image Without a Single Fisheye Training Image

Authors: Elad Plaut, Erez Ben Yaacov, Bat El Shlomo

Abstract: Existing monocular 3D object detection methods have been demonstrated on rectilinear perspective images and fail in images with alternative projections such as those acquired by fisheye cameras. Previous works on object detection in fisheye images have focused on 2D object detection, partly due to the lack of 3D datasets of such images. In this work, we show how to use existing monocular 3D object… ▽ More Existing monocular 3D object detection methods have been demonstrated on rectilinear perspective images and fail in images with alternative projections such as those acquired by fisheye cameras. Previous works on object detection in fisheye images have focused on 2D object detection, partly due to the lack of 3D datasets of such images. In this work, we show how to use existing monocular 3D object detection models, trained only on rectilinear images, to detect 3D objects in images from fisheye cameras, without using any fisheye training data. We outperform the only existing method for monocular 3D object detection in panoramas on a benchmark of synthetic data, despite the fact that the existing method trains on the target non-rectilinear projection whereas we train only on rectilinear images. We also experiment with an internal dataset of real fisheye images. △ Less

Submitted 31 May, 2021; v1 submitted 8 March, 2020; originally announced March 2020.

Comments: 9 pages, 7 figures

ACM Class: I.4.8

arXiv:1812.10538 [pdf, other]

A Greedy Approach to $\ell_{0,\infty}$ Based Convolutional Sparse Coding

Authors: Elad Plaut, Raja Giryes

Abstract: Sparse coding techniques for image processing traditionally rely on a processing of small overlap** patches separately followed by averaging. This has the disadvantage that the reconstructed image no longer obeys the sparsity prior used in the processing. For this purpose convolutional sparse coding has been introduced, where a shift-invariant dictionary is used and the sparsity of the recovered… ▽ More Sparse coding techniques for image processing traditionally rely on a processing of small overlap** patches separately followed by averaging. This has the disadvantage that the reconstructed image no longer obeys the sparsity prior used in the processing. For this purpose convolutional sparse coding has been introduced, where a shift-invariant dictionary is used and the sparsity of the recovered image is maintained. Most such strategies target the $\ell_0$ "norm" or the $\ell_1$ norm of the whole image, which may create an imbalanced sparsity across various regions in the image. In order to face this challenge, the $\ell_{0,\infty}$ "norm" has been proposed as an alternative that "operates locally while thinking globally". The approaches taken for tackling the non-convexity of these optimization problems have been either using a convex relaxation or local pursuit algorithms. In this paper, we present an efficient greedy method for sparse coding and dictionary learning, which is specifically tailored to $\ell_{0,\infty}$, and is based on matching pursuit. We demonstrate the usage of our approach in salt-and-pepper noise removal and image inpainting. A code package which reproduces the experiments presented in this work is available at https://web.eng.tau.ac.il/~raja △ Less

Submitted 26 December, 2018; originally announced December 2018.

Comments: Accepted for publication in SIAM Journal on Imaging Sciences (SIIMS)

arXiv:1804.10253 [pdf, other]

From Principal Subspaces to Principal Components with Linear Autoencoders

Authors: Elad Plaut

Abstract: The autoencoder is an effective unsupervised learning model which is widely used in deep learning. It is well known that an autoencoder with a single fully-connected hidden layer, a linear activation function and a squared error cost function trains weights that span the same subspace as the one spanned by the principal component loading vectors, but that they are not identical to the loading vect… ▽ More The autoencoder is an effective unsupervised learning model which is widely used in deep learning. It is well known that an autoencoder with a single fully-connected hidden layer, a linear activation function and a squared error cost function trains weights that span the same subspace as the one spanned by the principal component loading vectors, but that they are not identical to the loading vectors. In this paper, we show how to recover the loading vectors from the autoencoder weights. △ Less

Submitted 28 December, 2018; v1 submitted 26 April, 2018; originally announced April 2018.

Showing 1–3 of 3 results for author: Plaut, E