Skip to main content

Showing 1–15 of 15 results for author: Evangelidis, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.07921  [pdf, other

    cs.CV

    Continuous Cost Aggregation for Dual-Pixel Disparity Extraction

    Authors: Sagi Monin, Sagi Katz, Georgios Evangelidis

    Abstract: Recent works have shown that depth information can be obtained from Dual-Pixel (DP) sensors. A DP arrangement provides two views in a single shot, thus resembling a stereo image pair with a tiny baseline. However, the different point spread function (PSF) per view, as well as the small disparity range, makes the use of typical stereo matching algorithms problematic. To address the above shortcomin… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  2. arXiv:2212.08059  [pdf, other

    cs.CV cs.AI cs.LG

    Rethinking Vision Transformers for MobileNet Size and Speed

    Authors: Yanyu Li, Ju Hu, Yang Wen, Georgios Evangelidis, Kamyar Salahi, Yanzhi Wang, Sergey Tulyakov, Jian Ren

    Abstract: With the success of Vision Transformers (ViTs) in computer vision tasks, recent arts try to optimize the performance and complexity of ViTs to enable efficient deployment on mobile devices. Multiple approaches are proposed to accelerate attention mechanism, improve inefficient designs, or incorporate mobile-friendly lightweight convolutions to form hybrid architectures. However, ViT and its varian… ▽ More

    Submitted 4 September, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Code is available at: https://github.com/snap-research/EfficientFormer

  3. arXiv:2206.01191  [pdf, other

    cs.CV

    EfficientFormer: Vision Transformers at MobileNet Speed

    Authors: Yanyu Li, Geng Yuan, Yang Wen, Ju Hu, Georgios Evangelidis, Sergey Tulyakov, Yanzhi Wang, Jian Ren

    Abstract: Vision Transformers (ViT) have shown rapid progress in computer vision tasks, achieving promising results on various benchmarks. However, due to the massive number of parameters and model design, \textit{e.g.}, attention mechanism, ViT-based models are generally times slower than lightweight convolutional networks. Therefore, the deployment of ViT for real-time applications is particularly challen… ▽ More

    Submitted 10 October, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

  4. An Overview of Depth Cameras and Range Scanners Based on Time-of-Flight Technologies

    Authors: Radu Horaud, Miles Hansard, Georgios Evangelidis, Clement Menier

    Abstract: Time-of-flight (TOF) cameras are sensors that can measure the depths of scene-points, by illuminating the scene with a controlled laser or LED source, and then analyzing the reflected light. In this paper, we will first describe the underlying measurement principles of time-of-flight cameras, including: (i) pulsed-light cameras, which measure directly the time taken for a light pulse to travel fro… ▽ More

    Submitted 12 December, 2020; originally announced December 2020.

    Journal ref: Machine Vision and Applications, 27(7), 2016

  5. Fusion of Range and Stereo Data for High-Resolution Scene-Modeling

    Authors: Georgios D. Evangelidis, Miles Hansard, Radu Horaud

    Abstract: This paper addresses the problem of range-stereo fusion, for the construction of high-resolution depth maps. In particular, we combine low-resolution depth data with high-resolution stereo data, in a maximum a posteriori (MAP) formulation. Unlike existing schemes that build on MRF optimizers, we infer the disparity map from a series of local energy minimization problems that are solved hierarchica… ▽ More

    Submitted 12 December, 2020; originally announced December 2020.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(11), 2015

  6. arXiv:2010.02153  [pdf, ps, other

    cs.CV

    Ego-Motion Alignment from Face Detections for Collaborative Augmented Reality

    Authors: Branislav Micusik, Georgios Evangelidis

    Abstract: Sharing virtual content among multiple smart glasses wearers is an essential feature of a seamless Collaborative Augmented Reality experience. To enable the sharing, local coordinate systems of the underlying 6D ego-pose trackers, running independently on each set of glasses, have to be spatially and temporally aligned with respect to each other. In this paper, we propose a novel lightweight solut… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  7. arXiv:2008.06399  [pdf, ps, other

    cs.CV cs.RO

    Renormalization for Initialization of Rolling Shutter Visual-Inertial Odometry

    Authors: Branislav Micusik, Georgios Evangelidis

    Abstract: In this paper we deal with the initialization problem of a visual-inertial odometry system with rolling shutter cameras. Initialization is a prerequisite for using inertial signals and fusing them with visual data. We propose a novel statistical solution to the initialization problem on visual and inertial data simultaneously, by casting it into the renormalization scheme of Kanatani. The renormal… ▽ More

    Submitted 24 March, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

  8. arXiv:2006.06017  [pdf, ps, other

    cs.CV cs.RO

    Revisiting visual-inertial structure from motion for odometry and SLAM initialization

    Authors: Georgios Evangelidis, Branislav Micusik

    Abstract: In this paper, an efficient closed-form solution for the state initialization in visual-inertial odometry (VIO) and simultaneous localization and map** (SLAM) is presented. Unlike the state-of-the-art, we do not derive linear equations from triangulating pairs of point observations. Instead, we build on a direct triangulation of the unknown $3D$ point paired with each of its observations. We sho… ▽ More

    Submitted 28 January, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

  9. K-means clustering for efficient and robust registration of multi-view point sets

    Authors: Zutao Jiang, Jihua Zhu, Georgios D. Evangelidis, Changqing Zhang, Shanmin Pang, Yaochen Li

    Abstract: Generally, there are three main factors that determine the practical usability of registration, i.e., accuracy, robustness, and efficiency. In real-time applications, efficiency and robustness are more important. To promote these two abilities, we cast the multi-view registration into a clustering task. All the centroids are uniformly sampled from the initially aligned point sets involved in the m… ▽ More

    Submitted 30 April, 2018; v1 submitted 14 October, 2017; originally announced October 2017.

  10. Joint Alignment of Multiple Point Sets with Batch and Incremental Expectation-Maximization

    Authors: Georgios Evangelidis, Radu Horaud

    Abstract: This paper addresses the problem of registering multiple point sets. Solutions to this problem are often approximated by repeatedly solving for pairwise registration, which results in an uneven treatment of the sets forming a pair: a model set and a data set. The main drawback of this strategy is that the model set may contain noise and outliers, which negatively affects the estimation of the regi… ▽ More

    Submitted 6 March, 2017; v1 submitted 6 September, 2016; originally announced September 2016.

    Comments: 14 pages, 12 figures, 5 tables

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(6), 1397 - 1410, 2018

  11. Robust Head-Pose Estimation Based on Partially-Latent Mixture of Linear Regressions

    Authors: Vincent Drouard, Radu Horaud, Antoine Deleforge, Silèye Ba, Georgios Evangelidis

    Abstract: Head-pose estimation has many applications, such as social event analysis, human-robot and human-computer interaction, driving assistance, and so forth. Head-pose estimation is challenging because it must cope with changing illumination conditions, variabilities in face orientation and in appearance, partial occlusions of facial landmarks, as well as bounding-box-to-face alignment errors. We propo… ▽ More

    Submitted 6 March, 2017; v1 submitted 31 March, 2016; originally announced March 2016.

    Comments: 12 pages, 5 figures, 3 tables

    Journal ref: IEEE Transactions on Image Processing, volume 26, Issue 3, 1428-1440, 2017

  12. Continuous Action Recognition Based on Sequence Alignment

    Authors: Kaustubh Kulkarni, Georgios Evangelidis, Jan Cech, Radu Horaud

    Abstract: Continuous action recognition is more challenging than isolated recognition because classification and segmentation must be simultaneously carried out. We build on the well known dynamic time war** (DTW) framework and devise a novel visual alignment technique, namely dynamic frame war** (DFW), which performs isolated recognition based on per-frame representation of videos, and on aligning a te… ▽ More

    Submitted 2 June, 2014; originally announced June 2014.

    Journal ref: International Journal of Computer Vision 112(1), 90-114, 2015

  13. Cross-calibration of Time-of-flight and Colour Cameras

    Authors: Miles Hansard, Georgios Evangelidis, Quentin Pelorson, Radu Horaud

    Abstract: Time-of-flight cameras provide depth information, which is complementary to the photometric appearance of the scene in ordinary images. It is desirable to merge the depth and colour information, in order to obtain a coherent scene representation. However, the individual cameras will have different viewpoints, resolutions and fields of view, which means that they must be mutually calibrated. This p… ▽ More

    Submitted 30 June, 2014; v1 submitted 31 January, 2014; originally announced January 2014.

    Comments: 18 pages, 12 figures, 3 tables

    Journal ref: Computer Vision and Image Understanding, 134, pp.105-115, 2015

  14. Automatic Detection of Calibration Grids in Time-of-Flight Images

    Authors: Miles Hansard, Radu Horaud, Michel Amat, Georgios Evangelidis

    Abstract: It is convenient to calibrate time-of-flight cameras by established methods, using images of a chequerboard pattern. The low resolution of the amplitude image, however, makes it difficult to detect the board reliably. Heuristic detection methods, based on connected image-components, perform very poorly on this data. An alternative, geometrically-principled method is introduced here, based on the H… ▽ More

    Submitted 24 January, 2014; originally announced January 2014.

    Comments: 11 pages, 11 figures, 1 table

    Journal ref: Computer Vision and Image Understanding, 121, 2014

  15. arXiv:1309.7750  [pdf, ps, other

    cs.LG

    An Extensive Experimental Study on the Cluster-based Reference Set Reduction for speeding-up the k-NN Classifier

    Authors: Stefanos Ougiaroglou, Georgios Evangelidis, Dimitris A. Dervos

    Abstract: The k-Nearest Neighbor (k-NN) classification algorithm is one of the most widely-used lazy classifiers because of its simplicity and ease of implementation. It is considered to be an effective classifier and has many applications. However, its major drawback is that when sequential search is used to find the neighbors, it involves high computational cost. Speeding-up k-NN search is still an active… ▽ More

    Submitted 11 February, 2014; v1 submitted 30 September, 2013; originally announced September 2013.

    Comments: Proceeding of International Conference on Integrated Information (IC-InInfo 2011), pp. 12-15, Kos island, Greece, 2011