Skip to main content

Showing 1–12 of 12 results for author: Kanakis, M

.
  1. arXiv:2312.15471  [pdf, other

    cs.CV cs.RO

    Residual Learning for Image Point Descriptors

    Authors: Rashik Shrestha, Ajad Chhatkuli, Menelaos Kanakis, Luc Van Gool

    Abstract: Local image feature descriptors have had a tremendous impact on the development and application of computer vision methods. It is therefore unsurprising that significant efforts are being made for learning-based image point descriptors. However, the advantage of learned methods over handcrafted methods in real applications is subtle and more nuanced than expected. Moreover, handcrafted descriptors… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  2. arXiv:2311.03345  [pdf, other

    cs.CV

    Long-Term Invariant Local Features via Implicit Cross-Domain Correspondences

    Authors: Zador Pataki, Mohammad Altillawi, Menelaos Kanakis, RĂ©mi Pautrat, Fengyi Shen, Ziyuan Liu, Luc Van Gool, Marc Pollefeys

    Abstract: Modern learning-based visual feature extraction networks perform well in intra-domain localization, however, their performance significantly declines when image pairs are captured across long-term visual domain variations, such as different seasonal and daytime variations. In this paper, our first contribution is a benchmark to investigate the performance impact of long-term variations on visual l… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 14 pages + 5 pages appendix, 13 figures

  3. arXiv:2309.08523  [pdf, other

    cs.CV cs.GR

    Breathing New Life into 3D Assets with Generative Repainting

    Authors: Tianfu Wang, Menelaos Kanakis, Konrad Schindler, Luc Van Gool, Anton Obukhov

    Abstract: Diffusion-based text-to-image models ignited immense attention from the vision community, artists, and content creators. Broad adoption of these models is due to significant improvement in the quality of generations and efficient conditioning on various modalities, not just text. However, lifting the rich generative priors of these 2D models into 3D is challenging. Recent works have proposed vario… ▽ More

    Submitted 18 October, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

  4. arXiv:2210.07239  [pdf, other

    cs.CV

    Composite Learning for Robust and Effective Dense Predictions

    Authors: Menelaos Kanakis, Thomas E. Huang, David Bruggemann, Fisher Yu, Luc Van Gool

    Abstract: Multi-task learning promises better model generalization on a target task by jointly optimizing it with an auxiliary task. However, the current practice requires additional labeling efforts for the auxiliary task, while not guaranteeing better model performance. In this paper, we find that jointly training a dense prediction (target) task with a self-supervised (auxiliary) task can consistently im… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Winter Conference on Applications of Computer Vision (WACV), 2023

  5. arXiv:2203.03610  [pdf, other

    cs.CV cs.LG cs.RO

    ZippyPoint: Fast Interest Point Detection, Description, and Matching through Mixed Precision Discretization

    Authors: Menelaos Kanakis, Simon Maurer, Matteo Spallanzani, Ajad Chhatkuli, Luc Van Gool

    Abstract: Efficient detection and description of geometric regions in images is a prerequisite in visual systems for localization and map**. Such systems still rely on traditional hand-crafted methods for efficient generation of lightweight descriptors, a common limitation of the more powerful neural network models that come with high compute and specific hardware requirements. In this paper, we focus on… ▽ More

    Submitted 8 April, 2023; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: Computer Vision and Pattern Recognition Workshop (CVPRW), 2023

  6. arXiv:2112.09686  [pdf, other

    cs.CV

    Efficient Visual Tracking with Exemplar Transformers

    Authors: Philippe Blatter, Menelaos Kanakis, Martin Danelljan, Luc Van Gool

    Abstract: The design of more complex and powerful neural network models has significantly advanced the state-of-the-art in visual object tracking. These advances can be attributed to deeper networks, or the introduction of new building blocks, such as transformers. However, in the pursuit of increased tracking performance, runtime is often hindered. Furthermore, efficient tracking architectures have receive… ▽ More

    Submitted 4 October, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  7. arXiv:2105.07830  [pdf, other

    cs.CV cs.LG

    Learning to Relate Depth and Semantics for Unsupervised Domain Adaptation

    Authors: Suman Saha, Anton Obukhov, Danda Pani Paudel, Menelaos Kanakis, Yuhua Chen, Stamatios Georgoulis, Luc Van Gool

    Abstract: We present an approach for encoding visual task relationships to improve model performance in an Unsupervised Domain Adaptation (UDA) setting. Semantic segmentation and monocular depth estimation are shown to be complementary tasks; in a multi-task learning setting, a proper encoding of their relationships can further improve performance on both tasks. Motivated by this observation, we propose a n… ▽ More

    Submitted 3 July, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: Accepted at CVPR 2021; updated results according to the released source code

  8. arXiv:2104.13874  [pdf, other

    cs.CV

    Exploring Relational Context for Multi-Task Dense Prediction

    Authors: David Bruggemann, Menelaos Kanakis, Anton Obukhov, Stamatios Georgoulis, Luc Van Gool

    Abstract: The timeline of computer vision research is marked with advances in learning and utilizing efficient contextual representations. Most of them, however, are targeted at improving model performance on a single downstream task. We consider a multi-task environment for dense prediction tasks, represented by a common backbone and independent task-specific heads. Our goal is to find the most efficient w… ▽ More

    Submitted 23 August, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Comments: International Conference on Computer Vision (ICCV) 2021

  9. arXiv:2008.10292  [pdf, other

    cs.CV

    Automated Search for Resource-Efficient Branched Multi-Task Networks

    Authors: David Bruggemann, Menelaos Kanakis, Stamatios Georgoulis, Luc Van Gool

    Abstract: The multi-modal nature of many vision problems calls for neural network architectures that can perform multiple tasks concurrently. Typically, such architectures have been handcrafted in the literature. However, given the size and complexity of the problem, this manual architecture exploration likely exceeds human design abilities. In this paper, we propose a principled approach, rooted in differe… ▽ More

    Submitted 11 May, 2021; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: British Machine Vision Conference (BMVC) 2020

  10. arXiv:2007.12540  [pdf, other

    cs.CV cs.LG

    Reparameterizing Convolutions for Incremental Multi-Task Learning without Task Interference

    Authors: Menelaos Kanakis, David Bruggemann, Suman Saha, Stamatios Georgoulis, Anton Obukhov, Luc Van Gool

    Abstract: Multi-task networks are commonly utilized to alleviate the need for a large number of highly specialized single-task networks. However, two common challenges in develo** multi-task models are often overlooked in literature. First, enabling the model to be inherently incremental, continuously incorporating information from new tasks without forgetting the previously learned ones (incremental lear… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    Comments: European Conference on Computer Vision (ECCV), 2020

  11. arXiv:2007.06631  [pdf, other

    cs.LG cs.CV stat.ML

    T-Basis: a Compact Representation for Neural Networks

    Authors: Anton Obukhov, Maxim Rakhuba, Stamatios Georgoulis, Menelaos Kanakis, Dengxin Dai, Luc Van Gool

    Abstract: We introduce T-Basis, a novel concept for a compact representation of a set of tensors, each of an arbitrary shape, which is often seen in Neural Networks. Each of the tensors in the set is modeled using Tensor Rings, though the concept applies to other Tensor Networks. Owing its name to the T-shape of nodes in diagram notation of Tensor Rings, T-Basis is simply a list of equally shaped three-dime… ▽ More

    Submitted 13 July, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Accepted at ICML 2020

  12. arXiv:1912.07124  [pdf, other

    cs.CV

    Domain Agnostic Feature Learning for Image and Video Based Face Anti-spoofing

    Authors: Suman Saha, Wenhao Xu, Menelaos Kanakis, Stamatios Georgoulis, Yuhua Chen, Danda Pani Paudel, Luc Van Gool

    Abstract: Nowadays, the increasingly growing number of mobile and computing devices has led to a demand for safer user authentication systems. Face anti-spoofing is a measure towards this direction for bio-metric user authentication, and in particular face recognition, that tries to prevent spoof attacks. The state-of-the-art anti-spoofing techniques leverage the ability of deep neural networks to learn dis… ▽ More

    Submitted 13 April, 2020; v1 submitted 15 December, 2019; originally announced December 2019.

    Comments: CVPR 2020 Biometrics Workshop