Skip to main content

Showing 1–31 of 31 results for author: Komodakis, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.13103  [pdf, other

    eess.IV cs.CV cs.LG

    ToNNO: Tomographic Reconstruction of a Neural Network's Output for Weakly Supervised Segmentation of 3D Medical Images

    Authors: Marius Schmidt-Mengin, Alexis Benichoux, Shibeshih Belachew, Nikos Komodakis, Nikos Paragios

    Abstract: Annotating lots of 3D medical images for training segmentation models is time-consuming. The goal of weakly supervised semantic segmentation is to train segmentation models without using any ground truth segmentation masks. Our work addresses the case where only image-level categorical labels, indicating the presence or absence of a particular region of interest (such as tumours or lesions), are a… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024

  2. arXiv:2312.00648  [pdf, other

    cs.CV

    SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers

    Authors: Ioannis Kakogeorgiou, Spyros Gidaris, Konstantinos Karantzalos, Nikos Komodakis

    Abstract: Unsupervised object-centric learning aims to decompose scenes into interpretable object entities, termed slots. Slot-based auto-encoders stand out as a prominent method for this task. Within them, crucial aspects include guiding the encoder to generate object-specific slots and ensuring the decoder utilizes them during reconstruction. This work introduces two novel techniques, (i) an attention-bas… ▽ More

    Submitted 5 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: CVPR 2024 (Highlight). Code: https://github.com/gkakogeorgiou/spot

  3. arXiv:2307.09361  [pdf, other

    cs.CV cs.AI cs.LG

    MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments

    Authors: Spyros Gidaris, Andrei Bursuc, Oriane Simeoni, Antonin Vobecky, Nikos Komodakis, Matthieu Cord, Patrick Pérez

    Abstract: Self-supervised learning can be used for mitigating the greedy needs of Vision Transformer networks for very large fully-annotated datasets. Different classes of self-supervised learning offer representations with either good contextual reasoning properties, e.g., using masked image modeling strategies, or invariance to image perturbations, e.g., with contrastive methods. In this work, we propose… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  4. What to Hide from Your Students: Attention-Guided Masked Image Modeling

    Authors: Ioannis Kakogeorgiou, Spyros Gidaris, Bill Psomas, Yannis Avrithis, Andrei Bursuc, Konstantinos Karantzalos, Nikos Komodakis

    Abstract: Transformers and masked language modeling are quickly being adopted and explored in computer vision as vision transformers and masked image modeling (MIM). In this work, we argue that image token masking differs from token masking in text, due to the amount and correlation of tokens in an image. In particular, to generate a challenging pretext task for MIM, we advocate a shift from random masking… ▽ More

    Submitted 22 July, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: ECCV 2022. Codes and models are available at https://github.com/gkakogeorgiou/attmask

    Journal ref: European Conference on Computer Vision (2022)

  5. arXiv:2012.11552  [pdf, other

    cs.CV cs.LG

    OBoW: Online Bag-of-Visual-Words Generation for Self-Supervised Learning

    Authors: Spyros Gidaris, Andrei Bursuc, Gilles Puy, Nikos Komodakis, Matthieu Cord, Patrick Pérez

    Abstract: Learning image representations without human supervision is an important and active research field. Several recent approaches have successfully leveraged the idea of making such a representation invariant under different types of perturbations, especially via contrastive-based instance discrimination training. Although effective visual representations should indeed exhibit such invariances, there… ▽ More

    Submitted 29 October, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: Accepted to CVPR2021. Code at https://github.com/valeoai/obow

  6. arXiv:2002.12247  [pdf, other

    cs.CV cs.LG

    Learning Representations by Predicting Bags of Visual Words

    Authors: Spyros Gidaris, Andrei Bursuc, Nikos Komodakis, Patrick Pérez, Matthieu Cord

    Abstract: Self-supervised representation learning targets to learn convnet-based image representations from unlabeled data. Inspired by the success of NLP methods in this area, in this work we propose a self-supervised approach based on spatially dense image descriptions that encode discrete visual concepts, here called visual words. To build such discrete representations, we quantize the feature maps of a… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: Accepted to CVPR2020

  7. arXiv:1912.01540  [pdf, other

    cs.CV cs.LG

    QUEST: Quantized embedding space for transferring knowledge

    Authors: Himalaya Jain, Spyros Gidaris, Nikos Komodakis, Patrick Pérez, Matthieu Cord

    Abstract: Knowledge distillation refers to the process of training a compact student network to achieve better accuracy by learning from a high capacity teacher network. Most of the existing knowledge distillation methods direct the student to follow the teacher by matching the teacher's output, feature maps or their distribution. In this work, we propose a novel way to achieve this goal: by distilling the… ▽ More

    Submitted 17 July, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: Accepted at ECCV 2020

  8. arXiv:1908.04197  [pdf, other

    eess.IV cs.CV cs.GR

    Deep Tone Map** Operator for High Dynamic Range Images

    Authors: Aakanksha Rana, Praveer Singh, Giuseppe Valenzise, Frederic Dufaux, Nikos Komodakis, Aljosa Smolic

    Abstract: A computationally fast tone map** operator (TMO) that can quickly adapt to a wide spectrum of high dynamic range (HDR) content is quintessential for visualization on varied low dynamic range (LDR) output devices such as movie screens or standard displays. Existing TMOs can successfully tone-map only a limited number of HDR content and require an extensive parameter tuning to yield the best subje… ▽ More

    Submitted 12 August, 2019; originally announced August 2019.

  9. arXiv:1906.05186  [pdf, other

    cs.CV cs.LG

    Boosting Few-Shot Visual Learning with Self-Supervision

    Authors: Spyros Gidaris, Andrei Bursuc, Nikos Komodakis, Patrick Pérez, Matthieu Cord

    Abstract: Few-shot learning and self-supervised learning address different facets of the same problem: how to train a model with little or no labeled data. Few-shot learning aims for optimization methods and models that can learn efficiently to recognize patterns in the low data regime. Self-supervised learning focuses instead on unlabeled data and looks into it for the supervisory signal to feed high capac… ▽ More

    Submitted 12 June, 2019; originally announced June 2019.

  10. arXiv:1905.01102  [pdf, other

    cs.CV cs.LG

    Generating Classification Weights with GNN Denoising Autoencoders for Few-Shot Learning

    Authors: Spyros Gidaris, Nikos Komodakis

    Abstract: Given an initial recognition model already trained on a set of base classes, the goal of this work is to develop a meta-model for few-shot learning. The meta-model, given as input some novel classes with few training examples per class, must properly adapt the existing recognition model into a new model that can correctly classify in a unified way both the novel and the base classes. To accomplish… ▽ More

    Submitted 3 May, 2019; originally announced May 2019.

    Comments: Oral presentation at CVPR 2019. The code and models of our paper will be published on: https://github.com/gidariss/wDAE_GNN_FewShot

  11. arXiv:1812.11027  [pdf, other

    cs.LG stat.ML

    Exploring Weight Symmetry in Deep Neural Networks

    Authors: Xu Shell Hu, Sergey Zagoruyko, Nikos Komodakis

    Abstract: We propose to impose symmetry in neural network parameters to improve parameter usage and make use of dedicated convolution and matrix multiplication routines. Due to significant reduction in the number of parameters as a result of the symmetry constraints, one would expect a dramatic drop in accuracy. Surprisingly, we show that this is not the case, and, depending on network size, symmetry can ha… ▽ More

    Submitted 10 January, 2019; v1 submitted 28 December, 2018; originally announced December 2018.

  12. arXiv:1809.06367  [pdf, other

    cs.LG cs.CV stat.ML

    Scattering Networks for Hybrid Representation Learning

    Authors: Edouard Oyallon, Sergey Zagoruyko, Gabriel Huang, Nikos Komodakis, Simon Lacoste-Julien, Matthew Blaschko, Eugene Belilovsky

    Abstract: Scattering networks are a class of designed Convolutional Neural Networks (CNNs) with fixed weights. We argue they can serve as generic representations for modelling images. In particular, by working in scattering space, we achieve competitive results both for supervised and unsupervised learning tasks, while making progress towards constructing more interpretable CNNs. For supervised learning, we… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1703.08961

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers, 2018, pp.11

  13. arXiv:1804.09458  [pdf, other

    cs.CV cs.LG

    Dynamic Few-Shot Visual Learning without Forgetting

    Authors: Spyros Gidaris, Nikos Komodakis

    Abstract: The human visual system has the remarkably ability to be able to effortlessly learn novel concepts from only a few examples. Mimicking the same behavior on machine learning vision systems is an interesting and very challenging research problem with many practical advantages on real world vision applications. In this context, the goal of our work is to devise a few-shot visual learning system that… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Comments: Accepted at CVPR 2018. Code and models will be published on: https://github.com/gidariss/FewShotWithoutForgetting

  14. arXiv:1803.07728  [pdf, other

    cs.CV cs.LG

    Unsupervised Representation Learning by Predicting Image Rotations

    Authors: Spyros Gidaris, Praveer Singh, Nikos Komodakis

    Abstract: Over the last years, deep convolutional neural networks (ConvNets) have transformed the field of computer vision thanks to their unparalleled capacity to learn high level semantic image features. However, in order to successfully learn those features, they usually require massive amounts of manually labeled data, which is both expensive and impractical to scale. Therefore, unsupervised semantic fe… ▽ More

    Submitted 20 March, 2018; originally announced March 2018.

    Comments: Accepted at ICLR2018. Code and models will be published on: https://github.com/gidariss/FeatureLearningRotNet

  15. arXiv:1802.03480  [pdf, other

    cs.LG cs.CV cs.NE

    GraphVAE: Towards Generation of Small Graphs Using Variational Autoencoders

    Authors: Martin Simonovsky, Nikos Komodakis

    Abstract: Deep learning on graphs has become a popular research topic with many applications. However, past work has concentrated on learning graph embedding tasks, which is in contrast with advances in generative models for images and text. Is it possible to transfer this progress to the domain of graphs? We propose to sidestep hurdles associated with linearization of such discrete structures by having a d… ▽ More

    Submitted 9 February, 2018; originally announced February 2018.

  16. arXiv:1709.01237  [pdf, ps, other

    cs.CV cs.LG math.NA

    Newton-type Methods for Inference in Higher-Order Markov Random Fields

    Authors: Hariprasad Kannan, Nikos Komodakis, Nikos Paragios

    Abstract: Linear programming relaxations are central to {\sc map} inference in discrete Markov Random Fields. The ability to properly solve the Lagrangian dual is a critical component of such methods. In this paper, we study the benefit of using Newton-type methods to solve the Lagrangian dual of a smooth version of the problem. We investigate their ability to achieve superior convergence behavior and to be… ▽ More

    Submitted 5 September, 2017; originally announced September 2017.

    Comments: 10 pages, 3 figures, 3 tables, CVPR 2017

    Journal ref: Poster at IEEE International Conference on Computer Vision and Pattern Recognition 2017

  17. arXiv:1706.00388  [pdf, other

    cs.CV

    DiracNets: Training Very Deep Neural Networks Without Skip-Connections

    Authors: Sergey Zagoruyko, Nikos Komodakis

    Abstract: Deep neural networks with skip-connections, such as ResNet, show excellent performance in various image classification benchmarks. It is though observed that the initial motivation behind them - training deeper networks - does not actually hold true, and the benefits come from increased capacity, rather than from depth. Motivated by this, and inspired from ResNet, we propose a simple Dirac weight… ▽ More

    Submitted 26 January, 2018; v1 submitted 1 June, 2017; originally announced June 2017.

  18. arXiv:1704.02901  [pdf, other

    cs.CV cs.LG cs.NE

    Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs

    Authors: Martin Simonovsky, Nikos Komodakis

    Abstract: A number of problems can be formulated as prediction on graph-structured data. In this work, we generalize the convolution operator from regular grids to arbitrary graphs while avoiding the spectral domain, which allows us to handle graphs of varying size and connectivity. To move beyond a simple diffusion, filter weights are conditioned on the specific edge labels in the neighborhood of a vertex.… ▽ More

    Submitted 8 August, 2017; v1 submitted 10 April, 2017; originally announced April 2017.

    Comments: Accepted to CVPR 2017; extended version

  19. Rotation equivariant vector field networks

    Authors: Diego Marcos, Michele Volpi, Nikos Komodakis, Devis Tuia

    Abstract: In many computer vision tasks, we expect a particular behavior of the output with respect to rotations of the input image. If this relationship is explicitly encoded, instead of treated as any other variation, the complexity of the problem is decreased, leading to a reduction in the size of the required model. In this paper, we propose the Rotation Equivariant Vector Field Networks (RotEqNet), a C… ▽ More

    Submitted 25 August, 2017; v1 submitted 29 December, 2016; originally announced December 2016.

    Comments: 10 pages, accepted at ICCV 2017

  20. arXiv:1612.04770  [pdf, other

    cs.CV cs.LG

    Detect, Replace, Refine: Deep Structured Prediction For Pixel Wise Labeling

    Authors: Spyros Gidaris, Nikos Komodakis

    Abstract: Pixel wise image labeling is an interesting and challenging problem with great significance in the computer vision community. In order for a dense labeling algorithm to be able to achieve accurate and precise results, it has to consider the dependencies that exist in the joint space of both the input and the output variables. An implicit approach for modeling those dependencies is by training a de… ▽ More

    Submitted 14 December, 2016; originally announced December 2016.

  21. arXiv:1612.03928  [pdf, other

    cs.CV

    Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer

    Authors: Sergey Zagoruyko, Nikos Komodakis

    Abstract: Attention plays a critical role in human visual experience. Furthermore, it has recently been demonstrated that attention can also play an important role in the context of applying artificial neural networks to a variety of tasks from fields such as computer vision and NLP. In this work we show that, by properly defining attention for convolutional neural networks, we can actually use this type of… ▽ More

    Submitted 12 February, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

  22. arXiv:1609.05396  [pdf, other

    cs.CV cs.LG cs.NE

    A Deep Metric for Multimodal Registration

    Authors: Martin Simonovsky, Benjamín Gutiérrez-Becker, Diana Mateus, Nassir Navab, Nikos Komodakis

    Abstract: Multimodal registration is a challenging problem in medical imaging due the high variability of tissue appearance under different imaging modalities. The crucial component here is the choice of the right similarity measure. We make a step towards a general learning-based solution that can be adapted to specific situations and present a metric based on a convolutional neural network. Our network ca… ▽ More

    Submitted 17 September, 2016; originally announced September 2016.

    Comments: Accepted to MICCAI 2016; extended version

  23. arXiv:1608.02728  [pdf, other

    cs.CV cs.LG cs.NE

    OnionNet: Sharing Features in Cascaded Deep Classifiers

    Authors: Martin Simonovsky, Nikos Komodakis

    Abstract: The focus of our work is speeding up evaluation of deep neural networks in retrieval scenarios, where conventional architectures may spend too much time on negative examples. We propose to replace a monolithic network with our novel cascade of feature-sharing deep classifiers, called OnionNet, where subsequent stages may add both new layers as well as new feature channels to the previous ones. Imp… ▽ More

    Submitted 9 August, 2016; originally announced August 2016.

    Comments: Accepted to BMVC 2016

  24. arXiv:1606.04446  [pdf, other

    cs.CV

    Attend Refine Repeat: Active Box Proposal Generation via In-Out Localization

    Authors: Spyros Gidaris, Nikos Komodakis

    Abstract: The problem of computing category agnostic bounding box proposals is utilized as a core component in many computer vision tasks and thus has lately attracted a lot of attention. In this work we propose a new approach to tackle this problem that is based on an active strategy for generating box proposals that starts from a set of seed boxes, which are uniformly distributed on the image, and then pr… ▽ More

    Submitted 14 June, 2016; originally announced June 2016.

    Comments: Technical report. Code as well as box proposals computed for several datasets are available at:: https://github.com/gidariss/AttractioNet

  25. arXiv:1605.07146  [pdf, other

    cs.CV cs.LG cs.NE

    Wide Residual Networks

    Authors: Sergey Zagoruyko, Nikos Komodakis

    Abstract: Deep residual networks were shown to be able to scale up to thousands of layers and still have improving performance. However, each fraction of a percent of improved accuracy costs nearly doubling the number of layers, and so training very deep residual networks has a problem of diminishing feature reuse, which makes these networks very slow to train. To tackle these problems, in this paper we con… ▽ More

    Submitted 14 June, 2017; v1 submitted 23 May, 2016; originally announced May 2016.

  26. arXiv:1511.07763  [pdf, other

    cs.CV cs.LG cs.NE

    LocNet: Improving Localization Accuracy for Object Detection

    Authors: Spyros Gidaris, Nikos Komodakis

    Abstract: We propose a novel object localization methodology with the purpose of boosting the localization accuracy of state-of-the-art object detection systems. Our model, given a search region, aims at returning the bounding box of an object of interest inside this region. To accomplish its goal, it relies on assigning conditional probabilities to each row and column of this region, where these probabilit… ▽ More

    Submitted 7 April, 2016; v1 submitted 24 November, 2015; originally announced November 2015.

    Comments: Extended technical report -- short version to appear as oral paper on CVPR 2016. Code: https://github.com/gidariss/LocNet/

  27. arXiv:1505.01749  [pdf, other

    cs.CV cs.LG cs.NE

    Object detection via a multi-region & semantic segmentation-aware CNN model

    Authors: Spyros Gidaris, Nikos Komodakis

    Abstract: We propose an object detection system that relies on a multi-region deep convolutional neural network (CNN) that also encodes semantic segmentation-aware features. The resulting CNN-based representation aims at capturing a diverse set of discriminative appearance factors and exhibits localization sensitivity that is essential for accurate object localization. We exploit the above properties of our… ▽ More

    Submitted 23 September, 2015; v1 submitted 7 May, 2015; originally announced May 2015.

    Comments: Extended technical report -- short version to appear at ICCV 2015

  28. arXiv:1504.03641  [pdf, other

    cs.CV cs.LG cs.NE

    Learning to Compare Image Patches via Convolutional Neural Networks

    Authors: Sergey Zagoruyko, Nikos Komodakis

    Abstract: In this paper we show how to learn directly from image data (i.e., without resorting to manually-designed features) a general similarity function for comparing image patches, which is a task of fundamental importance for many computer vision problems. To encode such a function, we opt for a CNN-based model that is trained to account for a wide variety of changes in image appearance. To that end, w… ▽ More

    Submitted 14 April, 2015; originally announced April 2015.

    Comments: CVPR 2015

  29. arXiv:1409.4205  [pdf, other

    cs.CV

    Speeding-up Graphical Model Optimization via a Coarse-to-fine Cascade of Pruning Classifiers

    Authors: B. Conejo, N. Komodakis, S. Leprince, J. P. Avouac

    Abstract: We propose a general and versatile framework that significantly speeds-up graphical model optimization while maintaining an excellent solution accuracy. The proposed approach relies on a multi-scale pruning scheme that is able to progressively reduce the solution space by use of a novel strategy based on a coarse-to-fine cascade of learnt classifiers. We thoroughly experiment with classic computer… ▽ More

    Submitted 15 September, 2014; originally announced September 2014.

  30. arXiv:1406.5429  [pdf, ps, other

    math.NA cs.CV cs.LG math.OC

    Playing with Duality: An Overview of Recent Primal-Dual Approaches for Solving Large-Scale Optimization Problems

    Authors: Nikos Komodakis, Jean-Christophe Pesquet

    Abstract: Optimization methods are at the core of many problems in signal/image processing, computer vision, and machine learning. For a long time, it has been recognized that looking at the dual of an optimization problem may drastically simplify its solution. Deriving efficient strategies which jointly brings into play the primal and the dual problems is however a more recent idea which has generated many… ▽ More

    Submitted 3 December, 2014; v1 submitted 20 June, 2014; originally announced June 2014.

    ACM Class: G.1.6; I.4; I.5

  31. arXiv:1404.0533  [pdf, other

    cs.CV

    A Comparative Study of Modern Inference Techniques for Structured Discrete Energy Minimization Problems

    Authors: Jörg H. Kappes, Bjoern Andres, Fred A. Hamprecht, Christoph Schnörr, Sebastian Nowozin, Dhruv Batra, Sungwoong Kim, Bernhard X. Kausler, Thorben Kröger, Jan Lellmann, Nikos Komodakis, Bogdan Savchynskyy, Carsten Rother

    Abstract: Szeliski et al. published an influential study in 2006 on energy minimization methods for Markov Random Fields (MRF). This study provided valuable insights in choosing the best optimization technique for certain classes of problems. While these insights remain generally useful today, the phenomenal success of random field models means that the kinds of inference problems that have to be solved cha… ▽ More

    Submitted 2 April, 2014; originally announced April 2014.