Skip to main content

Showing 1–10 of 10 results for author: Perronnin, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:1611.08194  [pdf, other

    cs.CV

    Interferences in match kernels

    Authors: Naila Murray, Hervé Jégou, Florent Perronnin, Andrew Zisserman

    Abstract: We consider the design of an image representation that embeds and aggregates a set of local descriptors into a single vector. Popular representations of this kind include the bag-of-visual-words, the Fisher vector and the VLAD. When two such image representations are compared with the dot-product, the image-to-image similarity can be interpreted as a match kernel. In match kernels, one has to deal… ▽ More

    Submitted 24 November, 2016; originally announced November 2016.

    Comments: Accepted as regular paper in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  2. arXiv:1609.01882  [pdf, other

    cs.CV cs.DB cs.IT cs.LG

    Polysemous codes

    Authors: Matthijs Douze, Hervé Jégou, Florent Perronnin

    Abstract: This paper considers the problem of approximate nearest neighbor search in the compressed domain. We introduce polysemous codes, which offer both the distance estimation quality of product quantization and the efficient comparison of binary codes with Hamming distance. Their design is inspired by algorithms introduced in the 90's to construct channel-optimized vector quantizers. At search time, th… ▽ More

    Submitted 10 October, 2016; v1 submitted 7 September, 2016; originally announced September 2016.

    Comments: The final (author) version of our ECCV'16 paper

  3. arXiv:1603.00438  [pdf, other

    cs.CV

    Convolutional Patch Representations for Image Retrieval: an Unsupervised Approach

    Authors: Mattis Paulin, Julien Mairal, Matthijs Douze, Zaid Harchaoui, Florent Perronnin, Cordelia Schmid

    Abstract: Convolutional neural networks (CNNs) have recently received a lot of attention due to their ability to model local stationary structures in natural images in a multi-scale fashion, when learning all model parameters with supervision. While excellent performance was achieved for image classification when large amounts of labeled visual data are available, their success for un-supervised tasks such… ▽ More

    Submitted 1 March, 2016; originally announced March 2016.

  4. arXiv:1509.06243  [pdf, other

    cs.CV

    LEWIS: Latent Embeddings for Word Images and their Semantics

    Authors: Albert Gordo, Jon Almazan, Naila Murray, Florent Perronnin

    Abstract: The goal of this work is to bring semantics into the tasks of text recognition and retrieval in natural images. Although text recognition and retrieval have received a lot of attention in recent years, previous works have focused on recognizing or retrieving exactly the same word used as a query, without taking the semantics into consideration. In this paper, we ask the following question: \emph… ▽ More

    Submitted 21 September, 2015; originally announced September 2015.

    Comments: Accepted for publication at the International Conference on Computer Vision (ICCV) 2015

  5. arXiv:1507.06429  [pdf, other

    cs.CV

    Deep Fishing: Gradient Features from Deep Nets

    Authors: Albert Gordo, Adrien Gaidon, Florent Perronnin

    Abstract: Convolutional Networks (ConvNets) have recently improved image recognition performance thanks to end-to-end learning of deep feed-forward models from raw pixels. Deep learning is a marked departure from the previous state of the art, the Fisher Vector (FV), which relied on gradient-based encoding of local hand-crafted features. In this paper, we discuss a novel connection between these two approac… ▽ More

    Submitted 23 July, 2015; originally announced July 2015.

    Comments: To appear at BMVC 2015

  6. arXiv:1504.04763  [pdf, other

    cs.CV

    Understanding the Fisher Vector: a multimodal part model

    Authors: David Novotný, Diane Larlus, Florent Perronnin, Andrea Vedaldi

    Abstract: Fisher Vectors and related orderless visual statistics have demonstrated excellent performance in object detection, sometimes superior to established approaches such as the Deformable Part Models. However, it remains unclear how these models can capture complex appearance variations using visual codebooks of limited sizes and coarse geometric information. In this work, we propose to interpret Fish… ▽ More

    Submitted 18 April, 2015; originally announced April 2015.

  7. Label-Embedding for Image Classification

    Authors: Zeynep Akata, Florent Perronnin, Zaid Harchaoui, Cordelia Schmid

    Abstract: Attributes act as intermediate representations that enable parameter sharing between classes, a must when training data is scarce. We propose to view attribute-based image classification as a label-embedding problem: each class is embedded in the space of attribute vectors. We introduce a function that measures the compatibility between an image and a label embedding. The parameters of this functi… ▽ More

    Submitted 1 October, 2015; v1 submitted 30 March, 2015; originally announced March 2015.

    Comments: IEEE TPAMI preprint

  8. arXiv:1412.4940  [pdf, other

    cs.CV

    Discovering beautiful attributes for aesthetic image analysis

    Authors: Luca Marchesotti, Naila Murray, Florent Perronnin

    Abstract: Aesthetic image analysis is the study and assessment of the aesthetic properties of images. Current computational approaches to aesthetic image analysis either provide accurate or interpretable results. To obtain both accuracy and interpretability by humans, we advocate the use of learned and nameable visual attributes as mid-level features. For this purpose, we propose to discover and learn the v… ▽ More

    Submitted 16 December, 2014; originally announced December 2014.

    Comments: IJCV, 2014

  9. arXiv:1408.4325  [pdf, other

    cs.CV

    What makes an Image Iconic? A Fine-Grained Case Study

    Authors: Yangmuzi Zhang, Diane Larlus, Florent Perronnin

    Abstract: A natural approach to teaching a visual concept, e.g. a bird species, is to show relevant images. However, not all relevant images represent a concept equally well. In other words, they are not necessarily iconic. This observation raises three questions. Is iconicity a subjective property? If not, can we predict iconicity? And what exactly makes an image iconic? We provide answers to these questio… ▽ More

    Submitted 19 August, 2014; originally announced August 2014.

  10. arXiv:1406.0312  [pdf, other

    cs.CV

    Generalized Max Pooling

    Authors: Naila Murray, Florent Perronnin

    Abstract: State-of-the-art patch-based image representations involve a pooling operation that aggregates statistics computed from local descriptors. Standard pooling operations include sum- and max-pooling. Sum-pooling lacks discriminability because the resulting representation is strongly influenced by frequent yet often uninformative descriptors, but only weakly influenced by rare yet potentially highly-i… ▽ More

    Submitted 2 June, 2014; originally announced June 2014.

    Comments: (to appear) CVPR 2014 - IEEE Conference on Computer Vision & Pattern Recognition (2014)