Skip to main content

Showing 1–25 of 25 results for author: Audebert, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01400  [pdf, other

    cs.CV

    GalLoP: Learning Global and Local Prompts for Vision-Language Models

    Authors: Marc Lafon, Elias Ramzi, Clément Rambour, Nicolas Audebert, Nicolas Thome

    Abstract: Prompt learning has been widely adopted to efficiently adapt vision-language models (VLMs), e.g. CLIP, for few-shot image classification. Despite their success, most prompt learning methods trade-off between classification accuracy and robustness, e.g. in domain generalization or out-of-distribution (OOD) detection. In this work, we introduce Global-Local Prompts (GalLoP), a new prompt learning me… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: To be published at ECCV 2024

  2. arXiv:2405.09922  [pdf, other

    cs.CV

    Cross-sensor self-supervised training and alignment for remote sensing

    Authors: Valerio Marsocci, Nicolas Audebert

    Abstract: Large-scale "foundation models" have gained traction as a way to leverage the vast amounts of unlabeled remote sensing data collected every day. However, due to the multiplicity of Earth Observation satellites, these models should learn "sensor agnostic" representations, that generalize across sensor characteristics with minimal fine-tuning. This is complicated by data availability, as low-resolut… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  3. arXiv:2404.16409  [pdf, other

    cs.CV eess.IV

    Cross-sensor super-resolution of irregularly sampled Sentinel-2 time series

    Authors: Aimi Okabayashi, Nicolas Audebert, Simon Donike, Charlotte Pelletier

    Abstract: Satellite imaging generally presents a trade-off between the frequency of acquisitions and the spatial resolution of the images. Super-resolution is often advanced as a way to get the best of both worlds. In this work, we investigate multi-image super-resolution of satellite image time series, i.e. how multiple images of the same area acquired at different dates can help reconstruct a higher resol… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Journal ref: EARTHVISION 2024 IEEE/CVF CVPR Workshop. Large Scale Computer Vision for Remote Sensing Imagery, Jun 2024, Seattle, United States

  4. arXiv:2404.12667  [pdf, other

    cs.CV cs.AI cs.LG

    Detecting Out-Of-Distribution Earth Observation Images with Diffusion Models

    Authors: Georges Le Bellier, Nicolas Audebert

    Abstract: Earth Observation imagery can capture rare and unusual events, such as disasters and major landscape changes, whose visual appearance contrasts with the usual observations. Deep models trained on common remote sensing data will output drastically different features for these out-of-distribution samples, compared to those closer to their training dataset. Detecting them could therefore help anticip… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: EARTHVISION 2024 IEEE/CVF CVPR Workshop. Large Scale Computer Vision for Remote Sensing Imagery, Jun 2024, Seattle, United States

  5. arXiv:2311.16122  [pdf, other

    cs.CV cs.AI cs.LG

    Semantic Generative Augmentations for Few-Shot Counting

    Authors: Perla Doubinsky, Nicolas Audebert, Michel Crucianu, Hervé Le Borgne

    Abstract: With the availability of powerful text-to-image diffusion models, recent works have explored the use of synthetic data to improve image classification performances. These works show that it can effectively augment or even replace real data. In this work, we investigate how synthetic data can benefit few-shot class-agnostic counting. This requires to generate images that correspond to a given input… ▽ More

    Submitted 26 October, 2023; originally announced November 2023.

  6. arXiv:2309.08250  [pdf, other

    cs.CV

    Optimization of Rank Losses for Image Retrieval

    Authors: Elias Ramzi, Nicolas Audebert, Clément Rambour, André Araujo, Xavier Bitot, Nicolas Thome

    Abstract: In image retrieval, standard evaluation metrics rely on score ranking, \eg average precision (AP), recall at k (R@k), normalized discounted cumulative gain (NDCG). In this work we introduce a general framework for robust and decomposable rank losses optimization. It addresses two major challenges for end-to-end training of deep neural networks with rank losses: non-differentiability and non-decomp… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2207.04873

  7. arXiv:2304.10508  [pdf, other

    cs.CV cs.AI

    Wasserstein Loss for Semantic Editing in the Latent Space of GANs

    Authors: Perla Doubinsky, Nicolas Audebert, Michel Crucianu, Hervé Le Borgne

    Abstract: The latent space of GANs contains rich semantics reflecting the training data. Different methods propose to learn edits in latent space corresponding to semantic attributes, thus allowing to modify generated images. Most supervised methods rely on the guidance of classifiers to produce such edits. However, classifiers can lead to out-of-distribution regions and be fooled by adversarial samples. We… ▽ More

    Submitted 22 March, 2023; originally announced April 2023.

  8. arXiv:2207.04873  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Hierarchical Average Precision Training for Pertinent Image Retrieval

    Authors: Elias Ramzi, Nicolas Audebert, Nicolas Thome, Clément Rambour, Xavier Bitot

    Abstract: Image Retrieval is commonly evaluated with Average Precision (AP) or Recall@k. Yet, those metrics, are limited to binary labels and do not take into account errors' severity. This paper introduces a new hierarchical AP training method for pertinent image retrieval (HAP-PIER). HAPPIER is based on a new H-AP metric, which leverages a concept hierarchy to refine AP by integrating errors' importance a… ▽ More

    Submitted 22 July, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

    Journal ref: ECCV 2022, Oct 2022, Tel-Aviv, Israel

  9. arXiv:2202.03190  [pdf, other

    cs.NI cs.AI cs.NE

    Efficient Autoprecoder-based deep learning for massive MU-MIMO Downlink under PA Non-Linearities

    Authors: Xinying Cheng, Rafik Zayani, Marin Ferecatu, Nicolas Audebert

    Abstract: This paper introduces a new efficient autoprecoder (AP) based deep learning approach for massive multiple-input multiple-output (mMIMO) downlink systems in which the base station is equipped with a large number of antennas with energy-efficient power amplifiers (PAs) and serves multiple user terminals. We present AP-mMIMO, a new method that jointly eliminates the multiuser interference and compens… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Journal ref: IEEE Wireless Communications and Networking Conference, Apr 2022, Austin, United States

  10. arXiv:2111.00909  [pdf, other

    cs.LG cs.AI cs.CV

    Multi-Attribute Balanced Sampling for Disentangled GAN Controls

    Authors: Perla Doubinsky, Nicolas Audebert, Michel Crucianu, Hervé Le Borgne

    Abstract: Various controls over the generated data can be extracted from the latent space of a pre-trained GAN, as it implicitly encodes the semantics of the training data. The discovered controls allow to vary semantic attributes in the generated images but usually lead to entangled edits that affect multiple attributes at the same time. Supervised approaches typically sample and annotate a collection of l… ▽ More

    Submitted 27 January, 2022; v1 submitted 28 October, 2021; originally announced November 2021.

  11. arXiv:2110.01445  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Robust and Decomposable Average Precision for Image Retrieval

    Authors: Elias Ramzi, Nicolas Thome, Clément Rambour, Nicolas Audebert, Xavier Bitot

    Abstract: In image retrieval, standard evaluation metrics rely on score ranking, e.g. average precision (AP). In this paper, we introduce a method for robust and decomposable average precision (ROADMAP) addressing two major challenges for end-to-end training of deep neural networks with AP: non-differentiability and non-decomposability. Firstly, we propose a new differentiable approximation of the rank func… ▽ More

    Submitted 8 December, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

    Journal ref: Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), Dec 2021, Sydney, Australia

  12. arXiv:2108.11629  [pdf, other

    cs.CV cs.NE eess.IV

    Web Image Context Extraction with Graph Neural Networks and Sentence Embeddings on the DOM tree

    Authors: Chen Dang, Hicham Randrianarivo, Raphaël Fournier-S'Niehotta, Nicolas Audebert

    Abstract: Web Image Context Extraction (WICE) consists in obtaining the textual information describing an image using the content of the surrounding webpage. A common preprocessing step before performing WICE is to render the content of the webpage. When done at a large scale (e.g., for search engine indexation), it may become very computationally costly (up to several seconds per page). To avoid this cost,… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Journal ref: GEM: Graph Embedding and Mining - ECML/PKDD Workshops, Sep 2021, Bilbao, Spain

  13. arXiv:2107.14009  [pdf, other

    cs.SD eess.AS

    PKSpell: Data-Driven Pitch Spelling and Key Signature Estimation

    Authors: Francesco Foscarin, Nicolas Audebert, Raphaël Fournier-S'Niehotta

    Abstract: We present PKSpell: a data-driven approach for the joint estimation of pitch spelling and key signatures from MIDI files. Both elements are fundamental for the production of a full-fledged musical score and facilitate many MIR tasks such as harmonic analysis, section identification, melodic similarity, and search in a digital music library. We design a deep recurrent neural network model that only… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: International Society for Music Information Retrieval Conference (ISMIR), Nov 2021, Online, India

  14. arXiv:2010.07830  [pdf, other

    cs.CV

    Semi-Supervised Semantic Segmentation in Earth Observation: The MiniFrance Suite, Dataset Analysis and Multi-task Network Study

    Authors: Javiera Castillo-Navarro, Bertrand Le Saux, Alexandre Boulch, Nicolas Audebert, Sébastien Lefèvre

    Abstract: The development of semi-supervised learning techniques is essential to enhance the generalization capacities of machine learning algorithms. Indeed, raw image data are abundant while labels are scarce, therefore it is crucial to leverage unlabeled inputs to build better models. The availability of large databases have been key for the development of learning algorithms with high level performance.… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  15. arXiv:1909.01671  [pdf, other

    cs.NE cs.CV eess.IV

    Distance transform regression for spatially-aware deep semantic segmentation

    Authors: Nicolas Audebert, Alexandre Boulch, Bertrand Le Saux, Sébastien Lefèvre

    Abstract: Understanding visual scenes relies more and more on dense pixel-wise classification obtained via deep fully convolutional neural networks. However, due to the nature of the networks, predictions often suffer from blurry boundaries and ill-segmented shapes, fueling the need for post-processing. This work introduces a new semantic segmentation regularization based on the regression of a distance tra… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

  16. arXiv:1907.06370  [pdf, other

    cs.CV

    Multimodal deep networks for text and image-based document classification

    Authors: Nicolas Audebert, Catherine Herold, Kuider Slimani, Cédric Vidal

    Abstract: Classification of document images is a critical step for archival of old manuscripts, online subscription and administrative procedures. Computer vision and deep learning have been suggested as a first solution to classify documents based on their visual appearance. However, achieving the fine-grained classification that is required in real-world setting cannot be achieved by visual analysis alone… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

  17. arXiv:1904.10674  [pdf, other

    cs.LG cs.CV cs.NE eess.IV

    Deep Learning for Classification of Hyperspectral Data: A Comparative Review

    Authors: Nicolas Audebert, Bertrand Saux, Sébastien Lefèvre

    Abstract: In recent years, deep learning techniques revolutionized the way remote sensing data are processed. Classification of hyperspectral data is no exception to the rule, but has intrinsic specificities which make application of deep learning less straightforward than with other optical data. This article presents a state of the art of previous machine learning approaches, reviews the various deep lear… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

  18. arXiv:1806.02583  [pdf, other

    cs.NE cs.CV

    Generative Adversarial Networks for Realistic Synthesis of Hyperspectral Samples

    Authors: Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre

    Abstract: This work addresses the scarcity of annotated hyperspectral data required to train deep neural networks. Especially, we investigate generative adversarial networks and their application to the synthesis of consistent labeled spectra. By training such networks on public datasets, we show that these models are not only able to capture the underlying distribution, but also to generate genuine-looking… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

    Journal ref: International Geoscience and Remote Sensing Symposium (IGARSS 2018), Jul 2018, Valencia, Spain

  19. arXiv:1712.01600  [pdf, other

    cs.CV cs.LG

    Deep learning for semantic segmentation of remote sensing images with rich spectral content

    Authors: A Hamida, A. Benoît, P. Lambert, L Klein, C Amar, N. Audebert, S. Lefèvre

    Abstract: With the rapid development of Remote Sensing acquisition techniques, there is a need to scale and improve processing tools to cope with the observed increase of both data volume and richness. Among popular techniques in remote sensing, Deep Learning gains increasing interest but depends on the quality of the training data. Therefore, this paper presents recent Deep Learning approaches for fine or… ▽ More

    Submitted 5 December, 2017; originally announced December 2017.

    Comments: IEEE International Geoscience and Remote Sensing Symposium, Jul 2017, Fort Worth, United States. 2017

  20. arXiv:1711.08681  [pdf, other

    cs.NE cs.CV

    Beyond RGB: Very High Resolution Urban Remote Sensing With Multimodal Deep Networks

    Authors: Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre

    Abstract: In this work, we investigate various methods to deal with semantic labeling of very high resolution multi-modal remote sensing data. Especially, we study how deep fully convolutional networks can be adapted to deal with multi-modal and multi-scale remote sensing data for semantic labeling. Our contributions are threefold: a) we present an efficient multi-scale approach to leverage both a large spa… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

    Comments: ISPRS Journal of Photogrammetry and Remote Sensing, Elsevier, A Para{î}tre

  21. arXiv:1705.06057  [pdf, other

    cs.CV cs.NE

    Joint Learning from Earth Observation and OpenStreetMap Data to Get Faster Better Semantic Maps

    Authors: Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre

    Abstract: In this work, we investigate the use of OpenStreetMap data for semantic labeling of Earth Observation images. Deep neural networks have been used in the past for remote sensing data classification from various sensors, including multispectral, hyperspectral, SAR and LiDAR data. While OpenStreetMap has already been used as ground truth data for training such networks, this abundant data source rema… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

    Journal ref: EARTHVISION 2017 IEEE/ISPRS CVPR Workshop. Large Scale Computer Vision for Remote Sensing Imagery, Jul 2017, Honolulu, United States. 2017

  22. arXiv:1701.05818  [pdf, other

    cs.NE cs.CV

    Fusion of Heterogeneous Data in Convolutional Networks for Urban Semantic Labeling (Invited Paper)

    Authors: Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre

    Abstract: In this work, we present a novel module to perform fusion of heterogeneous data using fully convolutional networks for semantic labeling. We introduce residual correction as a way to learn how to fuse predictions coming out of a dual stream architecture. Especially, we perform fusion of DSM and IRRG optical data on the ISPRS Vaihingen dataset over a urban area and obtain new state-of-the-art resul… ▽ More

    Submitted 20 January, 2017; originally announced January 2017.

    Comments: Joint Urban Remote Sensing Event (JURSE), Mar 2017, Dubai, United Arab Emirates. Joint Urban Remote Sensing Event 2017

  23. arXiv:1609.06861  [pdf, other

    cs.CV

    How Useful is Region-based Classification of Remote Sensing Images in a Deep Learning Framework?

    Authors: Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre

    Abstract: In this paper, we investigate the impact of segmentation algorithms as a preprocessing step for classification of remote sensing images in a deep learning framework. Especially, we address the issue of segmenting the image into regions to be classified using pre-trained deep neural networks as feature extractors for an SVM-based classifier. An efficient segmentation as a preprocessing step… ▽ More

    Submitted 22 September, 2016; originally announced September 2016.

    Comments: IEEE International Geosciences and Remote Sensing Symposium (IGARSS), Jul 2016, Bei**g, China

  24. arXiv:1609.06846  [pdf, other

    cs.CV cs.NE

    Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks

    Authors: Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre

    Abstract: This work investigates the use of deep fully convolutional neural networks (DFCNN) for pixel-wise scene labeling of Earth Observation images. Especially, we train a variant of the SegNet architecture on remote sensing data over an urban area and study different strategies for performing accurate semantic segmentation. Our contributions are the following: 1) we transfer efficiently a DFCNN from gen… ▽ More

    Submitted 22 September, 2016; originally announced September 2016.

    Comments: Asian Conference on Computer Vision (ACCV16), Nov 2016, Taipei, Taiwan

  25. arXiv:1609.06845  [pdf, other

    cs.NE cs.CV

    On the usability of deep networks for object-based image analysis

    Authors: Nicolas Audebert, Bertrand Le Saux, Sébastien Lefèvre

    Abstract: As computer vision before, remote sensing has been radically changed by the introduction of Convolution Neural Networks. Land cover use, object detection and scene understanding in aerial images rely more and more on deep learning to achieve new state-of-the-art results. Recent architectures such as Fully Convolutional Networks (Long et al., 2015) can even produce pixel level annotations for sema… ▽ More

    Submitted 22 September, 2016; originally announced September 2016.

    Comments: in International Conference on Geographic Object-Based Image Analysis (GEOBIA), Sep 2016, Enschede, Netherlands