Search | arXiv e-print repository

Leveraging Self-Supervised Instance Contrastive Learning for Radar Object Detection

Authors: Colin Decourt, Rufin VanRullen, Didier Salle, Thomas Oberlin

Abstract: In recent years, driven by the need for safer and more autonomous transport systems, the automotive industry has shifted toward integrating a growing number of Advanced Driver Assistance Systems (ADAS). Among the array of sensors employed for object recognition tasks, radar sensors have emerged as a formidable contender due to their abilities in adverse weather conditions or low-light scenarios an… ▽ More In recent years, driven by the need for safer and more autonomous transport systems, the automotive industry has shifted toward integrating a growing number of Advanced Driver Assistance Systems (ADAS). Among the array of sensors employed for object recognition tasks, radar sensors have emerged as a formidable contender due to their abilities in adverse weather conditions or low-light scenarios and their robustness in maintaining consistent performance across diverse environments. However, the small size of radar datasets and the complexity of the labelling of those data limit the performance of radar object detectors. Driven by the promising results of self-supervised learning in computer vision, this paper presents RiCL, an instance contrastive learning framework to pre-train radar object detectors. We propose to exploit the detection from the radar and the temporal information to pre-train the radar object detection model in a self-supervised way using contrastive learning. We aim to pre-train an object detector's backbone, head and neck to learn with fewer data. Experiments on the CARRADA and the RADDet datasets show the effectiveness of our approach in learning generic representations of objects in range-Doppler maps. Notably, our pre-training strategy allows us to use only 20% of the labelled data to reach a similar [email protected] than a supervised approach using the whole training set. △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: 8 pages, 3 figures, 1 table

arXiv:2311.17744 [pdf, other]

Variational Bayes image restoration with compressive autoencoders

Authors: Maud Biquard, Marie Chabert, Thomas Oberlin

Abstract: Regularization of inverse problems is of paramount importance in computational imaging. The ability of neural networks to learn efficient image representations has been recently exploited to design powerful data-driven regularizers. While state-of-the-art plug-and-play methods rely on an implicit regularization provided by neural denoisers, alternative Bayesian approaches consider Maximum A Poster… ▽ More Regularization of inverse problems is of paramount importance in computational imaging. The ability of neural networks to learn efficient image representations has been recently exploited to design powerful data-driven regularizers. While state-of-the-art plug-and-play methods rely on an implicit regularization provided by neural denoisers, alternative Bayesian approaches consider Maximum A Posteriori (MAP) estimation in the latent space of a generative model, thus with an explicit regularization. However, state-of-the-art deep generative models require a huge amount of training data compared to denoisers. Besides, their complexity hampers the optimization involved in latent MAP derivation. In this work, we first propose to use compressive autoencoders instead. These networks, which can be seen as variational autoencoders with a flexible latent prior, are smaller and easier to train than state-of-the-art generative models. As a second contribution, we introduce the Variational Bayes Latent Estimation (VBLE) algorithm, which performs latent estimation within the framework of variational inference. Thanks to a simple yet efficient parameterization of the variational posterior, VBLE allows for fast and easy (approximate) posterior sampling. Experimental results on image datasets BSD and FFHQ demonstrate that VBLE reaches similar performance than state-of-the-art plug-and-play methods, while being able to quantify uncertainties faster than other existing posterior sampling techniques. △ Less

Submitted 25 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

arXiv:2212.11172 [pdf, other]

doi 10.1109/TITS.2024.3404076

A recurrent CNN for online object detection on raw radar frames

Authors: Colin Decourt, Rufin VanRullen, Didier Salle, Thomas Oberlin

Abstract: Automotive radar sensors provide valuable information for advanced driving assistance systems (ADAS). Radars can reliably estimate the distance to an object and the relative velocity, regardless of weather and light conditions. However, radar sensors suffer from low resolution and huge intra-class variations in the shape of objects. Exploiting the time information (e.g., multiple frames) has been… ▽ More Automotive radar sensors provide valuable information for advanced driving assistance systems (ADAS). Radars can reliably estimate the distance to an object and the relative velocity, regardless of weather and light conditions. However, radar sensors suffer from low resolution and huge intra-class variations in the shape of objects. Exploiting the time information (e.g., multiple frames) has been shown to help to capture better the dynamics of objects and, therefore, the variation in the shape of objects. Most temporal radar object detectors use 3D convolutions to learn spatial and temporal information. However, these methods are often non-causal and unsuitable for real-time applications. This work presents RECORD, a new recurrent CNN architecture for online radar object detection. We propose an end-to-end trainable architecture mixing convolutions and ConvLSTMs to learn spatio-temporal dependencies between successive frames. Our model is causal and requires only the past information encoded in the memory of the ConvLSTMs to detect objects. Our experiments show such a method's relevance for detecting objects in different radar representations (range-Doppler, range-angle) and outperform state-of-the-art models on the ROD2021 and CARRADA datasets while being less computationally expensive. △ Less

Submitted 20 May, 2024; v1 submitted 21 December, 2022; originally announced December 2022.

Comments: 11 pages, 4 figures, 5 tables

Journal ref: IEEE Transactions on Intelligent Transportation Systems, 2024

arXiv:2206.13768 [pdf, ps, other]

doi 10.1016/j.sigpro.2022.108905

Algorithms for audio inpainting based on probabilistic nonnegative matrix factorization

Authors: Ondřej Mokrý, Paul Magron, Thomas Oberlin, Cédric Févotte

Abstract: Audio inpainting, i.e., the task of restoring missing or occluded audio signal samples, usually relies on sparse representations or autoregressive modeling. In this paper, we propose to structure the spectrogram with nonnegative matrix factorization (NMF) in a probabilistic framework. First, we treat the missing samples as latent variables, and derive two expectation-maximization algorithms for es… ▽ More Audio inpainting, i.e., the task of restoring missing or occluded audio signal samples, usually relies on sparse representations or autoregressive modeling. In this paper, we propose to structure the spectrogram with nonnegative matrix factorization (NMF) in a probabilistic framework. First, we treat the missing samples as latent variables, and derive two expectation-maximization algorithms for estimating the parameters of the model, depending on whether we formulate the problem in the time- or time-frequency domain. Then, we treat the missing samples as parameters, and we address this novel problem by deriving an alternating minimization scheme. We assess the potential of these algorithms for the task of restoring short- to middle-length gaps in music signals. Experiments reveal great convergence properties of the proposed methods, as well as competitive performance when compared to state-of-the-art audio inpainting techniques. △ Less

Submitted 5 January, 2023; v1 submitted 28 June, 2022; originally announced June 2022.

arXiv:2204.01360 [pdf, other]

doi 10.1109/LSP.2022.3189275

Learning the Proximity Operator in Unfolded ADMM for Phase Retrieval

Authors: Pierre-Hugo Vial, Paul Magron, Thomas Oberlin, Cédric Févotte

Abstract: This paper considers the phase retrieval (PR) problem, which aims to reconstruct a signal from phaseless measurements such as magnitude or power spectrograms. PR is generally handled as a minimization problem involving a quadratic loss. Recent works have considered alternative discrepancy measures, such as the Bregman divergences, but it is still challenging to tailor the optimal loss for a given… ▽ More This paper considers the phase retrieval (PR) problem, which aims to reconstruct a signal from phaseless measurements such as magnitude or power spectrograms. PR is generally handled as a minimization problem involving a quadratic loss. Recent works have considered alternative discrepancy measures, such as the Bregman divergences, but it is still challenging to tailor the optimal loss for a given setting. In this paper we propose a novel strategy to automatically learn the optimal metric for PR. We unfold a recently introduced ADMM algorithm into a neural network, and we emphasize that the information about the loss used to formulate the PR problem is conveyed by the proximity operator involved in the ADMM updates. Therefore, we replace this proximity operator with trainable activation functions: learning these in a supervised setting is then equivalent to learning an optimal metric for PR. Experiments conducted with speech signals show that our approach outperforms the baseline ADMM, using a light and interpretable neural architecture. △ Less

Submitted 4 April, 2022; originally announced April 2022.

Comments: 10 pages, 5 figures, submitted to IEEE SPL

arXiv:2101.08661 [pdf, other]

Regularization via deep generative models: an analysis point of view

Authors: Thomas Oberlin, Mathieu Verm

Abstract: This paper proposes a new way of regularizing an inverse problem in imaging (e.g., deblurring or inpainting) by means of a deep generative neural network. Compared to end-to-end models, such approaches seem particularly interesting since the same network can be used for many different problems and experimental conditions, as soon as the generative model is suited to the data. Previous works propos… ▽ More This paper proposes a new way of regularizing an inverse problem in imaging (e.g., deblurring or inpainting) by means of a deep generative neural network. Compared to end-to-end models, such approaches seem particularly interesting since the same network can be used for many different problems and experimental conditions, as soon as the generative model is suited to the data. Previous works proposed to use a synthesis framework, where the estimation is performed on the latent vector, the solution being obtained afterwards via the decoder. Instead, we propose an analysis formulation where we directly optimize the image itself and penalize the latent vector. We illustrate the interest of such a formulation by running experiments of inpainting, deblurring and super-resolution. In many cases our technique achieves a clear improvement of the performance and seems to be more robust, in particular with respect to initialization. △ Less

Submitted 21 January, 2021; originally announced January 2021.

arXiv:2011.12818 [pdf, other]

Phase retrieval with Bregman divergences: Application to audio signal recovery

Authors: Pierre-Hugo Vial, Paul Magron, Thomas Oberlin, Cédric Févotte

Abstract: Phase retrieval aims to recover a signal from magnitude or power spectra measurements. It is often addressed by considering a minimization problem involving a quadratic cost function. We propose a different formulation based on Bregman divergences, which encompass divergences that are appropriate for audio signal processing applications. We derive a fast gradient algorithm to solve this problem. Phase retrieval aims to recover a signal from magnitude or power spectra measurements. It is often addressed by considering a minimization problem involving a quadratic cost function. We propose a different formulation based on Bregman divergences, which encompass divergences that are appropriate for audio signal processing applications. We derive a fast gradient algorithm to solve this problem. △ Less

Submitted 25 November, 2020; originally announced November 2020.

Comments: in Proceedings of iTWIST'20, Paper-ID: 16, Nantes, France, December, 2-4, 2020

arXiv:2011.10097 [pdf, other]

Compartment model-based nonlinear unmixing for kinetic analysis of dynamic PET images

Authors: Yanna Cruz Cavalcanti, Thomas Oberlin, Vinicius Ferraris, Nicolas Dobigeon, Maria Ribeiro, Clovis Tauber

Abstract: When no arterial input function is available, quantification of dynamic PET images requires a previous step devoted to the extraction of a reference time-activity curve (TAC). Factor analysis is often applied for this purpose. This paper introduces a novel approach that conducts a new kind of nonlinear factor analysis relying on a compartment model, and computes the kinetic parameters of specific… ▽ More When no arterial input function is available, quantification of dynamic PET images requires a previous step devoted to the extraction of a reference time-activity curve (TAC). Factor analysis is often applied for this purpose. This paper introduces a novel approach that conducts a new kind of nonlinear factor analysis relying on a compartment model, and computes the kinetic parameters of specific binding tissues jointly. To this end, it capitalizes on data-driven parametric imaging methods to provide a physical description of the underlying PET data, directly relating the specific binding with the kinetics of the non-specific binding in the corresponding tissues. This characterization is introduced into the factor analysis formulation to yield a novel nonlinear unmixing model designed for PET image analysis. This model also explicitly introduces global kinetic parameters that allow for a direct estimation of the binding potential with respect to the free fractions in each non-specific binding tissue. The performance of the method is evaluated on synthetic and real data to demonstrate its potential interest. △ Less

Submitted 19 November, 2020; originally announced November 2020.

arXiv:2010.10255 [pdf, ps, other]

Phase recovery with Bregman divergences for audio source separation

Authors: Paul Magron, Pierre-Hugo Vial, Thomas Oberlin, Cédric Févotte

Abstract: Time-frequency audio source separation is usually achieved by estimating the short-time Fourier transform (STFT) magnitude of each source, and then applying a phase recovery algorithm to retrieve time-domain signals. In particular, the multiple input spectrogram inversion (MISI) algorithm has shown good performance in several recent works. This algorithm minimizes a quadratic reconstruction error… ▽ More Time-frequency audio source separation is usually achieved by estimating the short-time Fourier transform (STFT) magnitude of each source, and then applying a phase recovery algorithm to retrieve time-domain signals. In particular, the multiple input spectrogram inversion (MISI) algorithm has shown good performance in several recent works. This algorithm minimizes a quadratic reconstruction error between magnitude spectrograms. However, this loss does not properly account for some perceptual properties of audio, and alternative discrepancy measures such as beta-divergences have been preferred in many settings. In this paper, we propose to reformulate phase recovery in audio source separation as a minimization problem involving Bregman divergences. To optimize the resulting objective, we derive a projected gradient descent algorithm. Experiments conducted on a speech enhancement task show that this approach outperforms MISI for several alternative losses, which highlights their relevance for audio source separation applications. △ Less

Submitted 9 February, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

arXiv:2010.00392 [pdf, other]

doi 10.1109/JSTSP.2021.3051870

Phase retrieval with Bregman divergences and application to audio signal recovery

Authors: Pierre-Hugo Vial, Paul Magron, Thomas Oberlin, Cédric Févotte

Abstract: Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products. This problem arises in many audio signal processing applications which operate on a short-time Fourier transform magnitude or power spectrogram, and discard the phase information. Recovering the missing phase from the resulting modified spectrogram is indeed necessary in order to synthesize time-domain si… ▽ More Phase retrieval (PR) aims to recover a signal from the magnitudes of a set of inner products. This problem arises in many audio signal processing applications which operate on a short-time Fourier transform magnitude or power spectrogram, and discard the phase information. Recovering the missing phase from the resulting modified spectrogram is indeed necessary in order to synthesize time-domain signals. PR is commonly addressed by considering a minimization problem involving a quadratic loss function. In this paper, we adopt a different standpoint. Indeed, the quadratic loss does not properly account for some perceptual properties of audio, and alternative discrepancy measures such as beta-divergences have been preferred in many settings. Therefore, we formulate PR as a new minimization problem involving Bregman divergences. Since these divergences are not symmetric with respect to their two input arguments in general, they lead to two different formulations of the problem. To optimize the resulting objective, we derive two algorithms based on accelerated gradient descent and alternating direction method of multipliers. Experiments conducted on audio signal recovery from spectrograms that are either exact or estimated from noisy observations highlight the potential of our proposed methods for audio restoration. In particular, leveraging some of these Bregman divergences induce better performance than the quadratic loss when performing PR from spectrograms under very noisy conditions. △ Less

Submitted 13 January, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

Comments: 23 pages, 3 figures, accepted for publication in the IEEE Journal of Selected Topics in Signal Processing

arXiv:2006.01034 [pdf, other]

Ordinal Non-negative Matrix Factorization for Recommendation

Authors: Olivier Gouvert, Thomas Oberlin, Cédric Févotte

Abstract: We introduce a new non-negative matrix factorization (NMF) method for ordinal data, called OrdNMF. Ordinal data are categorical data which exhibit a natural ordering between the categories. In particular, they can be found in recommender systems, either with explicit data (such as ratings) or implicit data (such as quantized play counts). OrdNMF is a probabilistic latent factor model that generali… ▽ More We introduce a new non-negative matrix factorization (NMF) method for ordinal data, called OrdNMF. Ordinal data are categorical data which exhibit a natural ordering between the categories. In particular, they can be found in recommender systems, either with explicit data (such as ratings) or implicit data (such as quantized play counts). OrdNMF is a probabilistic latent factor model that generalizes Bernoulli-Poisson factorization (BePoF) and Poisson factorization (PF) applied to binarized data. Contrary to these methods, OrdNMF circumvents binarization and can exploit a more informative representation of the data. We design an efficient variational algorithm based on a suitable model augmentation and related to variational PF. In particular, our algorithm preserves the scalability of PF and can be applied to huge sparse datasets. We report recommendation experiments on explicit and implicit datasets, and show that OrdNMF outperforms BePoF and PF applied to binarized data. △ Less

Submitted 2 September, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

Comments: Accepted for publication at ICML 2020

arXiv:2002.01225 [pdf, other]

Fast reconstruction of atomic-scale STEM-EELS images from sparse sampling

Authors: Etienne Monier, Thomas Oberlin, Nathalie Brun, Xiaoyan Li, Marcel Tencé, Nicolas Dobigeon

Abstract: This paper discusses the reconstruction of partially sampled spectrum-images to accelerate the acquisition in scanning transmission electron microscopy (STEM). The problem of image reconstruction has been widely considered in the literature for many imaging modalities, but only a few attempts handled 3D data such as spectral images acquired by STEM electron energy loss spectroscopy (EELS). Besides… ▽ More This paper discusses the reconstruction of partially sampled spectrum-images to accelerate the acquisition in scanning transmission electron microscopy (STEM). The problem of image reconstruction has been widely considered in the literature for many imaging modalities, but only a few attempts handled 3D data such as spectral images acquired by STEM electron energy loss spectroscopy (EELS). Besides, among the methods proposed in the microscopy literature, some are fast but inaccurate while others provide accurate reconstruction but at the price of a high computation burden. Thus none of the proposed reconstruction methods fulfills our expectations in terms of accuracy and computation complexity. In this paper, we propose a fast and accurate reconstruction method suited for atomic-scale EELS. This method is compared to popular solutions such as beta process factor analysis (BPFA) which is used for the first time on STEM-EELS images. Experiments based on real as synthetic data will be conducted. △ Less

Submitted 4 February, 2020; originally announced February 2020.

arXiv:2001.02618 [pdf, other]

doi 10.3847/1538-3881/ab9301

Simulated JWST datasets for multispectral and hyperspectral image fusion

Authors: Claire Guilloteau, Thomas Oberlin, Olivier Berné, Nicolas Dobigeon

Abstract: This paper aims at providing a comprehensive framework to generate an astrophysical scene and to simulate realistic hyperspectral and multispectral data acquired by two JWST instruments, namely NIRCam Imager and NIRSpec IFU. We want to show that this simulation framework can be resorted to assess the benefits of fusing these images to recover an image of high spatial and spectral resolutions. To d… ▽ More This paper aims at providing a comprehensive framework to generate an astrophysical scene and to simulate realistic hyperspectral and multispectral data acquired by two JWST instruments, namely NIRCam Imager and NIRSpec IFU. We want to show that this simulation framework can be resorted to assess the benefits of fusing these images to recover an image of high spatial and spectral resolutions. To do so, we create a synthetic scene associated with a canonical infrared source, the Orion Bar. This scene combines pre-existing modelled spectra provided by the JWST Early Release Science Program 1288 and real high resolution spatial maps from the Hubble space and ALMA telescopes. We develop forward models including corresponding noises for the two JWST instruments based on their technical designs and physical features. JWST observations are then simulated by applying the forward models to the aforementioned synthetic scene. We test a dedicated fusion algorithm we developed on these simulated observations. We show the fusion process reconstructs the high spatio-spectral resolution scene with a good accuracy on most areas, and we identify some limitations of the method to be tackled in future works. The synthetic scene and observations presented in the paper are made publicly available and can be used for instance to evaluate instrument models (aboard the JWST or on the ground), pipelines, or more sophisticated algorithms dedicated to JWST data analysis. Besides, fusion methods such as the one presented in this paper are shown to be promising tools to fully exploit the unprecedented capabilities of the JWST. △ Less

Submitted 8 January, 2020; originally announced January 2020.

arXiv:1912.11868 [pdf, other]

Hyperspectral and multispectral image fusion under spectrally varying spatial blurs -- Application to high dimensional infrared astronomical imaging

Authors: Claire Guilloteau, Thomas Oberlin, Olivier Berné, Nicolas Dobigeon

Abstract: Hyperspectral imaging has become a significant source of valuable data for astronomers over the past decades. Current instrumental and observing time constraints allow direct acquisition of multispectral images, with high spatial but low spectral resolution, and hyperspectral images, with low spatial but high spectral resolution. To enhance scientific interpretation of the data, we propose a data… ▽ More Hyperspectral imaging has become a significant source of valuable data for astronomers over the past decades. Current instrumental and observing time constraints allow direct acquisition of multispectral images, with high spatial but low spectral resolution, and hyperspectral images, with low spatial but high spectral resolution. To enhance scientific interpretation of the data, we propose a data fusion method which combines the benefits of each image to recover a high spatio-spectral resolution datacube. The proposed inverse problem accounts for the specificities of astronomical instruments, such as spectrally variant blurs. We provide a fast implementation by solving the problem in the frequency domain and in a low-dimensional subspace to efficiently handle the convolution operators as well as the high dimensionality of the data. We conduct experiments on a realistic synthetic dataset of simulated observation of the upcoming James Webb Space Telescope, and we show that our fusion algorithm outperforms state-of-the-art methods commonly used in remote sensing for Earth observation. △ Less

Submitted 26 December, 2019; originally announced December 2019.

arXiv:1905.13128 [pdf, ps, other]

Recommendation from Raw Data with Adaptive Compound Poisson Factorization

Authors: Olivier Gouvert, Thomas Oberlin, Cédric Févotte

Abstract: Count data are often used in recommender systems: they are widespread (song play counts, product purchases, clicks on web pages) and can reveal user preference without any explicit rating from the user. Such data are known to be sparse, over-dispersed and bursty, which makes their direct use in recommender systems challenging, often leading to pre-processing steps such as binarization. The aim of… ▽ More Count data are often used in recommender systems: they are widespread (song play counts, product purchases, clicks on web pages) and can reveal user preference without any explicit rating from the user. Such data are known to be sparse, over-dispersed and bursty, which makes their direct use in recommender systems challenging, often leading to pre-processing steps such as binarization. The aim of this paper is to build recommender systems from these raw data, by means of the recently proposed compound Poisson Factorization (cPF). The paper contributions are three-fold: we present a unified framework for discrete data (dcPF), leading to an adaptive and scalable algorithm; we show that our framework achieves a trade-off between Poisson Factorization (PF) applied to raw and binarized data; we study four specific instances that are relevant to recommendation and exhibit new links with combinatorics. Experiments with three different datasets show that dcPF is able to effectively adjust to over-dispersion, leading to better recommendation scores when compared with PF on either raw or binarized data. △ Less

Submitted 9 July, 2019; v1 submitted 20 May, 2019; originally announced May 2019.

Comments: Accepted for publication at UAI 2019

arXiv:1807.11455 [pdf, other]

Factor analysis of dynamic PET images: beyond Gaussian noise

Authors: Yanna Cruz Cavalcanti, Thomas Oberlin, Nicolas Dobigeon, Cédric Févotte, Simon Stute, Maria-Joao Ribeiro, Clovis Tauber

Abstract: Factor analysis has proven to be a relevant tool for extracting tissue time-activity curves (TACs) in dynamic PET images, since it allows for an unsupervised analysis of the data. Reliable and interpretable results are possible only if considered with respect to suitable noise statistics. However, the noise in reconstructed dynamic PET images is very difficult to characterize, despite the Poissoni… ▽ More Factor analysis has proven to be a relevant tool for extracting tissue time-activity curves (TACs) in dynamic PET images, since it allows for an unsupervised analysis of the data. Reliable and interpretable results are possible only if considered with respect to suitable noise statistics. However, the noise in reconstructed dynamic PET images is very difficult to characterize, despite the Poissonian nature of the count-rates. Rather than explicitly modeling the noise distribution, this work proposes to study the relevance of several divergence measures to be used within a factor analysis framework. To this end, the $β$-divergence, widely used in other applicative domains, is considered to design the data-fitting term involved in three different factor models. The performances of the resulting algorithms are evaluated for different values of $β$, in a range covering Gaussian, Poissonian and Gamma-distributed noises. The results obtained on two different types of synthetic images and one real image show the interest of applying non-standard values of $β$ to improve factor analysis. △ Less

Submitted 26 March, 2019; v1 submitted 30 July, 2018; originally announced July 2018.

Comments: This manuscript has been accepted for publication in IEEE Trans. Medical Imaging

arXiv:1807.08118 [pdf, other]

Coupled dictionary learning for unsupervised change detection between multi-sensor remote sensing images

Authors: Vinicius Ferraris, Nicolas Dobigeon, Yanna Cavalcanti, Thomas Oberlin, Marie Chabert

Abstract: Archetypal scenarios for change detection generally consider two images acquired through sensors of the same modality. However, in some specific cases such as emergency situations, the only images available may be those acquired through sensors of different modalities. This paper addresses the problem of unsupervisedly detecting changes between two observed images acquired by sensors of different… ▽ More Archetypal scenarios for change detection generally consider two images acquired through sensors of the same modality. However, in some specific cases such as emergency situations, the only images available may be those acquired through sensors of different modalities. This paper addresses the problem of unsupervisedly detecting changes between two observed images acquired by sensors of different modalities with possibly different resolutions. These sensor dissimilarities introduce additional issues in the context of operational change detection that are not addressed by most of the classical methods. This paper introduces a novel framework to effectively exploit the available information by modelling the two observed images as a sparse linear combination of atoms belonging to a pair of coupled overcomplete dictionaries learnt from each observed image. As they cover the same geographical location, codes are expected to be globally similar, except for possible changes in sparse spatial locations. Thus, the change detection task is envisioned through a dual code estimation which enforces spatial sparsity in the difference between the estimated codes associated with each image. This problem is formulated as an inverse problem which is iteratively solved using an efficient proximal alternating minimization algorithm accounting for nonsmooth and nonconvex functions. The proposed method is applied to real images with simulated yet realistic and real changes. A comparison with state-of-the-art change detection methods evidences the accuracy of the proposed strategy. △ Less

Submitted 2 September, 2019; v1 submitted 21 July, 2018; originally announced July 2018.

Comments: Submitted manuscript under consideration at Computer Vision and Image Understanding

arXiv:1802.10066 [pdf, other]

Reconstruction of partially sampled multi-band images - Application to STEM-EELS imaging

Authors: Étienne Monier, Thomas Oberlin, Nathalie Brun, Marcel Tencé, Marta de Frutos, Nicolas Dobigeon

Abstract: Electron microscopy has shown to be a very powerful tool to map the chemical nature of samples at various scales down to atomic resolution. However, many samples can not be analyzed with an acceptable signal-to-noise ratio because of the radiation damage induced by the electron beam. This is particularly crucial for electron energy loss spectroscopy (EELS) which acquires spectral-spatial data and… ▽ More Electron microscopy has shown to be a very powerful tool to map the chemical nature of samples at various scales down to atomic resolution. However, many samples can not be analyzed with an acceptable signal-to-noise ratio because of the radiation damage induced by the electron beam. This is particularly crucial for electron energy loss spectroscopy (EELS) which acquires spectral-spatial data and requires high beam intensity. Since scanning transmission electron microscopes (STEM) are able to acquire data cubes by scanning the electron probe over the sample and recording a spectrum for each spatial position, it is possible to design the scan pattern and to sample only specific pixels. As a consequence, partial acquisition schemes are now conceivable, provided a reconstruction of the full data cube is conducted as a post-processing step. This paper proposes two reconstruction algorithms for multi-band images acquired by STEM-EELS which exploits the spectral structure and the spatial smoothness of the image. The performance of the proposed schemes is illustrated thanks to experiments conducted on a realistic phantom dataset as well as real EELS spectrum-images. △ Less

Submitted 27 February, 2018; originally announced February 2018.

arXiv:1801.01708 [pdf, other]

Negative Binomial Matrix Factorization for Recommender Systems

Authors: Olivier Gouvert, Thomas Oberlin, Cédric Févotte

Abstract: We introduce negative binomial matrix factorization (NBMF), a matrix factorization technique specially designed for analyzing over-dispersed count data. It can be viewed as an extension of Poisson matrix factorization (PF) perturbed by a multiplicative term which models exposure. This term brings a degree of freedom for controlling the dispersion, making NBMF more robust to outliers. We show that… ▽ More We introduce negative binomial matrix factorization (NBMF), a matrix factorization technique specially designed for analyzing over-dispersed count data. It can be viewed as an extension of Poisson matrix factorization (PF) perturbed by a multiplicative term which models exposure. This term brings a degree of freedom for controlling the dispersion, making NBMF more robust to outliers. We show that NBMF allows to skip traditional pre-processing stages, such as binarization, which lead to loss of information. Two estimation approaches are presented: maximum likelihood and variational Bayes inference. We test our model with a recommendation task and show its ability to predict user tastes with better precision than PF. △ Less

Submitted 5 January, 2018; originally announced January 2018.

arXiv:1707.09867 [pdf, other]

Unmixing dynamic PET images with variable specific binding kinetics

Authors: Yanna Cruz Cavalcanti, Thomas Oberlin, Nicolas Dobigeon, Simon Stute, Maria Ribeiro, Clovis Tauber

Abstract: To analyze dynamic positron emission tomography (PET) images, various generic multivariate data analysis techniques have been considered in the literature, such as principal component analysis (PCA), independent component analysis (ICA), factor analysis and nonnegative matrix factorization (NMF). Nevertheless, these conventional approaches neglect any possible nonlinear variations in the time acti… ▽ More To analyze dynamic positron emission tomography (PET) images, various generic multivariate data analysis techniques have been considered in the literature, such as principal component analysis (PCA), independent component analysis (ICA), factor analysis and nonnegative matrix factorization (NMF). Nevertheless, these conventional approaches neglect any possible nonlinear variations in the time activity curves describing the kinetic behavior of tissues with specific binding, which limits their ability to recover a reliable, understandable and interpretable description of the data. This paper proposes an alternative analysis paradigm that accounts for spatial fluctuations in the exchange rate of the tracer between a free compartment and a specifically bound ligand compartment. The method relies on the concept of linear unmixing, usually applied on the hyperspectral domain, which combines NMF with a sum-to-one constraint that ensures an exhaustive description of the mixtures. The spatial variability of the signature corresponding to the specific binding tissue is explicitly modeled through a perturbed component. The performance of the method is assessed on both synthetic and real data and is shown to compete favorably when compared to other conventional analysis methods. The proposed method improved both factor estimation and proportions extraction for specific binding. Modeling the variability of the specific binding factor has a strong potential impact for dynamic PET image analysis. △ Less

Submitted 9 December, 2017; v1 submitted 19 July, 2017; originally announced July 2017.

arXiv:1609.06874 [pdf, other]

EEG reconstruction and skull conductivity estimation using a Bayesian model promoting structured sparsity

Authors: Facundo Costa, Hadj Batatia, Thomas Oberlin, Jean-Yves Tourneret

Abstract: M/EEG source localization is an open research issue. To solve it, it is important to have good knowledge of several physical parameters to build a reliable head operator. Amongst them, the value of the conductivity of the human skull has remained controversial. This report introduces a novel hierarchical Bayesian framework to estimate the skull conductivity jointly with the brain activity from the… ▽ More M/EEG source localization is an open research issue. To solve it, it is important to have good knowledge of several physical parameters to build a reliable head operator. Amongst them, the value of the conductivity of the human skull has remained controversial. This report introduces a novel hierarchical Bayesian framework to estimate the skull conductivity jointly with the brain activity from the M/EEG measurements to improve the reconstruction quality. A partially collapsed Gibbs sampler is used to draw samples asymptotically distributed according to the associated posterior. The generated samples are then used to estimate the brain activity and the model hyperparameters jointly in a completely unsupervised framework. We use synthetic and real data to illustrate the improvement of the reconstruction. The performance of our method is also compared with two optimization algorithms introduced by Vallaghé \textit{et al.} and Gutierrez \textit{et al.} respectively, showing that our method is able to provide results of similar or better quality while remaining applicable in a wider array of situations. △ Less

Submitted 4 January, 2017; v1 submitted 22 September, 2016; originally announced September 2016.

Comments: Technical report

arXiv:1509.04576 [pdf, other]

Bayesian Structured Sparsity Priors for EEG Source Localization Technical Report

Authors: Facundo Costa, Hadj Batatia, Thomas Oberlin, Jean-Yves Tourneret

Abstract: This report introduces a new hierarchical Bayesian model for the EEG source localization problem. This model promotes structured sparsity to search for focal brain activity. This sparsity is obtained via a multivariate Bernoulli Laplacian prior assigned to the brain activity approximating an $\ell_{20}$ pseudo norm regularization in a Bayesian framework. A partially collapsed Gibbs sampler is used… ▽ More This report introduces a new hierarchical Bayesian model for the EEG source localization problem. This model promotes structured sparsity to search for focal brain activity. This sparsity is obtained via a multivariate Bernoulli Laplacian prior assigned to the brain activity approximating an $\ell_{20}$ pseudo norm regularization in a Bayesian framework. A partially collapsed Gibbs sampler is used to draw samples asymptotically distributed according to the posterior associated with the proposed Bayesian model. The generated samples are used to estimate the brain activity and the model hyperparameters jointly in an unsupervised framework. Two different kinds of Metropolis-Hastings moves are introduced to accelerate the convergence of the Gibbs sampler. The first move is based on multiple dipole shifts within each MCMC chain whereas the second one exploits proposals associated with different MCMC chains. We use both synthetic and real data to compare the performance of the proposed method with the weighted $\ell_{21}$ mixed norm regularization and a method based on a multiple sparse prior, showing that our algorithm presents advantages in several scenarios. △ Less

Submitted 15 September, 2015; originally announced September 2015.

Comments: 38 pages, extended version of a paper that will be submitted for publication

arXiv:1212.0447 [pdf, ps, other]

Biological Database of Images and Genomes: tools for community annotations linking image and genomic information

Authors: Andrew T. Oberlin, Dominika A. Jurkovic, Mitchell F. Balish, Iddo Friedberg

Abstract: Genomic data and biomedical imaging data are undergoing exponential growth. However, our understanding of the phenotype-genotype connection linking the two types of data is lagging behind. While there are many types of software that enable the manipulation and analysis of image data and genomic data as separate entities, there is no framework established for linking the two. We present a generic s… ▽ More Genomic data and biomedical imaging data are undergoing exponential growth. However, our understanding of the phenotype-genotype connection linking the two types of data is lagging behind. While there are many types of software that enable the manipulation and analysis of image data and genomic data as separate entities, there is no framework established for linking the two. We present a generic set of software tools, BioDIG, that allows linking of image data to genomic data. BioDIG tools can be applied to a wide range of research problems that require linking images to genomes. BioDIG features the following: rapid construction of web-based workbenches, community-based annotation, user management, and web-services. By using BioDIG to create websites, researchers and curators can rapidly annotate large number of images with genomic information. Here we present the BioDIG software tools that include an image module, a genome module and a user management module. We also introduce a BioDIG-based website, MyDIG, which is being used to annotate images of Mycoplasma. △ Less

Submitted 3 December, 2012; originally announced December 2012.

arXiv:1211.5082 [pdf, ps, other]

The Monogenic Synchrosqueezed Wavelet Transform: A tool for the Decomposition/Demodulation of AM-FM images

Authors: Marianne Clausel, Thomas Oberlin, Valérie Perrier

Abstract: The synchrosqueezing method aims at decomposing 1D functions as superpositions of a small number of "Intrinsic Modes", supposed to be well separated both in time and frequency. Based on the unidimensional wavelet transform and its reconstruction properties, the synchrosqueezing transform provides a powerful representation of multicomponent signals in the time-frequency plane, together with a recon… ▽ More The synchrosqueezing method aims at decomposing 1D functions as superpositions of a small number of "Intrinsic Modes", supposed to be well separated both in time and frequency. Based on the unidimensional wavelet transform and its reconstruction properties, the synchrosqueezing transform provides a powerful representation of multicomponent signals in the time-frequency plane, together with a reconstruction of each mode. In this paper, a bidimensional version of the synchrosqueezing transform is defined, by considering a well-adapted extension of the concept of analytic signal to images: the monogenic signal. The natural bidimensional counterpart of the notion of Intrinsic Mode is then the concept of "Intrinsic Monogenic Mode" that we define. Thereafter, we investigate the properties of its associated Monogenic Wavelet Decomposition. This leads to a natural bivariate extension of the Synchrosqueezed Wavelet Transform, for decomposing and processing multicomponent images. Numerical tests validate the effectiveness of the method for different examples. △ Less

Submitted 20 November, 2012; originally announced November 2012.

MSC Class: 65T60; 92C55; 94A08

Showing 1–24 of 24 results for author: Oberlin, T