Skip to main content

Showing 1–15 of 15 results for author: Newson, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01331  [pdf, other

    cs.CV cs.AI cs.LG

    Restyling Unsupervised Concept Based Interpretable Networks with Generative Models

    Authors: Jayneel Parekh, Quentin Bouniot, Pavlo Mozharovskyi, Alasdair Newson, Florence d'Alché-Buc

    Abstract: Develo** inherently interpretable models for prediction has gained prominence in recent years. A subclass of these models, wherein the interpretable network relies on learning high-level concepts, are valued because of closeness of concept representations to human communication. However, the visualization and understanding of the learnt unsupervised dictionary of concepts encounters major limita… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Project page available at https://jayneelparekh.github.io/VisCoIN_project_page/

  2. arXiv:2406.08074  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    A Concept-Based Explainability Framework for Large Multimodal Models

    Authors: Jayneel Parekh, Pegah Khayatan, Mustafa Shukor, Alasdair Newson, Matthieu Cord

    Abstract: Large multimodal models (LMMs) combine unimodal encoders and large language models (LLMs) to perform multimodal tasks. Despite recent advancements towards the interpretability of these models, understanding internal representations of LMMs remains largely a mystery. In this paper, we present a novel framework for the interpretation of LMMs. We propose a dictionary learning based approach, applied… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2406.04206  [pdf, other

    cs.CV

    Diffusion-based image inpainting with internal learning

    Authors: Nicolas Cherel, Andrés Almansa, Yann Gousseau, Alasdair Newson

    Abstract: Diffusion models are now the undisputed state-of-the-art for image generation and image restoration. However, they require large amounts of computational power for training and inference. In this paper, we propose lightweight diffusion models for image inpainting that can be trained on a single image, or a few images. We show that our approach competes with large state-of-the-art models in specifi… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 5 pages, 4 figures. EUSIPCO 2024

  4. arXiv:2312.08256  [pdf, other

    cs.CV

    A Compact and Semantic Latent Space for Disentangled and Controllable Image Editing

    Authors: Gwilherm Lesné, Yann Gousseau, Saïd Ladjal, Alasdair Newson

    Abstract: Recent advances in the field of generative models and in particular generative adversarial networks (GANs) have lead to substantial progress for controlled image editing, especially compared with the pre-deep learning era. Despite their powerful ability to apply realistic modifications to an image, these methods often lack properties like disentanglement (the capacity to edit attributes independen… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  5. arXiv:2311.01090  [pdf, other

    cs.CV

    Infusion: Internal Diffusion for Video Inpainting

    Authors: Nicolas Cherel, Andrés Almansa, Yann Gousseau, Alasdair Newson

    Abstract: Video inpainting is the task of filling a desired region in a video in a visually convincing manner. It is a very challenging task due to the high dimensionality of the signal and the temporal consistency required for obtaining convincing results. Recently, diffusion models have shown impressive results in modeling complex data distributions, including images and videos. Diffusion models remain no… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 12 pages, 8 figures

  6. Patch-Based Stochastic Attention for Image Editing

    Authors: Nicolas Cherel, Andrés Almansa, Yann Gousseau, Alasdair Newson

    Abstract: Attention mechanisms have become of crucial importance in deep learning in recent years. These non-local operations, which are similar to traditional patch-based methods in image processing, complement local convolutions. However, computing the full attention matrix is an expensive step with heavy memory and computational loads. These limitations curb network architectures and performances, in par… ▽ More

    Submitted 1 November, 2023; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: 17 pages, 12 figures. Accepted version for publication in Computer Vision and Image Understanding (CVIU)

    Journal ref: Computer Vision and Image Understanding, Volume 238, 2024, 103866,

  7. arXiv:2202.02183  [pdf, other

    cs.CV

    Feature-Style Encoder for Style-Based GAN Inversion

    Authors: Xu Yao, Alasdair Newson, Yann Gousseau, Pierre Hellier

    Abstract: We propose a novel architecture for GAN inversion, which we call Feature-Style encoder. The style encoder is key for the manipulation of the obtained latent codes, while the feature encoder is crucial for optimal image reconstruction. Our model achieves accurate inversion of real images from the latent space of a pre-trained style-based GAN model, obtaining better perceptual quality and lower reco… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  8. arXiv:2106.11895  [pdf, other

    cs.CV

    A Latent Transformer for Disentangled Face Editing in Images and Videos

    Authors: Xu Yao, Alasdair Newson, Yann Gousseau, Pierre Hellier

    Abstract: High quality facial image editing is a challenging problem in the movie post-production industry, requiring a high degree of control and identity preservation. Previous works that attempt to tackle this problem may suffer from the entanglement of facial attributes and the loss of the person's identity. Furthermore, many algorithms are limited to a certain task. To tackle these limitations, we prop… ▽ More

    Submitted 17 August, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: Accepted by ICCV 2021. Source codes are available at https://github.com/InterDigitalInc/latent-transformer

  9. arXiv:2103.16214  [pdf, other

    cs.CV

    Multi-View Radar Semantic Segmentation

    Authors: Arthur Ouaknine, Alasdair Newson, Patrick Pérez, Florence Tupin, Julien Rebut

    Abstract: Understanding the scene around the ego-vehicle is key to assisted and autonomous driving. Nowadays, this is mostly conducted using cameras and laser scanners, despite their reduced performances in adverse weather conditions. Automotive radars are low-cost active sensors that measure properties of surrounding objects, including their relative speed, and have the key advantage of not being impacted… ▽ More

    Submitted 24 August, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: 16 pages, 9 figures. Accepted at ICCV 2021

  10. arXiv:2006.07827  [pdf, other

    cs.CV

    PCAAE: Principal Component Analysis Autoencoder for organising the latent space of generative networks

    Authors: Chi-Hieu Pham, Saïd Ladjal, Alasdair Newson

    Abstract: Autoencoders and generative models produce some of the most spectacular deep learning results to date. However, understanding and controlling the latent space of these models presents a considerable challenge. Drawing inspiration from principal component analysis and autoencoder, we propose the Principal Component Analysis Autoencoder (PCAAE). This is a novel autoencoder whose latent space verifie… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

    Comments: Preprint with Appendix

  11. arXiv:2005.04410  [pdf, other

    cs.CV

    High Resolution Face Age Editing

    Authors: Xu Yao, Gilles Puy, Alasdair Newson, Yann Gousseau, Pierre Hellier

    Abstract: Face age editing has become a crucial task in film post-production, and is also becoming popular for general purpose photography. Recently, adversarial training has produced some of the most visually impressive results for image manipulation, including the face aging/de-aging task. In spite of considerable progress, current methods often present visual artifacts and can only deal with low-resoluti… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

  12. arXiv:2005.01456  [pdf, other

    cs.CV

    CARRADA Dataset: Camera and Automotive Radar with Range-Angle-Doppler Annotations

    Authors: A. Ouaknine, A. Newson, J. Rebut, F. Tupin, P. Pérez

    Abstract: High quality perception is essential for autonomous driving (AD) systems. To reach the accuracy and robustness that are required by such systems, several types of sensors must be combined. Currently, mostly cameras and laser scanners (lidar) are deployed to build a representation of the world around the vehicle. While radar sensors have been used for a long time in the automotive industry, they ar… ▽ More

    Submitted 26 May, 2021; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 9 pages, 5 figues. Accepted at ICPR 2020. Erratum: results in Table III have been updated since the ICPR proceedings, models are selected using the PP metric instead of the previously used PR metric

    ACM Class: I.2.10; I.4.8

  13. arXiv:1904.07099  [pdf, other

    cs.CV cs.AI

    Processsing Simple Geometric Attributes with Autoencoders

    Authors: Alasdair Newson, Andrés Almansa, Yann Gousseau, Saïd Ladjal

    Abstract: Image synthesis is a core problem in modern deep learning, and many recent architectures such as autoencoders and Generative Adversarial networks produce spectacular results on highly complex data, such as images of faces or landscapes. While these results open up a wide range of new, advanced synthesis applications, there is also a severe lack of theoretical understanding of how these networks wo… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

  14. arXiv:1904.01277  [pdf, other

    cs.CV cs.LG

    A PCA-like Autoencoder

    Authors: Saïd Ladjal, Alasdair Newson, Chi-Hieu Pham

    Abstract: An autoencoder is a neural network which data projects to and from a lower dimensional latent space, where this data is easier to understand and model. The autoencoder consists of two sub-networks, the encoder and the decoder, which carry out these transformations. The neural network is trained such that the output is as close to the input as possible, the data having gone through an information b… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

  15. arXiv:1503.05528  [pdf, ps, other

    cs.CV cs.MM eess.IV math.NA

    Video Inpainting of Complex Scenes

    Authors: Alasdair Newson, Andrés Almansa, Matthieu Fradet, Yann Gousseau, Patrick Pérez

    Abstract: We propose an automatic video inpainting algorithm which relies on the optimisation of a global, patch-based functional. Our algorithm is able to deal with a variety of challenging situations which naturally arise in video inpainting, such as the correct reconstruction of dynamic textures, multiple moving objects and moving background. Furthermore, we achieve this in an order of magnitude less exe… ▽ More

    Submitted 8 June, 2015; v1 submitted 18 March, 2015; originally announced March 2015.

    Journal ref: SIAM Journal on Imaging Sciences, Society for Industrial and Applied Mathematics, 2014, 7 (4), pp.1993-2019