Skip to main content

Showing 1–9 of 9 results for author: Oldfield, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.12550  [pdf, other

    cs.CV cs.LG

    Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization

    Authors: James Oldfield, Markos Georgopoulos, Grigorios G. Chrysos, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Jiankang Deng, Ioannis Patras

    Abstract: The Mixture of Experts (MoE) paradigm provides a powerful way to decompose dense layers into smaller, modular computations often more amenable to human interpretation, debugging, and editability. However, a major challenge lies in the computational cost of scaling the number of experts high enough to achieve fine-grained specialization. In this paper, we propose the Multilinear Mixture of Experts… ▽ More

    Submitted 31 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Github: https://github.com/james-oldfield/muMoE. Project page: https://james-oldfield.github.io/muMoE/

  2. arXiv:2305.14053  [pdf, other

    cs.CV cs.LG

    Parts of Speech-Grounded Subspaces in Vision-Language Models

    Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: Latent image representations arising from vision-language models have proved immensely useful for a variety of downstream tasks. However, their utility is limited by their entanglement with respect to different visual attributes. For instance, recent work has shown that CLIP image representations are often biased toward specific visual properties (such as objects or actions) in an unpredictable ma… ▽ More

    Submitted 12 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023

  3. arXiv:2206.02104  [pdf, other

    cs.CV

    ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences

    Authors: Christos Tzelepis, James Oldfield, Georgios Tzimiropoulos, Ioannis Patras

    Abstract: This work addresses the problem of discovering non-linear interpretable paths in the latent space of pre-trained GANs in a model-agnostic manner. In the proposed method, the discovery is driven by a set of pairs of natural language sentences with contrasting semantics, named semantic dipoles, that serve as the limits of the interpretation that we require by the trainable latent paths to encode. By… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  4. arXiv:2206.00048  [pdf, other

    cs.CV cs.LG

    PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs

    Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: Recent advances in the understanding of Generative Adversarial Networks (GANs) have led to remarkable progress in visual editing and synthesis tasks, capitalizing on the rich semantics that are embedded in the latent spaces of pre-trained GANs. However, existing methods are often tailored to specific GAN architectures and are limited to either discovering global semantic directions that do not fac… ▽ More

    Submitted 6 February, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: Accepted at ICLR 2023. Code available at: https://github.com/james-oldfield/PandA

  5. arXiv:2112.12911  [pdf, other

    cs.CV

    Cluster-guided Image Synthesis with Unconditional Models

    Authors: Markos Georgopoulos, James Oldfield, Grigorios G Chrysos, Yannis Panagakis

    Abstract: Generative Adversarial Networks (GANs) are the driving force behind the state-of-the-art in image generation. Despite their ability to synthesize high-resolution photo-realistic images, generating content with on-demand conditioning of different granularity remains a challenge. This challenge is usually tackled by annotating massive datasets with the attributes of interest, a laborious task that i… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

  6. arXiv:2111.11736  [pdf, other

    cs.CV

    Tensor Component Analysis for Interpreting the Latent Space of GANs

    Authors: James Oldfield, Markos Georgopoulos, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: This paper addresses the problem of finding interpretable directions in the latent space of pre-trained Generative Adversarial Networks (GANs) to facilitate controllable image synthesis. Such interpretable directions correspond to transformations that can affect both the style and geometry of the synthetic images. However, existing approaches that utilise linear techniques to find these transforma… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: BMVC 2021

  7. Tensor Methods in Computer Vision and Deep Learning

    Authors: Yannis Panagakis, Jean Kossaifi, Grigorios G. Chrysos, James Oldfield, Mihalis A. Nicolaou, Anima Anandkumar, Stefanos Zafeiriou

    Abstract: Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a long history of applications in a wide span of computer vision problems. With the advent of the deep learning paradigm shift in computer vision, tensors have be… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Proceedings of the IEEE (2021)

  8. arXiv:2006.03985  [pdf, other

    cs.CV

    Enhancing Facial Data Diversity with Style-based Face Aging

    Authors: Markos Georgopoulos, James Oldfield, Mihalis A. Nicolaou, Yannis Panagakis, Maja Pantic

    Abstract: A significant limiting factor in training fair classifiers relates to the presence of dataset bias. In particular, face datasets are typically biased in terms of attributes such as gender, age, and race. If not mitigated, bias leads to algorithms that exhibit unfair behaviour towards such groups. In this work, we address the problem of increasing the diversity of face datasets with respect to age.… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    Comments: IEEE CVPR 2020 WORKSHOP ON FAIR, DATA EFFICIENT AND TRUSTED COMPUTER VISION

  9. arXiv:1904.04772  [pdf, other

    cs.CV cs.LG

    Adversarial Learning of Disentangled and Generalizable Representations for Visual Attributes

    Authors: James Oldfield, Yannis Panagakis, Mihalis A. Nicolaou

    Abstract: Recently, a multitude of methods for image-to-image translation have demonstrated impressive results on problems such as multi-domain or multi-attribute transfer. The vast majority of such works leverages the strengths of adversarial learning and deep convolutional autoencoders to achieve realistic results by well-capturing the target data distribution. Nevertheless, the most prominent representat… ▽ More

    Submitted 30 January, 2021; v1 submitted 9 April, 2019; originally announced April 2019.