Skip to main content

Showing 1–15 of 15 results for author: Nicolaou, M A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.12550  [pdf, other

    cs.CV cs.LG

    Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization

    Authors: James Oldfield, Markos Georgopoulos, Grigorios G. Chrysos, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Jiankang Deng, Ioannis Patras

    Abstract: The Mixture of Experts (MoE) paradigm provides a powerful way to decompose dense layers into smaller, modular computations often more amenable to human interpretation, debugging, and editability. However, a major challenge lies in the computational cost of scaling the number of experts high enough to achieve fine-grained specialization. In this paper, we propose the Multilinear Mixture of Experts… ▽ More

    Submitted 31 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Github: https://github.com/james-oldfield/muMoE. Project page: https://james-oldfield.github.io/muMoE/

  2. arXiv:2309.14883  [pdf, other

    cs.CV cs.LG

    Locality-preserving Directions for Interpreting the Latent Space of Satellite Image GANs

    Authors: Georgia Kourmouli, Nikos Kostagiolas, Yannis Panagakis, Mihalis A. Nicolaou

    Abstract: We present a locality-aware method for interpreting the latent space of wavelet-based Generative Adversarial Networks (GANs), that can well capture the large spatial and spectral variability that is characteristic to satellite imagery. By focusing on preserving locality, the proposed method is able to decompose the weight-space of pre-trained GANs and recover interpretable directions that correspo… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  3. arXiv:2305.14053  [pdf, other

    cs.CV cs.LG

    Parts of Speech-Grounded Subspaces in Vision-Language Models

    Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: Latent image representations arising from vision-language models have proved immensely useful for a variety of downstream tasks. However, their utility is limited by their entanglement with respect to different visual attributes. For instance, recent work has shown that CLIP image representations are often biased toward specific visual properties (such as objects or actions) in an unpredictable ma… ▽ More

    Submitted 12 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023

  4. arXiv:2208.02089  [pdf, other

    cs.CV cs.LG eess.IV

    Unsupervised Discovery of Semantic Concepts in Satellite Imagery with Style-based Wavelet-driven Generative Models

    Authors: Nikos Kostagiolas, Mihalis A. Nicolaou, Yannis Panagakis

    Abstract: In recent years, considerable advancements have been made in the area of Generative Adversarial Networks (GANs), particularly with the advent of style-based architectures that address many key shortcomings - both in terms of modeling capabilities and network interpretability. Despite these improvements, the adoption of such approaches in the domain of satellite imagery is not straightforward. Typi… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: 11 pages, 5 figures, accepted at SETN 2022

  5. arXiv:2206.00048  [pdf, other

    cs.CV cs.LG

    PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs

    Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: Recent advances in the understanding of Generative Adversarial Networks (GANs) have led to remarkable progress in visual editing and synthesis tasks, capitalizing on the rich semantics that are embedded in the latent spaces of pre-trained GANs. However, existing methods are often tailored to specific GAN architectures and are limited to either discovering global semantic directions that do not fac… ▽ More

    Submitted 6 February, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: Accepted at ICLR 2023. Code available at: https://github.com/james-oldfield/PandA

  6. arXiv:2111.11736  [pdf, other

    cs.CV

    Tensor Component Analysis for Interpreting the Latent Space of GANs

    Authors: James Oldfield, Markos Georgopoulos, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: This paper addresses the problem of finding interpretable directions in the latent space of pre-trained Generative Adversarial Networks (GANs) to facilitate controllable image synthesis. Such interpretable directions correspond to transformations that can affect both the style and geometry of the synthetic images. However, existing approaches that utilise linear techniques to find these transforma… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: BMVC 2021

  7. arXiv:2108.04240  [pdf, other

    q-bio.QM cs.LG

    Classification of Influenza Hemagglutinin Protein Sequences using Convolutional Neural Networks

    Authors: Charalambos Chrysostomou, Floris Alexandrou, Mihalis A. Nicolaou, Huseyin Seker

    Abstract: The Influenza virus can be considered as one of the most severe viruses that can infect multiple species with often fatal consequences to the hosts. The Hemagglutinin (HA) gene of the virus can be a target for antiviral drug development realised through accurate identification of its sub-types and possible the targeted hosts. This paper focuses on accurately predicting if an Influenza type A virus… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  8. Tensor Methods in Computer Vision and Deep Learning

    Authors: Yannis Panagakis, Jean Kossaifi, Grigorios G. Chrysos, James Oldfield, Mihalis A. Nicolaou, Anima Anandkumar, Stefanos Zafeiriou

    Abstract: Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a long history of applications in a wide span of computer vision problems. With the advent of the deep learning paradigm shift in computer vision, tensors have be… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Proceedings of the IEEE (2021)

  9. arXiv:2006.03985  [pdf, other

    cs.CV

    Enhancing Facial Data Diversity with Style-based Face Aging

    Authors: Markos Georgopoulos, James Oldfield, Mihalis A. Nicolaou, Yannis Panagakis, Maja Pantic

    Abstract: A significant limiting factor in training fair classifiers relates to the presence of dataset bias. In particular, face datasets are typically biased in terms of attributes such as gender, age, and race. If not mitigated, bias leads to algorithms that exhibit unfair behaviour towards such groups. In this work, we address the problem of increasing the diversity of face datasets with respect to age.… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    Comments: IEEE CVPR 2020 WORKSHOP ON FAIR, DATA EFFICIENT AND TRUSTED COMPUTER VISION

  10. arXiv:1905.13694  [pdf, other

    cs.CV

    Multimodal Joint Emotion and Game Context Recognition in League of Legends Livestreams

    Authors: Charles Ringer, James Alfred Walker, Mihalis A. Nicolaou

    Abstract: Video game streaming provides the viewer with a rich set of audio-visual data, conveying information both with regards to the game itself, through game footage and audio, as well as the streamer's emotional state and behaviour via webcam footage and audio. Analysing player behaviour and discovering correlations with game context is crucial for modelling and understanding important aspects of lives… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

    Comments: 8 Pages, IEEE Conference on Games 2019

  11. arXiv:1904.04772  [pdf, other

    cs.CV cs.LG

    Adversarial Learning of Disentangled and Generalizable Representations for Visual Attributes

    Authors: James Oldfield, Yannis Panagakis, Mihalis A. Nicolaou

    Abstract: Recently, a multitude of methods for image-to-image translation have demonstrated impressive results on problems such as multi-domain or multi-attribute transfer. The vast majority of such works leverages the strengths of adversarial learning and deep convolutional autoencoders to achieve realistic results by well-capturing the target data distribution. Nevertheless, the most prominent representat… ▽ More

    Submitted 30 January, 2021; v1 submitted 9 April, 2019; originally announced April 2019.

  12. arXiv:1807.09715  [pdf, other

    cs.CV

    Deep Unsupervised Multi-View Detection of Video Game Stream Highlights

    Authors: Charles Ringer, Mihalis A. Nicolaou

    Abstract: We consider the problem of automatic highlight-detection in video game streams. Currently, the vast majority of highlight-detection systems for games are triggered by the occurrence of hard-coded game events (e.g., score change, end-game), while most advanced tools and techniques are based on detection of highlights via visual analysis of game footage. We argue that in the context of game streamin… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

    Comments: Foundation of Digital Games 2018, 6 pages

  13. arXiv:1804.10938  [pdf, other

    cs.CV cs.AI cs.HC eess.IV stat.ML

    Deep Affect Prediction in-the-wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond

    Authors: Dimitrios Kollias, Panagiotis Tzirakis, Mihalis A. Nicolaou, Athanasios Papaioannou, Guoying Zhao, Björn Schuller, Irene Kotsia, Stefanos Zafeiriou

    Abstract: Automatic understanding of human affect using visual signals is of great importance in everyday human-machine interactions. Appraising human emotional states, behaviors and reactions displayed in real-world settings, can be accomplished using latent continuous dimensions (e.g., the circumplex model of affect). Valence (i.e., how positive or negative is an emotion) & arousal (i.e., power of the act… ▽ More

    Submitted 1 February, 2019; v1 submitted 29 April, 2018; originally announced April 2018.

  14. End-to-End Multimodal Emotion Recognition using Deep Neural Networks

    Authors: Panagiotis Tzirakis, George Trigeorgis, Mihalis A. Nicolaou, Björn Schuller, Stefanos Zafeiriou

    Abstract: Automatic affect recognition is a challenging task due to the various modalities emotions can be expressed with. Applications can be found in many domains including multimedia retrieval and human computer interaction. In recent years, deep neural networks have been used with great success in determining emotional states. Inspired by this success, we propose an emotion recognition system using audi… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

  15. arXiv:1303.3240  [pdf, other

    cs.LG cs.CV stat.ML

    A Unified Framework for Probabilistic Component Analysis

    Authors: Mihalis A. Nicolaou, Stefanos Zafeiriou, Maja Pantic

    Abstract: We present a unifying framework which reduces the construction of probabilistic component analysis techniques to a mere selection of the latent neighbourhood, thus providing an elegant and principled framework for creating novel component analysis models as well as constructing probabilistic equivalents of deterministic component analysis methods. Under our framework, we unify many very popular an… ▽ More

    Submitted 14 November, 2014; v1 submitted 13 March, 2013; originally announced March 2013.