Skip to main content

Showing 1–49 of 49 results for author: Panagakis, Y

.
  1. arXiv:2406.10685  [pdf, other

    cs.LG

    Scale Equivariant Graph Metanetworks

    Authors: Ioannis Kalogeropoulos, Giorgos Bouritsas, Yannis Panagakis

    Abstract: This paper pertains to an emerging machine learning paradigm: learning higher-order functions, i.e. functions whose inputs are functions themselves, $\textit{particularly when these inputs are Neural Networks (NNs)}$. With the growing interest in architectures that process NNs, a recurring design principle has permeated the field: adhering to the permutation symmetries arising from the connectioni… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 31 pages

  2. arXiv:2405.18045  [pdf, other

    cs.LG cs.CV

    Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From InfoNCE to Kernel-Based Losses

    Authors: Panagiotis Koromilas, Giorgos Bouritsas, Theodoros Giannakopoulos, Mihalis Nicolaou, Yannis Panagakis

    Abstract: What do different contrastive learning (CL) losses actually optimize for? Although multiple CL methods have demonstrated remarkable representation learning capabilities, the differences in their inner workings remain largely opaque. In this work, we analyse several CL families and prove that, under certain conditions, they admit the same minimisers when optimizing either their batch-level objectiv… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted at ICML 2024. Code available at: https://github.com/pakoromilas/DHEL-KCL.git

  3. arXiv:2403.20287  [pdf, other

    cs.CV cs.LG

    Benchmarking Counterfactual Image Generation

    Authors: Thomas Melistas, Nikos Spyrou, Nefeli Gkouti, Pedro Sanchez, Athanasios Vlontzos, Yannis Panagakis, Giorgos Papanastasiou, Sotirios A. Tsaftaris

    Abstract: Generative AI has revolutionised visual content editing, empowering users to effortlessly modify images and videos. However, not all edits are equal. To perform realistic edits in domains such as natural image or medical imaging, modifications must respect causal relationships inherent to the data generation process. Such image editing falls into the counterfactual image generation regime. Evaluat… ▽ More

    Submitted 10 June, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  4. arXiv:2402.12550  [pdf, other

    cs.CV cs.LG

    Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization

    Authors: James Oldfield, Markos Georgopoulos, Grigorios G. Chrysos, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Jiankang Deng, Ioannis Patras

    Abstract: The Mixture of Experts (MoE) paradigm provides a powerful way to decompose dense layers into smaller, modular computations often more amenable to human interpretation, debugging, and editability. However, a major challenge lies in the computational cost of scaling the number of experts high enough to achieve fine-grained specialization. In this paper, we propose the Multilinear Mixture of Experts… ▽ More

    Submitted 31 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Github: https://github.com/james-oldfield/muMoE. Project page: https://james-oldfield.github.io/muMoE/

  5. arXiv:2311.17968  [pdf, other

    eess.SP cs.AI cs.HC cs.LG

    Latent Alignment with Deep Set EEG Decoders

    Authors: Stylianos Bakas, Siegfried Ludwig, Dimitrios A. Adamos, Nikolaos Laskaris, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: The variability in EEG signals between different individuals poses a significant challenge when implementing brain-computer interfaces (BCI). Commonly proposed solutions to this problem include deep learning models, due to their increased capacity and generalization, as well as explicit domain adaptation techniques. Here, we introduce the Latent Alignment method that won the Benchmarks for EEG Tra… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    ACM Class: I.2.6

  6. arXiv:2309.14883  [pdf, other

    cs.CV cs.LG

    Locality-preserving Directions for Interpreting the Latent Space of Satellite Image GANs

    Authors: Georgia Kourmouli, Nikos Kostagiolas, Yannis Panagakis, Mihalis A. Nicolaou

    Abstract: We present a locality-aware method for interpreting the latent space of wavelet-based Generative Adversarial Networks (GANs), that can well capture the large spatial and spectral variability that is characteristic to satellite imagery. By focusing on preserving locality, the proposed method is able to decompose the weight-space of pre-trained GANs and recover interpretable directions that correspo… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  7. arXiv:2309.11140  [pdf, other

    cs.SD cs.LG eess.AS

    Investigating Personalization Methods in Text to Music Generation

    Authors: Manos Plitsis, Theodoros Kouzelis, Georgios Paraskevopoulos, Vassilis Katsouros, Yannis Panagakis

    Abstract: In this work, we investigate the personalization of text-to-music diffusion models in a few-shot setting. Motivated by recent advances in the computer vision domain, we are the first to explore the combination of pre-trained text-to-audio diffusers with two established personalization methods. We experiment with the effect of audio-specific data augmentation on the overall system performance and a… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Submitted to ICASSP 2024, Examples at https://zelaki.github.io/

  8. arXiv:2307.16584  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Audio-visual video-to-speech synthesis with synthesized input audio

    Authors: Triantafyllos Kefalas, Yannis Panagakis, Maja Pantic

    Abstract: Video-to-speech synthesis involves reconstructing the speech signal of a speaker from a silent video. The implicit assumption of this task is that the sound signal is either missing or contains a high amount of noise/corruption such that it is not useful for processing. Previous works in the literature either use video inputs only or employ both video and audio inputs during training, and discard… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  9. arXiv:2306.15464  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Large-scale unsupervised audio pre-training for video-to-speech synthesis

    Authors: Triantafyllos Kefalas, Yannis Panagakis, Maja Pantic

    Abstract: Video-to-speech synthesis is the task of reconstructing the speech signal from a silent video of a speaker. Most established approaches to date involve a two-step process, whereby an intermediate representation from the video, such as a spectrogram, is extracted first and then passed to a vocoder to produce the raw audio. Some recent work has focused on end-to-end synthesis, whereby the generation… ▽ More

    Submitted 31 July, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Corrected typos. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  10. arXiv:2305.14053  [pdf, other

    cs.CV cs.LG

    Parts of Speech-Grounded Subspaces in Vision-Language Models

    Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: Latent image representations arising from vision-language models have proved immensely useful for a variety of downstream tasks. However, their utility is limited by their entanglement with respect to different visual attributes. For instance, recent work has shown that CLIP image representations are often biased toward specific visual properties (such as objects or actions) in an unpredictable ma… ▽ More

    Submitted 12 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023

  11. arXiv:2303.05582  [pdf, other

    cs.LG cs.IR cs.IT eess.SP

    Generalization analysis of an unfolding network for analysis-based Compressed Sensing

    Authors: Vicky Kouni, Yannis Panagakis

    Abstract: Unfolding networks have shown promising results in the Compressed Sensing (CS) field. Yet, the investigation of their generalization ability is still in its infancy. In this paper, we perform generalization analysis of a state-of-the-art ADMM-based unfolding network, which jointly learns a decoder for CS and a sparsifying redundant analysis operator. To this end, we first impose a structural const… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  12. arXiv:2208.02089  [pdf, other

    cs.CV cs.LG eess.IV

    Unsupervised Discovery of Semantic Concepts in Satellite Imagery with Style-based Wavelet-driven Generative Models

    Authors: Nikos Kostagiolas, Mihalis A. Nicolaou, Yannis Panagakis

    Abstract: In recent years, considerable advancements have been made in the area of Generative Adversarial Networks (GANs), particularly with the advent of style-based architectures that address many key shortcomings - both in terms of modeling capabilities and network interpretability. Despite these improvements, the adoption of such approaches in the domain of satellite imagery is not straightforward. Typi… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: 11 pages, 5 figures, accepted at SETN 2022

  13. Fast Multilevel Algorithms for Compressive Principle Component Pursuit

    Authors: Vahan Hovhannisyan, Yannis Panagakis, Panos Parpas, Stefanos Zafeiriou

    Abstract: Recovering a low-rank matrix from highly corrupted measurements arises in compressed sensing of structured high-dimensional signals (e.g., videos and hyperspectral images among others). Robust principal component analysis (RPCA), solved via principal component pursuit (PCP), recovers a low-rank matrix from sparse corruptions that are of unknown value and support by decomposing the observation matr… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Journal ref: SIAM Journal on Imaging Sciences 12.1 (2019): 624-649

  14. arXiv:2206.00048  [pdf, other

    cs.CV cs.LG

    PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs

    Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: Recent advances in the understanding of Generative Adversarial Networks (GANs) have led to remarkable progress in visual editing and synthesis tasks, capitalizing on the rich semantics that are embedded in the latent spaces of pre-trained GANs. However, existing methods are often tailored to specific GAN architectures and are limited to either discovering global semantic directions that do not fac… ▽ More

    Submitted 6 February, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: Accepted at ICLR 2023. Code available at: https://github.com/james-oldfield/PandA

  15. arXiv:2205.07050  [pdf, other

    cs.IT cs.IR cs.LG

    DECONET: an Unfolding Network for Analysis-based Compressed Sensing with Generalization Error Bounds

    Authors: Vicky Kouni, Yannis Panagakis

    Abstract: We present a new deep unfolding network for analysis-sparsity-based Compressed Sensing. The proposed network coined Decoding Network (DECONET) jointly learns a decoder that reconstructs vectors from their incomplete, noisy measurements and a redundant sparsifying analysis operator, which is shared across the layers of DECONET. Moreover, we formulate the hypothesis class of DECONET and estimate its… ▽ More

    Submitted 26 April, 2023; v1 submitted 14 May, 2022; originally announced May 2022.

    Comments: Accepted in IEEE Transactions on Signal Processing

  16. arXiv:2202.12950  [pdf, other

    eess.SP cs.AI cs.LG

    2021 BEETL Competition: Advancing Transfer Learning for Subject Independence & Heterogenous EEG Data Sets

    Authors: Xiaoxi Wei, A. Aldo Faisal, Moritz Grosse-Wentrup, Alexandre Gramfort, Sylvain Chevallier, Vinay Jayaram, Camille Jeunet, Stylianos Bakas, Siegfried Ludwig, Konstantinos Barmpas, Mehdi Bahri, Yannis Panagakis, Nikolaos Laskaris, Dimitrios A. Adamos, Stefanos Zafeiriou, William C. Duong, Stephen M. Gordon, Vernon J. Lawhern, Maciej Śliwowski, Vincent Rouanne, Piotr Tempczyk

    Abstract: Transfer learning and meta-learning offer some of the most promising avenues to unlock the scalability of healthcare and consumer technologies driven by biosignal data. This is because current methods cannot generalise well across human subjects' data and handle learning from different heterogeneously collected data sets, thus limiting the scale of training data. On the other side, developments in… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: PrePrint of the NeurIPS2021 BEETL Competition Submitted to Proceedings of Machine Learning Research (PMLR)

  17. arXiv:2202.03267  [pdf, other

    eess.SP cs.AI cs.HC cs.LG

    Team Cogitat at NeurIPS 2021: Benchmarks for EEG Transfer Learning Competition

    Authors: Stylianos Bakas, Siegfried Ludwig, Konstantinos Barmpas, Mehdi Bahri, Yannis Panagakis, Nikolaos Laskaris, Dimitrios A. Adamos, Stefanos Zafeiriou

    Abstract: Building subject-independent deep learning models for EEG decoding faces the challenge of strong covariate-shift across different datasets, subjects and recording sessions. Our approach to address this difficulty is to explicitly align feature distributions at various layers of the deep learning model, using both simple statistical techniques as well as trainable methods with more representational… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    ACM Class: I.2.6

  18. arXiv:2112.12911  [pdf, other

    cs.CV

    Cluster-guided Image Synthesis with Unconditional Models

    Authors: Markos Georgopoulos, James Oldfield, Grigorios G Chrysos, Yannis Panagakis

    Abstract: Generative Adversarial Networks (GANs) are the driving force behind the state-of-the-art in image generation. Despite their ability to synthesize high-resolution photo-realistic images, generating content with on-demand conditioning of different granularity remains a challenge. This challenge is usually tackled by annotating massive datasets with the attributes of interest, a laborious task that i… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

  19. arXiv:2111.11736  [pdf, other

    cs.CV

    Tensor Component Analysis for Interpreting the Latent Space of GANs

    Authors: James Oldfield, Markos Georgopoulos, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: This paper addresses the problem of finding interpretable directions in the latent space of pre-trained Generative Adversarial Networks (GANs) to facilitate controllable image synthesis. Such interpretable directions correspond to transformations that can affect both the style and geometry of the synthetic images. However, existing approaches that utilise linear techniques to find these transforma… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: BMVC 2021

  20. arXiv:2110.13859  [pdf, other

    cs.LG cs.AI cs.CV

    Defensive Tensorization

    Authors: Adrian Bulat, Jean Kossaifi, Sourav Bhattacharya, Yannis Panagakis, Timothy Hospedales, Georgios Tzimiropoulos, Nicholas D Lane, Maja Pantic

    Abstract: We propose defensive tensorization, an adversarial defence technique that leverages a latent high-order factorization of the network. The layers of a network are first expressed as factorized tensor layers. Tensor dropout is then applied in the latent subspace, therefore resulting in dense reconstructed weights, without the sparsity or perturbations typically induced by the randomization.Our appro… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: To be presented at BMVC 2021

  21. arXiv:2110.10009  [pdf, other

    cs.LG cs.HC

    EEGminer: Discovering Interpretable Features of Brain Activity with Learnable Filters

    Authors: Siegfried Ludwig, Stylianos Bakas, Dimitrios A. Adamos, Nikolaos Laskaris, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: Patterns of brain activity are associated with different brain processes and can be used to identify different brain states and make behavioral predictions. However, the relevant features are not readily apparent and accessible. To mine informative latent representations from multichannel recordings of ongoing EEG activity, we propose a novel differentiable decoding pipeline consisting of learnabl… ▽ More

    Submitted 2 February, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: 14 pages, 8 figures

    ACM Class: I.2.6

  22. Tensor Methods in Computer Vision and Deep Learning

    Authors: Yannis Panagakis, Jean Kossaifi, Grigorios G. Chrysos, James Oldfield, Mihalis A. Nicolaou, Anima Anandkumar, Stefanos Zafeiriou

    Abstract: Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a long history of applications in a wide span of computer vision problems. With the advent of the deep learning paradigm shift in computer vision, tensors have be… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Proceedings of the IEEE (2021)

  23. arXiv:2104.07916  [pdf, other

    cs.CV

    Augmenting Deep Classifiers with Polynomial Neural Networks

    Authors: Grigorios G Chrysos, Markos Georgopoulos, Jiankang Deng, Jean Kossaifi, Yannis Panagakis, Anima Anandkumar

    Abstract: Deep neural networks have been the driving force behind the success in classification tasks, e.g., object and audio recognition. Impressive results and generalization have been achieved by a variety of recently proposed architectures, the majority of which are seemingly disconnected. In this work, we cast the study of deep classifiers under a unifying framework. In particular, we express state-of-… ▽ More

    Submitted 11 August, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at ECCV'22

  24. arXiv:2104.05077  [pdf, other

    cs.LG cs.CV

    CoPE: Conditional image generation using Polynomial Expansions

    Authors: Grigorios G Chrysos, Markos Georgopoulos, Yannis Panagakis

    Abstract: Generative modeling has evolved to a notable field of machine learning. Deep polynomial neural networks (PNNs) have demonstrated impressive results in unsupervised image generation, where the task is to map an input vector (i.e., noise) to a synthesized image. However, the success of PNNs has not been replicated in conditional generation tasks, such as super-resolution. Existing PNNs focus on sing… ▽ More

    Submitted 27 October, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: Accepted in NeurIPS 2021

  25. arXiv:2009.04075  [pdf, other

    cs.LG stat.ML

    Multilinear Latent Conditioning for Generating Unseen Attribute Combinations

    Authors: Markos Georgopoulos, Grigorios Chrysos, Maja Pantic, Yannis Panagakis

    Abstract: Deep generative models rely on their inductive bias to facilitate generalization, especially for problems with high dimensional data, like images. However, empirical studies have shown that variational autoencoders (VAE) and generative adversarial networks (GAN) lack the generalization ability that occurs naturally in human perception. For example, humans can visualize a woman smiling after only s… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Comments: published at International Conference on Machine Learning 2020

  26. arXiv:2006.13026  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Polynomial Neural Networks

    Authors: Grigorios Chrysos, Stylianos Moschoglou, Giorgos Bouritsas, Jiankang Deng, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: Deep Convolutional Neural Networks (DCNNs) are currently the method of choice both for generative, as well as for discriminative learning in computer vision and machine learning. The success of DCNNs can be attributed to the careful selection of their building blocks (e.g., residual blocks, rectifiers, sophisticated normalization schemes, to mention but a few). In this paper, we propose $Π$-Nets,… ▽ More

    Submitted 27 February, 2021; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI). Code: https://github.com/grigorisg9gr/polynomial_nets. arXiv admin note: substantial text overlap with arXiv:2003.03828

  27. arXiv:2006.03985  [pdf, other

    cs.CV

    Enhancing Facial Data Diversity with Style-based Face Aging

    Authors: Markos Georgopoulos, James Oldfield, Mihalis A. Nicolaou, Yannis Panagakis, Maja Pantic

    Abstract: A significant limiting factor in training fair classifiers relates to the presence of dataset bias. In particular, face datasets are typically biased in terms of attributes such as gender, age, and race. If not mitigated, bias leads to algorithms that exhibit unfair behaviour towards such groups. In this work, we address the problem of increasing the diversity of face datasets with respect to age.… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    Comments: IEEE CVPR 2020 WORKSHOP ON FAIR, DATA EFFICIENT AND TRUSTED COMPUTER VISION

  28. arXiv:2005.07302  [pdf, other

    cs.CV

    Investigating Bias in Deep Face Analysis: The KANFace Dataset and Empirical Study

    Authors: Markos Georgopoulos, Yannis Panagakis, Maja Pantic

    Abstract: Deep learning-based methods have pushed the limits of the state-of-the-art in face analysis. However, despite their success, these models have raised concerns regarding their bias towards certain demographics. This bias is inflicted both by limited diversity across demographics in the training set, as well as the design of the algorithms. In this work, we investigate the demographic bias of deep l… ▽ More

    Submitted 8 September, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

  29. arXiv:2003.03828  [pdf, other

    cs.LG cs.CV stat.ML

    $Π-$nets: Deep Polynomial Neural Networks

    Authors: Grigorios G. Chrysos, Stylianos Moschoglou, Giorgos Bouritsas, Yannis Panagakis, Jiankang Deng, Stefanos Zafeiriou

    Abstract: Deep Convolutional Neural Networks (DCNNs) is currently the method of choice both for generative, as well as for discriminative learning in computer vision and machine learning. The success of DCNNs can be attributed to the careful selection of their building blocks (e.g., residual blocks, rectifiers, sophisticated normalization schemes, to mention but a few). In this paper, we propose $Π$-Nets, a… ▽ More

    Submitted 26 March, 2020; v1 submitted 8 March, 2020; originally announced March 2020.

    Comments: Accepted in CVPR 2020

  30. arXiv:1912.05833  [pdf, other

    cs.LG eess.AS stat.ML

    Speech-driven facial animation using polynomial fusion of features

    Authors: Triantafyllos Kefalas, Konstantinos Vougioukas, Yannis Panagakis, Stavros Petridis, Jean Kossaifi, Maja Pantic

    Abstract: Speech-driven facial animation involves using a speech signal to generate realistic videos of talking faces. Recent deep learning approaches to facial synthesis rely on extracting low-dimensional representations and concatenating them, followed by a decoding step of the concatenated vector. This accounts for only first-order interactions of the features and ignores higher-order interactions. In th… ▽ More

    Submitted 19 February, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

  31. arXiv:1908.06571  [pdf, other

    cs.LG stat.ML

    PolyGAN: High-Order Polynomial Generators

    Authors: Grigorios Chrysos, Stylianos Moschoglou, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: Generative Adversarial Networks (GANs) have become the gold standard when it comes to learning generative models for high-dimensional distributions. Since their advent, numerous variations of GANs have been introduced in the literature, primarily focusing on utilization of novel loss functions, optimization/regularization strategies and network architectures. In this paper, we turn our attention t… ▽ More

    Submitted 13 October, 2019; v1 submitted 18 August, 2019; originally announced August 2019.

  32. arXiv:1906.06196  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Factorized Higher-Order CNNs with an Application to Spatio-Temporal Emotion Estimation

    Authors: Jean Kossaifi, Antoine Toisoul, Adrian Bulat, Yannis Panagakis, Timothy Hospedales, Maja Pantic

    Abstract: Training deep neural networks with spatio-temporal (i.e., 3D) or multidimensional convolutions of higher-order is computationally challenging due to millions of unknown parameters across dozens of layers. To alleviate this, one approach is to apply low-rank tensor decompositions to convolution kernels in order to compress the network and reduce its number of parameters. Alternatively, new convolut… ▽ More

    Submitted 31 March, 2020; v1 submitted 14 June, 2019; originally announced June 2019.

    Comments: IEEE CVPR 2020

  33. arXiv:1904.04772  [pdf, other

    cs.CV cs.LG

    Adversarial Learning of Disentangled and Generalizable Representations for Visual Attributes

    Authors: James Oldfield, Yannis Panagakis, Mihalis A. Nicolaou

    Abstract: Recently, a multitude of methods for image-to-image translation have demonstrated impressive results on problems such as multi-domain or multi-attribute transfer. The vast majority of such works leverages the strengths of adversarial learning and deep convolutional autoencoders to achieve realistic results by well-capturing the target data distribution. Nevertheless, the most prominent representat… ▽ More

    Submitted 30 January, 2021; v1 submitted 9 April, 2019; originally announced April 2019.

  34. arXiv:1902.10758  [pdf, other

    cs.LG stat.ML

    Tensor Dropout for Robust Learning

    Authors: Arinbjörn Kolbeinsson, Jean Kossaifi, Yannis Panagakis, Adrian Bulat, Anima Anandkumar, Ioanna Tzoulaki, Paul Matthews

    Abstract: CNNs achieve remarkable performance by leveraging deep, over-parametrized architectures, trained on large datasets. However, they have limited generalization ability to data outside the training domain, and a lack of robustness to noise and adversarial attacks. By building better inductive biases, we can improve robustness and also obtain smaller networks that are more memory and computationally e… ▽ More

    Submitted 11 December, 2020; v1 submitted 27 February, 2019; originally announced February 2019.

  35. SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild

    Authors: Jean Kossaifi, Robert Walecki, Yannis Panagakis, Jie Shen, Maximilian Schmitt, Fabien Ringeval, **g Han, Vedhas Pandit, Antoine Toisoul, Bjorn Schuller, Kam Star, Elnar Hajiyev, Maja Pantic

    Abstract: Natural human-computer interaction and audio-visual human behaviour sensing systems, which would achieve robust performance in-the-wild are more needed than ever as digital devices are increasingly becoming an indispensable part of our life. Accurately annotated real-world data are the crux in devising such systems. However, existing databases usually consider controlled settings, low demographic… ▽ More

    Submitted 18 November, 2019; v1 submitted 9 January, 2019; originally announced January 2019.

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2019

  36. arXiv:1802.04636  [pdf, other

    cs.CV

    Modeling of Facial Aging and Kinship: A Survey

    Authors: Markos Georgopoulos, Yannis Panagakis, Maja Pantic

    Abstract: Computational facial models that capture properties of facial cues related to aging and kinship increasingly attract the attention of the research community, enabling the development of reliable methods for age progression, age estimation, age-invariant facial characterization, and kinship verification from visual data. In this paper, we review recent advances in modeling of facial aging and kinsh… ▽ More

    Submitted 1 December, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

  37. arXiv:1801.07580  [pdf, other

    cs.CV

    Side Information for Face Completion: a Robust PCA Approach

    Authors: Niannan Xue, Jiankang Deng, Shiyang Cheng, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: Robust principal component analysis (RPCA) is a powerful method for learning low-rank feature representation of various visual data. However, for certain types as well as significant amount of error corruption, it fails to yield satisfactory results; a drawback that can be alleviated by exploiting domain-dependent prior knowledge or information. In this paper, we propose two models for the RPCA th… ▽ More

    Submitted 19 January, 2018; originally announced January 2018.

    Comments: arXiv admin note: text overlap with arXiv:1702.00648

  38. arXiv:1801.06665  [pdf, other

    cs.CV cs.LG

    Visual Data Augmentation through Learning

    Authors: Grigorios G. Chrysos, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: The rapid progress in machine learning methods has been empowered by i) huge datasets that have been collected and annotated, ii) improved engineering (e.g. data pre-processing/normalization). The existing datasets typically include several million samples, which constitutes their extension a colossal task. In addition, the state-of-the-art data-driven methods demand a vast amount of data, hence a… ▽ More

    Submitted 20 January, 2018; originally announced January 2018.

  39. arXiv:1801.06432  [pdf, other

    stat.ML cs.CV cs.LG

    Robust Kronecker Component Analysis

    Authors: Mehdi Bahri, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: Dictionary learning and component analysis models are fundamental for learning compact representations that are relevant to a given task (feature extraction, dimensionality reduction, denoising, etc.). The model complexity is encoded by means of specific structure, such as sparsity, low-rankness, or nonnegativity. Unfortunately, approaches like K-SVD - that learn dictionaries for sparse coding via… ▽ More

    Submitted 10 November, 2018; v1 submitted 18 January, 2018; originally announced January 2018.

    Comments: In IEEE Transactions on Pattern Analysis and Machine Intelligence, Special Issue on Compact and Efficient Feature Representation and Learning in Computer Vision, 2018. Contains appendices. arXiv admin note: text overlap with arXiv:1703.07886

  40. Multi-Attribute Robust Component Analysis for Facial UV Maps

    Authors: Stylianos Moschoglou, Evangelos Ververas, Yannis Panagakis, Mihalis Nicolaou, Stefanos Zafeiriou

    Abstract: Recently, due to the collection of large scale 3D face models, as well as the advent of deep learning, a significant progress has been made in the field of 3D face alignment "in-the-wild". That is, many methods have been proposed that establish sparse or dense 3D correspondences between a 2D facial image and a 3D face model. The utilization of 3D face alignment introduces new challenges and resear… ▽ More

    Submitted 15 December, 2017; originally announced December 2017.

  41. arXiv:1712.00684  [pdf, other

    cs.CV

    GAGAN: Geometry-Aware Generative Adversarial Networks

    Authors: Jean Kossaifi, Linh Tran, Yannis Panagakis, Maja Pantic

    Abstract: Deep generative models learned through adversarial training have become increasingly popular for their ability to generate naturalistic image textures. However, aside from their texture, the visual appearance of objects is significantly influenced by their shape geometry; information which is not taken into account by existing generative models. This paper introduces the Geometry-Aware Generative… ▽ More

    Submitted 27 March, 2018; v1 submitted 2 December, 2017; originally announced December 2017.

  42. An Adversarial Neuro-Tensorial Approach For Learning Disentangled Representations

    Authors: Mengjiao Wang, Zhixin Shu, Shiyang Cheng, Yannis Panagakis, Dimitris Samaras, Stefanos Zafeiriou

    Abstract: Several factors contribute to the appearance of an object in a visual scene, including pose, illumination, and deformation, among others. Each factor accounts for a source of variability in the data, while the multiplicative interactions of these factors emulate the entangled variability, giving rise to the rich structure of visual object appearance. Disentangling such unobserved factors from visu… ▽ More

    Submitted 24 February, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

    Journal ref: International Journal of Computer Vision, 2019

  43. arXiv:1709.04836  [pdf, other

    stat.ML cs.CV cs.LG

    Informed Non-convex Robust Principal Component Analysis with Features

    Authors: Niannan Xue, Jiankang Deng, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: We revisit the problem of robust principal component analysis with features acting as prior side information. To this aim, a novel, elegant, non-convex optimization approach is proposed to decompose a given observation matrix into a low-rank core and the corresponding sparse residual. Rigorous theoretical analysis of the proposed algorithm results in exact recovery guarantees with low computationa… ▽ More

    Submitted 14 September, 2017; originally announced September 2017.

  44. arXiv:1703.07886  [pdf, other

    stat.ML cs.CV

    Robust Kronecker-Decomposable Component Analysis for Low-Rank Modeling

    Authors: Mehdi Bahri, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: Dictionary learning and component analysis are part of one of the most well-studied and active research fields, at the intersection of signal and image processing, computer vision, and statistical machine learning. In dictionary learning, the current methods of choice are arguably K-SVD and its variants, which learn a dictionary (i.e., a decomposition) for sparse coding via Singular Value Decompos… ▽ More

    Submitted 26 July, 2017; v1 submitted 22 March, 2017; originally announced March 2017.

    Comments: Accepted for publication at ICCV 2017

  45. arXiv:1702.00648  [pdf, other

    cs.CV

    Side Information in Robust Principal Component Analysis: Algorithms and Applications

    Authors: Niannan Xue, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: Robust Principal Component Analysis (RPCA) aims at recovering a low-rank subspace from grossly corrupted high-dimensional (often visual) data and is a cornerstone in many machine learning and computer vision applications. Even though RPCA has been shown to be very successful in solving many rank minimisation problems, there are still cases where degenerate or suboptimal solutions are obtained. Thi… ▽ More

    Submitted 28 March, 2017; v1 submitted 2 February, 2017; originally announced February 2017.

  46. arXiv:1701.05360  [pdf, other

    cs.CV

    3D Face Morphable Models "In-the-Wild"

    Authors: James Booth, Epameinondas Antonakos, Stylianos Ploumpis, George Trigeorgis, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: 3D Morphable Models (3DMMs) are powerful statistical models of 3D facial shape and texture, and among the state-of-the-art methods for reconstructing facial shape from single images. With the advent of new 3D sensors, many 3D facial datasets have been collected containing both neutral as well as expressive faces. However, all datasets are captured under controlled conditions. Thus, even though pow… ▽ More

    Submitted 19 January, 2017; originally announced January 2017.

  47. arXiv:1612.00750  [pdf, other

    cs.SI stat.ML

    Non-Negative Matrix Factorizations for Multiplex Network Analysis

    Authors: Vladimir Gligorijevic, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: Networks have been a general tool for representing, analyzing, and modeling relational data arising in several domains. One of the most important aspect of network analysis is community detection or network clustering. Until recently, the major focus have been on discovering community structure in single (i.e., monoplex) networks. However, with the advent of relational data with multiple modalitie… ▽ More

    Submitted 25 January, 2017; v1 submitted 30 November, 2016; originally announced December 2016.

    Comments: 12 pages, 4 figures, 3 tables

  48. arXiv:1610.09555  [pdf, other

    cs.LG

    TensorLy: Tensor Learning in Python

    Authors: Jean Kossaifi, Yannis Panagakis, Anima Anandkumar, Maja Pantic

    Abstract: Tensors are higher-order extensions of matrices. While matrix methods form the cornerstone of machine learning and data analysis, tensor methods have been gaining increasing traction. However, software support for tensor operations is not on the same footing. In order to bridge this gap, we have developed \emph{TensorLy}, a high-level API for tensor methods and deep tensorized neural networks in P… ▽ More

    Submitted 9 May, 2018; v1 submitted 29 October, 2016; originally announced October 2016.

  49. arXiv:1502.00852  [pdf, other

    cs.CV

    Face frontalization for Alignment and Recognition

    Authors: Christos Sagonas, Yannis Panagakis, Stefanos Zafeiriou, Maja Pantic

    Abstract: Recently, it was shown that excellent results can be achieved in both face landmark localization and pose-invariant face recognition. These breakthroughs are attributed to the efforts of the community to manually annotate facial images in many different poses and to collect 3D faces data. In this paper, we propose a novel method for joint face landmark localization and frontal face reconstruction… ▽ More

    Submitted 3 February, 2015; originally announced February 2015.

    Comments: 8 pages, 8 figures