Skip to main content

Showing 1–42 of 42 results for author: Chrysos, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19299  [pdf, other

    cs.CV

    PNeRV: A Polynomial Neural Representation for Videos

    Authors: Sonam Gupta, Snehal Singh Tomar, Grigorios G Chrysos, Sukhendu Das, A. N. Rajagopalan

    Abstract: Extracting Implicit Neural Representations (INRs) on video data poses unique challenges due to the additional temporal dimension. In the context of videos, INRs have predominantly relied on a frame-only parameterization, which sacrifices the spatiotemporal continuity observed in pixel-level (spatial) representations. To mitigate this, we introduce Polynomial Neural Representation for Videos (PNeRV… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 25 pages, 17 figures, published at TMLR, Feb 2024

  2. arXiv:2406.18781  [pdf, other

    math.OC cs.DM cs.LG

    Learning to Remove Cuts in Integer Linear Programming

    Authors: Pol Puigdemont, Stratis Skoulakis, Grigorios Chrysos, Volkan Cevher

    Abstract: Cutting plane methods are a fundamental approach for solving integer linear programs (ILPs). In each iteration of such methods, additional linear constraints (cuts) are introduced to the constraint set with the aim of excluding the previous fractional optimal solution while not affecting the optimal integer solution. In this work, we explore a novel approach within cutting plane methods: instead o… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: International Conference on Machine Learning

    MSC Class: 68R01

  3. arXiv:2406.16906  [pdf, other

    eess.SP cs.AI cs.LG

    REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates

    Authors: Arshia Afzal, Grigorios Chrysos, Volkan Cevher, Mahsa Shoaran

    Abstract: EEG-based seizure detection models face challenges in terms of inference speed and memory efficiency, limiting their real-time implementation in clinical devices. This paper introduces a novel graph-based residual state update mechanism (REST) for real-time EEG signal analysis in applications such as epileptic seizure detection. By leveraging a combination of graph neural networks and recurrent st… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted paper at International Confrence on Machine Learning (ICML 2024). Visit our website: https://arshiaafzal.github.io/REST/

  4. arXiv:2405.19201  [pdf, other

    cs.CV cs.AI cs.NE

    Going beyond compositional generalization, DDPMs can produce zero-shot interpolation

    Authors: Justin Deschenaux, Igor Krawczuk, Grigorios Chrysos, Volkan Cevher

    Abstract: Denoising Diffusion Probabilistic Models (DDPMs) exhibit remarkable capabilities in image generation, with studies suggesting that they can generalize by composing latent factors learned from the training data. In this work, we go further and study DDPMs trained on strictly separate subsets of the data distribution with large gaps on the support of the latent factors. We show that such a model can… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2405.04346  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Revisiting character-level adversarial attacks

    Authors: Elias Abad Rocamora, Yongtao Wu, Fanghui Liu, Grigorios G. Chrysos, Volkan Cevher

    Abstract: Adversarial attacks in Natural Language Processing apply perturbations in the character or token levels. Token-level attacks, gaining prominence for their use of gradient-based methods, are susceptible to altering sentence semantics, leading to invalid adversarial examples. While character-level attacks easily maintain semantics, they have received less attention as they cannot easily adopt popula… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted in ICML 2024

  6. arXiv:2403.19920  [pdf, other

    cs.CV

    MI-NeRF: Learning a Single Face NeRF from Multiple Identities

    Authors: Aggelina Chatziagapi, Grigorios G. Chrysos, Dimitris Samaras

    Abstract: In this work, we introduce a method that learns a single dynamic neural radiance field (NeRF) from monocular talking face videos of multiple identities. NeRFs have shown remarkable results in modeling the 4D dynamics and appearance of human faces. However, they require per-identity optimization. Although recent approaches have proposed techniques to reduce the training and rendering time, increasi… ▽ More

    Submitted 2 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Project page: https://aggelinacha.github.io/MI-NeRF/

  7. arXiv:2403.13134  [pdf, other

    cs.LG cs.AI stat.ML

    Robust NAS under adversarial training: benchmark, theory, and beyond

    Authors: Yongtao Wu, Fanghui Liu, Carl-Johann Simon-Gabriel, Grigorios G Chrysos, Volkan Cevher

    Abstract: Recent developments in neural architecture search (NAS) emphasize the significance of considering robust architectures against malicious data. However, there is a notable absence of benchmark evaluations and theoretical guarantees for searching these robust architectures, especially when adversarial training is considered. In this work, we aim to address these two challenges, making twofold contri… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  8. arXiv:2403.09889  [pdf, other

    cs.LG

    Generalization of Scaled Deep ResNets in the Mean-Field Regime

    Authors: Yihang Chen, Fanghui Liu, Yi** Lu, Grigorios G. Chrysos, Volkan Cevher

    Abstract: Despite the widespread empirical success of ResNet, the generalization properties of deep ResNet are rarely explored beyond the lazy training regime. In this work, we investigate \emph{scaled} ResNet in the limit of infinitely deep and wide neural networks, of which the gradient flow is described by a partial differential equation in the large-neural network limit, i.e., the \emph{mean-field} regi… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: ICLR 2024 (Spotlight)

  9. arXiv:2402.12550  [pdf, other

    cs.CV cs.LG

    Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization

    Authors: James Oldfield, Markos Georgopoulos, Grigorios G. Chrysos, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Jiankang Deng, Ioannis Patras

    Abstract: The Mixture of Experts (MoE) paradigm provides a powerful way to decompose dense layers into smaller, modular computations often more amenable to human interpretation, debugging, and editability. However, a major challenge lies in the computational cost of scaling the number of experts high enough to achieve fine-grained specialization. In this paper, we propose the Multilinear Mixture of Experts… ▽ More

    Submitted 31 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Github: https://github.com/james-oldfield/muMoE. Project page: https://james-oldfield.github.io/muMoE/

  10. arXiv:2402.09177  [pdf, other

    cs.LG cs.AI cs.CL

    Leveraging the Context through Multi-Round Interactions for Jailbreaking Attacks

    Authors: Yixin Cheng, Markos Georgopoulos, Volkan Cevher, Grigorios G. Chrysos

    Abstract: Large Language Models (LLMs) are susceptible to Jailbreaking attacks, which aim to extract harmful information by subtly modifying the attack query. As defense mechanisms evolve, directly obtaining harmful information becomes increasingly challenging for Jailbreaking attacks. In this work, inspired by human practices of indirect context to elicit harmful information, we focus on a new attack form… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 29 pages

  11. arXiv:2401.17992  [pdf, other

    cs.CV cs.LG

    Multilinear Operator Networks

    Authors: Yixin Cheng, Grigorios G. Chrysos, Markos Georgopoulos, Volkan Cevher

    Abstract: Despite the remarkable capabilities of deep neural networks in image recognition, the dependence on activation functions remains a largely unexplored area and has yet to be eliminated. On the other hand, Polynomial Networks is a class of models that does not require activation functions, but have yet to perform on par with modern architectures. In this work, we aim close this gap and propose MONet… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: International Conference on Learning Representations Poster(2024)

  12. arXiv:2401.11618  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Efficient local linearity regularization to overcome catastrophic overfitting

    Authors: Elias Abad Rocamora, Fanghui Liu, Grigorios G. Chrysos, Pablo M. Olmos, Volkan Cevher

    Abstract: Catastrophic overfitting (CO) in single-step adversarial training (AT) results in abrupt drops in the adversarial test accuracy (even down to 0%). For models trained with multi-step AT, it has been observed that the loss function behaves locally linearly with respect to the input, this is however lost in single-step AT. To address CO in single-step AT, several methods have been proposed to enforce… ▽ More

    Submitted 28 February, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: Accepted in ICLR 2024

  13. arXiv:2311.01575  [pdf, other

    cs.LG

    On the Convergence of Encoder-only Shallow Transformers

    Authors: Yongtao Wu, Fanghui Liu, Grigorios G Chrysos, Volkan Cevher

    Abstract: In this paper, we aim to build the global convergence theory of encoder-only shallow Transformers under a realistic setting from the perspective of architectures, initialization, and scaling under a finite width regime. The difficulty lies in how to tackle the softmax in self-attention mechanism, the core ingredient of Transformer. In particular, we diagnose the scaling scheme, carefully tackle th… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  14. arXiv:2310.18672  [pdf, other

    cs.LG cs.DM

    Maximum Independent Set: Self-Training through Dynamic Programming

    Authors: Lorenzo Brusca, Lars C. P. M. Quaedvlieg, Stratis Skoulakis, Grigorios G Chrysos, Volkan Cevher

    Abstract: This work presents a graph neural network (GNN) framework for solving the maximum independent set (MIS) problem, inspired by dynamic programming (DP). Specifically, given a graph, we propose a DP-like recursive algorithm based on GNNs that firstly constructs two smaller sub-graphs, predicts the one with the larger MIS, and then uses it in the next recursive call. To train our algorithm, we require… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted in NeurIPS 2023

  15. arXiv:2306.05325  [pdf, other

    cs.LG

    Federated Learning under Covariate Shifts with Generalization Guarantees

    Authors: Ali Ramezani-Kebrya, Fanghui Liu, Thomas Pethick, Grigorios Chrysos, Volkan Cevher

    Abstract: This paper addresses intra-client and inter-client covariate shifts in federated learning (FL) with a focus on the overall generalization performance. To handle covariate shifts, we formulate a new global model training paradigm and propose Federated Importance-Weighted Empirical Risk Minimization (FTW-ERM) along with improving density ratio matching methods without requiring perfect knowledge of… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  16. arXiv:2305.19377  [pdf, other

    cs.LG

    Benign Overfitting in Deep Neural Networks under Lazy Training

    Authors: Zhenyu Zhu, Fanghui Liu, Grigorios G Chrysos, Francesco Locatello, Volkan Cevher

    Abstract: This paper focuses on over-parameterized deep neural networks (DNNs) with ReLU activation functions and proves that when the data distribution is well-separated, DNNs can achieve Bayes-optimal test error for classification while obtaining (nearly) zero-training error under the lazy training regime. For this purpose, we unify three interrelated concepts of overparameterization, benign overfitting,… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted in ICML 2023

  17. arXiv:2303.13896  [pdf, other

    cs.CV cs.LG

    Regularization of polynomial networks for image recognition

    Authors: Grigorios G Chrysos, Bohan Wang, Jiankang Deng, Volkan Cevher

    Abstract: Deep Neural Networks (DNNs) have obtained impressive performance across tasks, however they still remain as black boxes, e.g., hard to theoretically analyze. At the same time, Polynomial Networks (PNs) have emerged as an alternative method with a promising performance and improved interpretability but have yet to reach the performance of the powerful DNN baselines. In this work, we aim to close th… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR'23

  18. arXiv:2302.08872  [pdf, other

    cs.LG

    Revisiting adversarial training for the worst-performing class

    Authors: Thomas Pethick, Grigorios G. Chrysos, Volkan Cevher

    Abstract: Despite progress in adversarial training (AT), there is a substantial gap between the top-performing and worst-performing classes in many datasets. For example, on CIFAR10, the accuracies for the best and worst classes are 74% and 23%, respectively. We argue that this gap can be reduced by explicitly optimizing for the worst-performing class, resulting in a min-max-max optimization formulation. Ou… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: Code accessible at: https://github.com/LIONS-EPFL/class-focused-online-learning-code

  19. arXiv:2209.07736  [pdf, other

    cs.LG cs.AI

    Extrapolation and Spectral Bias of Neural Nets with Hadamard Product: a Polynomial Net Study

    Authors: Yongtao Wu, Zhenyu Zhu, Fanghui Liu, Grigorios G Chrysos, Volkan Cevher

    Abstract: Neural tangent kernel (NTK) is a powerful tool to analyze training dynamics of neural networks and their generalization bounds. The study on NTK has been devoted to typical neural network architectures, but it is incomplete for neural networks with Hadamard products (NNs-Hp), e.g., StyleGAN and polynomial neural networks (PNNs). In this work, we derive the finite-width NTK formulation for a specia… ▽ More

    Submitted 16 October, 2022; v1 submitted 16 September, 2022; originally announced September 2022.

  20. arXiv:2209.07263  [pdf, other

    cs.LG cs.AI

    Robustness in deep learning: The good (width), the bad (depth), and the ugly (initialization)

    Authors: Zhenyu Zhu, Fanghui Liu, Grigorios G Chrysos, Volkan Cevher

    Abstract: We study the average robustness notion in deep neural networks in (selected) wide and narrow, deep and shallow, as well as lazy and non-lazy training settings. We prove that in the under-parameterized setting, width has a negative effect while it improves robustness in the over-parameterized setting. The effect of depth closely depends on the initialization and the training mode. In particular, wh… ▽ More

    Submitted 9 February, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted in NeurIPS 2022

  21. arXiv:2209.07238  [pdf, other

    cs.LG cs.AI

    Generalization Properties of NAS under Activation and Skip Connection Search

    Authors: Zhenyu Zhu, Fanghui Liu, Grigorios G Chrysos, Volkan Cevher

    Abstract: Neural Architecture Search (NAS) has fostered the automatic discovery of state-of-the-art neural architectures. Despite the progress achieved with NAS, so far there is little attention to theoretical guarantees on NAS. In this work, we study the generalization properties of NAS under a unifying framework enabling (deep) layer skip connection search and activation function search. To this end, we d… ▽ More

    Submitted 1 November, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted in NeurIPS 2022

  22. arXiv:2209.07235  [pdf, ps, other

    cs.LG cs.AI cs.CR

    Sound and Complete Verification of Polynomial Networks

    Authors: Elias Abad Rocamora, Mehmet Fatih Sahin, Fanghui Liu, Grigorios G Chrysos, Volkan Cevher

    Abstract: Polynomial Networks (PNs) have demonstrated promising performance on face and image recognition recently. However, robustness of PNs is unclear and thus obtaining certificates becomes imperative for enabling their adoption in real-world applications. Existing verification algorithms on ReLU neural networks (NNs) based on classical branch and bound (BaB) techniques cannot be trivially applied to PN… ▽ More

    Submitted 22 October, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted in NeurIPS 2022

  23. arXiv:2206.06811  [pdf, other

    eess.AS cs.LG

    Adversarial Audio Synthesis with Complex-valued Polynomial Networks

    Authors: Yongtao Wu, Grigorios G Chrysos, Volkan Cevher

    Abstract: Time-frequency (TF) representations in audio synthesis have been increasingly modeled with real-valued networks. However, overlooking the complex-valued nature of TF representations can result in suboptimal performance and require additional modules (e.g., for modeling the phase). To this end, we introduce complex-valued polynomial networks, called APOLLO, that integrate such complex-valued repres… ▽ More

    Submitted 21 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Accepted as oral presentation in Workshop on Machine Learning for Audio Synthesis at ICML 2022

  24. arXiv:2202.13473  [pdf, other

    cs.LG cs.CV

    The Spectral Bias of Polynomial Neural Networks

    Authors: Moulik Choraria, Leello Tadesse Dadi, Grigorios Chrysos, Julien Mairal, Volkan Cevher

    Abstract: Polynomial neural networks (PNNs) have been recently shown to be particularly effective at image generation and face recognition, where high-frequency information is critical. Previous studies have revealed that neural networks demonstrate a $\textit{spectral bias}$ towards low-frequency functions, which yields faster learning of low-frequency components during training. Inspired by such studies,… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

    Comments: Accepted at the International Conference on Learning Representations(ICLR) 2022

  25. arXiv:2202.05068  [pdf, other

    cs.LG

    Controlling the Complexity and Lipschitz Constant improves polynomial nets

    Authors: Zhenyu Zhu, Fabian Latorre, Grigorios G Chrysos, Volkan Cevher

    Abstract: While the class of Polynomial Nets demonstrates comparable performance to neural networks (NN), it currently has neither theoretical generalization characterization nor robustness guarantees. To this end, we derive new complexity bounds for the set of Coupled CP-Decomposition (CCP) and Nested Coupled CP-decomposition (NCP) models of Polynomial Nets in terms of the $\ell_\infty$-operator-norm and t… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

  26. arXiv:2112.12911  [pdf, other

    cs.CV

    Cluster-guided Image Synthesis with Unconditional Models

    Authors: Markos Georgopoulos, James Oldfield, Grigorios G Chrysos, Yannis Panagakis

    Abstract: Generative Adversarial Networks (GANs) are the driving force behind the state-of-the-art in image generation. Despite their ability to synthesize high-resolution photo-realistic images, generating content with on-demand conditioning of different granularity remains a challenge. This challenge is usually tackled by annotating massive datasets with the attributes of interest, a laborious task that i… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

  27. arXiv:2109.08580  [pdf, other

    cs.CV cs.LG eess.IV

    Self-Supervised Neural Architecture Search for Imbalanced Datasets

    Authors: Aleksandr Timofeev, Grigorios G. Chrysos, Volkan Cevher

    Abstract: Neural Architecture Search (NAS) provides state-of-the-art results when trained on well-curated datasets with annotated labels. However, annotating data or even having balanced number of samples can be a luxury for practitioners from different scientific fields, e.g., in the medical domain. To that end, we propose a NAS-based framework that bears the threefold contributions: (a) we focus on the se… ▽ More

    Submitted 20 September, 2021; v1 submitted 17 September, 2021; originally announced September 2021.

    Comments: Published in ICML 2021 Workshop: Self-Supervised Learning for Reasoning and Perception. Code: https://github.com/TimofeevAlex/ssnas_imbalanced

  28. Tensor Methods in Computer Vision and Deep Learning

    Authors: Yannis Panagakis, Jean Kossaifi, Grigorios G. Chrysos, James Oldfield, Mihalis A. Nicolaou, Anima Anandkumar, Stefanos Zafeiriou

    Abstract: Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a long history of applications in a wide span of computer vision problems. With the advent of the deep learning paradigm shift in computer vision, tensors have be… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Proceedings of the IEEE (2021)

  29. arXiv:2107.02859  [pdf, other

    cs.CV

    Poly-NL: Linear Complexity Non-local Layers with Polynomials

    Authors: Francesca Babiloni, Ioannis Marras, Filippos Kokkinos, Jiankang Deng, Grigorios Chrysos, Stefanos Zafeiriou

    Abstract: Spatial self-attention layers, in the form of Non-Local blocks, introduce long-range dependencies in Convolutional Neural Networks by computing pairwise similarities among all possible positions. Such pairwise functions underpin the effectiveness of non-local layers, but also determine a complexity that scales quadratically with respect to the input size both in space and time. This is a severely… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: 11 pages, 4 figures

  30. arXiv:2104.07916  [pdf, other

    cs.CV

    Augmenting Deep Classifiers with Polynomial Neural Networks

    Authors: Grigorios G Chrysos, Markos Georgopoulos, Jiankang Deng, Jean Kossaifi, Yannis Panagakis, Anima Anandkumar

    Abstract: Deep neural networks have been the driving force behind the success in classification tasks, e.g., object and audio recognition. Impressive results and generalization have been achieved by a variety of recently proposed architectures, the majority of which are seemingly disconnected. In this work, we cast the study of deep classifiers under a unifying framework. In particular, we express state-of-… ▽ More

    Submitted 11 August, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at ECCV'22

  31. arXiv:2104.05077  [pdf, other

    cs.LG cs.CV

    CoPE: Conditional image generation using Polynomial Expansions

    Authors: Grigorios G Chrysos, Markos Georgopoulos, Yannis Panagakis

    Abstract: Generative modeling has evolved to a notable field of machine learning. Deep polynomial neural networks (PNNs) have demonstrated impressive results in unsupervised image generation, where the task is to map an input vector (i.e., noise) to a synthesized image. However, the success of PNNs has not been replicated in conditional generation tasks, such as super-resolution. Existing PNNs focus on sing… ▽ More

    Submitted 27 October, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: Accepted in NeurIPS 2021

  32. arXiv:2009.04075  [pdf, other

    cs.LG stat.ML

    Multilinear Latent Conditioning for Generating Unseen Attribute Combinations

    Authors: Markos Georgopoulos, Grigorios Chrysos, Maja Pantic, Yannis Panagakis

    Abstract: Deep generative models rely on their inductive bias to facilitate generalization, especially for problems with high dimensional data, like images. However, empirical studies have shown that variational autoencoders (VAE) and generative adversarial networks (GAN) lack the generalization ability that occurs naturally in human perception. For example, humans can visualize a woman smiling after only s… ▽ More

    Submitted 8 September, 2020; originally announced September 2020.

    Comments: published at International Conference on Machine Learning 2020

  33. arXiv:2007.09250  [pdf, other

    cs.LG cs.CV stat.ML

    Unsupervised Controllable Generation with Self-Training

    Authors: Grigorios G Chrysos, Jean Kossaifi, Zhiding Yu, Anima Anandkumar

    Abstract: Recent generative adversarial networks (GANs) are able to generate impressive photo-realistic images. However, controllable generation with GANs remains a challenging research problem. Achieving controllable generation requires semantically interpretable and disentangled factors of variation. It is challenging to achieve this goal using simple fixed distributions such as Gaussian distribution. Ins… ▽ More

    Submitted 2 May, 2021; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: Accepted in IJCNN 2021

  34. arXiv:2006.13026  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Polynomial Neural Networks

    Authors: Grigorios Chrysos, Stylianos Moschoglou, Giorgos Bouritsas, Jiankang Deng, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: Deep Convolutional Neural Networks (DCNNs) are currently the method of choice both for generative, as well as for discriminative learning in computer vision and machine learning. The success of DCNNs can be attributed to the careful selection of their building blocks (e.g., residual blocks, rectifiers, sophisticated normalization schemes, to mention but a few). In this paper, we propose $Π$-Nets,… ▽ More

    Submitted 27 February, 2021; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: Published in IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI). Code: https://github.com/grigorisg9gr/polynomial_nets. arXiv admin note: substantial text overlap with arXiv:2003.03828

  35. arXiv:2003.03828  [pdf, other

    cs.LG cs.CV stat.ML

    $Π-$nets: Deep Polynomial Neural Networks

    Authors: Grigorios G. Chrysos, Stylianos Moschoglou, Giorgos Bouritsas, Yannis Panagakis, Jiankang Deng, Stefanos Zafeiriou

    Abstract: Deep Convolutional Neural Networks (DCNNs) is currently the method of choice both for generative, as well as for discriminative learning in computer vision and machine learning. The success of DCNNs can be attributed to the careful selection of their building blocks (e.g., residual blocks, rectifiers, sophisticated normalization schemes, to mention but a few). In this paper, we propose $Π$-Nets, a… ▽ More

    Submitted 26 March, 2020; v1 submitted 8 March, 2020; originally announced March 2020.

    Comments: Accepted in CVPR 2020

  36. arXiv:2002.04147  [pdf, other

    eess.IV cs.CV

    Reconstructing the Noise Manifold for Image Denoising

    Authors: Ioannis Marras, Grigorios G. Chrysos, Ioannis Alexiou, Gregory Slabaugh, Stefanos Zafeiriou

    Abstract: Deep Convolutional Neural Networks (CNNs) have been successfully used in many low-level vision problems like image denoising. Although the conditional image generation techniques have led to large improvements in this task, there has been little effort in providing conditional generative adversarial networks (cGAN)[42] with an explicit way of understanding the image noise for object-independent de… ▽ More

    Submitted 6 March, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: 18 pages, 8 figures

  37. arXiv:1908.06571  [pdf, other

    cs.LG stat.ML

    PolyGAN: High-Order Polynomial Generators

    Authors: Grigorios Chrysos, Stylianos Moschoglou, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: Generative Adversarial Networks (GANs) have become the gold standard when it comes to learning generative models for high-dimensional distributions. Since their advent, numerous variations of GANs have been introduced in the literature, primarily focusing on utilization of novel loss functions, optimization/regularization strategies and network architectures. In this paper, we turn our attention t… ▽ More

    Submitted 13 October, 2019; v1 submitted 18 August, 2019; originally announced August 2019.

  38. arXiv:1805.08657  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Robust Conditional Generative Adversarial Networks

    Authors: Grigorios G. Chrysos, Jean Kossaifi, Stefanos Zafeiriou

    Abstract: Conditional generative adversarial networks (cGAN) have led to large improvements in the task of conditional image generation, which lies at the heart of computer vision. The major focus so far has been on performance improvement, while there has been little effort in making cGAN more robust to noise. The regression (of the generator) might lead to arbitrarily large errors in the output, which mak… ▽ More

    Submitted 13 March, 2019; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: To appear in ICLR 2019

  39. arXiv:1803.03330  [pdf, other

    cs.CV

    Motion deblurring of faces

    Authors: Grigorios G. Chrysos, Paolo Favaro, Stefanos Zafeiriou

    Abstract: Face analysis is a core part of computer vision, in which remarkable progress has been observed in the past decades. Current methods achieve recognition and tracking with invariance to fundamental modes of variation such as illumination, 3D pose, expressions. Notwithstanding, a much less standing mode of variation is motion deblurring, which however presents substantial challenges in face analysis… ▽ More

    Submitted 8 March, 2018; originally announced March 2018.

  40. arXiv:1801.06665  [pdf, other

    cs.CV cs.LG

    Visual Data Augmentation through Learning

    Authors: Grigorios G. Chrysos, Yannis Panagakis, Stefanos Zafeiriou

    Abstract: The rapid progress in machine learning methods has been empowered by i) huge datasets that have been collected and annotated, ii) improved engineering (e.g. data pre-processing/normalization). The existing datasets typically include several million samples, which constitutes their extension a colossal task. In addition, the state-of-the-art data-driven methods demand a vast amount of data, hence a… ▽ More

    Submitted 20 January, 2018; originally announced January 2018.

  41. arXiv:1704.08772  [pdf, other

    cs.CV cs.AI cs.LG

    Deep Face Deblurring

    Authors: Grigorios G. Chrysos, Stefanos Zafeiriou

    Abstract: Blind deblurring consists a long studied task, however the outcomes of generic methods are not effective in real world blurred images. Domain-specific methods for deblurring targeted object categories, e.g. text or faces, frequently outperform their generic counterparts, hence they are attracting an increasing amount of attention. In this work, we develop such a domain-specific method to tackle de… ▽ More

    Submitted 25 May, 2017; v1 submitted 27 April, 2017; originally announced April 2017.

  42. A Comprehensive Performance Evaluation of Deformable Face Tracking "In-the-Wild"

    Authors: Grigorios G. Chrysos, Epameinondas Antonakos, Patrick Snape, Akshay Asthana, Stefanos Zafeiriou

    Abstract: Recently, technologies such as face detection, facial landmark localisation and face recognition and verification have matured enough to provide effective and efficient solutions for imagery captured under arbitrary conditions (referred to as "in-the-wild"). This is partially attributed to the fact that comprehensive "in-the-wild" benchmarks have been developed for face detection, landmark localis… ▽ More

    Submitted 28 February, 2017; v1 submitted 18 March, 2016; originally announced March 2016.

    Comments: E. Antonakos and P. Snape contributed equally and have joint second authorship