Skip to main content

Showing 1–50 of 98 results for author: Vazirgiannis, M

.
  1. arXiv:2406.19219  [pdf, other

    cs.DL

    Metrics to Detect Small-Scale and Large-Scale Citation Orchestration

    Authors: Iakovos Evdaimon, John P. A. Ioannidis, Giannis Nikolentzos, Michail Chatzianastasis, George Panagopoulos, Michalis Vazirgiannis

    Abstract: Citation counts and related metrics have pervasive uses and misuses in academia and research appraisal, serving as scholarly influence and recognition measures. Hence, comprehending the citation patterns exhibited by authors is essential for assessing their research impact and contributions within their respective fields. Although the h-index, introduced by Hirsch in 2005, has emerged as a popular… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.18380  [pdf, ps, other

    cs.LG

    KAGNNs: Kolmogorov-Arnold Networks meet Graph Learning

    Authors: Roman Bresson, Giannis Nikolentzos, George Panagopoulos, Michail Chatzianastasis, Jun Pang, Michalis Vazirgiannis

    Abstract: In recent years, Graph Neural Networks (GNNs) have become the de facto tool for learning node and graph representations. Most GNNs typically consist of a sequence of neighborhood aggregation (a.k.a., message passing) layers. Within each of these layers, the representation of each node is updated from an aggregation and transformation of its neighbours representations at the previous layer. The upp… ▽ More

    Submitted 1 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2406.14142  [pdf, other

    q-bio.QM cs.LG q-bio.BM

    Geometric Self-Supervised Pretraining on 3D Protein Structures using Subgraphs

    Authors: Michail Chatzianastasis, George Dasoulas, Michalis Vazirgiannis

    Abstract: Protein representation learning aims to learn informative protein embeddings capable of addressing crucial biological questions, such as protein function prediction. Although sequence-based transformer models have shown promising results by leveraging the vast amount of protein sequence data in a self-supervised way, there is still a gap in applying these methods to 3D protein structures. In this… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. Explaining Predictions by Characteristic Rules

    Authors: Amr Alkhatib, Henrik Boström, Michalis Vazirgiannis

    Abstract: Characteristic rules have been advocated for their ability to improve interpretability over discriminative rules within the area of rule learning. However, the former type of rule has not yet been used by techniques for explaining predictions. A novel explanation technique, called CEGA (Characteristic Explanatory General Association rules), is proposed, which employs association rule mining to agg… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022

    Journal ref: In: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13713. Springer, Cham (2023)

  5. arXiv:2405.11055  [pdf, other

    cs.CL cs.AI

    Leveraging Discourse Structure for Extractive Meeting Summarization

    Authors: Virgile Rennard, Guokan Shang, Michalis Vazirgiannis, Julie Hunter

    Abstract: We introduce an extractive summarization system for meetings that leverages discourse structure to better identify salient information from complex multi-party discussions. Using discourse graphs to represent semantic relations between the contents of utterances in a meeting, we train a GNN-based node classification model to select the most important utterances, which are then combined to create a… ▽ More

    Submitted 21 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  6. arXiv:2404.17947  [pdf, other

    cs.LG cs.AI cs.CR

    Bounding the Expected Robustness of Graph Neural Networks Subject to Node Feature Attacks

    Authors: Yassine Abbahaddou, Sofiane Ennadir, Johannes F. Lutzeyer, Michalis Vazirgiannis, Henrik Boström

    Abstract: Graph Neural Networks (GNNs) have demonstrated state-of-the-art performance in various graph representation learning tasks. Recently, studies revealed their vulnerability to adversarial attacks. In this work, we theoretically define the concept of expected robustness in the context of attributed graphs and relate it to the classical definition of adversarial robustness in the graph representation… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Accepted at ICLR 2024

  7. arXiv:2403.01535  [pdf, other

    cs.LG cs.SI

    Neural Graph Generator: Feature-Conditioned Graph Generation using Latent Diffusion Models

    Authors: Iakovos Evdaimon, Giannis Nikolentzos, Michail Chatzianastasis, Hadi Abdine, Michalis Vazirgiannis

    Abstract: Graph generation has emerged as a crucial task in machine learning, with significant challenges in generating graphs that accurately reflect specific properties. Existing methods often fall short in efficiently addressing this need as they struggle with the high-dimensional complexity and varied nature of graph properties. In this paper, we introduce the Neural Graph Generator (NGG), a novel appro… ▽ More

    Submitted 23 April, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  8. arXiv:2403.00523  [pdf, ps, other

    q-fin.GN cs.CR cs.SI

    Assessing the Efficacy of Heuristic-Based Address Clustering for Bitcoin

    Authors: Hugo Schnoering, Pierre Porthaux, Michalis Vazirgiannis

    Abstract: Exploring transactions within the Bitcoin blockchain entails examining the transfer of bitcoins among several hundred million entities. However, it is often impractical and resource-consuming to study such a vast number of entities. Consequently, entity clustering serves as an initial step in most analytical studies. This process often employs heuristics grounded in the practices and behaviors of… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 20 pages

  9. arXiv:2402.13987  [pdf, other

    cs.LG

    A Simple and Yet Fairly Effective Defense for Graph Neural Networks

    Authors: Sofiane Ennadir, Yassine Abbahaddou, Johannes F. Lutzeyer, Michalis Vazirgiannis, Henrik Boström

    Abstract: Graph Neural Networks (GNNs) have emerged as the dominant approach for machine learning on graph-structured data. However, concerns have arisen regarding the vulnerability of GNNs to small adversarial perturbations. Existing defense methods against such perturbations suffer from high time complexity and can negatively impact the model's performance on clean graphs. To address these challenges, thi… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted at AAAI-24

  10. arXiv:2402.02862  [pdf, other

    stat.ML cs.LG

    Graph Neural Machine: A New Model for Learning with Tabular Data

    Authors: Giannis Nikolentzos, Siyun Wang, Johannes Lutzeyer, Michalis Vazirgiannis

    Abstract: In recent years, there has been a growing interest in map** data from different domains to graph structures. Among others, neural network models such as the multi-layer perceptron (MLP) can be modeled as graphs. In fact, MLPs can be represented as directed acyclic graphs. Graph neural networks (GNNs) have recently become the standard tool for performing machine learning tasks on graphs. In this… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  11. arXiv:2312.04843  [pdf, other

    cs.CL cs.AI

    FREDSum: A Dialogue Summarization Corpus for French Political Debates

    Authors: Virgile Rennard, Guokan Shang, Damien Grari, Julie Hunter, Michalis Vazirgiannis

    Abstract: Recent advances in deep learning, and especially the invention of encoder-decoder architectures, has significantly improved the performance of abstractive summarization systems. The majority of research has focused on written documents, however, neglecting the problem of multi-party dialogue summarization. In this paper, we present a dataset of French political debates for the purpose of enhancing… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted at EMNLP2023 Findings

  12. arXiv:2311.12491  [pdf, other

    cs.CR cs.DC cs.LG q-fin.GN

    Heuristics for Detecting CoinJoin Transactions on the Bitcoin Blockchain

    Authors: Hugo Schnoering, Michalis Vazirgiannis

    Abstract: This research delves into the intricacies of Bitcoin, a decentralized peer-to-peer network, and its associated blockchain, which records all transactions since its inception. While this ensures integrity and transparency, the transparent nature of Bitcoin potentially compromises users' privacy rights. To address this concern, users have adopted CoinJoin, a method that amalgamates multiple transact… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  13. arXiv:2311.11967  [pdf, other

    cs.CL

    Automatic Analysis of Substantiation in Scientific Peer Reviews

    Authors: Yanzhu Guo, Guokan Shang, Virgile Rennard, Michalis Vazirgiannis, Chloé Clavel

    Abstract: With the increasing amount of problematic peer reviews in top AI conferences, the community is urgently in need of automatic quality control measures. In this paper, we restrict our attention to substantiation -- one popular quality aspect indicating whether the claims in a review are sufficiently supported by evidence -- and provide a solution automatizing this evaluation process. To achieve this… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023 Findings

  14. arXiv:2311.09807  [pdf, other

    cs.CL

    The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text

    Authors: Yanzhu Guo, Guokan Shang, Michalis Vazirgiannis, Chloé Clavel

    Abstract: This study investigates the consequences of training language models on synthetic data generated by their predecessors, an increasingly prevalent practice given the prominence of powerful generative models. Diverging from the usual emphasis on performance metrics, we focus on the impact of this training methodology on linguistic diversity, especially when conducted recursively over time. To assess… ▽ More

    Submitted 16 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024 Findings

  15. arXiv:2308.08945  [pdf, other

    cs.LG cs.AI

    Interpretable Graph Neural Networks for Tabular Data

    Authors: Amr Alkhatib, Sofiane Ennadir, Henrik Boström, Michalis Vazirgiannis

    Abstract: Data in tabular format is frequently occurring in real-world applications. Graph Neural Networks (GNNs) have recently been extended to effectively handle such data, allowing feature interactions to be captured through representation learning. However, these approaches essentially produce black-box models, in the form of deep neural networks, precluding users from following the logic behind the mod… ▽ More

    Submitted 19 April, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: 18 pages, 12 figures

  16. arXiv:2308.03210  [pdf, other

    cs.LG

    Time-Parameterized Convolutional Neural Networks for Irregularly Sampled Time Series

    Authors: Chrysoula Kosma, Giannis Nikolentzos, Michalis Vazirgiannis

    Abstract: Irregularly sampled multivariate time series are ubiquitous in several application domains, leading to sparse, not fully-observed and non-aligned observations across different variables. Standard sequential neural network architectures, such as recurrent neural networks (RNNs) and convolutional neural networks (CNNs), consider regular spacing between observation times, posing significant challenge… ▽ More

    Submitted 9 August, 2023; v1 submitted 6 August, 2023; originally announced August 2023.

  17. arXiv:2308.01208  [pdf, other

    cs.IR cs.LG q-fin.CP stat.ML

    Adaptive Collaborative Filtering with Personalized Time Decay Functions for Financial Product Recommendation

    Authors: Ashraf Ghiye, Baptiste Barreau, Laurent Carlier, Michalis Vazirgiannis

    Abstract: Classical recommender systems often assume that historical data are stationary and fail to account for the dynamic nature of user preferences, limiting their ability to provide reliable recommendations in time-sensitive settings. This assumption is particularly problematic in finance, where financial products exhibit continuous changes in valuations, leading to frequent shifts in client interests.… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 10 pages, 1 figure, 2 tables, to be published in the Seventeenth ACM Conference on Recommender Systems (RecSys '23)

  18. arXiv:2307.14680  [pdf, other

    cs.LG

    TimeGNN: Temporal Dynamic Graph Learning for Time Series Forecasting

    Authors: Nancy Xu, Chrysoula Kosma, Michalis Vazirgiannis

    Abstract: Time series forecasting lies at the core of important real-world applications in many fields of science and engineering. The abundance of large time series datasets that consist of complex patterns and long-term dependencies has led to the development of various neural network architectures. Graph neural network approaches, which jointly learn a graph structure based on the correlation of raw valu… ▽ More

    Submitted 30 November, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  19. arXiv:2307.14367  [pdf, other

    q-bio.QM cs.CL cs.LG

    Prot2Text: Multimodal Protein's Function Generation with GNNs and Transformers

    Authors: Hadi Abdine, Michail Chatzianastasis, Costas Bouyioukos, Michalis Vazirgiannis

    Abstract: In recent years, significant progress has been made in the field of protein function prediction with the development of various machine-learning approaches. However, most existing methods formulate the task as a multi-classification problem, i.e. assigning predefined labels to proteins. In this work, we propose a novel approach, Prot2Text, which predicts a protein's function in a free text style,… ▽ More

    Submitted 20 April, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 38(10), 10757-10765 (2024)

  20. arXiv:2307.05217  [pdf, other

    cs.LG cs.SI

    Supervised Attention Using Homophily in Graph Neural Networks

    Authors: Michail Chatzianastasis, Giannis Nikolentzos, Michalis Vazirgiannis

    Abstract: Graph neural networks have become the standard approach for dealing with learning problems on graphs. Among the different variants of graph neural networks, graph attention networks (GATs) have been applied with great success to different tasks. In the GAT model, each node assigns an importance score to its neighbors using an attention mechanism. However, similar to other graph neural networks, GA… ▽ More

    Submitted 15 July, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted at ICANN 2023

  21. arXiv:2306.05955  [pdf, other

    cs.LG stat.ML

    Path Neural Networks: Expressive and Accurate Graph Neural Networks

    Authors: Gaspard Michel, Giannis Nikolentzos, Johannes Lutzeyer, Michalis Vazirgiannis

    Abstract: Graph neural networks (GNNs) have recently become the standard approach for learning with graph-structured data. Prior work has shed light into their potential, but also their limitations. Unfortunately, it was shown that standard GNNs are limited in their expressive power. These models are no more powerful than the 1-dimensional Weisfeiler-Leman (1-WL) algorithm in terms of distinguishing non-iso… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2023

  22. arXiv:2304.10851  [pdf, other

    cs.LG

    What Do GNNs Actually Learn? Towards Understanding their Representations

    Authors: Giannis Nikolentzos, Michail Chatzianastasis, Michalis Vazirgiannis

    Abstract: In recent years, graph neural networks (GNNs) have achieved great success in the field of graph representation learning. Although prior work has shed light into the expressiveness of those models (\ie whether they can distinguish pairs of non-isomorphic graphs), it is still not clear what structural information is encoded into the node representations that are learned by those models. In this pape… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  23. arXiv:2304.00869  [pdf, other

    cs.CL

    GreekBART: The First Pretrained Greek Sequence-to-Sequence Model

    Authors: Iakovos Evdaimon, Hadi Abdine, Christos Xypolopoulos, Stamatis Outsios, Michalis Vazirgiannis, Giorgos Stamou

    Abstract: The era of transfer learning has revolutionized the fields of Computer Vision and Natural Language Processing, bringing powerful pretrained models with exceptional performance across a variety of tasks. Specifically, Natural Language Processing tasks have been dominated by transformer-based language models. In Natural Language Inference and Natural Language Generation tasks, the BERT model and its… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  24. arXiv:2302.05894  [pdf, other

    cs.LG

    Neural Architecture Search with Multimodal Fusion Methods for Diagnosing Dementia

    Authors: Michail Chatzianastasis, Loukas Ilias, Dimitris Askounis, Michalis Vazirgiannis

    Abstract: Alzheimer's dementia (AD) affects memory, thinking, and language, deteriorating person's life. An early diagnosis is very important as it enables the person to receive medical help and ensure quality of life. Therefore, leveraging spontaneous speech in conjunction with machine learning methods for recognizing AD patients has emerged into a hot topic. Most of the previous works employ Convolutional… ▽ More

    Submitted 5 April, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: Accepted at ICASSP 2023

  25. arXiv:2301.08831  [pdf, other

    cs.LG

    Explainable Multilayer Graph Neural Network for Cancer Gene Prediction

    Authors: Michail Chatzianastasis, Michalis Vazirgiannis, Zijun Zhang

    Abstract: The identification of cancer genes is a critical yet challenging problem in cancer genomics research. Existing computational methods, including deep graph neural networks, fail to exploit the multilayered gene-gene interactions or provide limited explanation for their predictions. These methods are restricted to a single biological network, which cannot capture the full complexity of tumorigenesis… ▽ More

    Submitted 3 May, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  26. arXiv:2211.08972  [pdf, other

    cs.LG cs.SI stat.ML

    New Frontiers in Graph Autoencoders: Joint Community Detection and Link Prediction

    Authors: Guillaume Salha-Galvan, Johannes F. Lutzeyer, George Dasoulas, Romain Hennequin, Michalis Vazirgiannis

    Abstract: Graph autoencoders (GAE) and variational graph autoencoders (VGAE) emerged as powerful methods for link prediction (LP). Their performances are less impressive on community detection (CD), where they are often outperformed by simpler alternatives such as the Louvain method. It is still unclear to what extent one can improve CD with GAE and VGAE, especially in the absence of node features. It is mo… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: This NeurIPS 2022 GLFrontiers workshop paper summarizes results from the following journal article: arXiv:2202.00961. arXiv admin note: text overlap with arXiv:2205.14651

  27. arXiv:2211.04248  [pdf, other

    cs.LG cs.SI stat.ML

    Improving Graph Neural Networks at Scale: Combining Approximate PageRank and CoreRank

    Authors: Ariel R. Ramos Vela, Johannes F. Lutzeyer, Anastasios Giovanidis, Michalis Vazirgiannis

    Abstract: Graph Neural Networks (GNNs) have achieved great successes in many learning tasks performed on graph structures. Nonetheless, to propagate information GNNs rely on a message passing scheme which can become prohibitively expensive when working with industrial-scale graphs. Inspired by the PPRGo model, we propose the CorePPR model, a scalable solution that utilises a learnable convex combination of… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted at the "NeurIPS 2022 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2022)"

  28. arXiv:2211.02501  [pdf, other

    cs.LG

    Weisfeiler and Leman go Hyperbolic: Learning Distance Preserving Node Representations

    Authors: Giannis Nikolentzos, Michail Chatzianastasis, Michalis Vazirgiannis

    Abstract: In recent years, graph neural networks (GNNs) have emerged as a promising tool for solving machine learning problems on graphs. Most GNNs are members of the family of message passing neural networks (MPNNs). There is a close connection between these models and the Weisfeiler-Leman (WL) test of isomorphism, an algorithm that can successfully test isomorphism for a broad class of graphs. Recently, m… ▽ More

    Submitted 28 February, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: Accepted at AISTATS 2023

  29. arXiv:2210.17378  [pdf, other

    cs.CL

    Questioning the Validity of Summarization Datasets and Improving Their Factual Consistency

    Authors: Yanzhu Guo, Chloé Clavel, Moussa Kamal Eddine, Michalis Vazirgiannis

    Abstract: The topic of summarization evaluation has recently attracted a surge of attention due to the rapid development of abstractive summarization systems. However, the formulation of the task is rather ambiguous, neither the linguistic nor the natural language processing community has succeeded in giving a mutually agreed-upon definition. Due to this lack of well-defined formulation, a large number of p… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  30. arXiv:2210.06576  [pdf, other

    cs.CL

    DATScore: Evaluating Translation with Data Augmented Translations

    Authors: Moussa Kamal Eddine, Guokan Shang, Michalis Vazirgiannis

    Abstract: The rapid development of large pretrained language models has revolutionized not only the field of Natural Language Generation (NLG) but also its evaluation. Inspired by the recent work of BARTScore: a metric leveraging the BART language model to evaluate the quality of generated text from various aspects, we introduce DATScore. DATScore uses data augmentation techniques to improve the evaluation… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  31. Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization

    Authors: Hadi Abdine, Moussa Kamal Eddine, Michalis Vazirgiannis, Davide Buscaldi

    Abstract: Word sense induction (WSI) is a difficult problem in natural language processing that involves the unsupervised automatic detection of a word's senses (i.e. meanings). Recent work achieves significant results on the WSI task by pre-training a language model that can exclusively disambiguate word senses, whereas others employ previously pre-trained language models in conjunction with additional str… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  32. arXiv:2208.04163  [pdf, other

    cs.CL

    Abstractive Meeting Summarization: A Survey

    Authors: Virgile Rennard, Guokan Shang, Julie Hunter, Michalis Vazirgiannis

    Abstract: A system that could reliably identify and sum up the most important points of a conversation would be valuable in a wide variety of real-world contexts, from business meetings to medical consultations to customer service calls. Recent advances in deep learning, and especially the invention of encoder-decoder architectures, has significantly improved language generation systems, opening the door to… ▽ More

    Submitted 25 April, 2023; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: pre-MIT Press publication version for TACL journal

  33. arXiv:2207.13441  [pdf, ps, other

    cs.LG cs.AI

    Time Series Forecasting Models Copy the Past: How to Mitigate

    Authors: Chrysoula Kosma, Giannis Nikolentzos, Nancy Xu, Michalis Vazirgiannis

    Abstract: Time series forecasting is at the core of important application domains posing significant challenges to machine learning algorithms. Recently neural network architectures have been widely applied to the problem of time series forecasting. Most of these models are trained by minimizing a loss function that measures predictions' deviation from the real values. Typical loss functions include mean sq… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: accepted at ICANN'22

  34. arXiv:2207.03579  [pdf, other

    cs.SI cs.AI

    DGraph: A Large-Scale Financial Dataset for Graph Anomaly Detection

    Authors: Xuanwen Huang, Yang Yang, Yang Wang, Chun** Wang, Zhisheng Zhang, Jiarong Xu, Lei Chen, Michalis Vazirgiannis

    Abstract: Graph Anomaly Detection (GAD) has recently become a hot research spot due to its practicability and theoretical value. Since GAD emphasizes the application and the rarity of anomalous samples, enriching the varieties of its datasets is fundamental work. Thus, this paper present DGraph, a real-world dynamic graph in the finance domain. DGraph overcomes many limitations of current GAD datasets. It c… ▽ More

    Submitted 9 June, 2023; v1 submitted 30 June, 2022; originally announced July 2022.

    Comments: Accepted to NeurIPS 2022. Dataset Url: https://dgraph.xinye.com/

  35. arXiv:2205.14275  [pdf, ps, other

    cs.CV cs.LG

    Image Keypoint Matching using Graph Neural Networks

    Authors: Nancy Xu, Giannis Nikolentzos, Michalis Vazirgiannis, Henrik Boström

    Abstract: Image matching is a key component of many tasks in computer vision and its main objective is to find correspondences between features extracted from different natural images. When images are represented as graphs, image matching boils down to the problem of graph matching which has been studied intensively in the past. In recent years, graph neural networks have shown great potential in the graph… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: Complex Networks

  36. arXiv:2204.07436  [pdf, other

    cs.SI cs.CL

    Political Communities on Twitter: Case Study of the 2022 French Presidential Election

    Authors: Hadi Abdine, Yanzhu Guo, Virgile Rennard, Michalis Vazirgiannis

    Abstract: With the significant increase in users on social media platforms, a new means of political campaigning has appeared. Twitter and Facebook are now notable campaigning tools during elections. Indeed, the candidates and their parties now take to the internet to interact and spread their ideas. In this paper, we aim to identify political communities formed on Twitter during the 2022 French presidentia… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

  37. arXiv:2204.05351  [pdf, other

    cs.LG

    Graph Ordering Attention Networks

    Authors: Michail Chatzianastasis, Johannes F. Lutzeyer, George Dasoulas, Michalis Vazirgiannis

    Abstract: Graph Neural Networks (GNNs) have been successfully used in many problems involving graph-structured data, achieving state-of-the-art performance. GNNs typically employ a message-passing scheme, in which every node aggregates information from its neighbors using a permutation-invariant aggregation function. Standard well-examined choices such as the mean or sum aggregation functions have limited c… ▽ More

    Submitted 21 November, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted at AAAI 2023

  38. arXiv:2203.10945  [pdf, other

    cs.CL

    AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization

    Authors: Moussa Kamal Eddine, Nadi Tomeh, Nizar Habash, Joseph Le Roux, Michalis Vazirgiannis

    Abstract: Like most natural language understanding and generation tasks, state-of-the-art models for summarization are transformer-based sequence-to-sequence architectures that are pretrained on large corpora. While most existing models focused on English, Arabic remained understudied. In this paper we propose AraBART, the first Arabic model in which the encoder and the decoder are pretrained end-to-end, ba… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  39. arXiv:2202.00961  [pdf, other

    cs.LG cs.SI stat.ML

    Modularity-Aware Graph Autoencoders for Joint Community Detection and Link Prediction

    Authors: Guillaume Salha-Galvan, Johannes F. Lutzeyer, George Dasoulas, Romain Hennequin, Michalis Vazirgiannis

    Abstract: Graph autoencoders (GAE) and variational graph autoencoders (VGAE) emerged as powerful methods for link prediction. Their performances are less impressive on community detection problems where, according to recent and concurring experimental evaluations, they are often outperformed by simpler alternatives such as the Louvain method. It is currently still unclear to which extent one can improve com… ▽ More

    Submitted 20 June, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: Accepted for publication in Elsevier's Neural Networks journal in 2022

  40. arXiv:2201.01346  [pdf, other

    cs.DL

    Exploratory Analysis of Academic Collaborations between French and US

    Authors: George Panagopoulos, Michalis Vazirgiannis

    Abstract: International academic collaborations cultivate diversity in the research landscape and facilitate multiperspective methods, as the scope of each country's science depends on its needs, history, wealth etc. Moreover the quality of science differ significantly amongst nations\cite{king2004scientific}, which renders international collaborations a potential source to understand the dynamics between c… ▽ More

    Submitted 3 February, 2022; v1 submitted 4 January, 2022; originally announced January 2022.

  41. arXiv:2112.00566  [pdf, ps, other

    cs.CL

    NLP Research and Resources at DaSciM, Ecole Polytechnique

    Authors: Hadi Abdine, Yanzhu Guo, Moussa Kamal Eddine, Giannis Nikolentzos, Stamatis Outsios, Guokan Shang, Christos Xypolopoulos, Michalis Vazirgiannis

    Abstract: DaSciM (Data Science and Mining) part of LIX at Ecole Polytechnique, established in 2013 and since then producing research results in the area of large scale data analysis via methods of machine and deep learning. The group has been specifically active in the area of NLP and text mining with interesting results at methodological and resources level. Here follow our different contributions of inter… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  42. arXiv:2110.08559  [pdf, other

    cs.CL

    FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metricsfor Automatic Text Generation

    Authors: Moussa Kamal Eddine, Guokan Shang, Antoine J. -P. Tixier, Michalis Vazirgiannis

    Abstract: Fast and reliable evaluation metrics are key to R&D progress. While traditional natural language generation metrics are fast, they are not very reliable. Conversely, new metrics based on large pretrained language models are much more reliable, but require significant computational resources. In this paper, we propose FrugalScore, an approach to learn a fixed, low cost version of any expensive NLG… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

  43. arXiv:2110.01872  [pdf, ps, other

    stat.ML cs.LG

    Permute Me Softly: Learning Soft Permutations for Graph Representations

    Authors: Giannis Nikolentzos, George Dasoulas, Michalis Vazirgiannis

    Abstract: Graph neural networks (GNNs) have recently emerged as a dominant paradigm for machine learning with graphs. Research on GNNs has mainly focused on the family of message passing neural networks (MPNNs). Similar to the Weisfeiler-Leman (WL) test of isomorphism, these models follow an iterative neighborhood aggregation procedure to update vertex representations, and they next compute graph representa… ▽ More

    Submitted 4 July, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted at IEEE Transactions on Pattern Analysis and Machine Intelligence

  44. arXiv:2110.01485  [pdf, other

    cs.CL

    JuriBERT: A Masked-Language Model Adaptation for French Legal Text

    Authors: Stella Douka, Hadi Abdine, Michalis Vazirgiannis, Rajaa El Hamdani, David Restrepo Amariles

    Abstract: Language models have proven to be very useful when adapted to specific domains. Nonetheless, little research has been done on the adaptation of domain-specific BERT models in the French language. In this paper, we focus on creating a language model adapted to French legal text with the goal of hel** law professionals. We conclude that some specific tasks do not benefit from generic language mode… ▽ More

    Submitted 12 February, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: 7 pages

  45. arXiv:2109.10234  [pdf, other

    cs.CL

    BERTweetFR : Domain Adaptation of Pre-Trained Language Models for French Tweets

    Authors: Yanzhu Guo, Virgile Rennard, Christos Xypolopoulos, Michalis Vazirgiannis

    Abstract: We introduce BERTweetFR, the first large-scale pre-trained language model for French tweets. Our model is initialized using the general-domain French language model CamemBERT which follows the base architecture of RoBERTa. Experiments show that BERTweetFR outperforms all previous general-domain French language models on two downstream Twitter NLP tasks of offensiveness identification and named ent… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: Accepted at the Seventh Workshop on Noisy User-generated Text (W-NUT 2021)

  46. arXiv:2109.01785  [pdf, other

    cs.LG cs.SI stat.ML

    Node Feature Kernels Increase Graph Convolutional Network Robustness

    Authors: Mohamed El Amine Seddik, Changmin Wu, Johannes F. Lutzeyer, Michalis Vazirgiannis

    Abstract: The robustness of the much-used Graph Convolutional Networks (GCNs) to perturbations of their input is becoming a topic of increasing importance. In this paper, the random GCN is introduced for which a random matrix theory analysis is possible. This analysis suggests that if the graph is sufficiently perturbed, or in the extreme case random, then the GCN fails to benefit from the node features. It… ▽ More

    Submitted 21 February, 2022; v1 submitted 4 September, 2021; originally announced September 2021.

    Comments: 16 pages, 5 figures

  47. arXiv:2109.00909  [pdf, ps, other

    cs.LG cs.AI cs.SI

    Sparsifying the Update Step in Graph Neural Networks

    Authors: Johannes F. Lutzeyer, Changmin Wu, Michalis Vazirgiannis

    Abstract: Message-Passing Neural Networks (MPNNs), the most prominent Graph Neural Network (GNN) framework, celebrate much success in the analysis of graph-structured data. Concurrently, the sparsification of Neural Network models attracts a great amount of academic and industrial interest. In this paper we conduct a structured, empirical study of the effect of sparsification on the trainable part of MPNNs… ▽ More

    Submitted 15 April, 2022; v1 submitted 2 September, 2021; originally announced September 2021.

    Comments: 11 pages

  48. arXiv:2108.04623  [pdf, other

    cs.LG cs.AI cs.SI

    Maximizing Influence with Graph Neural Networks

    Authors: George Panagopoulos, Nikolaos Tziortziotis, Michalis Vazirgiannis, Fragkiskos D. Malliaros

    Abstract: Finding the seed set that maximizes the influence spread over a network is a well-known NP-hard problem. Though a greedy algorithm can provide near-optimal solutions, the subproblem of influence estimation renders the solutions inefficient. In this work, we propose \textsc{Glie}, a graph neural network that learns how to estimate the influence spread of the independent cascade. \textsc{Glie} relie… ▽ More

    Submitted 14 October, 2023; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: 8, IEEE/ACM ASONAM 2023

  49. arXiv:2108.01053  [pdf, other

    cs.LG cs.IR cs.SI

    Cold Start Similar Artists Ranking with Gravity-Inspired Graph Autoencoders

    Authors: Guillaume Salha-Galvan, Romain Hennequin, Benjamin Chapus, Viet-Anh Tran, Michalis Vazirgiannis

    Abstract: On an artist's profile page, music streaming services frequently recommend a ranked list of "similar artists" that fans also liked. However, implementing such a feature is challenging for new artists, for which usage data on the service (e.g. streams or likes) is not yet available. In this paper, we model this cold start similar artists ranking problem as a link prediction task in a directed and a… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 15th ACM Conference on Recommender Systems (RecSys 2021)

  50. arXiv:2105.04885  [pdf, other

    cs.LG cs.AI cs.CV

    Graph-based Neural Architecture Search with Operation Embeddings

    Authors: Michail Chatzianastasis, George Dasoulas, Georgios Siolas, Michalis Vazirgiannis

    Abstract: Neural Architecture Search (NAS) has recently gained increased attention, as a class of approaches that automatically searches in an input space of network architectures. A crucial part of the NAS pipeline is the encoding of the architecture that consists of the applied computational blocks, namely the operations and the links between them. Most of the existing approaches either fail to capture th… ▽ More

    Submitted 17 August, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: 12 pages, 10 figures