Skip to main content

Showing 1–50 of 218 results for author: Liò, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14442  [pdf, other

    cs.LG cs.AI cs.CE q-bio.BM q-bio.MN

    Graph Representation Learning Strategies for Omics Data: A Case Study on Parkinson's Disease

    Authors: Elisa Gómez de Lope, Saurabh Deshpande, Ramón Viñas Torné, Pietro Liò, Enrico Glaab, Stéphane P. A. Bordas

    Abstract: Omics data analysis is crucial for studying complex diseases, but its high dimensionality and heterogeneity challenge classical statistical and machine learning methods. Graph neural networks have emerged as promising alternatives, yet the optimal strategies for their design and optimization in real-world biomedical challenges remain unclear. This study evaluates various graph representation learn… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Submitted to Machine Learning in Computational Biology 2024 as an extended abstract, 2 pages + 1 appendix

  2. arXiv:2406.13864  [pdf, other

    cs.LG q-bio.BM

    Evaluating representation learning on the protein structure universe

    Authors: Arian R. Jamasb, Alex Morehead, Chaitanya K. Joshi, Zuobai Zhang, Kieran Didi, Simon V. Mathis, Charles Harris, Jian Tang, Jianlin Cheng, Pietro Lio, Tom L. Blundell

    Abstract: We introduce ProteinWorkshop, a comprehensive benchmark suite for representation learning on protein structures with Geometric Graph Neural Networks. We consider large-scale pre-training and downstream tasks on both experimental and predicted structures to enable the systematic evaluation of the quality of the learned structural representation and their usefulness in capturing functional relations… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: ICLR 2024

  3. arXiv:2406.13839  [pdf, other

    q-bio.BM cs.LG q-bio.GN

    RNA-FrameFlow: Flow Matching for de novo 3D RNA Backbone Design

    Authors: Rishabh Anand, Chaitanya K. Joshi, Alex Morehead, Arian R. Jamasb, Charles Harris, Simon V. Mathis, Kieran Didi, Bryan Hooi, Pietro Liò

    Abstract: We introduce RNA-FrameFlow, the first generative model for 3D RNA backbone design. We build upon SE(3) flow matching for protein backbone generation and establish protocols for data preparation and evaluation to address unique challenges posed by RNA modeling. We formulate RNA structures as a set of rigid-body frames and associated loss functions which account for larger, more conformationally fle… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: To be presented as an Oral at ICML 2024 Structured Probabilistic Inference & Generative Modeling Workshop, and a Spotlight at ICML 2024 AI4Science Workshop

  4. arXiv:2406.05832  [pdf, other

    q-bio.QM cs.LG q-bio.BM

    Improving Antibody Design with Force-Guided Sampling in Diffusion Models

    Authors: Paulina Kulytė, Francisco Vargas, Simon Valentin Mathis, Yu Guang Wang, José Miguel Hernández-Lobato, Pietro Liò

    Abstract: Antibodies, crucial for immune defense, primarily rely on complementarity-determining regions (CDRs) to bind and neutralize antigens, such as viruses. The design of these CDRs determines the antibody's affinity and specificity towards its target. Generative models, particularly denoising diffusion probabilistic models (DDPMs), have shown potential to advance the structure-based design of CDR regio… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  5. arXiv:2406.04501  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    FLUID-LLM: Learning Computational Fluid Dynamics with Spatiotemporal-aware Large Language Models

    Authors: Max Zhu, Adrián Bazaga, Pietro Liò

    Abstract: Learning computational fluid dynamics (CFD) traditionally relies on computationally intensive simulations of the Navier-Stokes equations. Recently, large language models (LLMs) have shown remarkable pattern recognition and reasoning abilities in natural language processing (NLP) and computer vision (CV). However, these models struggle with the complex geometries inherent in fluid dynamics. We intr… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  6. arXiv:2406.03145  [pdf, other

    cs.LG

    E(n) Equivariant Message Passing Cellular Networks

    Authors: Veljko Kovač, Erik J. Bekkers, Pietro Liò, Floor Eijkelboom

    Abstract: This paper introduces E(n) Equivariant Message Passing Cellular Networks (EMPCNs), an extension of E(n) Equivariant Graph Neural Networks to CW-complexes. Our approach addresses two aspects of geometric message passing networks: 1) enhancing their expressiveness by incorporating arbitrary cells, and 2) achieving this in a computationally efficient way with a decoupled EMPCNs technique. We demonstr… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  7. arXiv:2406.03143  [pdf, other

    cs.CV cs.CR

    ZeroPur: Succinct Training-Free Adversarial Purification

    Authors: Xiuli Bi, Zonglin Yang, Bo Liu, Xiaodong Cun, Chi-Man Pun, Pietro Lio, Bin Xiao

    Abstract: Adversarial purification is a kind of defense technique that can defend various unseen adversarial attacks without modifying the victim classifier. Existing methods often depend on external generative models or cooperation between auxiliary functions and victim classifiers. However, retraining generative models, auxiliary functions, or victim classifiers relies on the domain of the fine-tuned data… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 16 pages, 5 figures, under review

  8. arXiv:2406.01805  [pdf, other

    cs.LG cs.AI

    TabMDA: Tabular Manifold Data Augmentation for Any Classifier using Transformers with In-context Subsetting

    Authors: Andrei Margeloiu, Adrián Bazaga, Nikola Simidjievski, Pietro Liò, Mateja Jamnik

    Abstract: Tabular data is prevalent in many critical domains, yet it is often challenging to acquire in large quantities. This scarcity usually results in poor performance of machine learning models on such data. Data augmentation, a common strategy for performance improvement in vision and language tasks, typically underperforms for tabular data due to the lack of explicit symmetries in the input space. To… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  9. arXiv:2406.01781  [pdf, other

    cs.LG

    DEFT: Efficient Finetuning of Conditional Diffusion Models by Learning the Generalised $h$-transform

    Authors: Alexander Denker, Francisco Vargas, Shreyas Padhy, Kieran Didi, Simon Mathis, Vincent Dutordoir, Riccardo Barbano, Emile Mathieu, Urszula Julia Komorowska, Pietro Lio

    Abstract: Generative modelling paradigms based on denoising diffusion processes have emerged as a leading candidate for conditional sampling in inverse problems. In many real-world applications, we often have access to large, expensively trained unconditional diffusion models, which we aim to exploit for improving conditional sampling. Most recent approaches are motivated heuristically and lack a unifying f… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2312.09236

  10. arXiv:2406.00216  [pdf, other

    cs.AI

    The Explanation Necessity for Healthcare AI

    Authors: Michail Mamalakis, Héloïse de Vareilles, Graham Murray, Pietro Lio, John Suckling

    Abstract: Explainability is often critical to the acceptable implementation of artificial intelligence (AI). Nowhere is this more important than healthcare where decision-making directly impacts patients and trust in AI systems is essential. This trust is often built on the explanations and interpretations the AI provides. Despite significant advancements in AI interpretability, there remains the need for c… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  11. arXiv:2405.20882  [pdf, other

    cs.LG

    Sheaf HyperNetworks for Personalized Federated Learning

    Authors: Bao Nguyen, Lorenzo Sani, Xinchi Qiu, Pietro Liò, Nicholas D. Lane

    Abstract: Graph hypernetworks (GHNs), constructed by combining graph neural networks (GNNs) with hypernetworks (HNs), leverage relational data across various domains such as neural architecture search, molecular property prediction and federated learning. Despite GNNs and HNs being individually successful, we show that GHNs present problems compromising their performance, such as over-smoothing and heteroph… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 25 pages, 12 figures, 7 tables, pre-print under review

  12. arXiv:2405.19204  [pdf, other

    eess.IV cs.CV

    Contrastive-Adversarial and Diffusion: Exploring pre-training and fine-tuning strategies for sulcal identification

    Authors: Michail Mamalakis, Héloïse de Vareilles, Shun-Chin Jim Wu, Ingrid Agartz, Lynn Egeland Mørch-Johnsen, Jane Garrison, Jon Simons, Pietro Lio, John Suckling, Graham Murray

    Abstract: In the last decade, computer vision has witnessed the establishment of various training and learning approaches. Techniques like adversarial learning, contrastive learning, diffusion denoising learning, and ordinary reconstruction learning have become standard, representing state-of-the-art methods extensively employed for fully training or pre-training networks across various vision tasks. The ex… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  13. arXiv:2405.12474  [pdf, other

    cs.LG cs.SI

    How Universal Polynomial Bases Enhance Spectral Graph Neural Networks: Heterophily, Over-smoothing, and Over-squashing

    Authors: Keke Huang, Yu Guang Wang, Ming Li, and Pietro Liò

    Abstract: Spectral Graph Neural Networks (GNNs), alternatively known as graph filters, have gained increasing prevalence for heterophily graphs. Optimal graph filters rely on Laplacian eigendecomposition for Fourier transform. In an attempt to avert prohibitive computations, numerous polynomial filters have been proposed. However, polynomials in the majority of these filters are predefined and remain fixed… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2311.18177

  14. arXiv:2405.10008  [pdf, other

    cs.CV

    Solving the enigma: Deriving optimal explanations of deep networks

    Authors: Michail Mamalakis, Antonios Mamalakis, Ingrid Agartz, Lynn Egeland Mørch-Johnsen, Graham Murray, John Suckling, Pietro Lio

    Abstract: The accelerated progress of artificial intelligence (AI) has popularized deep learning models across domains, yet their inherent opacity poses challenges, notably in critical fields like healthcare, medicine and the geosciences. Explainable AI (XAI) has emerged to shed light on these "black box" models, hel** decipher their decision making process. Nevertheless, different XAI methods yield highl… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: keywords: XAI, neuroscience, brain, 3D, 2D, computer vision, classification

  15. arXiv:2405.04189  [pdf

    cs.CV

    Artificial Intelligence-powered fossil shark tooth identification: Unleashing the potential of Convolutional Neural Networks

    Authors: Andrea Barucci, Giulia Ciacci, Pietro Liò, Tiago Azevedo, Andrea Di Cencio, Marco Merella, Giovanni Bianucci, Giulia Bosio, Simone Casati, Alberto Collareta

    Abstract: All fields of knowledge are being impacted by Artificial Intelligence. In particular, the Deep Learning paradigm enables the development of data analysis tools that support subject matter experts in a variety of sectors, from physics up to the recognition of ancient languages. Palaeontology is now observing this trend as well. This study explores the capability of Convolutional Neural Networks (CN… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 40 pages, 8 figures

  16. arXiv:2405.01155  [pdf, other

    cs.LG q-bio.BM

    SynFlowNet: Towards Molecule Design with Guaranteed Synthesis Pathways

    Authors: Miruna Cretu, Charles Harris, Julien Roy, Emmanuel Bengio, Pietro Liò

    Abstract: Recent breakthroughs in generative modelling have led to a number of works proposing molecular generation models for drug discovery. While these models perform well at capturing drug-like motifs, they are known to often produce synthetically inaccessible molecules. This is because they are trained to compose atoms or fragments in a way that approximates the training distribution, but they are not… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Presented at ICLR 2024 GEM Workshop

  17. arXiv:2403.15297  [pdf, other

    cs.AI

    Sphere Neural-Networks for Rational Reasoning

    Authors: Tiansi Dong, Mateja Jamnik, Pietro Liò

    Abstract: The success of Large Language Models (LLMs), e.g., ChatGPT, is witnessed by their planetary popularity, their capability of human-like communication, and also by their steadily improved reasoning performance. However, it remains unclear whether LLMs reason. It is an open problem how traditional neural networks can be qualitatively extended to go beyond the statistic paradigm and achieve high-level… ▽ More

    Submitted 24 June, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  18. arXiv:2403.08549  [pdf, other

    cs.NE cs.AR

    Wet TinyML: Chemical Neural Network Using Gene Regulation and Cell Plasticity

    Authors: Samitha Somathilaka, Adrian Ratwatte, Sasitharan Balasubramaniam, Mehmet Can Vuran, Witawas Srisa-an, Pietro Liò

    Abstract: In our earlier work, we introduced the concept of Gene Regulatory Neural Network (GRNN), which utilizes natural neural network-like structures inherent in biological cells to perform computing tasks using chemical inputs. We define this form of chemical-based neural network as Wet TinyML. The GRNN structures are based on the gene regulatory network and have weights associated with each link based… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted as a full paper by the tinyML Research Symposium 2024

  19. arXiv:2403.07954  [pdf, other

    cs.LG eess.SP

    Optimizing Polynomial Graph Filters: A Novel Adaptive Krylov Subspace Approach

    Authors: Keke Huang, Wencai Cao, Hoang Ta, Xiaokui Xiao, Pietro Liò

    Abstract: Graph Neural Networks (GNNs), known as spectral graph filters, find a wide range of applications in web networks. To bypass eigendecomposition, polynomial graph filters are proposed to approximate graph filters by leveraging various polynomial bases for filter training. However, no existing studies have explored the diverse polynomial graph filters from a unified perspective for optimization. In… ▽ More

    Submitted 20 May, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  20. arXiv:2403.04106  [pdf

    cs.AI

    Understanding Biology in the Age of Artificial Intelligence

    Authors: Elsa Lawrence, Adham El-Shazly, Srijit Seal, Chaitanya K Joshi, Pietro Liò, Shantanu Singh, Andreas Bender, Pietro Sormanni, Matthew Greenig

    Abstract: Modern life sciences research is increasingly relying on artificial intelligence approaches to model biological systems, primarily centered around the use of machine learning (ML) models. Although ML is undeniably useful for identifying patterns in large, complex data sets, its widespread application in biological sciences represents a significant deviation from traditional methods of scientific i… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  21. arXiv:2402.13033  [pdf, other

    cs.LG cs.IR cs.SI

    Enhancing Real-World Complex Network Representations with Hyperedge Augmentation

    Authors: Xiangyu Zhao, Zehui Li, Mingzhu Shen, Guy-Bart Stan, Pietro Liò, Yiren Zhao

    Abstract: Graph augmentation methods play a crucial role in improving the performance and enhancing generalisation capabilities in Graph Neural Networks (GNNs). Existing graph augmentation methods mainly perturb the graph structures and are usually limited to pairwise node relations. These methods cannot fully address the complexities of real-world large-scale networks that often involve higher-order node r… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Preprint. Under review. 17 pages, 4 figures, 14 tables. arXiv admin note: text overlap with arXiv:2306.05108

  22. arXiv:2402.10793  [pdf, other

    cs.LG cs.AI

    Masked Attention is All You Need for Graphs

    Authors: David Buterez, Jon Paul Janet, Dino Oglic, Pietro Lio

    Abstract: Graph neural networks (GNNs) and variations of the message passing algorithm are the predominant means for learning on graphs, largely due to their flexibility, speed, and satisfactory performance. The design of powerful and general purpose GNNs, however, requires significant research efforts and often relies on handcrafted, carefully-chosen message passing operators. Motivated by this, we propose… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  23. arXiv:2402.08871  [pdf, other

    cs.LG stat.ML

    Position: Topological Deep Learning is the New Frontier for Relational Learning

    Authors: Theodore Papamarkou, Tolga Birdal, Michael Bronstein, Gunnar Carlsson, Justin Curry, Yue Gao, Mustafa Hajij, Roland Kwitt, Pietro Liò, Paolo Di Lorenzo, Vasileios Maroulas, Nina Miolane, Farzana Nasrin, Karthikeyan Natesan Ramamurthy, Bastian Rieck, Simone Scardapane, Michael T. Schaub, Petar Veličković, Bei Wang, Yusu Wang, Guo-Wei Wei, Ghada Zamzmi

    Abstract: Topological deep learning (TDL) is a rapidly evolving field that uses topological features to understand and design deep learning models. This paper posits that TDL is the new frontier for relational learning. TDL may complement graph representation learning and geometric deep learning by incorporating topological concepts, and can thus provide a natural choice for various machine learning setting… ▽ More

    Submitted 30 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  24. arXiv:2402.07309  [pdf, other

    cs.LG cs.CL stat.ML

    HyperBERT: Mixing Hypergraph-Aware Layers with Language Models for Node Classification on Text-Attributed Hypergraphs

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: Hypergraphs are characterized by complex topological structure, representing higher-order interactions among multiple entities through hyperedges. Lately, hypergraph-based deep learning methods to learn informative data representations for the problem of node classification on text-attributed hypergraphs have garnered increasing research attention. However, existing methods struggle to simultaneou… ▽ More

    Submitted 25 May, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures

  25. arXiv:2402.06445  [pdf, other

    cs.LG

    The Deep Equilibrium Algorithmic Reasoner

    Authors: Dobrik Georgiev, Pietro Liò, Davide Buffelli

    Abstract: Recent work on neural algorithmic reasoning has demonstrated that graph neural networks (GNNs) could learn to execute classical algorithms. Doing so, however, has always used a recurrent architecture, where each iteration of the GNN aligns with an algorithm's iteration. Since an algorithm's solution is often an equilibrium, we conjecture and empirically validate that one can train a network to sol… ▽ More

    Submitted 9 April, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  26. arXiv:2312.09236  [pdf, other

    cs.LG q-bio.BM

    A framework for conditional diffusion modelling with applications in motif scaffolding for protein design

    Authors: Kieran Didi, Francisco Vargas, Simon V Mathis, Vincent Dutordoir, Emile Mathieu, Urszula J Komorowska, Pietro Lio

    Abstract: Many protein design applications, such as binder or enzyme design, require scaffolding a structural motif with high precision. Generative modelling paradigms based on denoising diffusion processes emerged as a leading candidate to address this motif scaffolding problem and have shown early experimental success in some cases. In the diffusion paradigm, motif scaffolding is treated as a conditional… ▽ More

    Submitted 13 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 9 pages

  27. arXiv:2312.07511  [pdf, other

    cs.LG cs.AI q-bio.QM stat.ML

    A Hitchhiker's Guide to Geometric GNNs for 3D Atomic Systems

    Authors: Alexandre Duval, Simon V. Mathis, Chaitanya K. Joshi, Victor Schmidt, Santiago Miret, Fragkiskos D. Malliaros, Taco Cohen, Pietro Liò, Yoshua Bengio, Michael Bronstein

    Abstract: Recent advances in computational modelling of atomic systems, spanning molecules, proteins, and materials, represent them as geometric graphs with atoms embedded as nodes in 3D Euclidean space. In these graphs, the geometric attributes transform according to the inherent physical symmetries of 3D atomic systems, including rotations and translations in Euclidean space, as well as node permutations.… ▽ More

    Submitted 13 March, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  28. arXiv:2312.04193  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Language Model Knowledge Distillation for Efficient Question Answering in Spanish

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: Recent advances in the development of pre-trained Spanish language models has led to significant progress in many Natural Language Processing (NLP) tasks, such as question answering. However, the lack of efficient models imposes a barrier for the adoption of such models in resource-constrained environments. Therefore, smaller distilled models for the Spanish language could be proven to be highly s… ▽ More

    Submitted 16 March, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: ICLR 2024 Tiny Paper (6 pages, 2 tables)

  29. arXiv:2312.02225  [pdf, other

    physics.med-ph cs.CV cs.LG eess.IV

    Digital Histopathology with Graph Neural Networks: Concepts and Explanations for Clinicians

    Authors: Alessandro Farace di Villaforesta, Lucie Charlotte Magister, Pietro Barbiero, Pietro Liò

    Abstract: To address the challenge of the ``black-box" nature of deep learning in medical settings, we combine GCExplainer - an automated concept discovery solution - along with Logic Explained Networks to provide global explanations for Graph Neural Networks. We demonstrate this using a generally applicable graph construction and classification pipeline, involving panoptic segmentation with HoVer-Net and c… ▽ More

    Submitted 28 December, 2023; v1 submitted 3 December, 2023; originally announced December 2023.

  30. arXiv:2312.00677  [pdf, other

    eess.IV cs.CV

    Unsupervised Adaptive Implicit Neural Representation Learning for Scan-Specific MRI Reconstruction

    Authors: Junwei Yang, Pietro Liò

    Abstract: In recent studies on MRI reconstruction, advances have shown significant promise for further accelerating the MRI acquisition. Most state-of-the-art methods require a large amount of fully-sampled data to optimise reconstruction models, which is impractical and expensive under certain clinical settings. On the other hand, for unsupervised scan-specific reconstruction methods, overfitting is likely… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  31. arXiv:2312.00661  [pdf, other

    eess.IV cs.CV

    Dual-Domain Multi-Contrast MRI Reconstruction with Synthesis-based Fusion Network

    Authors: Junwei Yang, Pietro Liò

    Abstract: Purpose: To develop an efficient dual-domain reconstruction framework for multi-contrast MRI, with the focus on minimising cross-contrast misalignment in both the image and the frequency domains to enhance optimisation. Theory and Methods: Our proposed framework, based on deep learning, facilitates the optimisation for under-sampled target contrast using fully-sampled reference contrast that is qu… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  32. arXiv:2311.18839  [pdf, other

    cs.CV

    TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios

    Authors: Lihao Liu, Yanqi Cheng, Zhongying Deng, Shujun Wang, Dongdong Chen, Xiaowei Hu, Pietro Liò, Carola-Bibiane Schönlieb, Angelica Aviles-Rivero

    Abstract: Multi-object tracking in traffic videos is a crucial research area, offering immense potential for enhancing traffic monitoring accuracy and promoting road safety measures through the utilisation of advanced machine learning algorithms. However, existing datasets for multi-object tracking in traffic videos often feature limited instances or focus on single classes, which cannot well simulate the c… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 17 pages, 7 figures

  33. arXiv:2311.18177  [pdf, other

    cs.LG cs.SI eess.SP

    An Effective Universal Polynomial Basis for Spectral Graph Neural Networks

    Authors: Keke Huang, Pietro Liò

    Abstract: Spectral Graph Neural Networks (GNNs), also referred to as graph filters have gained increasing prevalence for heterophily graphs. Optimal graph filters rely on Laplacian eigendecomposition for Fourier transform. In an attempt to avert the prohibitive computations, numerous polynomial filters by leveraging distinct polynomials have been proposed to approximate the desired graph filters. However, p… ▽ More

    Submitted 5 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

  34. arXiv:2311.17250  [pdf, other

    cs.LG hep-ph quant-ph

    Fourier Neural Differential Equations for learning Quantum Field Theories

    Authors: Isaac Brant, Alexander Norcliffe, Pietro Liò

    Abstract: A Quantum Field Theory is defined by its interaction Hamiltonian, and linked to experimental data by the scattering matrix. The scattering matrix is calculated as a perturbative series, and represented succinctly as a first order differential equation in time. Neural Differential Equations (NDEs) learn the time derivative of a residual network's hidden state, and have proven efficacy in learning d… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 9 pages, 6 figures

  35. arXiv:2311.15112  [pdf, other

    cs.LG cs.AI

    Everybody Needs a Little HELP: Explaining Graphs via Hierarchical Concepts

    Authors: Jonas Jürß, Lucie Charlotte Magister, Pietro Barbiero, Pietro Liò, Nikola Simidjievski

    Abstract: Graph neural networks (GNNs) have led to major breakthroughs in a variety of domains such as drug discovery, social network analysis, and travel time estimation. However, they lack interpretability which hinders human trust and thereby deployment to settings with high-stakes decisions. A line of interpretable methods approach this by discovering a small set of relevant concepts as subgraphs in the… ▽ More

    Submitted 2 December, 2023; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: 33 pages, 16 figures, accepted at the NeurIPS 2023 GLFrontiers Workshop

  36. arXiv:2311.13610  [pdf, other

    cs.CV eess.IV

    TRIDENT: The Nonlinear Trilogy for Implicit Neural Representations

    Authors: Zhenda Shen, Yanqi Cheng, Raymond H. Chan, Pietro Liò, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Implicit neural representations (INRs) have garnered significant interest recently for their ability to model complex, high-dimensional data without explicit parameterisation. In this work, we introduce TRIDENT, a novel function for implicit neural representations characterised by a trilogy of nonlinearities. Firstly, it is designed to represent high-order features through order compactness. Secon… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  37. arXiv:2311.11891  [pdf, other

    cs.LG cs.SI stat.ML

    AMES: A Differentiable Embedding Space Selection Framework for Latent Graph Inference

    Authors: Yuan Lu, Haitz Sáez de Ocáriz Borde, Pietro Liò

    Abstract: In real-world scenarios, although data entities may possess inherent relationships, the specific graph illustrating their connections might not be directly accessible. Latent graph inference addresses this issue by enabling Graph Neural Networks (GNNs) to operate on point cloud data, dynamically learning the necessary graph structure. These graphs are often derived from a latent embedding space, w… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  38. arXiv:2311.11628  [pdf, other

    cs.LG

    Incorporating LLM Priors into Tabular Learners

    Authors: Max Zhu, Siniša Stanivuk, Andrija Petrovic, Mladen Nikolic, Pietro Lio

    Abstract: We present a method to integrate Large Language Models (LLMs) and traditional tabular data classification techniques, addressing LLMs challenges like data serialization sensitivity and biases. We introduce two strategies utilizing LLMs for ranking categorical variables and generating priors on correlations between continuous variables and targets, enhancing performance in few-shot scenarios. We fo… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Table Representation Learning Workshop at NeurIPS 2023

  39. arXiv:2311.10092  [pdf, other

    cs.CV

    Traffic Video Object Detection using Motion Prior

    Authors: Lihao Liu, Yanqi Cheng, Dongdong Chen, **g He, Pietro Liò, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Traffic videos inherently differ from generic videos in their stationary camera setup, thus providing a strong motion prior where objects often move in a specific direction over a short time interval. Existing works predominantly employ generic video object detection framework for traffic video object detection, which yield certain advantages such as broad applicability and robustness to diverse s… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 11 pages, 4 figures

  40. arXiv:2311.10051  [pdf, other

    cs.LG

    Tabular Few-Shot Generalization Across Heterogeneous Feature Spaces

    Authors: Max Zhu, Katarzyna Kobalczyk, Andrija Petrovic, Mladen Nikolic, Mihaela van der Schaar, Boris Delibasic, Petro Lio

    Abstract: Despite the prevalence of tabular datasets, few-shot learning remains under-explored within this domain. Existing few-shot methods are not directly applicable to tabular datasets due to varying column relationships, meanings, and permutational invariance. To address these challenges, we propose FLAT-a novel approach to tabular few-shot learning, encompassing knowledge sharing between datasets with… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: Tabular learning, Deep learning, Few shot learning

  41. arXiv:2311.06547  [pdf, other

    cs.LG

    From Charts to Atlas: Merging Latent Spaces into One

    Authors: Donato Crisostomi, Irene Cannistraci, Luca Moschella, Pietro Barbiero, Marco Ciccone, Pietro Liò, Emanuele Rodolà

    Abstract: Models trained on semantically related datasets and tasks exhibit comparable inter-sample relations within their latent spaces. We investigate in this study the aggregation of such latent spaces to create a unified space encompassing the combined information. To this end, we introduce Relative Latent Space Aggregation, a two-step approach that first renders the spaces comparable using relative rep… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: To appear in the NeurReps workshop @ NeurIPS 2023

  42. arXiv:2311.05767  [pdf, other

    cs.LG

    Dirichlet Energy Enhancement of Graph Neural Networks by Framelet Augmentation

    Authors: Jialin Chen, Yuelin Wang, Cristian Bodnar, Rex Ying, Pietro Lio, Yu Guang Wang

    Abstract: Graph convolutions have been a pivotal element in learning graph representations. However, recursively aggregating neighboring information with graph convolutions leads to indistinguishable node features in deep layers, which is known as the over-smoothing issue. The performance of graph neural networks decays fast as the number of stacked layers increases, and the Dirichlet energy associated with… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  43. arXiv:2310.18376  [pdf, other

    cs.CL cs.LG

    SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: In recent years, the task of text-to-SQL translation, which converts natural language questions into executable SQL queries, has gained significant attention for its potential to democratize data access. Despite its promise, challenges such as adapting to unseen databases and aligning natural language with SQL syntax have hindered widespread adoption. To overcome these issues, we introduce SQLform… ▽ More

    Submitted 27 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 13 pages, 4 figures, 11 tables

  44. arXiv:2310.07684  [pdf, other

    cs.AI cs.SI

    Hypergraph Neural Networks through the Lens of Message Passing: A Common Perspective to Homophily and Architecture Design

    Authors: Lev Telyatnikov, Maria Sofia Bucarelli, Guillermo Bernardez, Olga Zaghen, Simone Scardapane, Pietro Lio

    Abstract: Most of the current hypergraph learning methodologies and benchmarking datasets in the hypergraph realm are obtained by lifting procedures from their graph analogs, leading to overshadowing specific characteristics of hypergraphs. This paper attempts to confront some pending questions in that regard: Q1 Can the concept of homophily play a crucial role in Hypergraph Neural Networks (HNNs)? Q2 Is th… ▽ More

    Submitted 5 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  45. arXiv:2309.17116  [pdf, other

    cs.LG

    Sheaf Hypergraph Networks

    Authors: Iulia Duta, Giulia Cassarà, Fabrizio Silvestri, Pietro Liò

    Abstract: Higher-order relations are widespread in nature, with numerous phenomena involving complex interactions that extend beyond simple pairwise connections. As a result, advancements in higher-order processing can accelerate the growth of various fields requiring structured data. Current approaches typically represent these interactions using hypergraphs. We enhance this representation by introducing c… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS 2023)

  46. arXiv:2309.16540  [pdf, other

    cs.CL cs.LG stat.ML

    Unsupervised Pretraining for Fact Verification by Language Model Distillation

    Authors: Adrián Bazaga, Pietro Liò, Gos Micklem

    Abstract: Fact verification aims to verify a claim using evidence from a trustworthy knowledge base. To address this challenge, algorithms must produce features for every claim that are both semantically meaningful, and compact enough to find a semantic alignment with the source information. In contrast to previous work, which tackled the alignment problem by learning over annotated corpora of claims and th… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: ICLR 2024 Camera Ready

  47. arXiv:2309.00903  [pdf, other

    cs.CV cs.AI

    An explainable three dimension framework to uncover learning patterns: A unified look in variable sulci recognition

    Authors: Michail Mamalakis, Heloise de Vareilles, Atheer AI-Manea, Samantha C. Mitchell, Ingrid Arartz, Lynn Egeland Morch-Johnsen, Jane Garrison, Jon Simons, Pietro Lio, John Suckling, Graham Murray

    Abstract: Detecting the significant features of the learning process of an artificial intelligence framework in the entire training and validation dataset can be determined as 'global' explanations. Studies in the literature lack of accurate, low-complexity, and three-dimensional (3D) global explanations which are crucial in neuroimaging, a field with a complex representational space that demands more than… ▽ More

    Submitted 7 June, 2024; v1 submitted 2 September, 2023; originally announced September 2023.

  48. arXiv:2308.12316  [pdf, other

    cs.LG

    Graph Neural Stochastic Differential Equations

    Authors: Richard Bergna, Felix Opolka, Pietro Liò, Jose Miguel Hernandez-Lobato

    Abstract: We present a novel model Graph Neural Stochastic Differential Equations (Graph Neural SDEs). This technique enhances the Graph Neural Ordinary Differential Equations (Graph Neural ODEs) by embedding randomness into data representation using Brownian motion. This inclusion allows for the assessment of prediction uncertainty, a crucial aspect frequently missed in current models. In our framework, we… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: 9 main pages, 6 of appendix (15 in total), submitted for the Learning on Graph (LoG) conference

  49. arXiv:2308.11978  [pdf, other

    cs.LG cs.AI q-bio.BM stat.ML

    Will More Expressive Graph Neural Networks do Better on Generative Tasks?

    Authors: Xiandong Zou, Xiangyu Zhao, Pietro Liò, Yiren Zhao

    Abstract: Graph generation poses a significant challenge as it involves predicting a complete graph with multiple nodes and edges based on simply a given label. This task also carries fundamental importance to numerous real-world applications, including de-novo drug and molecular design. In recent years, several successful methods have emerged in the field of graph generation. However, these approaches suff… ▽ More

    Submitted 20 February, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: 2nd Learning on Graphs Conference (LoG 2023). 26 pages, 5 figures, 11 tables

  50. arXiv:2308.11068  [pdf, other

    cs.LG cs.AI cs.NI

    Topological Graph Signal Compression

    Authors: Guillermo Bernárdez, Lev Telyatnikov, Eduard Alarcón, Albert Cabellos-Aparicio, Pere Barlet-Ros, Pietro Liò

    Abstract: Recently emerged Topological Deep Learning (TDL) methods aim to extend current Graph Neural Networks (GNN) by naturally processing higher-order interactions, going beyond the pairwise relations and local neighborhoods defined by graph representations. In this paper we propose a novel TDL-based method for compressing signals over graphs, consisting in two main steps: first, disjoint sets of higher-… ▽ More

    Submitted 5 December, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted as Oral at the Second Learning on Graphs Conference (LoG 2023). The recording of the talk can be found in https://www.youtube.com/watch?v=OcruIkiRkiU