-
A construction of directed strongly regular graphs with parameters (63,11,8,1,2)
Authors:
Andries E. Brouwer,
Dean Crnković,
Andrea Švob
Abstract:
In this paper, we prove the existence of directed strongly regular graphs with parameters $(63,11,8,1,2)$. We construct a pair of nonisomorphic dsrg(63,11,8,1,2), where one is obtained from the other by reversing all arrows. Both directed strongly regular graphs have $L_2(8):3$ as the full automorphism group.
In this paper, we prove the existence of directed strongly regular graphs with parameters $(63,11,8,1,2)$. We construct a pair of nonisomorphic dsrg(63,11,8,1,2), where one is obtained from the other by reversing all arrows. Both directed strongly regular graphs have $L_2(8):3$ as the full automorphism group.
△ Less
Submitted 17 April, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
Atom-Level Optical Chemical Structure Recognition with Limited Supervision
Authors:
Martijn Oldenhof,
Edward De Brouwer,
Adam Arany,
Yves Moreau
Abstract:
Identifying the chemical structure from a graphical representation, or image, of a molecule is a challenging pattern recognition task that would greatly benefit drug development. Yet, existing methods for chemical structure recognition do not typically generalize well, and show diminished effectiveness when confronted with domains where data is sparse, or costly to generate, such as hand-drawn mol…
▽ More
Identifying the chemical structure from a graphical representation, or image, of a molecule is a challenging pattern recognition task that would greatly benefit drug development. Yet, existing methods for chemical structure recognition do not typically generalize well, and show diminished effectiveness when confronted with domains where data is sparse, or costly to generate, such as hand-drawn molecule images. To address this limitation, we propose a new chemical structure recognition tool that delivers state-of-the-art performance and can adapt to new domains with a limited number of data samples and supervision. Unlike previous approaches, our method provides atom-level localization, and can therefore segment the image into the different atoms and bonds. Our model is the first model to perform OCSR with atom-level entity detection with only SMILES supervision. Through rigorous and extensive benchmarking, we demonstrate the preeminence of our chemical structure recognition approach in terms of data efficiency, accuracy, and atom-level entity prediction.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Benchmarking Observational Studies with Experimental Data under Right-Censoring
Authors:
Ilker Demirel,
Edward De Brouwer,
Zeshan Hussain,
Michael Oberst,
Anthony Philippakis,
David Sontag
Abstract:
Drawing causal inferences from observational studies (OS) requires unverifiable validity assumptions; however, one can falsify those assumptions by benchmarking the OS with experimental data from a randomized controlled trial (RCT). A major limitation of existing procedures is not accounting for censoring, despite the abundance of RCTs and OSes that report right-censored time-to-event outcomes. We…
▽ More
Drawing causal inferences from observational studies (OS) requires unverifiable validity assumptions; however, one can falsify those assumptions by benchmarking the OS with experimental data from a randomized controlled trial (RCT). A major limitation of existing procedures is not accounting for censoring, despite the abundance of RCTs and OSes that report right-censored time-to-event outcomes. We consider two cases where censoring time (1) is independent of time-to-event and (2) depends on time-to-event the same way in OS and RCT. For the former, we adopt a censoring-doubly-robust signal for the conditional average treatment effect (CATE) to facilitate an equivalence test of CATEs in OS and RCT, which serves as a proxy for testing if the validity assumptions hold. For the latter, we show that the same test can still be used even though unbiased CATE estimation may not be possible. We verify the effectiveness of our censoring-aware tests via semi-synthetic experiments and analyze RCT and OS data from the Women's Health Initiative study.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
High-Resolution Maps of Left Atrial Displacements and Strains Estimated with 3D CINE MRI and Unsupervised Neural Networks
Authors:
Christoforos Galazis,
Samuel Shepperd,
Emma Brouwer,
Sandro Queirós,
Ebraham Alskaf,
Mustafa Anjari,
Amedeo Chiribiri,
Jack Lee,
Anil A. Bharath,
Marta Varela
Abstract:
The functional analysis of the left atrium (LA) is important for evaluating cardiac health and understanding diseases like atrial fibrillation. Cine MRI is ideally placed for the detailed 3D characterisation of LA motion and deformation, but it is lacking appropriate acquisition and analysis tools. In this paper, we present Analysis for Left Atrial Displacements and Deformations using unsupervIsed…
▽ More
The functional analysis of the left atrium (LA) is important for evaluating cardiac health and understanding diseases like atrial fibrillation. Cine MRI is ideally placed for the detailed 3D characterisation of LA motion and deformation, but it is lacking appropriate acquisition and analysis tools. In this paper, we present Analysis for Left Atrial Displacements and Deformations using unsupervIsed neural Networks, \textit{Aladdin}, to automatically and reliably characterise regional LA deformations from high-resolution 3D Cine MRI. The tool includes: an online few-shot segmentation network (Aladdin-S), an online unsupervised image registration network (Aladdin-R), and a strain calculations pipeline tailored to the LA. We create maps of LA Displacement Vector Field (DVF) magnitude and LA principal strain values from images of 10 healthy volunteers and 8 patients with cardiovascular disease (CVD). We additionally create an atlas of these biomarkers using the data from the healthy volunteers. Aladdin is able to accurately track the LA wall across the cardiac cycle and characterize its motion and deformation. The overall DVF magnitude and principal strain values are significantly higher in the healthy group vs CVD patients: $2.85 \pm 1.59~mm$ and $0.09 \pm 0.05$ vs $1.96 \pm 0.74~mm$ and $0.03 \pm 0.04$, respectively. The time course of these metrics is also different in the two groups, with a more marked active contraction phase observed in the healthy cohort. Finally, utilizing the LA atlas allows us to identify regional deviations from the population distribution that may indicate focal tissue abnormalities. The proposed tool for the quantification of novel regional LA deformation biomarkers should have important clinical applications. The source code, anonymized images, generated maps and atlas are publicly available: https://github.com/cgalaz01/aladdin_cmr_la.
△ Less
Submitted 14 December, 2023;
originally announced December 2023.
-
Some locally Kneser graphs
Authors:
A. E. Brouwer
Abstract:
The Kneser graph $K(n,d)$ is the graph on the $d$-subsets of an $n$-set, adjacent when disjoint. Clearly, $K(n+d,d)$ is locally $K(n,d)$. Hall showed for $n \ge 3d+1$ that there are no further examples. Here we give other examples of locally $K(n,d)$ graphs for $n = 3d$, and some further sporadic examples. It follows that Hall's bound is best possible.
The Kneser graph $K(n,d)$ is the graph on the $d$-subsets of an $n$-set, adjacent when disjoint. Clearly, $K(n+d,d)$ is locally $K(n,d)$. Hall showed for $n \ge 3d+1$ that there are no further examples. Here we give other examples of locally $K(n,d)$ graphs for $n = 3d$, and some further sporadic examples. It follows that Hall's bound is best possible.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Self-Supervised Versus Supervised Training for Segmentation of Organoid Images
Authors:
Asmaa Haja,
Eric Brouwer,
Lambert Schomaker
Abstract:
The process of annotating relevant data in the field of digital microscopy can be both time-consuming and especially expensive due to the required technical skills and human-expert knowledge. Consequently, large amounts of microscopic image data sets remain unlabeled, preventing their effective exploitation using deep-learning algorithms. In recent years it has been shown that a lot of relevant in…
▽ More
The process of annotating relevant data in the field of digital microscopy can be both time-consuming and especially expensive due to the required technical skills and human-expert knowledge. Consequently, large amounts of microscopic image data sets remain unlabeled, preventing their effective exploitation using deep-learning algorithms. In recent years it has been shown that a lot of relevant information can be drawn from unlabeled data. Self-supervised learning (SSL) is a promising solution based on learning intrinsic features under a pretext task that is similar to the main task without requiring labels. The trained result is transferred to the main task - image segmentation in our case. A ResNet50 U-Net was first trained to restore images of liver progenitor organoids from augmented images using the Structural Similarity Index Metric (SSIM), alone, and using SSIM combined with L1 loss. Both the encoder and decoder were trained in tandem. The weights were transferred to another U-Net model designed for segmentation with frozen encoder weights, using Binary Cross Entropy, Dice, and Intersection over Union (IoU) losses. For comparison, we used the same U-Net architecture to train two supervised models, one utilizing the ResNet50 encoder as well as a simple CNN. Results showed that self-supervised learning models using a 25\% pixel drop or image blurring augmentation performed better than the other augmentation techniques using the IoU loss. When trained on only 114 images for the main task, the self-supervised learning approach outperforms the supervised method achieving an F1-score of 0.85, with higher stability, in contrast to an F1=0.78 scored by the supervised method. Furthermore, when trained with larger data sets (1,000 images), self-supervised learning is still able to perform better, achieving an F1-score of 0.92, contrasting to a score of 0.85 for the supervised method.
△ Less
Submitted 18 November, 2023;
originally announced November 2023.
-
BLIS-Net: Classifying and Analyzing Signals on Graphs
Authors:
Charles Xu,
Laney Goldman,
Valentina Guo,
Benjamin Hollander-Bodie,
Maedee Trank-Greene,
Ian Adelstein,
Edward De Brouwer,
Rex Ying,
Smita Krishnaswamy,
Michael Perlmutter
Abstract:
Graph neural networks (GNNs) have emerged as a powerful tool for tasks such as node classification and graph classification. However, much less work has been done on signal classification, where the data consists of many functions (referred to as signals) defined on the vertices of a single graph. These tasks require networks designed differently from those designed for traditional GNN tasks. Inde…
▽ More
Graph neural networks (GNNs) have emerged as a powerful tool for tasks such as node classification and graph classification. However, much less work has been done on signal classification, where the data consists of many functions (referred to as signals) defined on the vertices of a single graph. These tasks require networks designed differently from those designed for traditional GNN tasks. Indeed, traditional GNNs rely on localized low-pass filters, and signals of interest may have intricate multi-frequency behavior and exhibit long range interactions. This motivates us to introduce the BLIS-Net (Bi-Lipschitz Scattering Net), a novel GNN that builds on the previously introduced geometric scattering transform. Our network is able to capture both local and global signal structure and is able to capture both low-frequency and high-frequency information. We make several crucial changes to the original geometric scattering architecture which we prove increase the ability of our network to capture information about the input signal and show that BLIS-Net achieves superior performance on both synthetic and real-world data sets based on traffic flow and fMRI data.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Manifold Filter-Combine Networks
Authors:
Joyce Chew,
Edward De Brouwer,
Smita Krishnaswamy,
Deanna Needell,
Michael Perlmutter
Abstract:
We introduce a class of manifold neural networks (MNNs) that we call Manifold Filter-Combine Networks (MFCNs), that aims to further our understanding of MNNs, analogous to how the aggregate-combine framework helps with the understanding of graph neural networks (GNNs). This class includes a wide variety of subclasses that can be thought of as the manifold analog of various popular GNNs. We then co…
▽ More
We introduce a class of manifold neural networks (MNNs) that we call Manifold Filter-Combine Networks (MFCNs), that aims to further our understanding of MNNs, analogous to how the aggregate-combine framework helps with the understanding of graph neural networks (GNNs). This class includes a wide variety of subclasses that can be thought of as the manifold analog of various popular GNNs. We then consider a method, based on building a data-driven graph, for implementing such networks when one does not have global knowledge of the manifold, but merely has access to finitely many sample points. We provide sufficient conditions for the network to provably converge to its continuum limit as the number of sample points tends to infinity. Unlike previous work (which focused on specific graph constructions), our rate of convergence does not directly depend on the number of filters used. Moreover, it exhibits linear dependence on the depth of the network rather than the exponential dependence obtained previously. Additionally, we provide several examples of interesting subclasses of MFCNs and of the rates of convergence that are obtained under specific graph constructions.
△ Less
Submitted 5 September, 2023; v1 submitted 8 July, 2023;
originally announced July 2023.
-
Inferring dynamic regulatory interaction graphs from time series data with perturbations
Authors:
Dhananjay Bhaskar,
Sumner Magruder,
Edward De Brouwer,
Aarthi Venkat,
Frederik Wenkel,
Guy Wolf,
Smita Krishnaswamy
Abstract:
Complex systems are characterized by intricate interactions between entities that evolve dynamically over time. Accurate inference of these dynamic relationships is crucial for understanding and predicting system behavior. In this paper, we propose Regulatory Temporal Interaction Network Inference (RiTINI) for inferring time-varying interaction graphs in complex systems using a novel combination o…
▽ More
Complex systems are characterized by intricate interactions between entities that evolve dynamically over time. Accurate inference of these dynamic relationships is crucial for understanding and predicting system behavior. In this paper, we propose Regulatory Temporal Interaction Network Inference (RiTINI) for inferring time-varying interaction graphs in complex systems using a novel combination of space-and-time graph attentions and graph neural ordinary differential equations (ODEs). RiTINI leverages time-lapse signals on a graph prior, as well as perturbations of signals at various nodes in order to effectively capture the dynamics of the underlying system. This approach is distinct from traditional causal inference networks, which are limited to inferring acyclic and static graphs. In contrast, RiTINI can infer cyclic, directed, and time-varying graphs, providing a more comprehensive and accurate representation of complex systems. The graph attention mechanism in RiTINI allows the model to adaptively focus on the most relevant interactions in time and space, while the graph neural ODEs enable continuous-time modeling of the system's dynamics. We evaluate RiTINI's performance on various simulated and real-world datasets, demonstrating its state-of-the-art capability in inferring interaction graphs compared to previous methods.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
A Heat Diffusion Perspective on Geodesic Preserving Dimensionality Reduction
Authors:
Guillaume Huguet,
Alexander Tong,
Edward De Brouwer,
Yanlei Zhang,
Guy Wolf,
Ian Adelstein,
Smita Krishnaswamy
Abstract:
Diffusion-based manifold learning methods have proven useful in representation learning and dimensionality reduction of modern high dimensional, high throughput, noisy datasets. Such datasets are especially present in fields like biology and physics. While it is thought that these methods preserve underlying manifold structure of data by learning a proxy for geodesic distances, no specific theoret…
▽ More
Diffusion-based manifold learning methods have proven useful in representation learning and dimensionality reduction of modern high dimensional, high throughput, noisy datasets. Such datasets are especially present in fields like biology and physics. While it is thought that these methods preserve underlying manifold structure of data by learning a proxy for geodesic distances, no specific theoretical links have been established. Here, we establish such a link via results in Riemannian geometry explicitly connecting heat diffusion to manifold distances. In this process, we also formulate a more general heat kernel based manifold embedding method that we call heat geodesic embeddings. This novel perspective makes clearer the choices available in manifold learning and denoising. Results show that our method outperforms existing state of the art in preserving ground truth manifold distances, and preserving cluster structure in toy datasets. We also showcase our method on single cell RNA-sequencing datasets with both continuum and cluster structure, where our method enables interpolation of withheld timepoints of data. Finally, we show that parameters of our more general method can be configured to give results similar to PHATE (a state-of-the-art diffusion based manifold learning method) as well as SNE (an attraction/repulsion neighborhood based method that forms the basis of t-SNE).
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Weakly Supervised Knowledge Transfer with Probabilistic Logical Reasoning for Object Detection
Authors:
Martijn Oldenhof,
Adam Arany,
Yves Moreau,
Edward De Brouwer
Abstract:
Training object detection models usually requires instance-level annotations, such as the positions and labels of all objects present in each image. Such supervision is unfortunately not always available and, more often, only image-level information is provided, also known as weak supervision. Recent works have addressed this limitation by leveraging knowledge from a richly annotated domain. Howev…
▽ More
Training object detection models usually requires instance-level annotations, such as the positions and labels of all objects present in each image. Such supervision is unfortunately not always available and, more often, only image-level information is provided, also known as weak supervision. Recent works have addressed this limitation by leveraging knowledge from a richly annotated domain. However, the scope of weak supervision supported by these approaches has been very restrictive, preventing them to use all available information. In this work, we propose ProbKT, a framework based on probabilistic logical reasoning that allows to train object detection models with arbitrary types of weak supervision. We empirically show on different datasets that using all available information is beneficial as our ProbKT leads to significant improvement on target domain and better generalization compared to existing baselines. We also showcase the ability of our approach to handle complex logic statements as supervision signal.
△ Less
Submitted 9 March, 2023;
originally announced March 2023.
-
Anamnesic Neural Differential Equations with Orthogonal Polynomial Projections
Authors:
Edward De Brouwer,
Rahul G. Krishnan
Abstract:
Neural ordinary differential equations (Neural ODEs) are an effective framework for learning dynamical systems from irregularly sampled time series data. These models provide a continuous-time latent representation of the underlying dynamical system where new observations at arbitrary time points can be used to update the latent representation of the dynamical system. Existing parameterizations fo…
▽ More
Neural ordinary differential equations (Neural ODEs) are an effective framework for learning dynamical systems from irregularly sampled time series data. These models provide a continuous-time latent representation of the underlying dynamical system where new observations at arbitrary time points can be used to update the latent representation of the dynamical system. Existing parameterizations for the dynamics functions of Neural ODEs limit the ability of the model to retain global information about the time series; specifically, a piece-wise integration of the latent process between observations can result in a loss of memory on the dynamic patterns of previously observed data points. We propose PolyODE, a Neural ODE that models the latent continuous-time process as a projection onto a basis of orthogonal polynomials. This formulation enforces long-range memory and preserves a global representation of the underlying dynamical system. Our construction is backed by favourable theoretical guarantees and in a series of experiments, we demonstrate that it outperforms previous works in the reconstruction of past and future data, and in downstream prediction tasks.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
The unique coclique extension property for apartments of buildings
Authors:
Andries E. Brouwer,
Jan Draisma,
Çiçek Güven
Abstract:
We show that the Kneser graph of objects of a fixed type in a building of spherical type has the unique coclique extension property when the corresponding representation has minuscule weight and also when the diagram is simply laced and the representation is adjoint.
We show that the Kneser graph of objects of a fixed type in a building of spherical type has the unique coclique extension property when the corresponding representation has minuscule weight and also when the diagram is simply laced and the representation is adjoint.
△ Less
Submitted 22 November, 2022;
originally announced November 2022.
-
Learning predictive checklists from continuous medical data
Authors:
Yukti Makhija,
Edward De Brouwer,
Rahul G. Krishnan
Abstract:
Checklists, while being only recently introduced in the medical domain, have become highly popular in daily clinical practice due to their combined effectiveness and great interpretability. Checklists are usually designed by expert clinicians that manually collect and analyze available evidence. However, the increasing quantity of available medical data is calling for a partially automated checkli…
▽ More
Checklists, while being only recently introduced in the medical domain, have become highly popular in daily clinical practice due to their combined effectiveness and great interpretability. Checklists are usually designed by expert clinicians that manually collect and analyze available evidence. However, the increasing quantity of available medical data is calling for a partially automated checklist design. Recent works have taken a step in that direction by learning predictive checklists from categorical data. In this work, we propose to extend this approach to accomodate learning checklists from continuous medical data using mixed-integer programming approach. We show that this extension outperforms a range of explainable machine learning baselines on the prediction of sepsis from intensive care clinical trajectories.
△ Less
Submitted 13 November, 2022;
originally announced November 2022.
-
Deep Counterfactual Estimation with Categorical Background Variables
Authors:
Edward De Brouwer
Abstract:
Referred to as the third rung of the causal inference ladder, counterfactual queries typically ask the "What if ?" question retrospectively. The standard approach to estimate counterfactuals resides in using a structural equation model that accurately reflects the underlying data generating process. However, such models are seldom available in practice and one usually wishes to infer them from obs…
▽ More
Referred to as the third rung of the causal inference ladder, counterfactual queries typically ask the "What if ?" question retrospectively. The standard approach to estimate counterfactuals resides in using a structural equation model that accurately reflects the underlying data generating process. However, such models are seldom available in practice and one usually wishes to infer them from observational data alone. Unfortunately, the correct structural equation model is in general not identifiable from the observed factual distribution. Nevertheless, in this work, we show that under the assumption that the main latent contributors to the treatment responses are categorical, the counterfactuals can be still reliably predicted. Building upon this assumption, we introduce CounterFactual Query Prediction (CFQP), a novel method to infer counterfactuals from continuous observations when the background variables are categorical. We show that our method significantly outperforms previously available deep-learning-based counterfactual methods, both theoretically and empirically on time series and image data. Our code is available at https://github.com/edebrouwer/cfqp.
△ Less
Submitted 16 January, 2023; v1 submitted 11 October, 2022;
originally announced October 2022.
-
The equivalence of two inequalities for quasisymmetric designs
Authors:
A. E. Brouwer
Abstract:
It has been an open problem whether Hobart's inequality on the parameters of a quasisymmetric 2-design is independent of earlier known restrictions. In this note we show that it is equivalent to inequalities found by Neumaier and Calderbank. We also give some more parameter sets ruled out by the Blokhuis-Calderbank inequality.
It has been an open problem whether Hobart's inequality on the parameters of a quasisymmetric 2-design is independent of earlier known restrictions. In this note we show that it is equivalent to inequalities found by Neumaier and Calderbank. We also give some more parameter sets ruled out by the Blokhuis-Calderbank inequality.
△ Less
Submitted 16 March, 2022;
originally announced March 2022.
-
Majorana Algebra for the Hoffman-Singleton Graph
Authors:
Andries E. Brouwer,
Alexander A. Ivanov
Abstract:
Majorana theory is an axiomatic tool introduced by A. A. Ivanov in 2009 for studying the Monster group M and its subgroups through the 196884-dimensional Conway-Griess-Norton algebra. The group U3(5) is the socle of the centralizer in M of a subgroup of order 25. The involutions of this U3(5)-subgroup are 2A-involutions in the Monster. Therefore, U3(5) possesses a Majorana representation based on…
▽ More
Majorana theory is an axiomatic tool introduced by A. A. Ivanov in 2009 for studying the Monster group M and its subgroups through the 196884-dimensional Conway-Griess-Norton algebra. The group U3(5) is the socle of the centralizer in M of a subgroup of order 25. The involutions of this U3(5)-subgroup are 2A-involutions in the Monster. Therefore, U3(5) possesses a Majorana representation based on the embedding in the Monster. We prove that this is the unique Majorana representation of U3(5), and calculate its dimension, which is 798.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
Predicting the impact of treatments over time with uncertainty aware neural differential equations
Authors:
Edward De Brouwer,
Javier González Hernández,
Stephanie Hyland
Abstract:
Predicting the impact of treatments from observational data only still represents a majorchallenge despite recent significant advances in time series modeling. Treatment assignments are usually correlated with the predictors of the response, resulting in a lack of data support for counterfactual predictions and therefore in poor quality estimates. Developments in causal inference have lead to meth…
▽ More
Predicting the impact of treatments from observational data only still represents a majorchallenge despite recent significant advances in time series modeling. Treatment assignments are usually correlated with the predictors of the response, resulting in a lack of data support for counterfactual predictions and therefore in poor quality estimates. Developments in causal inference have lead to methods addressing this confounding by requiring a minimum level of overlap. However,overlap is difficult to assess and usually notsatisfied in practice. In this work, we propose Counterfactual ODE (CF-ODE), a novel method to predict the impact of treatments continuously over time using Neural Ordinary Differential Equations equipped with uncertainty estimates. This allows to specifically assess which treatment outcomes can be reliably predicted. We demonstrate over several longitudinal data sets that CF-ODE provides more accurate predictions and more reliable uncertainty estimates than previously available methods.
△ Less
Submitted 24 February, 2022;
originally announced February 2022.
-
Learning dynamical systems from data: A simple cross-validation perspective, part III: Irregularly-Sampled Time Series
Authors:
Jonghyeon Lee,
Edward De Brouwer,
Boumediene Hamzi,
Houman Owhadi
Abstract:
A simple and interpretable way to learn a dynamical system from data is to interpolate its vector-field with a kernel. In particular, this strategy is highly efficient (both in terms of accuracy and complexity) when the kernel is data-adapted using Kernel Flows (KF)~\cite{Owhadi19} (which uses gradient-based optimization to learn a kernel based on the premise that a kernel is good if there is no s…
▽ More
A simple and interpretable way to learn a dynamical system from data is to interpolate its vector-field with a kernel. In particular, this strategy is highly efficient (both in terms of accuracy and complexity) when the kernel is data-adapted using Kernel Flows (KF)~\cite{Owhadi19} (which uses gradient-based optimization to learn a kernel based on the premise that a kernel is good if there is no significant loss in accuracy if half of the data is used for interpolation). Despite its previous successes, this strategy (based on interpolating the vector field driving the dynamical system) breaks down when the observed time series is not regularly sampled in time. In this work, we propose to address this problem by directly approximating the vector field of the dynamical system by incorporating time differences between observations in the (KF) data-adapted kernels. We compare our approach with the classical one over different benchmark dynamical systems and show that it significantly improves the forecasting accuracy while remaining simple, fast, and robust.
△ Less
Submitted 25 November, 2021;
originally announced November 2021.
-
The magnitude vector of images
Authors:
Michael F. Adamer,
Edward De Brouwer,
Leslie O'Bray,
Bastian Rieck
Abstract:
The magnitude of a finite metric space has recently emerged as a novel invariant quantity, allowing to measure the effective size of a metric space. Despite encouraging first results demonstrating the descriptive abilities of the magnitude, such as being able to detect the boundary of a metric space, the potential use cases of magnitude remain under-explored. In this work, we investigate the prope…
▽ More
The magnitude of a finite metric space has recently emerged as a novel invariant quantity, allowing to measure the effective size of a metric space. Despite encouraging first results demonstrating the descriptive abilities of the magnitude, such as being able to detect the boundary of a metric space, the potential use cases of magnitude remain under-explored. In this work, we investigate the properties of the magnitude on images, an important data modality in many machine learning applications. By endowing each individual images with its own metric space, we are able to define the concept of magnitude on images and analyse the individual contribution of each pixel with the magnitude vector. In particular, we theoretically show that the previously known properties of boundary detection translate to edge detection abilities in images. Furthermore, we demonstrate practical use cases of magnitude for machine learning applications and propose a novel magnitude model that consists of a computationally efficient magnitude computation and a learnable metric. By doing so, we address the computational hurdle that used to make magnitude impractical for many applications and open the way for the adoption of magnitude in machine learning research.
△ Less
Submitted 7 October, 2022; v1 submitted 28 October, 2021;
originally announced October 2021.
-
Triple intersection numbers for the Paley graphs
Authors:
Andries E. Brouwer,
William J. Martin
Abstract:
We give a tight bound for the triple intersection numbers of Paley graphs. In particular, we show that any three vertices have a common neighbor in Paley graphs of order larger than 25.
We give a tight bound for the triple intersection numbers of Paley graphs. In particular, we show that any three vertices have a common neighbor in Paley graphs of order larger than 25.
△ Less
Submitted 8 September, 2021;
originally announced September 2021.
-
Strongly regular graphs satisfying the 4-vertex condition
Authors:
A. E. Brouwer,
F. Ihringer,
W. M. Kantor
Abstract:
We survey the area of strongly regular graphs satisfying the 4-vertex condition and find several new families. We describe a switching operation on collinearity graphs of polar spaces that produces cospectral graphs. The obtained graphs satisfy the 4-vertex condition if the original graph belongs to a symplectic polar space.
We survey the area of strongly regular graphs satisfying the 4-vertex condition and find several new families. We describe a switching operation on collinearity graphs of polar spaces that produces cospectral graphs. The obtained graphs satisfy the 4-vertex condition if the original graph belongs to a symplectic polar space.
△ Less
Submitted 8 September, 2022; v1 submitted 30 June, 2021;
originally announced July 2021.
-
Topological Graph Neural Networks
Authors:
Max Horn,
Edward De Brouwer,
Michael Moor,
Yves Moreau,
Bastian Rieck,
Karsten Borgwardt
Abstract:
Graph neural networks (GNNs) are a powerful architecture for tackling graph learning tasks, yet have been shown to be oblivious to eminent substructures such as cycles. We present TOGL, a novel layer that incorporates global topological information of a graph using persistent homology. TOGL can be easily integrated into any type of GNN and is strictly more expressive (in terms the Weisfeiler--Lehm…
▽ More
Graph neural networks (GNNs) are a powerful architecture for tackling graph learning tasks, yet have been shown to be oblivious to eminent substructures such as cycles. We present TOGL, a novel layer that incorporates global topological information of a graph using persistent homology. TOGL can be easily integrated into any type of GNN and is strictly more expressive (in terms the Weisfeiler--Lehman graph isomorphism test) than message-passing GNNs. Augmenting GNNs with TOGL leads to improved predictive performance for graph and node classification tasks, both on synthetic data sets, which can be classified by humans using their topology but not by ordinary GNNs, and on real-world data.
△ Less
Submitted 17 March, 2022; v1 submitted 15 February, 2021;
originally announced February 2021.
-
Longitudinal modeling of MS patient trajectories improves predictions of disability progression
Authors:
Edward De Brouwer,
Thijs Becker,
Yves Moreau,
Eva Kubala Havrdova,
Maria Trojano,
Sara Eichau,
Serkan Ozakbas,
Marco Onofrj,
Pierre Grammond,
Jens Kuhle,
Ludwig Kappos,
Patrizia Sola,
Elisabetta Cartechini,
Jeannette Lechner-Scott,
Raed Alroughani,
Oliver Gerlach,
Tomas Kalincik,
Franco Granella,
Francois GrandMaison,
Roberto Bergamaschi,
Maria Jose Sa,
Bart Van Wijmeersch,
Aysun Soysal,
Jose Luis Sanchez-Menoyo,
Claudio Solaro
, et al. (16 additional authors not shown)
Abstract:
Research in Multiple Sclerosis (MS) has recently focused on extracting knowledge from real-world clinical data sources. This type of data is more abundant than data produced during clinical trials and potentially more informative about real-world clinical practice. However, this comes at the cost of less curated and controlled data sets. In this work, we address the task of optimally extracting in…
▽ More
Research in Multiple Sclerosis (MS) has recently focused on extracting knowledge from real-world clinical data sources. This type of data is more abundant than data produced during clinical trials and potentially more informative about real-world clinical practice. However, this comes at the cost of less curated and controlled data sets. In this work, we address the task of optimally extracting information from longitudinal patient data in the real-world setting with a special focus on the sporadic sampling problem. Using the MSBase registry, we show that with machine learning methods suited for patient trajectories modeling, such as recurrent neural networks and tensor factorization, we can predict disability progression of patients in a two-year horizon with an ROC-AUC of 0.86, which represents a 33% decrease in the ranking pair error (1-AUC) compared to reference methods using static clinical features. Compared to the models available in the literature, this work uses the most complete patient history for MS disease progression prediction.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.
-
Expressive Graph Informer Networks
Authors:
Jaak Simm,
Adam Arany,
Edward De Brouwer,
Yves Moreau
Abstract:
Applying machine learning to molecules is challenging because of their natural representation as graphs rather than vectors.Several architectures have been recently proposed for deep learning from molecular graphs, but they suffer from informationbottlenecks because they only pass information from a graph node to its direct neighbors. Here, we introduce a more expressiveroute-based multi-attention…
▽ More
Applying machine learning to molecules is challenging because of their natural representation as graphs rather than vectors.Several architectures have been recently proposed for deep learning from molecular graphs, but they suffer from informationbottlenecks because they only pass information from a graph node to its direct neighbors. Here, we introduce a more expressiveroute-based multi-attention mechanism that incorporates features from routes between node pairs. We call the resulting methodGraph Informer. A single network layer can therefore attend to nodes several steps away. We show empirically that the proposedmethod compares favorably against existing approaches in two prediction tasks: (1) 13C Nuclear Magnetic Resonance (NMR)spectra, improving the state-of-the-art with an MAE of 1.35 ppm and (2) predicting drug bioactivity and toxicity. Additionally, wedevelop a variant called injective Graph Informer that isprovablyas powerful as the Weisfeiler-Lehman test for graph isomorphism.Furthermore, we demonstrate that the route information allows the method to be informed about thenonlocal topologyof the graphand, thus, even go beyond the capabilities of the Weisfeiler-Lehman test.
△ Less
Submitted 14 September, 2020; v1 submitted 25 July, 2019;
originally announced July 2019.
-
GRU-ODE-Bayes: Continuous modeling of sporadically-observed time series
Authors:
Edward De Brouwer,
Jaak Simm,
Adam Arany,
Yves Moreau
Abstract:
Modeling real-world multidimensional time series can be particularly challenging when these are sporadically observed (i.e., sampling is irregular both in time and across dimensions)-such as in the case of clinical patient data. To address these challenges, we propose (1) a continuous-time version of the Gated Recurrent Unit, building upon the recent Neural Ordinary Differential Equations (Chen et…
▽ More
Modeling real-world multidimensional time series can be particularly challenging when these are sporadically observed (i.e., sampling is irregular both in time and across dimensions)-such as in the case of clinical patient data. To address these challenges, we propose (1) a continuous-time version of the Gated Recurrent Unit, building upon the recent Neural Ordinary Differential Equations (Chen et al., 2018), and (2) a Bayesian update network that processes the sporadic observations. We bring these two ideas together in our GRU-ODE-Bayes method. We then demonstrate that the proposed method encodes a continuity prior for the latent process and that it can exactly represent the Fokker-Planck dynamics of complex processes driven by a multidimensional stochastic differential equation. Additionally, empirical evaluation shows that our method outperforms the state of the art on both synthetic data and real-world data with applications in healthcare and climate forecast. What is more, the continuity prior is shown to be well suited for low number of samples settings.
△ Less
Submitted 28 November, 2019; v1 submitted 29 May, 2019;
originally announced May 2019.
-
Deep Ensemble Tensor Factorization for Longitudinal Patient Trajectories Classification
Authors:
Edward De Brouwer,
Jaak Simm,
Adam Arany,
Yves Moreau
Abstract:
We present a generative approach to classify scarcely observed longitudinal patient trajectories. The available time series are represented as tensors and factorized using generative deep recurrent neural networks. The learned factors represent the patient data in a compact way and can then be used in a downstream classification task. For more robustness and accuracy in the predictions, we used an…
▽ More
We present a generative approach to classify scarcely observed longitudinal patient trajectories. The available time series are represented as tensors and factorized using generative deep recurrent neural networks. The learned factors represent the patient data in a compact way and can then be used in a downstream classification task. For more robustness and accuracy in the predictions, we used an ensemble of those deep generative models to mimic Bayesian posterior sampling. We illustrate the performance of our architecture on an intensive-care case study of in-hospital mortality prediction with 96 longitudinal measurement types measured across the first 48-hour from admission. Our combination of generative and ensemble strategies achieves an AUC of over 0.85, and outperforms the SAPS-II mortality score and GRU baselines.
△ Less
Submitted 28 November, 2018; v1 submitted 26 November, 2018;
originally announced November 2018.
-
The smallest eigenvalues of Hamming graphs, Johnson graphs and other distance-regular graphs with classical parameters
Authors:
Andries E. Brouwer,
Sebastian M. Cioabă,
Ferdinand Ihringer,
Matt McGinnis
Abstract:
We prove a conjecture by Van Dam and Sotirov on the smallest eigenvalue of (distance-$j$) Hamming graphs and a conjecture by Karloff on the smallest eigenvalue of (distance-$j$) Johnson graphs. More generally, we study the smallest eigenvalue and the second largest eigenvalue in absolute value of the graphs of the relations of classical $P$- and $Q$-polynomial association schemes.
We prove a conjecture by Van Dam and Sotirov on the smallest eigenvalue of (distance-$j$) Hamming graphs and a conjecture by Karloff on the smallest eigenvalue of (distance-$j$) Johnson graphs. More generally, we study the smallest eigenvalue and the second largest eigenvalue in absolute value of the graphs of the relations of classical $P$- and $Q$-polynomial association schemes.
△ Less
Submitted 20 April, 2018; v1 submitted 26 September, 2017;
originally announced September 2017.
-
Uniqueness of codes using semidefinite programming
Authors:
Andries E. Brouwer,
Sven C. Polak
Abstract:
For $n,d,w \in \mathbb{N}$, let $A(n,d,w)$ denote the maximum size of a binary code of word length $n$, minimum distance $d$ and constant weight $w$. Schrijver recently showed using semidefinite programming that $A(23,8,11)=1288$, and the second author that $A(22,8,11)=672$ and $A(22,8,10)=616$. Here we show uniqueness of the codes achieving these bounds.
Let $A(n,d)$ denote the maximum size of…
▽ More
For $n,d,w \in \mathbb{N}$, let $A(n,d,w)$ denote the maximum size of a binary code of word length $n$, minimum distance $d$ and constant weight $w$. Schrijver recently showed using semidefinite programming that $A(23,8,11)=1288$, and the second author that $A(22,8,11)=672$ and $A(22,8,10)=616$. Here we show uniqueness of the codes achieving these bounds.
Let $A(n,d)$ denote the maximum size of a binary code of word length $n$ and minimum distance $d$. Gijswijt, Mittelmann and Schrijver showed that $A(20,8)=256$. We show that there are several nonisomorphic codes achieving this bound, and classify all such codes with all distances divisible by 4.
△ Less
Submitted 24 November, 2018; v1 submitted 7 September, 2017;
originally announced September 2017.
-
Counterexamples to conjectures about Subset Takeaway and counting linear extensions of a Boolean lattice
Authors:
Andries E. Brouwer,
J. Daniel Christensen
Abstract:
We develop an algorithm for efficiently computing recursively defined functions on posets. We illustrate this algorithm by disproving conjectures about the game Subset Takeaway (Chomp on a hypercube) and computing the number of linear extensions of the lattice of a 7-cube and related lattices.
We develop an algorithm for efficiently computing recursively defined functions on posets. We illustrate this algorithm by disproving conjectures about the game Subset Takeaway (Chomp on a hypercube) and computing the number of linear extensions of the lattice of a 7-cube and related lattices.
△ Less
Submitted 10 July, 2017; v1 submitted 9 February, 2017;
originally announced February 2017.
-
Distance-regular graphs where the distance-$d$ graph has fewer distinct eigenvalues
Authors:
A. E. Brouwer,
M. A. Fiol
Abstract:
Let the Kneser graph $K$ of a distance-regular graph $Γ$ be the graph on the same vertex set as $Γ$, where two vertices are adjacent when they have maximal distance in $Γ$. We study the situation where the Bose-Mesner algebra of $Γ$ is not generated by the adjacency matrix of $K$. In particular, we obtain strong results in the so-called `half antipodal' case.
Let the Kneser graph $K$ of a distance-regular graph $Γ$ be the graph on the same vertex set as $Γ$, where two vertices are adjacent when they have maximal distance in $Γ$. We study the situation where the Bose-Mesner algebra of $Γ$ is not generated by the adjacency matrix of $K$. In particular, we obtain strong results in the so-called `half antipodal' case.
△ Less
Submitted 1 September, 2014;
originally announced September 2014.
-
Notes on simplicial rook graphs
Authors:
Andries E. Brouwer,
Sebastian M. Cioabă,
Willem H. Haemers,
Jason R. Vermette
Abstract:
The simplicial rook graph ${\rm SR}(m,n)$ is the graph of which the vertices are the sequences of nonnegative integers of length $m$ summing to $n$, where two such sequences are adjacent when they differ in precisely two places. We show that ${\rm SR}(m,n)$ has integral eigenvalues, and smallest eigenvalue $s = \max (-n, -{m \choose 2})$, and that this graph has a large part of its spectrum in com…
▽ More
The simplicial rook graph ${\rm SR}(m,n)$ is the graph of which the vertices are the sequences of nonnegative integers of length $m$ summing to $n$, where two such sequences are adjacent when they differ in precisely two places. We show that ${\rm SR}(m,n)$ has integral eigenvalues, and smallest eigenvalue $s = \max (-n, -{m \choose 2})$, and that this graph has a large part of its spectrum in common with the Johnson graph $J(m+n-1,n)$. We determine the automorphism group and several other properties.
△ Less
Submitted 24 August, 2014;
originally announced August 2014.
-
Godsil-McKay switching and isomorphism
Authors:
Aida Abiad,
Andries E. Brouwer,
Willem H. Haemers
Abstract:
Godsil-McKay switching is an operation on graphs that doesn't change the spectrum of the adjacency matrix. Usually (but not always) the obtained graph is non-isomorphic with the original graph. We present a straightforward sufficient condition for being isomorphic after switching, and give examples which show that this condition is not necessary. For some graph products we obtain sufficient condit…
▽ More
Godsil-McKay switching is an operation on graphs that doesn't change the spectrum of the adjacency matrix. Usually (but not always) the obtained graph is non-isomorphic with the original graph. We present a straightforward sufficient condition for being isomorphic after switching, and give examples which show that this condition is not necessary. For some graph products we obtain sufficient conditions for being non-isomorphic after switching. As an example we find that the tensor product of the $\ell\times m$ grid ($\ell>m\geq 2$) and a graph with at least one vertex of degree two is not determined by its adjacency spectrum.
△ Less
Submitted 16 June, 2014;
originally announced June 2014.
-
Lossy gossip and composition of metrics
Authors:
Andries E. Brouwer,
Jan Draisma,
Bart J. Frenk
Abstract:
We study the monoid generated by n-by-n distance matrices under tropical (or min-plus) multiplication. Using the tropical geometry of the orthogonal group, we prove that this monoid is a finite polyhedral fan of dimension n(n-1)/2, and we compute the structure of this fan for n up to 5. The monoid captures gossip among n gossipers over lossy phone lines, and contains the gossip monoid over ordinar…
▽ More
We study the monoid generated by n-by-n distance matrices under tropical (or min-plus) multiplication. Using the tropical geometry of the orthogonal group, we prove that this monoid is a finite polyhedral fan of dimension n(n-1)/2, and we compute the structure of this fan for n up to 5. The monoid captures gossip among n gossipers over lossy phone lines, and contains the gossip monoid over ordinary phone lines as a submonoid. We prove several new results about this submonoid, as well. In particular, we establish a sharp bound on chains of calls in each of which someone learns something new.
△ Less
Submitted 9 January, 2015; v1 submitted 23 May, 2014;
originally announced May 2014.
-
The degrees of a system of parameters of the ring of invariants of a binary form
Authors:
Andries E. Brouwer,
Jan Draisma,
Mihaela Popoviciu
Abstract:
We consider the degrees of the elements of a homogeneous system of parameters for the ring of invariants of a binary form, give a divisibility condition, and a complete classification for forms of degree at most 8.
We consider the degrees of the elements of a homogeneous system of parameters for the ring of invariants of a binary form, give a divisibility condition, and a complete classification for forms of degree at most 8.
△ Less
Submitted 23 April, 2014;
originally announced April 2014.
-
Sylvester versus Gundelfinger
Authors:
Andries E. Brouwer,
Mihaela Popoviciu
Abstract:
Let $V_n$ be the ${\rm SL}_2$-module of binary forms of degree $n$ and let $V = V_1 \oplus V_3 \oplus V_4$. We show that the minimum number of generators of the algebra $R = \mathbb{C}[V]^{{\rm SL}_2}$ of polynomial functions on $V$ invariant under the action of ${\rm SL}_2$ equals 63. This settles a 143-year old question.
Let $V_n$ be the ${\rm SL}_2$-module of binary forms of degree $n$ and let $V = V_1 \oplus V_3 \oplus V_4$. We show that the minimum number of generators of the algebra $R = \mathbb{C}[V]^{{\rm SL}_2}$ of polynomial functions on $V$ invariant under the action of ${\rm SL}_2$ equals 63. This settles a 143-year old question.
△ Less
Submitted 19 October, 2012;
originally announced October 2012.
-
Two distance-regular graphs
Authors:
Andries E. Brouwer,
Dmitrii V. Pasechnik
Abstract:
We construct two families of distance-regular graphs, namely the subgraph of the dual polar graph of type B_3(q) induced on the vertices far from a fixed point, and the subgraph of the dual polar graph of type D_4(q) induced on the vertices far from a fixed edge. The latter is the extended bipartite double of the former.
We construct two families of distance-regular graphs, namely the subgraph of the dual polar graph of type B_3(q) induced on the vertices far from a fixed point, and the subgraph of the dual polar graph of type D_4(q) induced on the vertices far from a fixed edge. The latter is the extended bipartite double of the former.
△ Less
Submitted 3 July, 2011;
originally announced July 2011.
-
The Elementary Divisors of the Incidence Matrix of Skew Lines in PG(3,q)
Authors:
Andries E. Brouwer,
Joshua E. Ducey,
Peter Sin
Abstract:
The elementary divisors of the incidence matrix of lines in PG(3,q) are computed, where two lines are incident if and only if they are skew.
The elementary divisors of the incidence matrix of lines in PG(3,q) are computed, where two lines are incident if and only if they are skew.
△ Less
Submitted 6 October, 2011; v1 submitted 28 February, 2011;
originally announced March 2011.
-
SL2-modules of small homological dimension
Authors:
Andries E. Brouwer,
Mihaela Popoviciu
Abstract:
Let Vn be the SL2-module of binary forms of degree n and let V = Vn1+...+Vnp . We consider the algebra R of polynomial functions on V invariant under the action of SL2. The measure of the intricacy of these algebras is the length of their chains of syzygies, called homological dimension hdR. Popov gave in 1983 a classification of the cases in which hdR <=10 for a single binary form (p = 1) or hdR…
▽ More
Let Vn be the SL2-module of binary forms of degree n and let V = Vn1+...+Vnp . We consider the algebra R of polynomial functions on V invariant under the action of SL2. The measure of the intricacy of these algebras is the length of their chains of syzygies, called homological dimension hdR. Popov gave in 1983 a classification of the cases in which hdR <=10 for a single binary form (p = 1) or hdR <=3 for a system of two or more binary forms (p > 1). We extend Popov's result and determine for p = 1 the cases with hdR <= 100, and for p > 1 those with hdR <= 15. In these cases we give a set of homogeneous parameters and a set of generators for the algebra R.
△ Less
Submitted 21 February, 2011;
originally announced February 2011.
-
The invariants of the binary decimic
Authors:
Andries E. Brouwer,
Mihaela Popoviciu
Abstract:
We consider the algebra of invariants of binary forms of degree 10 with complex coefficients, construct a system of parameters with degrees 2, 4, 6, 6, 8, 9, 10, 14 and find the 106 basic invariants.
We consider the algebra of invariants of binary forms of degree 10 with complex coefficients, construct a system of parameters with degrees 2, 4, 6, 6, 8, 9, 10, 14 and find the 106 basic invariants.
△ Less
Submitted 4 February, 2010;
originally announced February 2010.
-
The invariants of the binary nonic
Authors:
Andries E. Brouwer,
Mihaela Popoviciu
Abstract:
We consider the algebra of invariants of binary forms of degree 9 with complex coefficients, find the 92 basic invariants, give an explicit system of parameters and show the existence of four more systems of parameters with different sets of degrees.
We consider the algebra of invariants of binary forms of degree 9 with complex coefficients, find the 92 basic invariants, give an explicit system of parameters and show the existence of four more systems of parameters with different sets of degrees.
△ Less
Submitted 3 February, 2010;
originally announced February 2010.
-
Equivariant Groebner bases and the Gaussian two-factor model
Authors:
Andries E. Brouwer,
Jan Draisma
Abstract:
Exploiting symmetry in Groebner basis computations is difficult when the symmetry takes the form of a group acting by automorphisms on monomials in finitely many variables. This is largely due to the fact that the group elements, being invertible, cannot preserve a term order. By contrast, inspired by work of Aschenbrenner and Hillar, we introduce the concept of equivariant Groebner basis in a s…
▽ More
Exploiting symmetry in Groebner basis computations is difficult when the symmetry takes the form of a group acting by automorphisms on monomials in finitely many variables. This is largely due to the fact that the group elements, being invertible, cannot preserve a term order. By contrast, inspired by work of Aschenbrenner and Hillar, we introduce the concept of equivariant Groebner basis in a setting where a_monoid_ acts by_homomorphisms_ on monomials in potentially infinitely many variables. We require that the action be compatible with a term order, and under some further assumptions derive a Buchberger-type algorithm for computing equivariant Groebner bases. Using this algorithm and the monoid of strictly increasing functions N -> N we prove that the kernel of the ring homomorphism R[y_{ij} | i,j in N, i > j] -> R[s_i,t_i | i in N], y_{ij} -> s_i s_j + t_i t_j is generated by two types of polynomials: off-diagonal 3x3-minors and pentads. This confirms a conjecture by Drton, Sturmfels, and Sullivant on the Gaussian two-factor model from algebraic statistics.
△ Less
Submitted 6 January, 2010; v1 submitted 11 August, 2009;
originally announced August 2009.
-
Algebraic Graph Theory (a short course for postgraduate students and researchers)
Authors:
A. E. Brouwer,
W. H. Haemers
Abstract:
This submission has been withdrawn by arXiv administration.
This submission has been withdrawn by arXiv administration.
△ Less
Submitted 12 June, 2008; v1 submitted 30 May, 2008;
originally announced June 2008.