-
Global Human-guided Counterfactual Explanations for Molecular Properties via Reinforcement Learning
Authors:
Danqing Wang,
Antonis Antoniades,
Kha-Dinh Luong,
Edwin Zhang,
Mert Kosan,
Jiachen Li,
Ambuj Singh,
William Yang Wang,
Lei Li
Abstract:
Counterfactual explanations of Graph Neural Networks (GNNs) offer a powerful way to understand data that can naturally be represented by a graph structure. Furthermore, in many domains, it is highly desirable to derive data-driven global explanations or rules that can better explain the high-level properties of the models and data in question. However, evaluating global counterfactual explanations…
▽ More
Counterfactual explanations of Graph Neural Networks (GNNs) offer a powerful way to understand data that can naturally be represented by a graph structure. Furthermore, in many domains, it is highly desirable to derive data-driven global explanations or rules that can better explain the high-level properties of the models and data in question. However, evaluating global counterfactual explanations is hard in real-world datasets due to a lack of human-annotated ground truth, which limits their use in areas like molecular sciences. Additionally, the increasing scale of these datasets provides a challenge for random search-based methods. In this paper, we develop a novel global explanation model RLHEX for molecular property prediction. It aligns the counterfactual explanations with human-defined principles, making the explanations more interpretable and easy for experts to evaluate. RLHEX includes a VAE-based graph generator to generate global explanations and an adapter to adjust the latent representation space to human-defined principles. Optimized by Proximal Policy Optimization (PPO), the global explanations produced by RLHEX cover 4.12% more input graphs and reduce the distance between the counterfactual explanation set and the input set by 0.47% on average across three molecular datasets. RLHEX provides a flexible framework to incorporate different human-designed principles into the counterfactual explanation generation process, aligning these explanations with domain expertise. The code and data are released at https://github.com/dqwang122/RLHEX.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
DGCLUSTER: A Neural Framework for Attributed Graph Clustering via Modularity Maximization
Authors:
Aritra Bhowmick,
Mert Kosan,
Zexi Huang,
Ambuj Singh,
Sourav Medya
Abstract:
Graph clustering is a fundamental and challenging task in the field of graph mining where the objective is to group the nodes into clusters taking into consideration the topology of the graph. It has several applications in diverse domains spanning social network analysis, recommender systems, computer vision, and bioinformatics. In this work, we propose a novel method, DGCluster, which primarily…
▽ More
Graph clustering is a fundamental and challenging task in the field of graph mining where the objective is to group the nodes into clusters taking into consideration the topology of the graph. It has several applications in diverse domains spanning social network analysis, recommender systems, computer vision, and bioinformatics. In this work, we propose a novel method, DGCluster, which primarily optimizes the modularity objective using graph neural networks and scales linearly with the graph size. Our method does not require the number of clusters to be specified as a part of the input and can also leverage the availability of auxiliary node level information. We extensively test DGCluster on several real-world datasets of varying sizes, across multiple popular cluster quality metrics. Our approach consistently outperforms the state-of-the-art methods, demonstrating significant performance gains in almost all settings.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
GNNX-BENCH: Unravelling the Utility of Perturbation-based GNN Explainers through In-depth Benchmarking
Authors:
Mert Kosan,
Samidha Verma,
Burouj Armgaan,
Khushbu Pahwa,
Ambuj Singh,
Sourav Medya,
Sayan Ranu
Abstract:
Numerous explainability methods have been proposed to shed light on the inner workings of GNNs. Despite the inclusion of empirical evaluations in all the proposed algorithms, the interrogative aspects of these evaluations lack diversity. As a result, various facets of explainability pertaining to GNNs, such as a comparative analysis of counterfactual reasoners, their stability to variational facto…
▽ More
Numerous explainability methods have been proposed to shed light on the inner workings of GNNs. Despite the inclusion of empirical evaluations in all the proposed algorithms, the interrogative aspects of these evaluations lack diversity. As a result, various facets of explainability pertaining to GNNs, such as a comparative analysis of counterfactual reasoners, their stability to variational factors such as different GNN architectures, noise, stochasticity in non-convex loss surfaces, feasibility amidst domain constraints, and so forth, have yet to be formally investigated. Motivated by this need, we present a benchmarking study on perturbation-based explainability methods for GNNs, aiming to systematically evaluate and compare a wide range of explainability techniques. Among the key findings of our study, we identify the Pareto-optimal methods that exhibit superior efficacy and stability in the presence of noise. Nonetheless, our study reveals that all algorithms are affected by stability issues when faced with noisy data. Furthermore, we have established that the current generation of counterfactual explainers often fails to provide feasible recourses due to violations of topological constraints encoded by domain-specific considerations. Overall, this benchmarking study empowers stakeholders in the field of GNNs with a comprehensive understanding of the state-of-the-art explainability methods, potential research problems for further enhancement, and the implications of their application in real-world scenarios.
△ Less
Submitted 14 March, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Robust Ante-hoc Graph Explainer using Bilevel Optimization
Authors:
Kha-Dinh Luong,
Mert Kosan,
Arlei Lopes Da Silva,
Ambuj Singh
Abstract:
Explaining the decisions made by machine learning models for high-stakes applications is critical for increasing transparency and guiding improvements to these decisions. This is particularly true in the case of models for graphs, where decisions often depend on complex patterns combining rich structural and attribute data. While recent work has focused on designing so-called post-hoc explainers,…
▽ More
Explaining the decisions made by machine learning models for high-stakes applications is critical for increasing transparency and guiding improvements to these decisions. This is particularly true in the case of models for graphs, where decisions often depend on complex patterns combining rich structural and attribute data. While recent work has focused on designing so-called post-hoc explainers, the broader question of what constitutes a good explanation remains open. One intuitive property is that explanations should be sufficiently informative to reproduce the predictions given the data. In other words, a good explainer can be repurposed as a predictor. Post-hoc explainers do not achieve this goal as their explanations are highly dependent on fixed model parameters (e.g., learned GNN weights). To address this challenge, we propose RAGE (Robust Ante-hoc Graph Explainer), a novel and flexible ante-hoc explainer designed to discover explanations for graph neural networks using bilevel optimization, with a focus on the chemical domain. RAGE can effectively identify molecular substructures that contain the full information needed for prediction while enabling users to rank these explanations in terms of relevance. Our experiments on various molecular classification tasks show that RAGE explanations are better than existing post-hoc and ante-hoc approaches.
△ Less
Submitted 4 June, 2024; v1 submitted 25 May, 2023;
originally announced May 2023.
-
Link Prediction without Graph Neural Networks
Authors:
Zexi Huang,
Mert Kosan,
Arlei Silva,
Ambuj Singh
Abstract:
Link prediction, which consists of predicting edges based on graph features, is a fundamental task in many graph applications. As for several related problems, Graph Neural Networks (GNNs), which are based on an attribute-centric message-passing paradigm, have become the predominant framework for link prediction. GNNs have consistently outperformed traditional topology-based heuristics, but what c…
▽ More
Link prediction, which consists of predicting edges based on graph features, is a fundamental task in many graph applications. As for several related problems, Graph Neural Networks (GNNs), which are based on an attribute-centric message-passing paradigm, have become the predominant framework for link prediction. GNNs have consistently outperformed traditional topology-based heuristics, but what contributes to their performance? Are there simpler approaches that achieve comparable or better results? To answer these questions, we first identify important limitations in how GNN-based link prediction methods handle the intrinsic class imbalance of the problem -- due to the graph sparsity -- in their training and evaluation. Moreover, we propose Gelato, a novel topology-centric framework that applies a topological heuristic to a graph enhanced by attribute information via graph learning. Our model is trained end-to-end with an N-pair loss on an unbiased training set to address class imbalance. Experiments show that Gelato is 145% more accurate, trains 11 times faster, infers 6,000 times faster, and has less than half of the trainable parameters compared to state-of-the-art GNNs for link prediction.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Global Counterfactual Explainer for Graph Neural Networks
Authors:
Mert Kosan,
Zexi Huang,
Sourav Medya,
Sayan Ranu,
Ambuj Singh
Abstract:
Graph neural networks (GNNs) find applications in various domains such as computational biology, natural language processing, and computer security. Owing to their popularity, there is an increasing need to explain GNN predictions since GNNs are black-box machine learning models. One way to address this is counterfactual reasoning where the objective is to change the GNN prediction by minimal chan…
▽ More
Graph neural networks (GNNs) find applications in various domains such as computational biology, natural language processing, and computer security. Owing to their popularity, there is an increasing need to explain GNN predictions since GNNs are black-box machine learning models. One way to address this is counterfactual reasoning where the objective is to change the GNN prediction by minimal changes in the input graph. Existing methods for counterfactual explanation of GNNs are limited to instance-specific local reasoning. This approach has two major limitations of not being able to offer global recourse policies and overloading human cognitive ability with too much information. In this work, we study the global explainability of GNNs through global counterfactual reasoning. Specifically, we want to find a small set of representative counterfactual graphs that explains all input graphs. Towards this goal, we propose GCFExplainer, a novel algorithm powered by vertex-reinforced random walks on an edit map of graphs with a greedy summary. Extensive experiments on real graph datasets show that the global explanation from GCFExplainer provides important high-level insights of the model behavior and achieves a 46.9% gain in recourse coverage and a 9.5% reduction in recourse cost compared to the state-of-the-art local counterfactual explainers.
△ Less
Submitted 10 November, 2022; v1 submitted 20 October, 2022;
originally announced October 2022.
-
Event Detection on Dynamic Graphs
Authors:
Mert Kosan,
Arlei Silva,
Sourav Medya,
Brian Uzzi,
Ambuj Singh
Abstract:
Event detection is a critical task for timely decision-making in graph analytics applications. Despite the recent progress towards deep learning on graphs, event detection on dynamic graphs presents particular challenges to existing architectures. Real-life events are often associated with sudden deviations of the normal behavior of the graph. However, existing approaches for dynamic node embeddin…
▽ More
Event detection is a critical task for timely decision-making in graph analytics applications. Despite the recent progress towards deep learning on graphs, event detection on dynamic graphs presents particular challenges to existing architectures. Real-life events are often associated with sudden deviations of the normal behavior of the graph. However, existing approaches for dynamic node embedding are unable to capture the graph-level dynamics related to events. In this paper, we propose DyGED, a simple yet novel deep learning model for event detection on dynamic graphs. DyGED learns correlations between the graph macro dynamics -- i.e. a sequence of graph-level representations -- and labeled events. Moreover, our approach combines structural and temporal self-attention mechanisms to account for application-specific node and time importances effectively. Our experimental evaluation, using a representative set of datasets, demonstrates that DyGED outperforms competing solutions in terms of event detection accuracy by up to 8.5% while being more scalable than the top alternatives. We also present case studies illustrating key features of our model.
△ Less
Submitted 13 February, 2023; v1 submitted 23 October, 2021;
originally announced October 2021.
-
Feature-based Individual Fairness in k-Clustering
Authors:
Debajyoti Kar,
Mert Kosan,
Debmalya Mandal,
Sourav Medya,
Arlei Silva,
Palash Dey,
Swagato Sanyal
Abstract:
Ensuring fairness in machine learning algorithms is a challenging and essential task. We consider the problem of clustering a set of points while satisfying fairness constraints. While there have been several attempts to capture group fairness in the $k$-clustering problem, fairness at an individual level is relatively less explored. We introduce a new notion of individual fairness in $k$-clusteri…
▽ More
Ensuring fairness in machine learning algorithms is a challenging and essential task. We consider the problem of clustering a set of points while satisfying fairness constraints. While there have been several attempts to capture group fairness in the $k$-clustering problem, fairness at an individual level is relatively less explored. We introduce a new notion of individual fairness in $k$-clustering based on features not necessarily used for clustering. We show that this problem is NP-hard and does not admit a constant factor approximation. Therefore, we design a randomized algorithm that guarantees approximation both in terms of minimizing the clustering distance objective and individual fairness under natural restrictions on the distance metric and fairness constraints. Finally, our experimental results against six competing baselines validate that our algorithm produces individually fairer clusters than the fairest baseline by 12.5% on average while also being less costly in terms of the clustering objective than the best baseline by 34.5% on average.
△ Less
Submitted 3 February, 2023; v1 submitted 9 September, 2021;
originally announced September 2021.
-
On Fusible Rings
Authors:
M. Tamer Kosan,
Jerzy Matczuk
Abstract:
We answer in negative two of questions posed in [4]. We also establish a new characterization of semiprime left Goldie rings by showing that a semiprime ring R is left Goldie iff it is regular left fusible and has finite left Goldie dimension.
We answer in negative two of questions posed in [4]. We also establish a new characterization of semiprime left Goldie rings by showing that a semiprime ring R is left Goldie iff it is regular left fusible and has finite left Goldie dimension.
△ Less
Submitted 1 January, 2019;
originally announced January 2019.
-
On UJ-rings
Authors:
M. Tamer Kosan,
Andre Leroy,
Jerzy Matczuk
Abstract:
UJ-rings are studied, i.e. ring in which all units can be presented in a form 1 + x, for some x\in J(R). The behavior of UJ-rings under various algebraic construction is investigated. In particular, it is shown that the problem of lifting the UJ property from a ring R to the polynomial ring R[x] is equivalent to the Kothe's problem for F_2-algebras.
UJ-rings are studied, i.e. ring in which all units can be presented in a form 1 + x, for some x\in J(R). The behavior of UJ-rings under various algebraic construction is investigated. In particular, it is shown that the problem of lifting the UJ property from a ring R to the polynomial ring R[x] is equivalent to the Kothe's problem for F_2-algebras.
△ Less
Submitted 29 August, 2017;
originally announced August 2017.
-
Rings with each right ideal automorphism-invariant
Authors:
M. Tamer Koşan,
Truong Cong Quynh,
Ashish K. Srivastava
Abstract:
In this paper, we study rings having the property that every right ideal is automorphism-invariant. Such rings are called right $a$-rings. It is shown that (1) a right $a$-ring is a direct sum of a square-full semisimple artinian ring and a right square-free ring, (2) a ring $R$ is semisimple artinian if and only if the matrix ring $\mathbb{M}_n(R)$ for some $n>1$ is a right $a$-ring, (3) every ri…
▽ More
In this paper, we study rings having the property that every right ideal is automorphism-invariant. Such rings are called right $a$-rings. It is shown that (1) a right $a$-ring is a direct sum of a square-full semisimple artinian ring and a right square-free ring, (2) a ring $R$ is semisimple artinian if and only if the matrix ring $\mathbb{M}_n(R)$ for some $n>1$ is a right $a$-ring, (3) every right $a$-ring is stably-finite, (4) a right $a$-ring is von Neumann regular if and only if it is semiprime, and (5) a prime right $a$-ring is simple artinian. We also describe the structure of an indecomposable right artinian right non-singular right $a$-ring as a triangular matrix ring of certain block matrices.
△ Less
Submitted 30 August, 2015; v1 submitted 7 March, 2015;
originally announced March 2015.
-
Correspondences of coclosed submodules
Authors:
Septimiu Crivei,
Hatice Inankıl,
M. Tamer Koşan,
Gabriela Olteanu
Abstract:
We establish an order-preserving bijective correspondence between the sets of coclosed elements of some bounded lattices related by suitable Galois connections. As an application, we deduce that if $M$ is a finitely generated quasi-projective left $R$-module with $S=End_R(M)$ and $N$ is an $M$-generated left $R$-module, then there exists an order-preserving bijective correspondence between the set…
▽ More
We establish an order-preserving bijective correspondence between the sets of coclosed elements of some bounded lattices related by suitable Galois connections. As an application, we deduce that if $M$ is a finitely generated quasi-projective left $R$-module with $S=End_R(M)$ and $N$ is an $M$-generated left $R$-module, then there exists an order-preserving bijective correspondence between the sets of coclosed left $R$-submodules of $N$ and coclosed left $S$-submodules of $Hom_R(M,N)$.
△ Less
Submitted 4 March, 2012;
originally announced March 2012.