Skip to main content

Showing 1–31 of 31 results for author: Ioannidis, V N

.
  1. arXiv:2406.11200  [pdf, other

    cs.LG cs.CL

    AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval

    Authors: Shirley Wu, Shiyu Zhao, Qian Huang, Kexin Huang, Michihiro Yasunaga, Kaidi Cao, Vassilis N. Ioannidis, Karthik Subbian, Jure Leskovec, James Zou

    Abstract: Large language model (LLM) agents have demonstrated impressive capability in utilizing external tools and knowledge to boost accuracy and reduce hallucinations. However, develo** the prompting techniques that make LLM agents able to effectively use external tools and knowledge is a heuristic and laborious task. Here, we introduce AvaTaR, a novel and automatic framework that optimizes an LLM agen… ▽ More

    Submitted 17 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 19 pages, 8 figures, 6 tables

  2. arXiv:2405.00988  [pdf, other

    cs.CL cs.LG

    Context-Aware Clustering using Large Language Models

    Authors: Sindhu Tipirneni, Ravinarayana Adkathimar, Nurendra Choudhary, Gaurush Hiranandani, Rana Ali Amjad, Vassilis N. Ioannidis, Changhe Yuan, Chandan K. Reddy

    Abstract: Despite the remarkable success of Large Language Models (LLMs) in text understanding and generation, their potential for text clustering tasks remains underexplored. We observed that powerful closed-source LLMs provide good quality clusterings of entity sets but are not scalable due to the massive compute power required and the associated costs. Thus, we propose CACTUS (Context-Aware ClusTering wi… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 16 pages

    ACM Class: I.2.7; I.2.m

  3. arXiv:2404.13207  [pdf, other

    cs.IR cs.LG

    STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases

    Authors: Shirley Wu, Shiyu Zhao, Michihiro Yasunaga, Kexin Huang, Kaidi Cao, Qian Huang, Vassilis N. Ioannidis, Karthik Subbian, James Zou, Jure Leskovec

    Abstract: Answering real-world complex queries, such as complex product search, often requires accurate retrieval from semi-structured knowledge bases that involve blend of unstructured (e.g., textual descriptions of products) and structured (e.g., entity relations of products) information. However, previous works have mostly studied textual and relational retrieval tasks as separate topics. To address the… ▽ More

    Submitted 20 May, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 26 pages, 6 figures

  4. arXiv:2402.07999  [pdf, other

    cs.LG cs.SI

    NetInfoF Framework: Measuring and Exploiting Network Usable Information

    Authors: Meng-Chieh Lee, Haiyang Yu, Jian Zhang, Vassilis N. Ioannidis, Xiang Song, Soji Adeshina, Da Zheng, Christos Faloutsos

    Abstract: Given a node-attributed graph, and a graph task (link prediction or node classification), can we tell if a graph neural network (GNN) will perform well? More specifically, do the graph structure and the node features carry enough usable information for the task? Our goals are (1) to develop a fast tool to measure how much information is in the graph structure and in the node features, and (2) to e… ▽ More

    Submitted 20 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to ICLR 2024 (Spotlight)

  5. arXiv:2310.03320  [pdf, other

    cs.LG

    BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs

    Authors: Zifeng Wang, Zichen Wang, Balasubramaniam Srinivasan, Vassilis N. Ioannidis, Huzefa Rangwala, Rishita Anubhai

    Abstract: Foundation models (FMs) are able to leverage large volumes of unlabeled data to demonstrate superior performance across a wide range of tasks. However, FMs developed for biomedical domains have largely remained unimodal, i.e., independently trained and used for tasks on protein sequences alone, small molecule structures alone, or clinical data alone. To overcome this limitation of biomedical FMs,… ▽ More

    Submitted 18 January, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  6. arXiv:2309.13885  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.SI

    TouchUp-G: Improving Feature Representation through Graph-Centric Finetuning

    Authors: **g Zhu, Xiang Song, Vassilis N. Ioannidis, Danai Koutra, Christos Faloutsos

    Abstract: How can we enhance the node features acquired from Pretrained Models (PMs) to better suit downstream graph learning tasks? Graph Neural Networks (GNNs) have become the state-of-the-art approach for many high-impact, real-world graph applications. For feature-rich graphs, a prevalent practice involves utilizing a PM directly to generate features, without incorporating any domain adaptation techniqu… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: preprint, ongoing work

  7. arXiv:2306.02592  [pdf, other

    cs.CL cs.AI cs.LG

    Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications

    Authors: Han Xie, Da Zheng, Jun Ma, Houyu Zhang, Vassilis N. Ioannidis, Xiang Song, Qing **, Sheng Wang, Carl Yang, Yi Xu, Belinda Zeng, Trishul Chilimbi

    Abstract: Model pre-training on large text corpora has been demonstrated effective for various downstream applications in the NLP domain. In the graph mining domain, a similar analogy can be drawn for pre-training graph models on large graphs in the hope of benefiting downstream graph applications, which has also been explored by several recent studies. However, no existing study has ever investigated the p… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: To be published in the KDD 2023 proceedings as a full paper

  8. arXiv:2306.00899  [pdf, other

    cs.LG cs.IR cs.SI

    Pitfalls in Link Prediction with Graph Neural Networks: Understanding the Impact of Target-link Inclusion & Better Practices

    Authors: **g Zhu, Yuhang Zhou, Vassilis N. Ioannidis, Shengyi Qian, Wei Ai, Xiang Song, Danai Koutra

    Abstract: While Graph Neural Networks (GNNs) are remarkably successful in a variety of high-impact applications, we demonstrate that, in link prediction, the common practices of including the edges being predicted in the graph at training and/or test have outsized impact on the performance of low-degree nodes. We theoretically and empirically investigate how these practices impact node-level performance acr… ▽ More

    Submitted 17 December, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Extended Version of our WSDM'24 paper. 8 pages, 2 page appendix

  9. arXiv:2304.10668  [pdf, other

    cs.LG

    Train Your Own GNN Teacher: Graph-Aware Distillation on Textual Graphs

    Authors: Costas Mavromatis, Vassilis N. Ioannidis, Shen Wang, Da Zheng, Soji Adeshina, Jun Ma, Han Zhao, Christos Faloutsos, George Karypis

    Abstract: How can we learn effective node representations on textual graphs? Graph Neural Networks (GNNs) that use Language Models (LMs) to encode textual information of graphs achieve state-of-the-art performance in many node classification tasks. Yet, combining GNNs with LMs has not been widely explored for practical deployments due to its scalability issues. In this work, we tackle this challenge by deve… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  10. arXiv:2302.00109  [pdf, other

    cs.LG

    OrthoReg: Improving Graph-regularized MLPs via Orthogonality Regularization

    Authors: Hengrui Zhang, Shen Wang, Vassilis N. Ioannidis, Soji Adeshina, Jiani Zhang, Xiao Qin, Christos Faloutsos, Da Zheng, George Karypis, Philip S. Yu

    Abstract: Graph Neural Networks (GNNs) are currently dominating in modeling graph-structure data, while their high reliance on graph structure for inference significantly impedes them from widespread applications. By contrast, Graph-regularized MLPs (GR-MLPs) implicitly inject the graph structure information into model weights, while their performance can hardly match that of GNNs in most tasks. This motiva… ▽ More

    Submitted 31 January, 2023; originally announced February 2023.

  11. arXiv:2210.00116  [pdf, other

    cs.LG cs.AI q-bio.GN stat.ME stat.ML

    Predicting Cellular Responses with Variational Causal Inference and Refined Relational Information

    Authors: Yulun Wu, Robert A. Barton, Zichen Wang, Vassilis N. Ioannidis, Carlo De Donno, Layne C. Price, Luis F. Voloch, George Karypis

    Abstract: Predicting the responses of a cell under perturbations may bring important benefits to drug discovery and personalized therapeutics. In this work, we propose a novel graph variational Bayesian causal inference framework to predict a cell's gene expressions under counterfactual perturbations (perturbations that this cell did not factually receive), leveraging information representing biological kno… ▽ More

    Submitted 17 April, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

  12. arXiv:2209.05935  [pdf, ps, other

    stat.ML cs.AI cs.LG math.ST q-bio.GN

    Variational Causal Inference

    Authors: Yulun Wu, Layne C. Price, Zichen Wang, Vassilis N. Ioannidis, Robert A. Barton, George Karypis

    Abstract: Estimating an individual's potential outcomes under counterfactual treatments is a challenging task for traditional causal inference and supervised learning approaches when the outcome is high-dimensional (e.g. gene expressions, impulse responses, human faces) and covariates are relatively limited. In this case, to construct one's outcome under a counterfactual treatment, it is crucial to leverage… ▽ More

    Submitted 31 January, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

  13. arXiv:2206.10781  [pdf, ps, other

    cs.LG cs.CL

    Efficient and effective training of language and graph neural network models

    Authors: Vassilis N. Ioannidis, Xiang Song, Da Zheng, Houyu Zhang, Jun Ma, Yi Xu, Belinda Zeng, Trishul Chilimbi, George Karypis

    Abstract: Can we combine heterogenous graph structure with text to learn high-quality semantic and behavioural representations? Graph neural networks (GNN)s encode numerical node attributes and graph structure to achieve impressive performance in a variety of supervised learning tasks. Current GNN approaches are challenged by textual features, which typically need to be encoded to a numerical vector before… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  14. arXiv:2206.08473  [pdf, other

    cs.LG

    A Robust Stacking Framework for Training Deep Graph Models with Multifaceted Node Features

    Authors: Jiuhai Chen, Jonas Mueller, Vassilis N. Ioannidis, Tom Goldstein, David Wipf

    Abstract: Graph Neural Networks (GNNs) with numerical node features and graph structure as inputs have demonstrated superior performance on various supervised learning tasks with graph data. However the numerical node features utilized by GNNs are commonly extracted from raw data which is of text or tabular (numeric/categorical) type in most real-world applications. The best models for such data types in mo… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  15. arXiv:2112.05785  [pdf, ps, other

    cs.CL cs.AI cs.LG

    TempoQR: Temporal Question Reasoning over Knowledge Graphs

    Authors: Costas Mavromatis, Prasanna Lakkur Subramanyam, Vassilis N. Ioannidis, Soji Adeshina, Phillip R. Howard, Tetiana Grinberg, Nagib Hakim, George Karypis

    Abstract: Knowledge Graph Question Answering (KGQA) involves retrieving facts from a Knowledge Graph (KG) using natural language queries. A KG is a curated set of facts consisting of entities linked by relations. Certain facts include also temporal information forming a Temporal KG (TKG). Although many natural questions involve explicit or implicit time constraints, question answering (QA) over TKGs has bee… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: AAAI 2022

  16. arXiv:2110.13413  [pdf, other

    cs.LG

    Does your graph need a confidence boost? Convergent boosted smoothing on graphs with tabular node features

    Authors: Jiuhai Chen, Jonas Mueller, Vassilis N. Ioannidis, Soji Adeshina, Yangkun Wang, Tom Goldstein, David Wipf

    Abstract: For supervised learning with tabular data, decision tree ensembles produced via boosting techniques generally dominate real-world applications involving iid training/test sets. However for graph data where the iid assumption is violated due to structured relations between samples, it remains unclear how to best incorporate this structure within existing boosting pipelines. To this end, we propose… ▽ More

    Submitted 4 October, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

  17. arXiv:2110.06290  [pdf, other

    cs.LG

    Scalable Consistency Training for Graph Neural Networks via Self-Ensemble Self-Distillation

    Authors: Cole Hawkins, Vassilis N. Ioannidis, Soji Adeshina, George Karypis

    Abstract: Consistency training is a popular method to improve deep learning models in computer vision and natural language processing. Graph neural networks (GNNs) have achieved remarkable performance in a variety of network science learning tasks, but to date no work has studied the effect of consistency training on large-scale graph problems. GNNs scale to large graphs by minibatch training and subsample… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

  18. arXiv:2104.08637  [pdf, other

    cs.SI cs.AI eess.SP

    Unveiling Anomalous Edges and Nominal Connectivity of Attributed Networks

    Authors: Konstantinos D. Polyzos, Costas Mavromatis, Vassilis N. Ioannidis, Georgios B. Giannakis

    Abstract: Uncovering anomalies in attributed networks has recently gained popularity due to its importance in unveiling outliers and flagging adversarial behavior in a gamut of data and network science applications including {the Internet of Things (IoT)}, finance, security, to list a few. The present work deals with uncovering anomalous edges in attributed graphs using two distinct formulations with comple… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  19. arXiv:2007.12731  [pdf, other

    cs.IR cs.AI cs.CL

    COVID-19 Knowledge Graph: Accelerating Information Retrieval and Discovery for Scientific Literature

    Authors: Colby Wise, Vassilis N. Ioannidis, Miguel Romero Calvo, Xiang Song, George Price, Ninad Kulkarni, Ryan Brand, Parminder Bhatia, George Karypis

    Abstract: The coronavirus disease (COVID-19) has claimed the lives of over 350,000 people and infected more than 6 million people worldwide. Several search engines have surfaced to provide researchers with additional tools to find and retrieve information from the rapidly growing corpora on COVID-19. These engines lack extraction and visualization tools necessary to retrieve and interpret complex relations… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

  20. arXiv:2007.10445  [pdf, other

    cs.LG stat.ML

    PanRep: Graph neural networks for extracting universal node embeddings in heterogeneous graphs

    Authors: Vassilis N. Ioannidis, Da Zheng, George Karypis

    Abstract: Learning unsupervised node embeddings facilitates several downstream tasks such as node classification and link prediction. A node embedding is universal if it is designed to be used by and benefit various downstream tasks. This work introduces PanRep, a graph neural network (GNN) model, for unsupervised learning of universal node representations for heterogenous graphs. PanRep consists of a GNN e… ▽ More

    Submitted 4 March, 2021; v1 submitted 20 July, 2020; originally announced July 2020.

  21. arXiv:2007.10261  [pdf, ps, other

    cs.LG stat.ML

    Few-shot link prediction via graph neural networks for Covid-19 drug-repurposing

    Authors: Vassilis N. Ioannidis, Da Zheng, George Karypis

    Abstract: Predicting interactions among heterogenous graph structured data has numerous applications such as knowledge graph completion, recommendation systems and drug discovery. Often times, the links to be predicted belong to rare types such as the case in repurposing drugs for novel diseases. This motivates the task of few-shot link prediction. Typically, GCNs are ill-equipped in learning such rare link… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  22. arXiv:2003.07729  [pdf, ps, other

    cs.LG eess.SP stat.ML

    Tensor Graph Convolutional Networks for Multi-relational and Robust Learning

    Authors: Vassilis N. Ioannidis, Antonio G. Marques, Georgios B. Giannakis

    Abstract: The era of "data deluge" has sparked renewed interest in graph-based learning methods and their widespread applications ranging from sociology and biology to transportation and communications. In this context of graph-aware methods, the present paper introduces a tensor-graph convolutional network (TGCN) for scalable semi-supervised learning (SSL) from data associated with a collection of graphs,… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

    Comments: Graph Convolutinal Networks, Robustness, Adversarial Attacks, Semi-supervised learning, Multi-relational/Heterogenous networks. arXiv admin note: text overlap with arXiv:1910.09590, arXiv:1811.02061

  23. arXiv:2001.09882  [pdf, other

    eess.SP cs.LG cs.SI

    Efficient and Stable Graph Scattering Transforms via Pruning

    Authors: Vassilis N. Ioannidis, Siheng Chen, Georgios B. Giannakis

    Abstract: Graph convolutional networks (GCNs) have well-documented performance in various graph learning tasks, but their analysis is still at its infancy. Graph scattering transforms (GSTs) offer training-free deep GCN models that extract features from graph data, and are amenable to generalization and stability analyses. The price paid by GSTs is exponential complexity in space and time that increases wit… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  24. arXiv:1910.09590  [pdf, ps, other

    cs.LG stat.ML

    Edge Dithering for Robust Adaptive Graph Convolutional Networks

    Authors: Vassilis N. Ioannidis, Georgios B. Giannakis

    Abstract: Graph convolutional networks (GCNs) are vulnerable to perturbations of the graph structure that are either random, or, adversarially designed. The perturbed links modify the graph neighborhoods, which critically affects the performance of GCNs in semi-supervised learning (SSL) tasks. Aiming at robustifying GCNs conditioned on the perturbed graph, the present paper generates multiple auxiliary grap… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

  25. arXiv:1910.09589  [pdf, other

    cs.LG stat.ML

    GraphSAC: Detecting anomalies in large-scale graphs

    Authors: Vassilis N. Ioannidis, Dimitris Berberidis, Georgios B. Giannakis

    Abstract: A graph-based sampling and consensus (GraphSAC) approach is introduced to effectively detect anomalous nodes in large-scale graphs. Existing approaches rely on connectivity and attributes of all nodes to assign an anomaly score per node. However, nodal attributes and network links might be compromised by adversaries, rendering these holistic approaches vulnerable. Alleviating this limitation, Grap… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

  26. arXiv:1811.02061  [pdf, other

    cs.LG stat.ML

    A Recurrent Graph Neural Network for Multi-Relational Data

    Authors: Vassilis N. Ioannidis, Antonio G. Marques, Georgios B. Giannakis

    Abstract: The era of data deluge has sparked the interest in graph-based learning methods in a number of disciplines such as sociology, biology, neuroscience, or engineering. In this paper, we introduce a graph recurrent neural network (GRNN) for scalable semi-supervised learning from multi-relational data. Key aspects of the novel GRNN architecture are the use of multi-relational graphs, the dynamic adapta… ▽ More

    Submitted 17 February, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: Submitted to ICASSP 2019

  27. arXiv:1809.08353  [pdf, other

    stat.ML cs.LG cs.SI eess.SP

    Coupled Graphs and Tensor Factorization for Recommender Systems and Community Detection

    Authors: Vassilis N. Ioannidis, Ahmed S. Zamzam, Georgios B. Giannakis, Nicholas D. Sidiropoulos

    Abstract: Joint analysis of data from multiple information repositories facilitates uncovering the underlying structure in heterogeneous datasets. Single and coupled matrix-tensor factorization (CMTF) has been widely used in this context for imputation-based recommendation from ratings, social network, and other user-item data. When this side information is in the form of item-item correlation matrices or g… ▽ More

    Submitted 30 May, 2019; v1 submitted 21 September, 2018; originally announced September 2018.

    Comments: This paper is submitted to the IEEE Transactions on Knowledge and Data Engineering. A preliminary version of this work was accepted for presentation in the special track of GlobalSIP on Tensor Methods for Signal Processing and Machine Learning

  28. arXiv:1805.06095  [pdf, other

    cs.LG eess.SP stat.ML

    Semi-Blind Inference of Topologies and Dynamical Processes over Graphs

    Authors: Vassilis N. Ioannidis, Yanning Shen, Georgios B. Giannakis

    Abstract: Network science provides valuable insights across numerous disciplines including sociology, biology, neuroscience and engineering. A task of major practical importance in these application domains is inferring the network structure from noisy observations at a subset of nodes. Available methods for topology inference typically assume that the process over the network is observed at all nodes. Howe… ▽ More

    Submitted 15 May, 2018; originally announced May 2018.

  29. arXiv:1711.10353  [pdf, other

    stat.ML cs.LG eess.SP

    Kernel-based Inference of Functions over Graphs

    Authors: Vassilis N. Ioannidis, Meng Ma, Athanasios N. Nikolakopoulos, Georgios B. Giannakis, Daniel Romero

    Abstract: The study of networks has witnessed an explosive growth over the past decades with several ground-breaking methods introduced. A particularly interesting -- and prevalent in several fields of study -- problem is that of inferring a function defined over the nodes of a network. This work presents a versatile kernel-based framework for tackling this inference problem that naturally subsumes and gene… ▽ More

    Submitted 10 April, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: To be published as a chapter in `Adaptive Learning Methods for Nonlinear System Modeling', Elsevier Publishing, Eds. D. Comminiello and J.C. Principe (2018). This chapter surveys recent work on kernel-based inference of functions over graphs including arXiv:1612.03615 and arXiv:1605.07174 and arXiv:1711.09306

  30. arXiv:1711.09306  [pdf, other

    cs.LG eess.SP stat.ML

    Inference of Spatio-Temporal Functions over Graphs via Multi-Kernel Kriged Kalman Filtering

    Authors: Vassilis N. Ioannidis, Daniel Romero, Georgios B. Giannakis

    Abstract: Inference of space-time varying signals on graphs emerges naturally in a plethora of network science related applications. A frequently encountered challenge pertains to reconstructing such dynamic processes, given their values over a subset of vertices and time instants. The present paper develops a graph-aware kernel-based kriged Kalman filter that accounts for the spatio-temporal variations, an… ▽ More

    Submitted 25 November, 2017; originally announced November 2017.

    Comments: Submitted to IEEE Transactions on Signal processing, Nov. 2017

  31. arXiv:1612.03615  [pdf, other

    cs.LG eess.SP stat.ML

    Kernel-based Reconstruction of Space-time Functions on Dynamic Graphs

    Authors: Daniel Romero, Vassilis N. Ioannidis, Georgios B. Giannakis

    Abstract: Graph-based methods pervade the inference toolkits of numerous disciplines including sociology, biology, neuroscience, physics, chemistry, and engineering. A challenging problem encountered in this context pertains to determining the attributes of a set of vertices given those of another subset at possibly different time instants. Leveraging spatiotemporal dynamics can drastically reduce the numbe… ▽ More

    Submitted 20 May, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

    Comments: Submitted to IEEE Journal of Selected Topics in Signal processing, Oct. 2016