Skip to main content

Showing 1–32 of 32 results for author: Galkin, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09639  [pdf, other

    cs.LG cs.SI

    TGB 2.0: A Benchmark for Learning on Temporal Knowledge Graphs and Heterogeneous Graphs

    Authors: Julia Gastinger, Shenyang Huang, Mikhail Galkin, Erfan Loghmani, Ali Parviz, Farimah Poursafaei, Jacob Danovitch, Emanuele Rossi, Ioannis Koutis, Heiner Stuckenschmidt, Reihaneh Rabbany, Guillaume Rabusseau

    Abstract: Multi-relational temporal graphs are powerful tools for modeling real-world data, capturing the evolving and interconnected nature of entities over time. Recently, many novel models are proposed for ML on such graphs intensifying the need for robust evaluation and standardized benchmark datasets. However, the availability of such resources remains scarce and evaluation faces added complexity due t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 27 pages, 8 figures

  2. arXiv:2405.20445  [pdf, other

    cs.LG cs.SI

    GraphAny: A Foundation Model for Node Classification on Any Graph

    Authors: Jianan Zhao, Hesham Mostafa, Mikhail Galkin, Michael Bronstein, Zhaocheng Zhu, Jian Tang

    Abstract: Foundation models that can perform inference on any new task without requiring specific training have revolutionized machine learning in vision and language applications. However, applications involving graph-structured data remain a tough nut for foundation models, due to challenges in the unique feature- and label spaces associated with each graph. Traditional graph ML models such as graph neura… ▽ More

    Submitted 2 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: Preprint. Work in progress

  3. arXiv:2405.05495  [pdf, other

    cs.OH

    PARSAC: Fast, Human-quality Floorplanning for Modern SoCs with Complex Design Constraints

    Authors: Hesham Mostafa, Uday Mallappa, Mikhail Galkin, Mariano Phielipp, Somdeb Majumdar

    Abstract: The floorplanning of Systems-on-a-Chip (SoCs) and of chip sub-systems is a crucial step in the physical design flow as it determines the optimal shapes and locations of the blocks that make up the system. Simulated Annealing (SA) has been the method of choice for tackling classical floorplanning problems where the objective is to minimize wire-length and the total placement area. The goal in indus… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 9 pages, 7 figures

  4. arXiv:2405.05480  [pdf, other

    cs.AR cs.AI cs.LG

    FloorSet -- a VLSI Floorplanning Dataset with Design Constraints of Real-World SoCs

    Authors: Uday Mallappa, Hesham Mostafa, Mikhail Galkin, Mariano Phielipp, Somdeb Majumdar

    Abstract: Floorplanning for systems-on-a-chip (SoCs) and its sub-systems is a crucial and non-trivial step of the physical design flow. It represents a difficult combinatorial optimization problem. A typical large scale SoC with 120 partitions generates a search-space of nearly 10E250. As novel machine learning (ML) approaches emerge to tackle such problems, there is a growing need for a modern benchmark th… ▽ More

    Submitted 27 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 10 pages, 11 figures

  5. arXiv:2404.07198  [pdf, other

    cs.AI cs.LG

    Zero-shot Logical Query Reasoning on any Knowledge Graph

    Authors: Mikhail Galkin, **cheng Zhou, Bruno Ribeiro, Jian Tang, Zhaocheng Zhu

    Abstract: Complex logical query answering (CLQA) in knowledge graphs (KGs) goes beyond simple KG completion and aims at answering compositional queries comprised of multiple projections and logical operations. Existing CLQA methods that learn parameters bound to certain entity or relation vocabularies can only be applied to the graph they are trained on which requires substantial training time before being… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  6. arXiv:2402.02216  [pdf, other

    cs.LG

    Position: Graph Foundation Models are Already Here

    Authors: Haitao Mao, Zhikai Chen, Wenzhuo Tang, Jianan Zhao, Yao Ma, Tong Zhao, Neil Shah, Mikhail Galkin, Jiliang Tang

    Abstract: Graph Foundation Models (GFMs) are emerging as a significant research topic in the graph domain, aiming to develop graph models trained on extensive and diverse data to enhance their applicability across various tasks and domains. Develo** GFMs presents unique challenges over traditional Graph Neural Networks (GNNs), which are typically trained from scratch for specific tasks on particular datas… ▽ More

    Submitted 30 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: 23 pages, 2 figures

  7. arXiv:2310.18777  [pdf, other

    cs.LG cs.AI

    Improving Compositional Generalization Using Iterated Learning and Simplicial Embeddings

    Authors: Yi Ren, Samuel Lavoie, Mikhail Galkin, Danica J. Sutherland, Aaron Courville

    Abstract: Compositional generalization, the ability of an agent to generalize to unseen combinations of latent factors, is easy for humans but hard for deep neural networks. A line of research in cognitive science has hypothesized a process, ``iterated learning,'' to help explain how human language developed this ability; the theory rests on simultaneous pressures towards compressibility (when an ignorant a… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  8. arXiv:2310.04562  [pdf, other

    cs.CL cs.AI

    Towards Foundation Models for Knowledge Graph Reasoning

    Authors: Mikhail Galkin, Xinyu Yuan, Hesham Mostafa, Jian Tang, Zhaocheng Zhu

    Abstract: Foundation models in language and vision have the ability to run inference on any textual and visual inputs thanks to the transferable representations such as a vocabulary of tokens in language. Knowledge graphs (KGs) have different entity and relation vocabularies that generally do not overlap. The key challenge of designing foundation models on KGs is to learn such transferable representations t… ▽ More

    Submitted 9 April, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  9. arXiv:2309.05934  [pdf, other

    cond-mat.mtrl-sci cs.AI

    MatSciML: A Broad, Multi-Task Benchmark for Solid-State Materials Modeling

    Authors: Kin Long Kelvin Lee, Carmelo Gonzales, Marcel Nassar, Matthew Spellings, Mikhail Galkin, Santiago Miret

    Abstract: We propose MatSci ML, a novel benchmark for modeling MATerials SCIence using Machine Learning (MatSci ML) methods focused on solid-state materials with periodic crystal structures. Applying machine learning methods to solid-state materials is a nascent field with substantial fragmentation largely driven by the great variety of datasets used to develop machine learning models. This fragmentation ma… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  10. arXiv:2308.06585  [pdf, other

    cs.LG cs.AI cs.DB cs.LO cs.NE

    Approximate Answering of Graph Queries

    Authors: Michael Cochez, Dimitrios Alivanistos, Erik Arakelyan, Max Berrendorf, Daniel Daza, Mikhail Galkin, Pasquale Minervini, Mathias Niepert, Hongyu Ren

    Abstract: Knowledge graphs (KGs) are inherently incomplete because of incomplete world knowledge and bias in what is the input to the KG. Additionally, world knowledge constantly expands and evolves, making existing facts deprecated or introducing new ones. However, we would still want to be able to answer queries as if the graph were complete. In this chapter, we will give an overview of several methods wh… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Preprint of Ch. 17 "Approximate Answering of Graph Queries" in "Compendium of Neurosymbolic Artificial Intelligence", https://ebooks.iospress.nl/ISBN/978-1-64368-406-2

  11. arXiv:2303.14617  [pdf, other

    cs.DB cs.AI cs.LG

    Neural Graph Reasoning: Complex Logical Query Answering Meets Graph Databases

    Authors: Hongyu Ren, Mikhail Galkin, Michael Cochez, Zhaocheng Zhu, Jure Leskovec

    Abstract: Complex logical query answering (CLQA) is a recently emerged task of graph machine learning that goes beyond simple one-hop link prediction and solves a far more complex task of multi-hop logical reasoning over massive, potentially incomplete graphs in a latent space. The task received a significant traction in the community; numerous works expanded the field along theoretical and practical axes t… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

  12. arXiv:2302.04181  [pdf, other

    cs.LG cs.AI cs.NE

    Attending to Graph Transformers

    Authors: Luis Müller, Mikhail Galkin, Christopher Morris, Ladislav Rampášek

    Abstract: Recently, transformer architectures for graphs emerged as an alternative to established techniques for machine learning with graphs, such as (message-passing) graph neural networks. So far, they have shown promising empirical results, e.g., on molecular prediction datasets, often attributed to their ability to circumvent graph neural networks' shortcomings, such as over-smoothing and over-squashin… ▽ More

    Submitted 28 March, 2024; v1 submitted 8 February, 2023; originally announced February 2023.

  13. arXiv:2211.17113  [pdf, other

    cs.LG cs.NE stat.ML

    Weisfeiler and Leman Go Relational

    Authors: Pablo Barcelo, Mikhail Galkin, Christopher Morris, Miguel Romero Orth

    Abstract: Knowledge graphs, modeling multi-relational data, improve numerous applications such as question answering or graph logical reasoning. Many graph neural networks for such data emerged recently, often outperforming shallow architectures. However, the design of such multi-relational graph neural networks is ad-hoc, driven mainly by intuition and empirical insights. Up to now, their expressivity, the… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Comments: Learning on Graphs Conference 2022. arXiv admin note: text overlap with arXiv:2206.11168

  14. arXiv:2210.08008  [pdf, other

    cs.AI cs.LG

    Inductive Logical Query Answering in Knowledge Graphs

    Authors: Mikhail Galkin, Zhaocheng Zhu, Hongyu Ren, Jian Tang

    Abstract: Formulating and answering logical queries is a standard communication interface for knowledge graphs (KGs). Alleviating the notorious incompleteness of real-world KGs, neural methods achieved impressive results in link prediction and complex query answering tasks by learning representations of entities, relations, and queries. Still, most existing query answering methods rely on transductive entit… ▽ More

    Submitted 8 November, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022

  15. arXiv:2210.07453  [pdf, ps, other

    cs.LG

    Using Graph Algorithms to Pretrain Graph Completion Transformers

    Authors: Jonathan Pilault, Michael Galkin, Bahare Fatemi, Perouz Taslakian, David Vasquez, Christopher Pal

    Abstract: Recent work on Graph Neural Networks has demonstrated that self-supervised pretraining can further enhance performance on downstream graph, link, and node classification tasks. However, the efficacy of pretraining tasks has not been fully investigated for downstream large knowledge graph completion tasks. Using a contextualized knowledge graph embedding approach, we investigate five different pret… ▽ More

    Submitted 27 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

  16. arXiv:2210.00105  [pdf, other

    cs.CL cs.AI

    A Decade of Knowledge Graphs in Natural Language Processing: A Survey

    Authors: Phillip Schneider, Tim Schopf, Juraj Vladika, Mikhail Galkin, Elena Simperl, Florian Matthes

    Abstract: In pace with developments in the research field of artificial intelligence, knowledge graphs (KGs) have attracted a surge of interest from both academia and industry. As a representation of semantic relations between entities, KGs have proven to be particularly relevant for natural language processing (NLP), experiencing a rapid spread and wide adoption within recent years. Given the increasing am… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

    Comments: Accepted to AACL-IJCNLP 2022

  17. arXiv:2206.08164  [pdf, other

    cs.LG

    Long Range Graph Benchmark

    Authors: Vijay Prakash Dwivedi, Ladislav Rampášek, Mikhail Galkin, Ali Parviz, Guy Wolf, Anh Tuan Luu, Dominique Beaini

    Abstract: Graph Neural Networks (GNNs) that are based on the message passing (MP) paradigm generally exchange information between 1-hop neighbors to build node representations at each layer. In principle, such networks are not able to capture long-range interactions (LRI) that may be desired or necessary for learning a given task on graphs. Recently, there has been an increasing interest in development of T… ▽ More

    Submitted 28 November, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Added reference to Tönshoff et al., 2023 in Sec. 4.1; NeurIPS 2022 Track on D&B; Open-sourced at: https://github.com/vijaydwivedi75/lrgb

  18. arXiv:2206.04798  [pdf, other

    cs.AI cs.LG

    A*Net: A Scalable Path-based Reasoning Approach for Knowledge Graphs

    Authors: Zhaocheng Zhu, Xinyu Yuan, Mikhail Galkin, Sophie Xhonneux, Ming Zhang, Maxime Gazeau, Jian Tang

    Abstract: Reasoning on large-scale knowledge graphs has been long dominated by embedding methods. While path-based methods possess the inductive capacity that embeddings lack, their scalability is limited by the exponential number of paths. Here we present A*Net, a scalable path-based method for knowledge graph reasoning. Inspired by the A* algorithm for shortest path problems, our A*Net learns a priority f… ▽ More

    Submitted 8 November, 2023; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2023

  19. arXiv:2205.12454  [pdf, other

    cs.LG

    Recipe for a General, Powerful, Scalable Graph Transformer

    Authors: Ladislav Rampášek, Mikhail Galkin, Vijay Prakash Dwivedi, Anh Tuan Luu, Guy Wolf, Dominique Beaini

    Abstract: We propose a recipe on how to build a general, powerful, scalable (GPS) graph Transformer with linear complexity and state-of-the-art results on a diverse set of benchmarks. Graph Transformers (GTs) have gained popularity in the field of graph representation learning with a variety of recent publications but they lack a common foundation about what constitutes a good positional or structural encod… ▽ More

    Submitted 15 January, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: In Proceedings of NeurIPS 2022

  20. arXiv:2205.10128  [pdf, other

    cs.AI cs.LG

    Neural-Symbolic Models for Logical Queries on Knowledge Graphs

    Authors: Zhaocheng Zhu, Mikhail Galkin, Zuobai Zhang, Jian Tang

    Abstract: Answering complex first-order logic (FOL) queries on knowledge graphs is a fundamental task for multi-hop reasoning. Traditional symbolic methods traverse a complete knowledge graph to extract the answers, which provides good interpretation for each step. Recent neural methods learn geometric embeddings for complex queries. These methods can generalize to incomplete knowledge graphs, but their rea… ▽ More

    Submitted 6 September, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

    Comments: ICML 2022

  21. arXiv:2203.07544  [pdf, other

    cs.LG cs.AI

    A Unified Framework for Rank-based Evaluation Metrics for Link Prediction in Knowledge Graphs

    Authors: Charles Tapley Hoyt, Max Berrendorf, Mikhail Galkin, Volker Tresp, Benjamin M. Gyori

    Abstract: The link prediction task on knowledge graphs without explicit negative triples in the training data motivates the usage of rank-based metrics. Here, we review existing rank-based metrics and propose desiderata for improved metrics to address lack of interpretability and comparability of existing metrics to datasets of different sizes and properties. We introduce a simple theoretical framework for… ▽ More

    Submitted 19 April, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted at the Workshop on Graph Learning Benchmarks @ The WebConf 2022

  22. arXiv:2203.01520  [pdf, other

    cs.LG cs.AI

    An Open Challenge for Inductive Link Prediction on Knowledge Graphs

    Authors: Mikhail Galkin, Max Berrendorf, Charles Tapley Hoyt

    Abstract: An emerging trend in representation learning over knowledge graphs (KGs) moves beyond transductive link prediction tasks over a fixed set of known entities in favor of inductive tasks that imply training on one graph and performing inference over a new graph with unseen entities. In inductive setups, node features are often not available and training shallow entity embedding matrices is meaningles… ▽ More

    Submitted 18 April, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Accepted at the Workshop on Graph Learning Benchmarks @ The WebConf 2022

  23. arXiv:2108.09535  [pdf, other

    eess.IV cs.CV

    Systematic Clinical Evaluation of A Deep Learning Method for Medical Image Segmentation: Radiosurgery Application

    Authors: Boris Shirokikh, Alexandra Dalechina, Alexey Shevtsov, Egor Krivov, Valery Kostjuchenko, Amayak Durgaryan, Mikhail Galkin, Andrey Golanov, Mikhail Belyaev

    Abstract: We systematically evaluate a Deep Learning (DL) method in a 3D medical image segmentation task. Our segmentation method is integrated into the radiosurgery treatment process and directly impacts the clinical workflow. With our method, we address the relative drawbacks of manual segmentation: high inter-rater contouring variability and high time consumption of the contouring process. The main exten… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.

  24. arXiv:2107.04894  [pdf, other

    cs.LG

    Improving Inductive Link Prediction Using Hyper-Relational Facts

    Authors: Mehdi Ali, Max Berrendorf, Mikhail Galkin, Veronika Thost, Tengfei Ma, Volker Tresp, Jens Lehmann

    Abstract: For many years, link prediction on knowledge graphs (KGs) has been a purely transductive task, not allowing for reasoning on unseen entities. Recently, increasing efforts are put into exploring semi- and fully inductive scenarios, enabling inference over unseen and emerging entities. Still, all these approaches only consider triple-based \glspl{kg}, whereas their richer counterparts, hyper-relatio… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

  25. arXiv:2106.12144  [pdf, other

    cs.CL cs.AI cs.LG

    NodePiece: Compositional and Parameter-Efficient Representations of Large Knowledge Graphs

    Authors: Mikhail Galkin, Etienne Denis, Jiapeng Wu, William L. Hamilton

    Abstract: Conventional representation learning algorithms for knowledge graphs (KG) map each entity to a unique embedding vector. Such a shallow lookup results in a linear growth of memory consumption for storing the embedding matrix and incurs high computational costs when working with real-world KGs. Drawing parallels with subword tokenization commonly used in NLP, we explore the landscape of more paramet… ▽ More

    Submitted 1 February, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: Accepted to ICLR 2022

  26. arXiv:2106.08166  [pdf, other

    cs.AI cs.DB cs.IR cs.LG

    Query Embedding on Hyper-relational Knowledge Graphs

    Authors: Dimitrios Alivanistos, Max Berrendorf, Michael Cochez, Mikhail Galkin

    Abstract: Multi-hop logical reasoning is an established problem in the field of representation learning on knowledge graphs (KGs). It subsumes both one-hop link prediction as well as other more complex types of logical queries. Existing algorithms operate only on classical, triple-based graphs, whereas modern KGs often employ a hyper-relational modeling paradigm. In this paradigm, typed edges may have sever… ▽ More

    Submitted 6 September, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: Presented at ICLR2022. https://openreview.net/forum?id=4rLw09TgRw9

  27. arXiv:2009.10847  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Message Passing for Hyper-Relational Knowledge Graphs

    Authors: Mikhail Galkin, Priyansh Trivedi, Gaurav Maheshwari, Ricardo Usbeck, Jens Lehmann

    Abstract: Hyper-relational knowledge graphs (KGs) (e.g., Wikidata) enable associating additional key-value pairs along with the main triple to disambiguate, or restrict the validity of a fact. In this work, we propose a message passing based graph encoder - StarE capable of modeling such hyper-relational KGs. Unlike existing approaches, StarE can encode an arbitrary number of additional information (qualifi… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: Accepted to EMNLP 2020

  28. arXiv:2007.01955  [pdf, other

    cs.CL

    El Departamento de Nosotros: How Machine Translated Corpora Affects Language Models in MRC Tasks

    Authors: Maria Khvalchik, Mikhail Galkin

    Abstract: Pre-training large-scale language models (LMs) requires huge amounts of text corpora. LMs for English enjoy ever growing corpora of diverse language resources. However, less resourced languages and their mono- and multilingual LMs often struggle to obtain bigger datasets. A typical approach in this case implies using machine translation of English corpora to a target language. In this work, we stu… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

  29. arXiv:2006.13365  [pdf, other

    cs.LG cs.AI stat.ML

    Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework

    Authors: Mehdi Ali, Max Berrendorf, Charles Tapley Hoyt, Laurent Vermue, Mikhail Galkin, Sahand Sharifzadeh, Asja Fischer, Volker Tresp, Jens Lehmann

    Abstract: The heterogeneity in recently published knowledge graph embedding models' implementations, training, and evaluation has made fair and thorough comparisons difficult. In order to assess the reproducibility of previously published results, we re-implemented and evaluated 21 interaction models in the PyKEEN software package. Here, we outline which results could be reproduced with their reported hyper… ▽ More

    Submitted 1 November, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

  30. arXiv:1909.02799  [pdf, other

    eess.IV cs.CV physics.med-ph

    Deep Learning for Brain Tumor Segmentation in Radiosurgery: Prospective Clinical Evaluation

    Authors: Boris Shirokikh, Alexandra Dalechina, Alexey Shevtsov, Egor Krivov, Valery Kostjuchenko, Amayak Durgaryan, Mikhail Galkin, Ivan Osinov, Andrey Golanov, Mikhail Belyaev

    Abstract: Stereotactic radiosurgery is a minimally-invasive treatment option for a large number of patients with intracranial tumors. As part of the therapy treatment, accurate delineation of brain tumors is of great importance. However, slice-by-slice manual segmentation on T1c MRI could be time-consuming (especially for multiple metastases) and subjective. In our work, we compared several deep convolution… ▽ More

    Submitted 18 December, 2019; v1 submitted 6 September, 2019; originally announced September 2019.

  31. arXiv:1902.08295  [pdf, other

    cs.LG stat.ML

    Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

    Authors: Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob , et al. (66 additional authors not shown)

    Abstract: Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily extensible, and experiment configurations are centralized and highly customizable. Distributed training and quantized inference are supported directly w… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

  32. arXiv:1503.06598  [pdf, other

    cs.IR

    Identifying Web Tables - Supporting a Neglected Type of Content on the Web

    Authors: Mikhail Galkin, Dmitry Mouromtsev, Sören Auer

    Abstract: The abundance of the data in the Internet facilitates the improvement of extraction and processing tools. The trend in the open data publishing encourages the adoption of structured formats like CSV and RDF. However, there is still a plethora of unstructured data on the Web which we assume contain semantics. For this reason, we propose an approach to derive semantics from web tables which are stil… ▽ More

    Submitted 23 March, 2015; originally announced March 2015.

    Comments: 9 pages, 4 figures