Skip to main content

Showing 1–50 of 55 results for author: Pan, J Z

Searching in archive cs. Search in all archives.
.
  1. Start from Zero: Triple Set Prediction for Automatic Knowledge Graph Completion

    Authors: Wen Zhang, Ya**g Xu, Peng Ye, Zhiwei Huang, Zezhong Xu, Jiaoyan Chen, Jeff Z. Pan, Huajun Chen

    Abstract: Knowledge graph (KG) completion aims to find out missing triples in a KG. Some tasks, such as link prediction and instance completion, have been proposed for KG completion. They are triple-level tasks with some elements in a missing triple given to predict the missing element of the triple. However, knowing some elements of the missing triple in advance is not always a realistic setting. In this p… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Paper accepted by TKDE in 2024

  2. arXiv:2406.14282  [pdf, other

    cs.CL cs.AI

    Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

    Authors: Junjie Wang, Mingyang Chen, Binbin Hu, Dan Yang, Ziqi Liu, Yue Shen, Peng Wei, Zhiqiang Zhang, **jie Gu, Jun Zhou, Jeff Z. Pan, Wen Zhang, Huajun Chen

    Abstract: Improving the performance of large language models (LLMs) in complex question-answering (QA) scenarios has always been a research focal point. Recent studies have attempted to enhance LLMs' performance by combining step-wise planning with external retrieval. While effective for advanced models like GPT-3.5, smaller LLMs face challenges in decomposing complex questions, necessitating supervised fin… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Work in progress

  3. arXiv:2406.05130  [pdf, other

    cs.CL

    An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models

    Authors: Xiongtao Zhou, Jie He, Yuhua Ke, Guangyao Zhu, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Multimodal large language models (MLLMs) fine-tuned with multimodal instruction datasets have demonstrated remarkable capabilities in multimodal tasks. However, fine-tuning all parameters of MLLMs has become challenging as they usually contain billions of parameters. To address this issue, we study parameter-efficient fine-tuning (PEFT) methods for MLLMs. We aim to identify effective methods for e… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: ACL finding 2024

  4. arXiv:2405.15984  [pdf, other

    cs.CL cs.AI

    Evaluating the Adversarial Robustness of Retrieval-Based In-Context Learning for Large Language Models

    Authors: Simon Chi Lok Yu, Jie He, Pasquale Minervini, Jeff Z. Pan

    Abstract: With the emergence of large language models, such as LLaMA and OpenAI GPT-3, In-Context Learning (ICL) gained significant attention due to its effectiveness and efficiency. However, ICL is very sensitive to the choice, order, and verbaliser used to encode the demonstrations in the prompt. Retrieval-Augmented ICL methods try to address this problem by leveraging retrievers to extract semantically r… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 29 pages, 6 figures

  5. arXiv:2405.13602  [pdf, other

    cs.AI cs.CL cs.LG

    COTET: Cross-view Optimal Transport for Knowledge Graph Entity Ty**

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: Knowledge graph entity ty** (KGET) aims to infer missing entity type instances in knowledge graphs. Previous research has predominantly centered around leveraging contextual information associated with entities, which provides valuable clues for inference. However, they have long ignored the dual nature of information inherent in entities, encompassing both high-level coarse-grained cluster know… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  6. arXiv:2405.06524  [pdf, other

    cs.CL

    Prompting Large Language Models with Knowledge Graphs for Question Answering Involving Long-tail Facts

    Authors: Wenyu Huang, Guancheng Zhou, Mirella Lapata, Pavlos Vougiouklis, Sebastien Montella, Jeff Z. Pan

    Abstract: Although Large Language Models (LLMs) are effective in performing various NLP tasks, they still struggle to handle tasks that require extensive, real-world knowledge, especially when dealing with long-tail facts (facts related to long-tail entities). This limitation highlights the need to supplement LLMs with non-parametric knowledge. To address this issue, we analysed the effects of different typ… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  7. arXiv:2404.17590  [pdf, other

    cs.IR cs.AI

    Leveraging Intra-modal and Inter-modal Interaction for Multi-Modal Entity Alignment

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: Multi-modal entity alignment (MMEA) aims to identify equivalent entity pairs across different multi-modal knowledge graphs (MMKGs). Existing approaches focus on how to better encode and aggregate information from different modalities. However, it is not trivial to leverage multi-modal knowledge in entity alignment due to the modal heterogeneity. In this paper, we propose a Multi-Grained Interactio… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  8. arXiv:2404.09848  [pdf, other

    cs.AI cs.LG

    HyperMono: A Monotonicity-aware Approach to Hyper-Relational Knowledge Representation

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: In a hyper-relational knowledge graph (HKG), each fact is composed of a main triple associated with attribute-value qualifiers, which express additional factual knowledge. The hyper-relational knowledge graph completion (HKGC) task aims at inferring plausible missing links in a HKG. Most existing approaches to HKGC focus on enhancing the communication between qualifier pairs and main triples, whil… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  9. arXiv:2404.01253  [pdf, other

    cs.CL

    UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing

    Authors: Yijun Yang, Jie He, Pinzhen Chen, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Several recent papers have investigated the potential of language models as knowledge bases as well as the existence of severe biases when extracting factual knowledge. In this work, we focus on the factual probing performance over unseen prompts from tuning, and using a probabilistic view we show the inherent misalignment between pre-training and downstream tuning objectives in language models fo… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: NAACL 2024

  10. arXiv:2402.14901  [pdf, other

    cs.CL cs.AI

    A Usage-centric Take on Intent Understanding in E-Commerce

    Authors: Wendi Zhou, Tianyi Li, Pavlos Vougiouklis, Mark Steedman, Jeff Z. Pan

    Abstract: Identifying and understanding user intents is a pivotal task for E-Commerce. Despite its popularity, intent understanding has not been consistently defined or accurately benchmarked. In this paper, we focus on predicative user intents as "how a customer uses a product", and pose intent understanding as a natural language reasoning task, independent of product ontologies. We identify two weaknesses… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  11. arXiv:2402.12554  [pdf, other

    cs.CL

    Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning

    Authors: Danna Zheng, Mirella Lapata, Jeff Z. Pan

    Abstract: We present Archer, a challenging bilingual text-to-SQL dataset specific to complex reasoning, including arithmetic, commonsense and hypothetical reasoning. It contains 1,042 English questions and 1,042 Chinese questions, along with 521 unique SQL queries, covering 20 English databases across 20 domains. Notably, this dataset demonstrates a significantly higher level of complexity compared to exist… ▽ More

    Submitted 24 February, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: EACL 2024

  12. arXiv:2402.12545  [pdf, other

    cs.CL

    TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

    Authors: Danna Zheng, Danyang Liu, Mirella Lapata, Jeff Z. Pan

    Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities across various domains, prompting a surge in their practical applications. However, concerns have arisen regarding the trustworthiness of LLMs outputs, particularly in closed-book question-answering tasks, where non-experts may struggle to identify inaccuracies due to the absence of contextual or ground truth information. This… ▽ More

    Submitted 6 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  13. arXiv:2402.05391  [pdf, other

    cs.AI cs.CV cs.IR cs.LG

    Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

    Authors: Zhuo Chen, Yichi Zhang, Yin Fang, Yuxia Geng, Lingbing Guo, Xiang Chen, Qian Li, Wen Zhang, Jiaoyan Chen, Yushan Zhu, Jiaqi Li, Xiaoze Liu, Jeff Z. Pan, Ningyu Zhang, Huajun Chen

    Abstract: Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the semantic web community's exploration into multi-modal dimensions unlocking new avenues for innovation. In this survey, we carefully review over 300 articles, focusing on KG-aware research in two principal aspects: KG-driven Multi-Modal (KG4MM) learning, where KGs support multi-modal tasks, and Multi-Modal Kno… ▽ More

    Submitted 26 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Ongoing work; 41 pages (Main Text), 55 pages (Total), 11 Tables, 13 Figures, 619 citations; Paper list is available at https://github.com/zjukg/KG-MM-Survey

  14. arXiv:2401.15820  [pdf, other

    cs.CV cs.AI

    Knowledge-Aware Neuron Interpretation for Scene Classification

    Authors: Yong Guan, Freddy Lecue, Jiaoyan Chen, Ru Li, Jeff Z. Pan

    Abstract: Although neural models have achieved remarkable performance, they still encounter doubts due to the intransparency. To this end, model prediction explanation is attracting more and more attentions. However, current methods rarely incorporate external knowledge and still suffer from three limitations: (1) Neglecting concept completeness. Merely selecting concepts may not sufficient for prediction.… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI2024

  15. arXiv:2401.14640  [pdf, other

    cs.CL

    Benchmarking Large Language Models in Complex Question Answering Attribution using Knowledge Graphs

    Authors: Nan Hu, Jiaoyan Chen, Yike Wu, Guilin Qi, Sheng Bi, Tongtong Wu, Jeff Z. Pan

    Abstract: The attribution of question answering is to provide citations for supporting generated statements, and has attracted wide research attention. The current methods for automatically evaluating the attribution, which are often based on Large Language Models (LLMs), are still inadequate, particularly in recognizing subtle differences between attributions, and complex relationships between citations an… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 13 pages, 5 figures

  16. arXiv:2401.13256  [pdf, other

    cs.CL cs.AI

    UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems

    Authors: Hongru Wang, Wenyu Huang, Yang Deng, Rui Wang, Zezhong Wang, Yufei Wang, Fei Mi, Jeff Z. Pan, Kam-Fai Wong

    Abstract: Large Language Models (LLMs) has shown exceptional capabilities in many natual language understanding and generation tasks. However, the personalization issue still remains a much-coveted property, especially when it comes to the multiple sources involved in the dialogue system. To better plan and incorporate the use of multiple sources in generating personalized response, we firstly decompose it… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  17. arXiv:2312.01837  [pdf, other

    cs.CL

    Prompting Disentangled Embeddings for Knowledge Graph Completion with Pre-trained Language Model

    Authors: Yuxia Geng, Jiaoyan Chen, Yuhang Zeng, Zhuo Chen, Wen Zhang, Jeff Z. Pan, Yuxiang Wang, Xiaoliang Xu

    Abstract: Both graph structures and textual information play a critical role in Knowledge Graph Completion (KGC). With the success of Pre-trained Language Models (PLMs) such as BERT, they have been applied for text encoding for KGC. However, the current methods mostly prefer to fine-tune PLMs, leading to huge training costs and limited scalability to larger PLMs. In contrast, we propose to utilize prompts a… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: under review

  18. arXiv:2310.14050  [pdf, other

    cs.CL

    Code-Switching with Word Senses for Pretraining in Neural Machine Translation

    Authors: Vivek Iyer, Edoardo Barba, Alexandra Birch, Jeff Z. Pan, Roberto Navigli

    Abstract: Lexical ambiguity is a significant and pervasive challenge in Neural Machine Translation (NMT), with many state-of-the-art (SOTA) NMT systems struggling to handle polysemous words (Campolungo et al., 2022). The same holds for the NMT pretraining paradigm of denoising synthetic "code-switched" text (Pan et al., 2021; Iyer et al., 2023), where word senses are ignored in the noising stage -- leading… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: EMNLP (Findings) 2023 Long Paper

  19. arXiv:2310.12008  [pdf, other

    cs.CL cs.AI

    Multi-view Contrastive Learning for Entity Ty** over Knowledge Graphs

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: Knowledge graph entity ty** (KGET) aims at inferring plausible types of entities in knowledge graphs. Existing approaches to KGET focus on how to better encode the knowledge provided by the neighbors and types of an entity into its representation. However, they ignore the semantic knowledge provided by the way in which types can be clustered together. In this paper, we propose a novel method cal… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 Main

  20. arXiv:2310.05128  [pdf, other

    cs.CL cs.AI cs.LG

    Instances and Labels: Hierarchy-aware Joint Supervised Contrastive Learning for Hierarchical Multi-Label Text Classification

    Authors: Simon Yu, Jie He, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Hierarchical multi-label text classification (HMTC) aims at utilizing a label hierarchy in multi-label classification. Recent approaches to HMTC deal with the problem of imposing an over-constrained premise on the output space by using contrastive learning on generated samples in a semi-supervised manner to bring text and label embeddings closer. However, the generation of samples tends to introdu… ▽ More

    Submitted 19 June, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: 18 pages; 10 figures. Published as a conference paper at EMNLP 2023 Findings (Long Paper). Code and data available at https://github.com/simonucl/HJCL

  21. arXiv:2308.06512  [pdf, other

    cs.AI cs.CL

    HyperFormer: Enhancing Entity and Relation Interaction for Hyper-Relational Knowledge Graph Completion

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: Hyper-relational knowledge graphs (HKGs) extend standard knowledge graphs by associating attribute-value qualifiers to triples, which effectively represent additional fine-grained information about its associated triple. Hyper-relational knowledge graph completion (HKGC) aims at inferring unknown triples while considering its qualifiers. Most existing approaches to HKGC exploit a global-level grap… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Accepted at CIKM'23

  22. arXiv:2308.06374  [pdf, other

    cs.AI cs.CL

    Large Language Models and Knowledge Graphs: Opportunities and Challenges

    Authors: Jeff Z. Pan, Simon Razniewski, Jan-Christoph Kalo, Sneha Singhania, Jiaoyan Chen, Stefan Dietze, Hajira Jabeen, Janna Omeliyanenko, Wen Zhang, Matteo Lissandrini, Russa Biswas, Gerard de Melo, Angela Bonifati, Edlira Vakaj, Mauro Dragoni, Damien Graux

    Abstract: Large Language Models (LLMs) have taken Knowledge Representation -- and the world -- by storm. This inflection point marks a shift from explicit knowledge representation to a renewed focus on the hybrid representation of both explicit knowledge and parametric knowledge. In this position paper, we will discuss some of the common debate points within the community on LLMs (parametric knowledge) and… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 30 pages

  23. arXiv:2307.16210  [pdf, other

    cs.AI cs.CV cs.LG cs.MM

    Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity Alignment

    Authors: Zhuo Chen, Lingbing Guo, Yin Fang, Yichi Zhang, Jiaoyan Chen, Jeff Z. Pan, Yangning Li, Huajun Chen, Wen Zhang

    Abstract: As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to identify identical entities across disparate knowledge graphs (KGs) by exploiting associated visual information. However, existing MMEA approaches primarily concentrate on the fusion paradigm of multi-modal entity features, while neglecting the challenges presented by the pervasive phenomenon of missing an… ▽ More

    Submitted 1 August, 2023; v1 submitted 30 July, 2023; originally announced July 2023.

    Comments: International Semantic Web Conference '23 (ISWC 2023), https://github.com/zjukg/UMAEA

  24. arXiv:2305.15932  [pdf, other

    cs.CL

    BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering

    Authors: Jie He, Simon Chi Lok U, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Unsupervised commonsense reasoning (UCR) is becoming increasingly popular as the construction of commonsense reasoning datasets is expensive, and they are inevitably limited in their scope. A popular approach to UCR is to fine-tune language models with external knowledge (e.g., knowledge graphs), but this usually requires a large number of training examples. In this paper, we propose to transform… ▽ More

    Submitted 7 June, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL2023

  25. arXiv:2305.11527  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    InstructIE: A Bilingual Instruction-based Information Extraction Dataset

    Authors: Honghao Gui, Shuofei Qiao, **tian Zhang, Hongbin Ye, Mengshu Sun, Lei Liang, Jeff Z. Pan, Huajun Chen, Ningyu Zhang

    Abstract: Large language models can perform well on general natural language tasks, but their effectiveness is still not optimal for information extraction. Recent works indicate that the main reason lies in the lack of extensive data on information extraction instructions. Note that the existing datasets on information extraction instructions not only have limited coverage but also involve high constructio… ▽ More

    Submitted 18 April, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Work in progress; project homepage: https://www.zjukg.org/project/InstructIE/ dataset: https://huggingface.co/datasets/zjunlp/InstructIE

  26. arXiv:2303.10368  [pdf, other

    cs.CL

    An Empirical Study of Pre-trained Language Models in Simple Knowledge Graph Question Answering

    Authors: Nan Hu, Yike Wu, Guilin Qi, Dehai Min, Jiaoyan Chen, Jeff Z. Pan, Zafar Ali

    Abstract: Large-scale pre-trained language models (PLMs) such as BERT have recently achieved great success and become a milestone in natural language processing (NLP). It is now the consensus of the NLP community to adopt PLMs as the backbone for downstream tasks. In recent works on knowledge graph question answering (KGQA), BERT or its variants have become necessary in their KGQA models. However, there is… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: Accepted by World Wide Web Journal

  27. arXiv:2303.09189  [pdf, other

    cs.SI cs.CY

    Wiki-based Communities of Interest: Demographics and Outliers

    Authors: Hiba Arnaout, Simon Razniewski, Jeff Z. Pan

    Abstract: In this paper, we release data about demographic information and outliers of communities of interest. Identified from Wiki-based sources, mainly Wikidata, the data covers 7.5k communities, such as members of the White House Coronavirus Task Force, and 345k subjects, e.g., Deborah Birx. We describe the statistical inference methodology adopted to mine such data. We release subject-centric and group… ▽ More

    Submitted 17 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted to ICWSM 2023. For demo, see https://wikiknowledge.onrender.com/demographics/ and for dataset see https://doi.org/10.5281/zenodo.7410436

  28. arXiv:2302.01859  [pdf, other

    cs.CL

    Generalizing to Unseen Elements: A Survey on Knowledge Extrapolation for Knowledge Graphs

    Authors: Mingyang Chen, Wen Zhang, Yuxia Geng, Zezhong Xu, Jeff Z. Pan, Huajun Chen

    Abstract: Knowledge graphs (KGs) have become valuable knowledge resources in various applications, and knowledge graph embedding (KGE) methods have garnered increasing attention in recent years. However, conventional KGE methods still face challenges when it comes to handling unseen entities or relations during model testing. To address this issue, much effort has been devoted to various fields of KGs. In t… ▽ More

    Submitted 16 December, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted to IJCAI 2023 Survey Track

  29. arXiv:2302.01849  [pdf, other

    cs.CL

    Entity-Agnostic Representation Learning for Parameter-Efficient Knowledge Graph Embedding

    Authors: Mingyang Chen, Wen Zhang, Zhen Yao, Yushan Zhu, Yang Gao, Jeff Z. Pan, Huajun Chen

    Abstract: We propose an entity-agnostic representation learning method for handling the problem of inefficient parameter storage costs brought by embedding knowledge graphs. Conventional knowledge graph embedding methods map elements in a knowledge graph, including entities and relations, into continuous vector spaces by assigning them one or multiple specific embeddings (i.e., vector representations). Thus… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted to AAAI 2023 conference

  30. arXiv:2212.14454  [pdf, other

    cs.AI cs.CL

    MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality Hybrid

    Authors: Zhuo Chen, Jiaoyan Chen, Wen Zhang, Lingbing Guo, Yin Fang, Yufeng Huang, Yichi Zhang, Yuxia Geng, Jeff Z. Pan, Wenting Song, Huajun Chen

    Abstract: Multi-modal entity alignment (MMEA) aims to discover identical entities across different knowledge graphs (KGs) whose entities are associated with relevant images. However, current MMEA algorithms rely on KG-level modality fusion strategies for multi-modal entity representation, which ignores the variations of modality preferences of different entities, thus compromising robustness against noise i… ▽ More

    Submitted 30 July, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: ACM Multimedia 2023 Accpeted, Repo: https://github.com/zjukg/MEAformer

    Journal ref: ACM MM 2023

  31. arXiv:2210.11151  [pdf, other

    cs.AI

    Transformer-based Entity Ty** in Knowledge Graphs

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: We investigate the knowledge graph entity ty** task which aims at inferring plausible entity types. In this paper, we propose a novel Transformer-based Entity Ty** (TET) approach, effectively encoding the content of neighbors of an entity. More precisely, TET is composed of three different mechanisms: a local transformer allowing to infer missing types of an entity by independently encoding th… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: Paper accepted at EMNLP 2022

  32. arXiv:2210.03994  [pdf, other

    cs.AI

    Relational Message Passing for Fully Inductive Knowledge Graph Completion

    Authors: Yuxia Geng, Jiaoyan Chen, Jeff Z. Pan, Mingyang Chen, Song Jiang, Wen Zhang, Huajun Chen

    Abstract: In knowledge graph completion (KGC), predicting triples involving emerging entities and/or relations, which are unseen when the KG embeddings are learned, has become a critical challenge. Subgraph reasoning with message passing is a promising and popular solution. Some recent methods have achieved good performance, but they (i) usually can only predict triples involving unseen entities alone, fail… ▽ More

    Submitted 30 December, 2022; v1 submitted 8 October, 2022; originally announced October 2022.

    Comments: Accepted by ICDE 2023

  33. arXiv:2209.15214  [pdf, other

    cs.AI cs.CL cs.IR cs.LG

    Construction and Applications of Billion-Scale Pre-Trained Multimodal Business Knowledge Graph

    Authors: Shumin Deng, Chengming Wang, Zhoubo Li, Ningyu Zhang, Zelin Dai, Hehong Chen, Feiyu Xiong, Ming Yan, Qiang Chen, Mosha Chen, Jiaoyan Chen, Jeff Z. Pan, Bryan Hooi, Huajun Chen

    Abstract: Business Knowledge Graphs (KGs) are important to many enterprises today, providing factual knowledge and structured data that steer many products and make them more intelligent. Despite their promising benefits, building business KG necessitates solving prohibitive issues of deficient structure and multiple modalities. In this paper, we advance the understanding of the practical challenges related… ▽ More

    Submitted 19 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: OpenBG. Accepted by ICDE 2023. The project is released at https://github.com/OpenBGBenchmark/OpenBG . Website: https://kg.alibaba.com/ , Leaderboard: https://tianchi.aliyun.com/dataset/dataDetail?dataId=122271

  34. arXiv:2208.12539  [pdf, other

    cs.CL

    Task-specific Pre-training and Prompt Decomposition for Knowledge Graph Population with Language Models

    Authors: Tianyi Li, Wenyu Huang, Nikos Papasarantopoulos, Pavlos Vougiouklis, Jeff Z. Pan

    Abstract: We present a system for knowledge graph population with Language Models, evaluated on the Knowledge Base Construction from Pre-trained Language Models (LM-KBC) challenge at ISWC 2022. Our system involves task-specific pre-training to improve LM representation of the masked object tokens, prompt decomposition for progressive generation of candidate objects, among other methods for higher-quality re… ▽ More

    Submitted 31 August, 2022; v1 submitted 26 August, 2022; originally announced August 2022.

    Comments: To appear in ISWC 2022

  35. arXiv:2208.09417  [pdf, other

    cs.CV cs.AI

    Target-oriented Sentiment Classification with Sequential Cross-modal Semantic Graph

    Authors: Yufeng Huang, Zhuo Chen, Jiaoyan Chen, Jeff Z. Pan, Zhen Yao, Wen Zhang

    Abstract: Multi-modal aspect-based sentiment classification (MABSC) is task of classifying the sentiment of a target entity mentioned in a sentence and an image. However, previous methods failed to account for the fine-grained semantic association between the image and the text, which resulted in limited identification of fine-grained image aspects and opinions. To address these limitations, in this paper w… ▽ More

    Submitted 23 July, 2023; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: ICANN 2023, https://github.com/zjukg/SeqCSG

  36. UnCommonSense: Informative Negative Knowledge about Everyday Concepts

    Authors: Hiba Arnaout, Simon Razniewski, Gerhard Weikum, Jeff Z. Pan

    Abstract: Commonsense knowledge about everyday concepts is an important asset for AI applications, such as question answering and chatbots. Recently, we have seen an increasing interest in the construction of structured commonsense knowledge bases (CSKBs). An important part of human commonsense is about properties that do not apply to concepts, yet existing CSKBs only store positive statements. Moreover, si… ▽ More

    Submitted 5 September, 2022; v1 submitted 19 August, 2022; originally announced August 2022.

  37. arXiv:2207.01328  [pdf, other

    cs.CV cs.AI

    DUET: Cross-modal Semantic Grounding for Contrastive Zero-shot Learning

    Authors: Zhuo Chen, Yufeng Huang, Jiaoyan Chen, Yuxia Geng, Wen Zhang, Yin Fang, Jeff Z. Pan, Huajun Chen

    Abstract: Zero-shot learning (ZSL) aims to predict unseen classes whose samples have never appeared during training. One of the most effective and widely used semantic information for zero-shot image classification are attributes which are annotations for class-level visual characteristics. However, the current methods often fail to discriminate those subtle visual distinctions between images due to not onl… ▽ More

    Submitted 16 February, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: AAAI 2023 (Oral). Repository: https://github.com/zjukg/DUET

  38. arXiv:2206.03739  [pdf, other

    cs.AI cs.CV cs.LG

    Disentangled Ontology Embedding for Zero-shot Learning

    Authors: Yuxia Geng, Jiaoyan Chen, Wen Zhang, Ya**g Xu, Zhuo Chen, Jeff Z. Pan, Yufeng Huang, Feiyu Xiong, Huajun Chen

    Abstract: Knowledge Graph (KG) and its variant of ontology have been widely used for knowledge representation, and have shown to be quite effective in augmenting Zero-shot Learning (ZSL). However, existing ZSL methods that utilize KGs all neglect the intrinsic complexity of inter-class relationships represented in KGs. One typical feature is that a class is often related to other classes in different semant… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted by KDD'22

  39. arXiv:2205.00782  [pdf, other

    cs.AI cs.LG

    Type-aware Embeddings for Multi-Hop Reasoning over Knowledge Graphs

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Xiaoli Li, Ru Li, Jeff Z. Pan

    Abstract: Multi-hop reasoning over real-life knowledge graphs (KGs) is a highly challenging problem as traditional subgraph matching methods are not capable to deal with noise and missing information. To address this problem, it has been recently introduced a promising approach based on jointly embedding logical queries and KGs into a low-dimensional space to identify answer entities. However, existing prop… ▽ More

    Submitted 4 July, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: Accepted to IJCAI-ECAI 2022

  40. arXiv:2202.07412  [pdf, ps, other

    cs.AI

    Knowledge Graph Reasoning with Logics and Embeddings: Survey and Perspective

    Authors: Wen Zhang, Jiaoyan Chen, Juan Li, Zezhong Xu, Jeff Z. Pan, Huajun Chen

    Abstract: Knowledge graph (KG) reasoning is becoming increasingly popular in both academia and industry. Conventional KG reasoning based on symbolic logic is deterministic, with reasoning results being explainable, while modern embedding-based reasoning can deal with uncertainty and predict plausible knowledge, often with high efficiency via vector computation. A promising direction is to integrate both log… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: This is a survey of Knowledge Graph Reasoning with Logics and Embeddings. We discuss methods from diverse perspectives

  41. arXiv:2112.10006  [pdf, other

    cs.LG cs.AI

    Zero-shot and Few-shot Learning with Knowledge Graphs: A Comprehensive Survey

    Authors: Jiaoyan Chen, Yuxia Geng, Zhuo Chen, Jeff Z. Pan, Yuan He, Wen Zhang, Ian Horrocks, Huajun Chen

    Abstract: Machine learning especially deep neural networks have achieved great success but many of them often rely on a number of labeled samples for supervision. As sufficient labeled training data are not always ready due to e.g., continuously emerging prediction targets and costly sample annotation in real world applications, machine learning with sample shortage is now being widely investigated. Among a… ▽ More

    Submitted 3 December, 2022; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: A survey on Zero-shot and Few-shot Learning with Knowledge Graph. It has collected 96 ZSL and FSL papers on this topic, with 11 figures and 4 tables

  42. arXiv:2108.12239  [pdf, ps, other

    cs.LO cs.AI

    Geometric Models for (Temporally) Attributed Description Logics

    Authors: Camille Bourgaux, Ana Ozaki, Jeff Z. Pan

    Abstract: In the search for knowledge graph embeddings that could capture ontological knowledge, geometric models of existential rules have been recently introduced. It has been shown that convex geometric regions capture the so-called quasi-chained rules. Attributed description logics (DL) have been defined to bridge the gap between DL languages and knowledge graphs, whose facts often come with various kin… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: Long version of a DL 2021 paper. 24 pages

  43. arXiv:2107.05348  [pdf, other

    cs.AI

    Zero-shot Visual Question Answering using Knowledge Graph

    Authors: Zhuo Chen, Jiaoyan Chen, Yuxia Geng, Jeff Z. Pan, Zonggang Yuan, Huajun Chen

    Abstract: Incorporating external knowledge to Visual Question Answering (VQA) has become a vital practical need. Existing methods mostly adopt pipeline approaches with different components for knowledge matching and extraction, feature learning, etc.However, such pipeline approaches suffer when some component does not perform well, which leads to error propagation and poor overall performance. Furthermore,… ▽ More

    Submitted 17 October, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: accepted at the International Semantic Web Conference '21 (ISWC 2021)

  44. arXiv:2106.15047  [pdf, other

    cs.AI

    Benchmarking Knowledge-driven Zero-shot Learning

    Authors: Yuxia Geng, Jiaoyan Chen, Xiang Zhuang, Zhuo Chen, Jeff Z. Pan, Juan Li, Zonggang Yuan, Huajun Chen

    Abstract: External knowledge (a.k.a. side information) plays a critical role in zero-shot learning (ZSL) which aims to predict with unseen classes that have never appeared in training data. Several kinds of external knowledge, such as text and attribute, have been widely investigated, but they alone are limited with incomplete semantics. Some very recent studies thus propose to use Knowledge Graph (KG) due… ▽ More

    Submitted 27 October, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Published in Journal of Web Semantics, 2022. Final version please refer to our Github repository!

  45. arXiv:2103.00070  [pdf, ps, other

    cs.AI cs.LG

    Knowledge-aware Zero-Shot Learning: Survey and Perspective

    Authors: Jiaoyan Chen, Yuxia Geng, Zhuo Chen, Ian Horrocks, Jeff Z. Pan, Huajun Chen

    Abstract: Zero-shot learning (ZSL) which aims at predicting classes that have never appeared during the training using external knowledge (a.k.a. side information) has been widely investigated. In this paper we present a literature review towards ZSL in the perspective of external knowledge, where we categorize the external knowledge, review their methods and compare different external knowledge. With the l… ▽ More

    Submitted 10 May, 2021; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: Accepted by IJCAI'21 Survey Track

  46. arXiv:2102.07339  [pdf, other

    cs.AI

    OntoZSL: Ontology-enhanced Zero-shot Learning

    Authors: Yuxia Geng, Jiaoyan Chen, Zhuo Chen, Jeff Z. Pan, Zhiquan Ye, Zonggang Yuan, Yantao Jia, Huajun Chen

    Abstract: Zero-shot Learning (ZSL), which aims to predict for those classes that have never appeared in the training data, has arisen hot research interests. The key of implementing ZSL is to leverage the prior knowledge of classes which builds the semantic relationship between classes and enables the transfer of the learned models (e.g., features) from training classes (i.e., seen classes) to unseen classe… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

    Comments: Accepted to The Web Conference (WWW) 2021

  47. arXiv:2006.16917  [pdf, other

    cs.AI cs.LG

    Ontology-guided Semantic Composition for Zero-Shot Learning

    Authors: Jiaoyan Chen, Freddy Lecue, Yuxia Geng, Jeff Z. Pan, Huajun Chen

    Abstract: Zero-shot learning (ZSL) is a popular research problem that aims at predicting for those classes that have never appeared in the training stage by utilizing the inter-class relationship with some side information. In this study, we propose to model the compositional and expressive semantics of class labels by an OWL (Web Ontology Language) ontology, and further develop a new ZSL framework with ont… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Comments: Accepted by KR 2020 - 17th International Conference on Principles of Knowledge Representation and Reasoning

  48. arXiv:2005.09170  [pdf, other

    cs.LG stat.ML

    On Intrinsic Dataset Properties for Adversarial Machine Learning

    Authors: Jeffrey Z. Pan, Nicholas Zufelt

    Abstract: Deep neural networks (DNNs) have played a key role in a wide range of machine learning applications. However, DNN classifiers are vulnerable to human-imperceptible adversarial perturbations, which can cause them to misclassify inputs with high confidence. Thus, creating robust DNNs which can defend against malicious examples is critical in applications where security plays a major role. In this pa… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 12 pages, 9 figures

  49. arXiv:2001.04425  [pdf, other

    cs.IR cs.AI cs.CL cs.DB

    Negative Statements Considered Useful

    Authors: Hiba Arnaout, Simon Razniewski, Gerhard Weikum, Jeff Z. Pan

    Abstract: Knowledge bases (KBs) about notable entities and their properties are an important asset in applications such as search, question answering and dialogue. All popular KBs capture virtually only positive statements, and abstain from taking any stance on statements not stored in the KB. This paper makes the case for explicitly stating salient statements that do not hold. Negative statements are usefu… ▽ More

    Submitted 25 September, 2021; v1 submitted 13 January, 2020; originally announced January 2020.

    Journal ref: Journal of Web Semantics (JWS), Volume 71, 2021

  50. arXiv:1907.01183  [pdf, other

    cs.IR cs.DB

    A Framework for Evaluating Snippet Generation for Dataset Search

    Authors: Xiaxia Wang, **chi Chen, Shuxin Li, Gong Cheng, Jeff Z. Pan, Evgeny Kharlamov, Yuzhong Qu

    Abstract: Reusing existing datasets is of considerable significance to researchers and developers. Dataset search engines help a user find relevant datasets for reuse. They can present a snippet for each retrieved dataset to explain its relevance to the user's data needs. This emerging problem of snippet generation for dataset search has not received much research attention. To provide a basis for future re… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: 17 pages, to appear at the research track of the 18th International Semantic Web Conference (ISWC 2019)