Skip to main content

Showing 1–50 of 53 results for author: Xiong, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01178  [pdf, other

    cs.CL cs.AI cs.LG

    $\text{Memory}^3$: Language Modeling with Explicit Memory

    Authors: Hongkang Yang, Zehao Lin, Wen** Wang, Hao Wu, Zhiyu Li, Bo Tang, Wenqiang Wei, **bo Wang, Zeyun Tang, Shichao Song, Chenyang Xi, Yu Yu, Kai Chen, Feiyu Xiong, Linpeng Tang, Weinan E

    Abstract: The training and inference of large language models (LLMs) are together a costly process that transports knowledge from raw data to meaningful computation. Inspired by the memory hierarchy of the human brain, we reduce this cost by equip** LLMs with explicit memory, a memory format cheaper than model parameters and text retrieval-augmented generation (RAG). Conceptually, with most of its knowled… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2407.00668  [pdf, other

    cs.CL

    HRDE: Retrieval-Augmented Large Language Models for Chinese Health Rumor Detection and Explainability

    Authors: Yanfang Chen, Ding Chen, Shichao Song, Simin Niu, Hanyu Wang, Zeyun Tang, Feiyu Xiong, Zhiyu Li

    Abstract: As people increasingly prioritize their health, the speed and breadth of health information dissemination on the internet have also grown. At the same time, the presence of false health information (health rumors) intermingled with genuine content poses a significant potential threat to public health. However, current research on Chinese health rumors still lacks a large-scale, public, and open-so… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  3. arXiv:2406.16069  [pdf, other

    cs.CL cs.AI

    FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models

    Authors: Junyi Zhu, Shuochen Liu, Yu Yu, Bo Tang, Yibo Yan, Zhiyu Li, Feiyu Xiong, Tong Xu, Matthew B. Blaschko

    Abstract: Large language models (LLMs) excel in generating coherent text, but they often struggle with context awareness, leading to inaccuracies in tasks requiring faithful adherence to provided information. We introduce FastMem, a novel method designed to enhance instruction fine-tuned LLMs' context awareness through fast memorization of the prompt. FastMem maximizes the likelihood of the prompt before in… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  4. arXiv:2405.20763  [pdf, other

    cs.LG math.OC stat.ML

    Improving Generalization and Convergence by Enhancing Implicit Regularization

    Authors: Mingze Wang, Haotian He, **bo Wang, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu

    Abstract: In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that I… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 35 pages

  5. arXiv:2405.16933  [pdf, other

    cs.CL cs.IR

    Empowering Large Language Models to Set up a Knowledge Retrieval Indexer via Self-Learning

    Authors: Xun Liang, Simin Niu, Zhiyu li, Sensen Zhang, Shichao Song, Hanyu Wang, Jiawei Yang, Feiyu Xiong, Bo Tang, Chenyang Xi

    Abstract: Retrieval-Augmented Generation (RAG) offers a cost-effective approach to injecting real-time knowledge into large language models (LLMs). Nevertheless, constructing and validating high-quality knowledge repositories require considerable effort. We propose a pre-retrieval framework named Pseudo-Graph Retrieval-Augmented Generation (PG-RAG), which conceptualizes LLMs as students by providing them wi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.11874  [pdf, other

    cs.CL

    xFinder: Robust and Pinpoint Answer Extraction for Large Language Models

    Authors: Qingchen Yu, Zifan Zheng, Shichao Song, Zhiyu Li, Feiyu Xiong, Bo Tang, Ding Chen

    Abstract: The continuous advancement of large language models (LLMs) has brought increasing attention to the critical issue of develo** fair and reliable methods for evaluating their performance. Particularly, the emergence of subjective or non-subjective cheating phenomena, such as test set leakage and prompt format overfitting, poses significant challenges to the reliable evaluation of LLMs. Since evalu… ▽ More

    Submitted 23 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 37 Pages

  7. arXiv:2405.01726  [pdf, ps, other

    eess.IV cs.CV cs.LG

    SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising

    Authors: Guanyiman Fu, Fengchao Xiong, Jianfeng Lu, Jun Zhou

    Abstract: Denoising is a crucial preprocessing step for hyperspectral images (HSIs) due to noise arising from intraimaging mechanisms and environmental factors. Long-range spatial-spectral correlation modeling is beneficial for HSI denoising but often comes with high computational complexity. Based on the state space model (SSM), Mamba is known for its remarkable long-range dependency modeling capabilities… ▽ More

    Submitted 20 June, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  8. arXiv:2404.06926  [pdf, other

    cs.RO

    Gaussian-LIC: Photo-realistic LiDAR-Inertial-Camera SLAM with 3D Gaussian Splatting

    Authors: Xiaolei Lang, Laijian Li, Hang Zhang, Feng Xiong, Mu Xu, Yong Liu, Xingxing Zuo, Jiajun Lv

    Abstract: We present a real-time LiDAR-Inertial-Camera SLAM system with 3D Gaussian Splatting as the map** backend. Leveraging robust pose estimates from our LiDAR-Inertial-Camera odometry, Coco-LIC, an incremental photo-realistic map** system is proposed in this paper. We initialize 3D Gaussians from colorized LiDAR points and optimize them using differentiable rendering powered by 3D Gaussian Splattin… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Submitted to IROS 2024

  9. arXiv:2403.12839  [pdf, other

    cs.CV

    Global-guided Focal Neural Radiance Field for Large-scale Scene Rendering

    Authors: Mingqi Shao, Feng Xiong, Hang Zhang, Shuang Yang, Mu Xu, Wei Bian, Xueqian Wang

    Abstract: Neural radiance fields~(NeRF) have recently been applied to render large-scale scenes. However, their limited model capacity typically results in blurred rendering results. Existing large-scale NeRFs primarily address this limitation by partitioning the scene into blocks, which are subsequently handled by separate sub-NeRFs. These sub-NeRFs, trained from scratch and processed independently, lead t… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  10. arXiv:2403.04283  [pdf, other

    cs.CL cs.AI cs.LG

    Proxy-RLHF: Decoupling Generation and Alignment in Large Language Model with Proxy

    Authors: Yu Zhu, Chuxiong Sun, Wenfei Yang, Wenqiang Wei, Bo Tang, Tianzhu Zhang, Zhiyu Li, Shifeng Zhang, Feiyu Xiong, Jie Hu, Mingchuan yang

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is the prevailing approach to ensure Large Language Models (LLMs) align with human values. However, existing RLHF methods require a high computational cost, one main reason being that RLHF assigns both the generation and alignment tasks to the LLM simultaneously. In this paper, we introduce Proxy-RLHF, which decouples the generation and alignment p… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  11. arXiv:2403.00862  [pdf, other

    cs.CL cs.AI

    NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism

    Authors: Miao Li, Ming-Bin Chen, Bo Tang, Shengbin Hou, Pengyu Wang, Haiying Deng, Zhiyu Li, Feiyu Xiong, Keming Mao, Peng Cheng, Yi Luo

    Abstract: We present NewsBench, a novel evaluation framework to systematically assess the capabilities of Large Language Models (LLMs) for editorial capabilities in Chinese journalism. Our constructed benchmark dataset is focused on four facets of writing proficiency and six facets of safety adherence, and it comprises manually and carefully designed 1,267 test samples in the types of multiple choice questi… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: Long paper, ACL 2024 Main

  12. arXiv:2402.11218  [pdf, other

    cs.CL

    Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs

    Authors: Xun Liang, Hanyu Wang, Shichao Song, Mengting Hu, Xunzhi Wang, Zhiyu Li, Feiyu Xiong, Bo Tang

    Abstract: Controlled Text Generation (CTG) aims to produce texts that exhibit specific desired attributes. In this study, we introduce a pluggable CTG framework for Large Language Models (LLMs) named Dynamic Attribute Graphs-based controlled text generation (DATG). This framework utilizes an attribute scorer to evaluate the attributes of sentences generated by LLMs and constructs dynamic attribute graphs. D… ▽ More

    Submitted 24 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: 18 Pages, Accepted by ACL 2024 Findings

  13. arXiv:2402.07744  [pdf, other

    cs.AI cs.CL cs.LG

    Towards Unified Alignment Between Agents, Humans, and Environment

    Authors: Zonghan Yang, An Liu, Zijun Liu, Kaiming Liu, Fangzhou Xiong, Yile Wang, Zeyuan Yang, Qingyuan Hu, Xinrui Chen, Zhenhe Zhang, Fuwen Luo, Zhicheng Guo, Peng Li, Yang Liu

    Abstract: The rapid progress of foundation models has led to the prosperity of autonomous agents, which leverage the universal capabilities of foundation models to conduct reasoning, decision-making, and environmental interaction. However, the efficacy of agents remains limited when operating in intricate, realistic environments. In this work, we introduce the principles of $\mathbf{U}$nified $\mathbf{A}$li… ▽ More

    Submitted 14 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Project webpage: https://agent-force.github.io/unified-alignment-for-agents.html

  14. arXiv:2401.17043  [pdf, other

    cs.CL

    CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models

    Authors: Yuanjie Lyu, Zhiyu Li, Simin Niu, Feiyu Xiong, Bo Tang, Wen** Wang, Hao Wu, Huanyong Liu, Tong Xu, Enhong Chen, Yi Luo, Peng Cheng, Haiying Deng, Zhonghao Wang, Zijia Lu

    Abstract: Retrieval-Augmented Generation (RAG) is a technique that enhances the capabilities of large language models (LLMs) by incorporating external knowledge sources. This method addresses common LLM limitations, including outdated information and the tendency to produce inaccurate "hallucinated" content. However, the evaluation of RAG systems is challenging, as existing benchmarks are limited in scope a… ▽ More

    Submitted 18 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 26 Pages

  15. arXiv:2401.12326  [pdf, other

    cs.CL cs.AI

    Fine-tuning Large Language Models for Multigenerator, Multidomain, and Multilingual Machine-Generated Text Detection

    Authors: Feng Xiong, Thanet Markchom, Ziwei Zheng, Subin Jung, Varun Ojha, Huizhi Liang

    Abstract: SemEval-2024 Task 8 introduces the challenge of identifying machine-generated texts from diverse Large Language Models (LLMs) in various languages and domains. The task comprises three subtasks: binary classification in monolingual and multilingual (Subtask A), multi-class classification (Subtask B), and mixed text detection (Subtask C). This paper focuses on Subtask A & B. Each subtask is support… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  16. Hyperspectral Image Denoising via Spatial-Spectral Recurrent Transformer

    Authors: Guanyiman Fu, Fengchao Xiong, Jianfeng Lu, Jun Zhou, Jiantao Zhou, Yuntao Qian

    Abstract: Hyperspectral images (HSIs) often suffer from noise arising from both intra-imaging mechanisms and environmental factors. Leveraging domain knowledge specific to HSIs, such as global spectral correlation (GSC) and non-local spatial self-similarity (NSS), is crucial for effective denoising. Existing methods tend to independently utilize each of these knowledge components with multiple blocks, overl… ▽ More

    Submitted 8 January, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

  17. arXiv:2401.03385  [pdf, other

    cs.CL

    Grimoire is All You Need for Enhancing Large Language Models

    Authors: Ding Chen, Shichao Song, Qingchen Yu, Zhiyu Li, Wen** Wang, Feiyu Xiong, Bo Tang

    Abstract: In-context Learning (ICL) is one of the key methods for enhancing the performance of large language models on specific tasks by providing a set of few-shot examples. However, the ICL capability of different types of models shows significant variation due to factors such as model architecture, volume of learning data, and the size of parameters. Generally, the larger the model's parameter size and… ▽ More

    Submitted 10 January, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: 9 pages

  18. arXiv:2311.15296  [pdf, other

    cs.CL

    UHGEval: Benchmarking the Hallucination of Chinese Large Language Models via Unconstrained Generation

    Authors: Xun Liang, Shichao Song, Simin Niu, Zhiyu Li, Feiyu Xiong, Bo Tang, Yezhaohui Wang, Dawei He, Peng Cheng, Zhonghao Wang, Haiying Deng

    Abstract: Large language models (LLMs) have emerged as pivotal contributors in contemporary natural language processing and are increasingly being applied across a diverse range of industries. However, these large-scale probabilistic statistical models cannot currently ensure the requisite quality in professional content generation. These models often produce hallucinated text, compromising their practical… ▽ More

    Submitted 23 May, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: Accepted by ACL 2024

  19. arXiv:2304.09048  [pdf, other

    cs.CL cs.AI cs.IR cs.LG cs.SE

    CodeKGC: Code Language Model for Generative Knowledge Graph Construction

    Authors: Zhen Bi, **g Chen, Yinuo Jiang, Feiyu Xiong, Wei Guo, Huajun Chen, Ningyu Zhang

    Abstract: Current generative knowledge graph construction approaches usually fail to capture structural knowledge by simply flattening natural language into serialized texts or a specification language. However, large generative language model trained on structured data such as code has demonstrated impressive capability in understanding natural language for structural prediction and reasoning tasks. Intuit… ▽ More

    Submitted 18 January, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: ACM Transactions on Asian and Low-Resource Language Information Processing

  20. arXiv:2304.06925   

    cs.CV cs.AI

    YOLO-Drone:Airborne real-time detection of dense small objects from high-altitude perspective

    Authors: Li Zhu, Jiahui Xiong, Feng Xiong, Hanzheng Hu, Zhengnan Jiang

    Abstract: Unmanned Aerial Vehicles (UAVs), specifically drones equipped with remote sensing object detection technology, have rapidly gained a broad spectrum of applications and emerged as one of the primary research focuses in the field of computer vision. Although UAV remote sensing systems have the ability to detect various objects, small-scale objects can be challenging to detect reliably due to factors… ▽ More

    Submitted 10 October, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Some contributing authors are not signed

  21. arXiv:2303.02959  [pdf, other

    cs.CV cs.MM eess.IV

    Butterfly: Multiple Reference Frames Feature Propagation Mechanism for Neural Video Compression

    Authors: Feng Wang, Haihang Ruan, Fei Xiong, Jiayu Yang, Litian Li, Ronggang Wang

    Abstract: Using more reference frames can significantly improve the compression efficiency in neural video compression. However, in low-latency scenarios, most existing neural video compression frameworks usually use the previous one frame as reference. Or a few frameworks which use the previous multiple frames as reference only adopt a simple multi-reference frames propagation mechanism. In this paper, we… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted by DCC 2023

  22. arXiv:2211.07504  [pdf, other

    cs.CL cs.AI cs.CV cs.IR cs.LG

    On Analyzing the Role of Image for Visual-enhanced Relation Extraction

    Authors: Lei Li, Xiang Chen, Shuofei Qiao, Feiyu Xiong, Huajun Chen, Ningyu Zhang

    Abstract: Multimodal relation extraction is an essential task for knowledge graph construction. In this paper, we take an in-depth empirical analysis that indicates the inaccurate information in the visual scene graph leads to poor modal alignment weights, further degrading performance. Moreover, the visual shuffle experiments illustrate that the current approaches may not take full advantage of visual info… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted by AAAI 2023 (Student Abstract)

  23. arXiv:2209.15214  [pdf, other

    cs.AI cs.CL cs.IR cs.LG

    Construction and Applications of Billion-Scale Pre-Trained Multimodal Business Knowledge Graph

    Authors: Shumin Deng, Chengming Wang, Zhoubo Li, Ningyu Zhang, Zelin Dai, Hehong Chen, Feiyu Xiong, Ming Yan, Qiang Chen, Mosha Chen, Jiaoyan Chen, Jeff Z. Pan, Bryan Hooi, Huajun Chen

    Abstract: Business Knowledge Graphs (KGs) are important to many enterprises today, providing factual knowledge and structured data that steer many products and make them more intelligent. Despite their promising benefits, building business KG necessitates solving prohibitive issues of deficient structure and multiple modalities. In this paper, we advance the understanding of the practical challenges related… ▽ More

    Submitted 19 March, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: OpenBG. Accepted by ICDE 2023. The project is released at https://github.com/OpenBGBenchmark/OpenBG . Website: https://kg.alibaba.com/ , Leaderboard: https://tianchi.aliyun.com/dataset/dataDetail?dataId=122271

  24. arXiv:2207.07790  [pdf, other

    cs.LG cs.IR

    BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion

    Authors: Fanglin Chen, Xiao Liu, Bo Tang, Feiyu Xiong, Serim Hwang, Guomian Zhuang

    Abstract: We utilize an offline reinforcement learning (RL) model for sequential targeted promotion in the presence of budget constraints in a real-world business environment. In our application, the mobile app aims to boost customer retention by sending cash bonuses to customers and control the costs of such cash bonuses during each time period. To achieve the multi-task goal, we propose the Budget Constra… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: 8 pages, DRL4IR@SIGIR

  25. arXiv:2206.03739  [pdf, other

    cs.AI cs.CV cs.LG

    Disentangled Ontology Embedding for Zero-shot Learning

    Authors: Yuxia Geng, Jiaoyan Chen, Wen Zhang, Ya**g Xu, Zhuo Chen, Jeff Z. Pan, Yufeng Huang, Feiyu Xiong, Huajun Chen

    Abstract: Knowledge Graph (KG) and its variant of ontology have been widely used for knowledge representation, and have shown to be quite effective in augmenting Zero-shot Learning (ZSL). However, existing ZSL methods that utilize KGs all neglect the intrinsic complexity of inter-class relationships represented in KGs. One typical feature is that a class is often related to other classes in different semant… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted by KDD'22

  26. arXiv:2205.10852  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Relphormer: Relational Graph Transformer for Knowledge Graph Representations

    Authors: Zhen Bi, Siyuan Cheng, **g Chen, Xiaozhuan Liang, Feiyu Xiong, Ningyu Zhang

    Abstract: Transformers have achieved remarkable performance in widespread fields, including natural language processing, computer vision and graph mining. However, vanilla Transformer architectures have not yielded promising improvements in the Knowledge Graph (KG) representations, where the translational distance paradigm dominates this area. Note that vanilla Transformer architectures struggle to capture… ▽ More

    Submitted 21 November, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: Neurocomputing 2023

  27. arXiv:2205.10362  [pdf, ps, other

    cs.LG cs.AI

    FIND:Explainable Framework for Meta-learning

    Authors: Xinyue Shao, Hongzhi Wang, Xiao Zhu, Feng Xiong

    Abstract: Meta-learning is used to efficiently enable the automatic selection of machine learning models by combining data and prior knowledge. Since the traditional meta-learning technique lacks explainability, as well as shortcomings in terms of transparency and fairness, achieving explainability for meta-learning is crucial. This paper proposes FIND, an interpretable meta-learning framework that not only… ▽ More

    Submitted 12 June, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

  28. arXiv:2205.05889  [pdf, other

    cs.CL cs.AI cs.DB

    Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction

    Authors: Tianshu Wang, Hongyu Lin, Cheng Fu, Xianpei Han, Le Sun, Feiyu Xiong, Hui Chen, Minlong Lu, Xiuwen Zhu

    Abstract: Entity matching (EM) is the most critical step for entity resolution (ER). While current deep learningbased methods achieve very impressive performance on standard EM benchmarks, their realworld application performance is much frustrating. In this paper, we highlight that such the gap between reality and ideality stems from the unreasonable benchmark construction process, which is inconsistent wit… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted to IJCAI2022

  29. arXiv:2202.12571  [pdf, other

    cs.LG cs.AI cs.CL

    NeuralKG: An Open Source Library for Diverse Representation Learning of Knowledge Graphs

    Authors: Wen Zhang, Xiangnan Chen, Zhen Yao, Mingyang Chen, Yushan Zhu, Hongtao Yu, Yufeng Huang, Zezhong Xu, Ya**g Xu, Ningyu Zhang, Zonggang Yuan, Feiyu Xiong, Huajun Chen

    Abstract: NeuralKG is an open-source Python-based library for diverse representation learning of knowledge graphs. It implements three different series of Knowledge Graph Embedding (KGE) methods, including conventional KGEs, GNN-based KGEs, and Rule-based KGEs. With a unified framework, NeuralKG successfully reproduces link prediction results of these methods on benchmarks, freeing users from the laborious… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: work in progress

  30. arXiv:2202.02113  [pdf, other

    cs.CL cs.AI cs.DB cs.IR cs.LG

    From Discrimination to Generation: Knowledge Graph Completion with Generative Transformer

    Authors: Xin Xie, Ningyu Zhang, Zhoubo Li, Shumin Deng, Hui Chen, Feiyu Xiong, Mosha Chen, Huajun Chen

    Abstract: Knowledge graph completion aims to address the problem of extending a KG with missing triples. In this paper, we provide an approach GenKGC, which converts knowledge graph completion to sequence-to-sequence generation task with the pre-trained language model. We further introduce relation-guided demonstration and entity-aware hierarchical decoding for better representation learning and fast infere… ▽ More

    Submitted 14 March, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: Accepted by WWW 2022 Poster

  31. Building time-surfaces by exploiting the complex volatility of an ECRAM memristor

    Authors: Marco Rasetto, Qingzhou Wan, Himanshu Akolkar, Feng Xiong, Bertram Shi, Ryad Benosman

    Abstract: Memristors have emerged as a promising technology for efficient neuromorphic architectures owing to their ability to act as programmable synapses, combining processing and memory into a single device. Although they are most commonly used for static encoding of synaptic weights, recent work has begun to investigate the use of their dynamical properties, such as Short Term Plasticity (STP), to integ… ▽ More

    Submitted 15 April, 2024; v1 submitted 29 January, 2022; originally announced January 2022.

  32. arXiv:2201.11332  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Ontology-enhanced Prompt-tuning for Few-shot Learning

    Authors: Hongbin Ye, Ningyu Zhang, Shumin Deng, Xiang Chen, Hui Chen, Feiyu Xiong, Xi Chen, Huajun Chen

    Abstract: Few-shot Learning (FSL) is aimed to make predictions based on a limited number of samples. Structured data such as knowledge graphs and ontology libraries has been leveraged to benefit the few-shot setting in various tasks. However, the priors adopted by the existing methods suffer from challenging knowledge missing, knowledge noise, and knowledge heterogeneity, which hinder the performance for fe… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: Accepted by WWW2022

  33. arXiv:2201.06206  [pdf, other

    cs.CL cs.SI

    SQUIRE: A Sequence-to-sequence Framework for Multi-hop Knowledge Graph Reasoning

    Authors: Yushi Bai, Xin Lv, Juanzi Li, Lei Hou, Yincen Qu, Zelin Dai, Feiyu Xiong

    Abstract: Multi-hop knowledge graph (KG) reasoning has been widely studied in recent years to provide interpretable predictions on missing links with evidential paths. Most previous works use reinforcement learning (RL) based methods that learn to navigate the path towards the target entity. However, these methods suffer from slow and poor convergence, and they may fail to infer a certain path when there is… ▽ More

    Submitted 31 October, 2022; v1 submitted 16 January, 2022; originally announced January 2022.

    Comments: EMNLP 2022. Code is available at https://github.com/bys0318/SQUIRE

  34. arXiv:2201.03335  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    DeepKE: A Deep Learning Based Knowledge Extraction Toolkit for Knowledge Base Population

    Authors: Ningyu Zhang, Xin Xu, Liankuan Tao, Haiyang Yu, Hongbin Ye, Shuofei Qiao, Xin Xie, Xiang Chen, Zhoubo Li, Lei Li, Xiaozhuan Liang, Yunzhi Yao, Shumin Deng, Peng Wang, Wen Zhang, Zhenru Zhang, Chuanqi Tan, Qiang Chen, Feiyu Xiong, Fei Huang, Guozhou Zheng, Huajun Chen

    Abstract: We present an open-source and extensible knowledge extraction toolkit DeepKE, supporting complicated low-resource, document-level and multimodal scenarios in the knowledge base population. DeepKE implements various information extraction tasks, including named entity recognition, relation extraction and attribute extraction. With a unified framework, DeepKE allows developers and researchers to cus… ▽ More

    Submitted 18 September, 2023; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: Accepted by EMNLP 2022 System Demonstrations and the project website is http://deepke.zjukg.cn/

  35. arXiv:2112.08589  [pdf, other

    cs.AI

    Knowledge Graph Embedding in E-commerce Applications: Attentive Reasoning, Explanations, and Transferable Rules

    Authors: Wen Zhang, Shumin Deng, Mingyang Chen, Liang Wang, Qiang Chen, Feiyu Xiong, Xiangwen Liu, Huajun Chen

    Abstract: Knowledge Graphs (KGs), representing facts as triples, have been widely adopted in many applications. Reasoning tasks such as link prediction and rule induction are important for the development of KGs. Knowledge Graph Embeddings (KGEs) embedding entities and relations of a KG into continuous vector spaces, have been proposed for these reasoning tasks and proven to be efficient and robust. But the… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted at IJCKG2021

  36. arXiv:2108.03989  [pdf, other

    cs.AI

    Spatial-Temporal Deep Intention Destination Networks for Online Travel Planning

    Authors: Yu Li, Fei Xiong, Ziyi Wang, Zulong Chen, Chuanfei Xu, Yuyu Yin, Li Zhou

    Abstract: Nowadays, artificial neural networks are widely used for users' online travel planning. Personalized travel planning has many real applications and is affected by various factors, such as transportation type, intention destination estimation, budget limit and crowdness prediction. Among those factors, users' intention destination prediction is an essential task in online travel platforms. The reas… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  37. Anomaly Detection in Dynamic Graphs via Transformer

    Authors: Yixin Liu, Shirui Pan, Yu Guang Wang, Fei Xiong, Liang Wang, Qingfeng Chen, Vincent CS Lee

    Abstract: Detecting anomalies for dynamic graphs has drawn increasing attention due to their wide applications in social networks, e-commerce, and cybersecurity. Recent deep learning-based approaches have shown promising results over shallow methods. However, they fail to address two core challenges of anomaly detection in dynamic graphs: the lack of informative encoding for unattributed nodes and the diffi… ▽ More

    Submitted 27 October, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: 13 pages, 5 figures

  38. arXiv:2105.05473  [pdf, other

    cs.LG cs.AI

    Interpretable performance analysis towards offline reinforcement learning: A dataset perspective

    Authors: Chenyang Xi, Bo Tang, Jiajun Shen, Xinfu Liu, Feiyu Xiong, Xueying Li

    Abstract: Offline reinforcement learning (RL) has increasingly become the focus of the artificial intelligent research due to its wide real-world applications where the collection of data may be difficult, time-consuming, or costly. In this paper, we first propose a two-fold taxonomy for existing offline RL algorithms from the perspective of exploration and exploitation tendency. Secondly, we derive the exp… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  39. arXiv:2012.01829  [pdf, other

    eess.IV cs.CV

    SMDS-Net: Model Guided Spectral-Spatial Network for Hyperspectral Image Denoising

    Authors: Fengchao Xiong, Shuyin Tao, Jun Zhou, Jianfeng Lu, Jiantao Zhou, Yuntao Qian

    Abstract: Deep learning (DL) based hyperspectral images (HSIs) denoising approaches directly learn the nonlinear map** between observed noisy images and underlying clean images. They normally do not consider the physical characteristics of HSIs, therefore making them lack of interpretability that is key to understand their denoising mechanism.. In order to tackle this problem, we introduce a novel model g… ▽ More

    Submitted 14 November, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: The experimental settings have been updated

  40. arXiv:2007.05720  [pdf, other

    cs.CV

    ECML: An Ensemble Cascade Metric Learning Mechanism towards Face Verification

    Authors: Fu Xiong, Yang Xiao, Zhiguo Cao, Yancheng Wang, Joey Tianyi Zhou, Jianxi Wu

    Abstract: Face verification can be regarded as a 2-class fine-grained visual recognition problem. Enhancing the feature's discriminative power is one of the key problems to improve its performance. Metric learning technology is often applied to address this need, while achieving a good tradeoff between underfitting and overfitting plays the vital role in metric learning. Hence, we propose a novel ensemble c… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: Accepted to IEEE Transaction on Cybernetics

  41. arXiv:2005.10402  [pdf, other

    cs.LG stat.ML

    Studying Product Competition Using Representation Learning

    Authors: Fanglin Chen, Xiao Liu, Davide Proserpio, Isamar Troncoso, Feiyu Xiong

    Abstract: Studying competition and market structure at the product level instead of brand level can provide firms with insights on cannibalization and product line optimization. However, it is computationally challenging to analyze product-level competition for the millions of products available on e-commerce platforms. We introduce Product2Vec, a method based on the representation learning algorithm Word2V… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: 8 pages, to be published in SIGIR '20

  42. arXiv:2005.05501  [pdf, other

    cs.CV

    3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

    Authors: Yancheng Wang, Yang Xiao, Fu Xiong, Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

    Abstract: To facilitate depth-based 3D action recognition, 3D dynamic voxel (3DV) is proposed as a novel 3D motion representation. With 3D space voxelization, the key idea of 3DV is to encode 3D motion information within depth video into a regular voxel set (i.e., 3DV) compactly, via temporal rank pooling. Each available 3DV voxel intrinsically involves 3D spatial and motion feature jointly. 3DV is then abs… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: Accepted by CVPR2020

  43. arXiv:2003.13764  [pdf, other

    cs.CV

    Measuring Generalisation to Unseen Viewpoints, Articulations, Shapes and Objects for 3D Hand Pose Estimation under Hand-Object Interaction

    Authors: Anil Armagan, Guillermo Garcia-Hernando, Seungryul Baek, Shreyas Hampali, Mahdi Rad, Zhaohui Zhang, Shipeng Xie, MingXiu Chen, Boshen Zhang, Fu Xiong, Yang Xiao, Zhiguo Cao, Junsong Yuan, Pengfei Ren, Weiting Huang, Haifeng Sun, Marek Hrúz, Jakub Kanis, Zdeněk Krňoul, Qingfu Wan, Shile Li, Linlin Yang, Dongheui Lee, Angela Yao, Weiguo Zhou , et al. (10 additional authors not shown)

    Abstract: We study how well different types of approaches generalise in the task of 3D hand pose estimation under single hand scenarios and hand-object interaction. We show that the accuracy of state-of-the-art methods can drop, and that they fail mostly on poses absent from the training set. Unfortunately, since the space of hand poses is highly dimensional, it is inherently not feasible to cover the whole… ▽ More

    Submitted 10 September, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: European Conference on Computer Vision (ECCV), 2020

  44. arXiv:1911.03079  [pdf

    cs.IT

    Maximum Likelihood Decoding of Convolutionally Coded Noncoherent ASK Signals in AWGN Channels

    Authors: Arafat Al-Dweik, Fuqin Xiong

    Abstract: In this work we develop the maximum likelihood detection (MLD) algorithm for noncoherent amplitude shift keying (NCASK) systems in additive white Gaussian noise (AWGN) channels. The developed algorithm was used to investigate the performance of the NCASK system with convolutional coding and soft-decision Viterbi decoding. Tight and simple upper bounds have been derived to describe the system perfo… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Journal ref: Proceedings of the IASTED International Conference WIRELESS AND OPTICAL COMMUNICATIONS, June 2001, Banff, Canada,

  45. arXiv:1909.09998  [pdf, other

    cs.CV

    Double Anchor R-CNN for Human Detection in a Crowd

    Authors: Kevin Zhang, Feng Xiong, Peize Sun, Li Hu, Boxun Li, Gang Yu

    Abstract: Detecting human in a crowd is a challenging problem due to the uncertainties of occlusion patterns. In this paper, we propose to handle the crowd occlusion problem in human detection by leveraging the head part. Double Anchor RPN is developed to capture body and head parts in pairs. A proposal crossover strategy is introduced to generate high-quality proposals for both parts as a training augmenta… ▽ More

    Submitted 22 September, 2019; originally announced September 2019.

  46. arXiv:1908.09999  [pdf, other

    cs.CV

    A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image

    Authors: Fu Xiong, Boshen Zhang, Yang Xiao, Zhiguo Cao, Taidong Yu, Joey Tianyi Zhou, Junsong Yuan

    Abstract: For 3D hand and body pose estimation task in depth image, a novel anchor-based approach termed Anchor-to-Joint regression network (A2J) with the end-to-end learning ability is proposed. Within A2J, anchor points able to capture global-local spatial context information are densely set on depth image as local regressors for the joints. They contribute to predict the positions of the joints in ensemb… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

    Comments: Accepted by ICCV2019

  47. arXiv:1812.04179  [pdf, other

    cs.CV cs.AI

    Material Based Object Tracking in Hyperspectral Videos: Benchmark and Algorithms

    Authors: Fengchao Xiong, Jun Zhou, Yuntao Qian

    Abstract: Traditional color images only depict color intensities in red, green and blue channels, often making object trackers fail in challenging scenarios, e.g., background clutter and rapid changes of target appearance. Alternatively, material information of targets contained in a large amount of bands of hyperspectral images (HSI) is more robust to these difficult conditions. In this paper, we conduct a… ▽ More

    Submitted 10 July, 2019; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: Update results

  48. arXiv:1810.11819  [pdf, other

    cs.CV

    Object Tracking in Hyperspectral Videos with Convolutional Features and Kernelized Correlation Filter

    Authors: Kun Qian, Jun Zhou, Fengchao Xiong, Huixin Zhou, Juan Du

    Abstract: Target tracking in hyperspectral videos is a new research topic. In this paper, a novel method based on convolutional network and Kernelized Correlation Filter (KCF) framework is presented for tracking objects of interest in hyperspectral videos. We extract a set of normalized three-dimensional cubes from the target region as fixed convolution filters which contain spectral information surrounding… ▽ More

    Submitted 28 October, 2018; originally announced October 2018.

    Comments: Accepted by ICSM 2018

  49. arXiv:1808.01616  [pdf, other

    cs.CY cs.LG

    Predicting Learning Status in MOOCs using LSTM

    Authors: Zhemin Liu, Feng Xiong, Kaifa Zou, Hongzhi Wang

    Abstract: Real-time and open online course resources of MOOCs have attracted a large number of learners in recent years. However, many new questions were emerging about the high dropout rate of learners. For MOOCs platform, predicting the learning status of MOOCs learners in real time with high accuracy is the crucial task, and it also help improve the quality of MOOCs teaching. The prediction task in this… ▽ More

    Submitted 5 August, 2018; originally announced August 2018.

    Comments: arXiv admin note: text overlap with arXiv:1402.1128 by other authors

  50. arXiv:1807.11042  [pdf, other

    cs.CV

    Towards Good Practices on Building Effective CNN Baseline Model for Person Re-identification

    Authors: Fu Xiong, Yang Xiao, Zhiguo Cao, Kaicheng Gong, Zhiwen Fang, Joey Tianyi Zhou

    Abstract: Person re-identification is indeed a challenging visual recognition task due to the critical issues of human pose variation, human body occlusion, camera view variation, etc. To address this, most of the state-of-the-art approaches are proposed based on deep convolutional neural network (CNN), being leveraged by its strong feature learning power and classification boundary fitting capacity. Althou… ▽ More

    Submitted 29 July, 2018; originally announced July 2018.