Skip to main content

Showing 1–50 of 312 results for author: Chen, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19622  [pdf, other

    cs.LG cs.AI

    Data-Driven Lipschitz Continuity: A Cost-Effective Approach to Improve Adversarial Robustness

    Authors: Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung, Che-Rung Lee

    Abstract: The security and robustness of deep neural networks (DNNs) have become increasingly concerning. This paper aims to provide both a theoretical foundation and a practical solution to ensure the reliability of DNNs. We explore the concept of Lipschitz continuity to certify the robustness of DNNs against adversarial attacks, which aim to mislead the network with adding imperceptible perturbations into… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.14979  [pdf, other

    cs.CL

    Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation

    Authors: Yuanjie Lyu, Zihan Niu, Zheyong Xie, Chao Zhang, Tong Xu, Yang Wang, Enhong Chen

    Abstract: Despite the significant progress of large language models (LLMs) in various tasks, they often produce factual errors due to their limited internal knowledge. Retrieval-Augmented Generation (RAG), which enhances LLMs with external knowledge sources, offers a promising solution. However, these methods can be misled by irrelevant paragraphs in retrieved documents. Due to the inherent uncertainty in L… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.13618  [pdf, other

    cs.CL

    In-Context Former: Lightning-fast Compressing Context for Large Language Model

    Authors: Xiangfeng Wang, Zaiyi Chen, Zheyong Xie, Tong Xu, Yongyi He, Enhong Chen

    Abstract: With the rising popularity of Transformer-based large language models (LLMs), reducing their high inference costs has become a significant research focus. One effective approach is to compress the long input contexts. Existing methods typically leverage the self-attention mechanism of the LLM itself for context compression. While these methods have achieved notable results, the compression process… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.12020  [pdf, other

    cs.IR cs.AI

    When Box Meets Graph Neural Network in Tag-aware Recommendation

    Authors: Fake Lin, Ziwei Zhao, Xi Zhu, Da Zhang, Shitian Shen, Xueying Li, Tong Xu, Suojuan Zhang, Enhong Chen

    Abstract: Last year has witnessed the re-flourishment of tag-aware recommender systems supported by the LLM-enriched tags. Unfortunately, though large efforts have been made, current solutions may fail to describe the diversity and uncertainty inherent in user preferences with only tag-driven profiles. Recently, with the development of geometry-based techniques, e.g., box embedding, diversity of user prefer… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2406.08358  [pdf, other

    cs.CV cs.AI

    From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition

    Authors: Shiwei Wu, Chao Zhang, Joya Chen, Tong Xu, Likang Wu, Yao Hu, Enhong Chen

    Abstract: People's social relationships are often manifested through their surroundings, with certain objects or interactions acting as symbols for specific relationships, e.g., wedding rings, roses, hugs, or holding hands. This brings unique challenges to recognizing social relationships, requiring understanding and capturing the essence of these contexts from visual appearances. However, current methods o… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  6. arXiv:2406.03085  [pdf, other

    cs.LG cs.IR

    Exploring User Retrieval Integration towards Large Language Models for Cross-Domain Sequential Recommendation

    Authors: Tingjia Shen, Hao Wang, Jiaqing Zhang, Sirui Zhao, Liangyue Li, Zulong Chen, Defu Lian, Enhong Chen

    Abstract: Cross-Domain Sequential Recommendation (CDSR) aims to mine and transfer users' sequential preferences across different domains to alleviate the long-standing cold-start issue. Traditional CDSR models capture collaborative information through user and item modeling while overlooking valuable semantic information. Recently, Large Language Model (LLM) has demonstrated powerful semantic reasoning capa… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures

    ACM Class: I.2.7

  7. arXiv:2406.01276  [pdf, other

    cs.CL

    EduNLP: Towards a Unified and Modularized Library for Educational Resources

    Authors: Zhenya Huang, Yuting Ning, Longhu Qin, Shiwei Tong, Shangzi Xue, Tong Xiao, Xin Lin, Jiayu Liu, Qi Liu, Enhong Chen, Shi**g Wang

    Abstract: Educational resource understanding is vital to online learning platforms, which have demonstrated growing applications recently. However, researchers and developers always struggle with using existing general natural language toolkits or domain-specific models. The issue raises a need to develop an effective and easy-to-use one that benefits AI education-related research and applications. To bridg… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  8. arXiv:2405.21075  [pdf, other

    cs.CV cs.CL

    Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

    Authors: Chaoyou Fu, Yuhan Dai, Yongdong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun

    Abstract: In the quest for artificial general intelligence, Multi-modal Large Language Models (MLLMs) have emerged as a focal point in recent advancements. However, the predominant focus remains on develo** their capabilities in static image understanding. The potential of MLLMs in processing sequential visual data is still insufficiently explored, highlighting the absence of a comprehensive, high-quality… ▽ More

    Submitted 16 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Project Page: https://video-mme.github.io

  9. arXiv:2405.18708  [pdf, other

    cs.AI cs.IR cs.NE

    Cognitive Evolutionary Learning to Select Feature Interactions for Recommender Systems

    Authors: Runlong Yu, Qixiang Shao, Qi Liu, Huan Liu, Enhong Chen

    Abstract: Feature interaction selection is a fundamental problem in commercial recommender systems. Most approaches equally enumerate all features and interactions by the same pre-defined operation under expert guidance. Their recommendation is unsatisfactory sometimes due to the following issues: (1)~They cannot ensure the learning abilities of models because their architectures are poorly adaptable to tas… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  10. arXiv:2405.17795  [pdf, other

    cs.IR

    Dataset Regeneration for Sequential Recommendation

    Authors: Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Suojuan Zhang, Sirui Zhao, Defu Lian, Enhong Chen

    Abstract: The sequential recommender (SR) system is a crucial component of modern recommender systems, as it aims to capture the evolving preferences of users. Significant efforts have been made to enhance the capabilities of SR systems. These methods typically follow the model-centric paradigm, which involves develo** effective models based on fixed datasets. However, this approach often overlooks potent… ▽ More

    Submitted 3 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  11. arXiv:2405.16789  [pdf, other

    cs.IR

    NoteLLM-2: Multimodal Large Representation Models for Recommendation

    Authors: Chao Zhang, Haoxin Zhang, Shiwei Wu, Di Wu, Tong Xu, Yan Gao, Yao Hu, Enhong Chen

    Abstract: Large Language Models (LLMs) have demonstrated exceptional text understanding. Existing works explore their application in text embedding tasks. However, there are few works utilizing LLMs to assist multimodal representation tasks. In this work, we investigate the potential of LLMs to enhance multimodal representation in multimodal item-to-item (I2I) recommendations. One feasible method is the tra… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 19 pages, 5 figures

  12. arXiv:2405.12473  [pdf, other

    cs.IR cs.AI

    Learning Partially Aligned Item Representation for Cross-Domain Sequential Recommendation

    Authors: Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Zhi Li, Sirui Zhao, Defu Lian, Enhong Chen

    Abstract: Cross-domain sequential recommendation (CDSR) aims to uncover and transfer users' sequential preferences across multiple recommendation domains. While significant endeavors have been made, they primarily concentrated on develo** advanced transfer modules and aligning user representations using self-supervised learning techniques. However, the problem of aligning item representations has received… ▽ More

    Submitted 3 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  13. arXiv:2405.11531  [pdf, other

    cs.IR cs.AI

    Knowledge Graph Pruning for Recommendation

    Authors: Fake Lin, Xi Zhu, Ziwei Zhao, Deqiang Huang, Yu Yu, Xueying Li, Tong Xu, Enhong Chen

    Abstract: Recent years have witnessed the prosperity of knowledge graph based recommendation system (KGRS), which enriches the representation of users, items, and entities by structural knowledge with striking improvement. Nevertheless, its unaffordable computational cost still limits researchers from exploring more sophisticated models. We observe that the bottleneck for training efficiency arises from the… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  14. arXiv:2405.07580  [pdf, other

    cs.IR cs.AI

    DynLLM: When Large Language Models Meet Dynamic Graph Recommendation

    Authors: Ziwei Zhao, Fake Lin, Xi Zhu, Zhi Zheng, Tong Xu, Shitian Shen, Xueying Li, Zikai Yin, Enhong Chen

    Abstract: Last year has witnessed the considerable interest of Large Language Models (LLMs) for their potential applications in recommender systems, which may mitigate the persistent issue of data sparsity. Though large efforts have been made for user-item graph augmentation with better graph-based recommendation performance, they may fail to deal with the dynamic graph recommendation task, which involves b… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  15. arXiv:2405.07518  [pdf, other

    cs.AR cs.AI

    SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

    Authors: Raghu Prabhakar, Ram Sivaramakrishnan, Darshan Gandhi, Yun Du, Mingran Wang, Xiangyu Song, Kejie Zhang, Tianren Gao, Angela Wang, Karen Li, Yongning Sheng, Joshua Brot, Denis Sokolov, Apurv Vivek, Calvin Leung, Arjun Sabnis, Jiayu Bai, Tuowen Zhao, Mark Gottscho, David Jackson, Mark Luttrell, Manish K. Shah, Edison Chen, Kaizhao Liang, Swayambhoo Jain , et al. (5 additional authors not shown)

    Abstract: Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate increase in compute-to-memory ratio of modern AI accelerators have created a memory wall, necessitating new methods to deploy AI. Composition of Expert… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  16. arXiv:2405.06232  [pdf, other

    cs.AI

    Learning to Solve Geometry Problems via Simulating Human Dual-Reasoning Process

    Authors: Tong Xiao, Jiayu Liu, Zhenya Huang, **ze Wu, **g Sha, Shi** Wang, Enhong Chen

    Abstract: Geometry Problem Solving (GPS), which is a classic and challenging math problem, has attracted much attention in recent years. It requires a solver to comprehensively understand both text and diagram, master essential geometry knowledge, and appropriately apply it in reasoning. However, existing works follow a paradigm of neural machine translation and only focus on enhancing the capability of enc… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: IJCAI 2024 Accepted

  17. arXiv:2405.05811  [pdf, other

    cs.CV

    Parallel Cross Strip Attention Network for Single Image Dehazing

    Authors: Lihan Tong, Yun Liu, Tian Ye, Weijia Li, Liyuan Chen, Erkang Chen

    Abstract: The objective of single image dehazing is to restore hazy images and produce clear, high-quality visuals. Traditional convolutional models struggle with long-range dependencies due to their limited receptive field size. While Transformers excel at capturing such dependencies, their quadratic computational complexity in relation to feature map resolution makes them less suitable for pixel-to-pixel… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 10 pages , 4 figures, CTISC'24

    Report number: C052

  18. arXiv:2405.02287  [pdf, other

    cs.CL cs.AI cs.CV

    Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models

    Authors: Piotr Padlewski, Max Bain, Matthew Henderson, Zhongkai Zhu, Nishant Relan, Hai Pham, Donovan Ong, Kaloyan Aleksiev, Aitor Ormazabal, Samuel Phua, Ethan Yeo, Eugenie Lamprecht, Qi Liu, Yuqi Wang, Eric Chen, Deyu Fu, Lei Li, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Mikel Artetxe, Yi Tay

    Abstract: We introduce Vibe-Eval: a new open benchmark and framework for evaluating multimodal chat models. Vibe-Eval consists of 269 visual understanding prompts, including 100 of hard difficulty, complete with gold-standard responses authored by experts. Vibe-Eval is open-ended and challenging with dual objectives: (i) vibe checking multimodal chat models for day-to-day tasks and (ii) rigorously testing a… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  19. arXiv:2405.00291  [pdf, other

    cs.CL cs.AI cs.HC

    How Can I Improve? Using GPT to Highlight the Desired and Undesired Parts of Open-ended Responses

    Authors: Jionghao Lin, Eason Chen, Zeifei Han, Ashish Gurung, Danielle R. Thomas, Wei Tan, Ngoc Dang Nguyen, Kenneth R. Koedinger

    Abstract: Automated explanatory feedback systems play a crucial role in facilitating learning for a large cohort of learners by offering feedback that incorporates explanations, significantly enhancing the learning process. However, delivering such explanatory feedback in real-time poses challenges, particularly when high classification accuracy for domain-specific, nuanced responses is essential. Our study… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 11 pages, full research paper, EDM 2024

  20. arXiv:2404.16587  [pdf, other

    cs.CL cs.AI

    Understanding Privacy Risks of Embeddings Induced by Large Language Models

    Authors: Zhihao Zhu, Ninglu Shao, Defu Lian, Chenwang Wu, Zheng Liu, Yi Yang, Enhong Chen

    Abstract: Large language models (LLMs) show early signs of artificial general intelligence but struggle with hallucinations. One promising solution to mitigate these hallucinations is to store external knowledge as embeddings, aiding LLMs in retrieval-augmented generation. However, such a solution risks compromising privacy, as recent studies experimentally showed that the original text can be partially rec… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  21. arXiv:2404.15881  [pdf, other

    cs.CV cs.AI

    Steal Now and Attack Later: Evaluating Robustness of Object Detection against Black-box Adversarial Attacks

    Authors: Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung, Che-Rung Lee

    Abstract: Latency attacks against object detection represent a variant of adversarial attacks that aim to inflate the inference time by generating additional ghost objects in a target image. However, generating ghost objects in the black-box scenario remains a challenge since information about these unqualified objects remains opaque. In this study, we demonstrate the feasibility of generating ghost objects… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  22. arXiv:2404.15209  [pdf, other

    cs.LG stat.ME stat.ML

    Data-Driven Knowledge Transfer in Batch $Q^*$ Learning

    Authors: Elynn Chen, Xi Chen, Wenbo **g

    Abstract: In data-driven decision-making in marketing, healthcare, and education, it is desirable to utilize a large amount of data from existing ventures to navigate high-dimensional feature spaces and address data scarcity in new ventures. We explore knowledge transfer in dynamic decision-making by concentrating on batch stationary environments and formally defining task discrepancies through the lens of… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  23. arXiv:2404.12387  [pdf, other

    cs.CL cs.CV

    Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models

    Authors: Reka Team, Aitor Ormazabal, Che Zheng, Cyprien de Masson d'Autume, Dani Yogatama, Deyu Fu, Donovan Ong, Eric Chen, Eugenie Lamprecht, Hai Pham, Isaac Ong, Kaloyan Aleksiev, Lei Li, Matthew Henderson, Max Bain, Mikel Artetxe, Nishant Relan, Piotr Padlewski, Qi Liu, Ren Chen, Samuel Phua, Yazheng Yang, Yi Tay, Yuqi Wang, Zhongkai Zhu , et al. (1 additional authors not shown)

    Abstract: We introduce Reka Core, Flash, and Edge, a series of powerful multimodal language models trained from scratch by Reka. Reka models are able to process and reason with text, images, video, and audio inputs. This technical report discusses details of training some of these models and provides comprehensive evaluation results. We show that Reka Edge and Reka Flash are not only state-of-the-art but al… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  24. arXiv:2404.07456  [pdf, other

    cs.AI cs.MA

    WESE: Weak Exploration to Strong Exploitation for LLM Agents

    Authors: Xu Huang, Weiwen Liu, Xiaolong Chen, Xingmei Wang, Defu Lian, Yasheng Wang, Ruiming Tang, Enhong Chen

    Abstract: Recently, large language models (LLMs) have demonstrated remarkable potential as an intelligent agent. However, existing researches mainly focus on enhancing the agent's reasoning or decision-making abilities through well-designed prompt engineering or task-specific fine-tuning, ignoring the procedure of exploration and exploitation. When addressing complex tasks within open-world interactive envi… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  25. arXiv:2404.04242  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Physical Property Understanding from Language-Embedded Feature Fields

    Authors: Albert J. Zhai, Yuan Shen, Emily Y. Chen, Gloria X. Wang, Xinlei Wang, Sheng Wang, Kaiyu Guan, Shenlong Wang

    Abstract: Can computers perceive the physical properties of objects solely through vision? Research in cognitive science and vision science has shown that humans excel at identifying materials and estimating their physical properties based purely on visual appearance. In this paper, we present a novel approach for dense prediction of the physical properties of objects using a collection of images. Inspired… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: CVPR 2024. Project page (with code): https://ajzhai.github.io/NeRF2Physics/

  26. arXiv:2404.03845  [pdf

    cs.CR cs.HC

    Buck You: Designing Easy-to-Onboard Blockchain Applications with Zero-Knowledge Login and Sponsored Transactions on Sui

    Authors: Eason Chen, Zimo Xiao, Justa Liang, Damien Chen, Pierce Hung, Kostas Kryptos Chalkias

    Abstract: In this paper, we developed a blockchain application to demonstrate the functionality of Sui's recent innovations: Zero Knowledge Login and Sponsored Transactions. Zero Knowledge Login allows users to create and access their blockchain wallets just with their OAuth accounts (e.g., Google, Facebook, Twitch), while Sponsored Transactions eliminate the need for users to prepare transaction fees, as t… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  27. arXiv:2404.00712  [pdf, other

    cs.LG cs.AI cs.CY cs.IR

    Survey of Computerized Adaptive Testing: A Machine Learning Perspective

    Authors: Qi Liu, Yan Zhuang, Haoyang Bi, Zhenya Huang, Weizhe Huang, Jiatong Li, Junhao Yu, Zirui Liu, Zirui Hu, Yuting Hong, Zachary A. Pardos, Hai** Ma, Mengxiao Zhu, Shi** Wang, Enhong Chen

    Abstract: Computerized Adaptive Testing (CAT) provides an efficient and tailored method for assessing the proficiency of examinees, by dynamically adjusting test questions based on their performance. Widely adopted across diverse fields like education, healthcare, sports, and sociology, CAT has revolutionized testing practices. While traditional methods rely on psychometrics and statistics, the increasing c… ▽ More

    Submitted 4 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  28. arXiv:2404.00268  [pdf, other

    cs.IR

    A Unified Framework for Adaptive Representation Enhancement and Inversed Learning in Cross-Domain Recommendation

    Authors: Luankang Zhang, Hao Wang, Suojuan Zhang, Mingjia Yin, Yongqiang Han, Jiaqing Zhang, Defu Lian, Enhong Chen

    Abstract: Cross-domain recommendation (CDR), aiming to extract and transfer knowledge across domains, has attracted wide attention for its efficacy in addressing data sparsity and cold-start problems. Despite significant advances in representation disentanglement to capture diverse user preferences, existing methods usually neglect representation enhancement and lack rigorous decoupling constraints, thereby… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted by DASFAA 2024

  29. arXiv:2403.18227  [pdf, other

    cs.IR

    One Backpropagation in Two Tower Recommendation Models

    Authors: Erjia Chen, Bang Wang

    Abstract: Recent years have witnessed extensive researches on develo** two tower recommendation models for relieving information overload. Four building modules can be identified in such models, namely, user-item encoding, negative sampling, loss computing and back-propagation updating. To the best of our knowledge, existing algorithms have researched only on the first three modules, yet neglecting the ba… ▽ More

    Submitted 27 May, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 10 pages, 8 figures

  30. arXiv:2403.17603  [pdf, other

    cs.IR

    END4Rec: Efficient Noise-Decoupling for Multi-Behavior Sequential Recommendation

    Authors: Yongqiang Han, Hao Wang, Kefan Wang, Likang Wu, Zhi Li, Wei Guo, Yong Liu, Defu Lian, Enhong Chen

    Abstract: In recommendation systems, users frequently engage in multiple types of behaviors, such as clicking, adding to a cart, and purchasing. However, with diversified behavior data, user behavior sequences will become very long in the short term, which brings challenges to the efficiency of the sequence recommendation model. Meanwhile, some behavior data will also bring inevitable noise to the modeling… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  31. arXiv:2403.14676  [pdf, other

    cs.CY cs.AI cs.LG

    Unified Uncertainty Estimation for Cognitive Diagnosis Models

    Authors: Fei Wang, Qi Liu, Enhong Chen, Chuanren Liu, Zhenya Huang, **ze Wu, Shi** Wang

    Abstract: Cognitive diagnosis models have been widely used in different areas, especially intelligent education, to measure users' proficiency levels on knowledge concepts, based on which users can get personalized instructions. As the measurement is not always reliable due to the weak links of the models and data, the uncertainty of measurement also offers important information for decisions. However, the… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  32. arXiv:2403.14409  [pdf, other

    cs.CL cs.AI

    Locating and Mitigating Gender Bias in Large Language Models

    Authors: Yuchen Cai, Ding Cao, Rongxi Guo, Yaqin Wen, Guiquan Liu, Enhong Chen

    Abstract: Large language models(LLM) are pre-trained on extensive corpora to learn facts and human cognition which contain human preferences. However, this process can inadvertently lead to these models acquiring biases and stereotypes prevalent in society. Prior research has typically tackled the issue of bias through a one-dimensional perspective, concentrating either on locating or mitigating it. This li… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 23 pages, 5 figures

  33. arXiv:2403.14381  [pdf, other

    cs.CL cs.AI

    Editing Knowledge Representation of Language Model via Rephrased Prefix Prompts

    Authors: Yuchen Cai, Ding Cao, Rongxi Guo, Yaqin Wen, Guiquan Liu, Enhong Chen

    Abstract: Neural language models (LMs) have been extensively trained on vast corpora to store factual knowledge about various aspects of the world described in texts. Current technologies typically employ knowledge editing methods or specific prompts to modify LM outputs. However, existing knowledge editing methods are costly and inefficient, struggling to produce appropriate text. Additionally, prompt engi… ▽ More

    Submitted 11 May, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 19pages,3figures

  34. A Knowledge-Injected Curriculum Pretraining Framework for Question Answering

    Authors: Xin Lin, Tianhuang Su, Zhenya Huang, Shangzi Xue, Haifeng Liu, Enhong Chen

    Abstract: Knowledge-based question answering (KBQA) is a key task in NLP research, and also an approach to access the web data and knowledge, which requires exploiting knowledge graphs (KGs) for reasoning. In the literature, one promising solution for KBQA is to incorporate the pretrained language model (LM) with KGs by generating KG-centered pretraining corpus, which has shown its superiority. However, the… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Accepted by WWW 2024

  35. arXiv:2403.08887  [pdf, other

    cs.CV

    Federated Data Model

    Authors: Xiao Chen, Shunan Zhang, Eric Z. Chen, Yikang Liu, Lin Zhao, Terrence Chen, Shanhui Sun

    Abstract: In artificial intelligence (AI), especially deep learning, data diversity and volume play a pivotal role in model development. However, training a robust deep learning model often faces challenges due to data privacy, regulations, and the difficulty of sharing data between different locations, especially for medical applications. To address this, we developed a method called the Federated Data Mod… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  36. arXiv:2403.08826  [pdf, other

    cs.HC cs.LG

    A Dataset for the Validation of Truth Inference Algorithms Suitable for Online Deployment

    Authors: Fei Wang, Haoyu Liu, Haoyang Bi, Xiangzhuang Shen, Renyu Zhu, Runze Wu, Minmin Lin, Tangjie Lv, Changjie Fan, Qi Liu, Zhenya Huang, Enhong Chen

    Abstract: For the purpose of efficient and cost-effective large-scale data labeling, crowdsourcing is increasingly being utilized. To guarantee the quality of data labeling, multiple annotations need to be collected for each data sample, and truth inference algorithms have been developed to accurately infer the true labels. Despite previous studies having released public datasets to evaluate the efficacy of… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  37. arXiv:2403.08758  [pdf

    eess.IV cs.CV

    Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI

    Authors: Shihan Qiu, Shaoyan Pan, Yikang Liu, Lin Zhao, Jian Xu, Qi Liu, Terrence Chen, Eric Z. Chen, Xiao Chen, Shanhui Sun

    Abstract: Current deep learning reconstruction for accelerated cardiac cine MRI suffers from spatial and temporal blurring. We aim to improve image sharpness and motion delineation for cine MRI under high undersampling rates. A spatiotemporal diffusion enhancement model conditional on an existing deep learning reconstruction along with a novel paired sampling strategy was developed. The diffusion model prov… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  38. arXiv:2403.08749  [pdf

    eess.IV cs.CV

    Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI

    Authors: Shihan Qiu, Shaoyan Pan, Yikang Liu, Lin Zhao, Jian Xu, Qi Liu, Terrence Chen, Eric Z. Chen, Xiao Chen, Shanhui Sun

    Abstract: The currently limited quality of accelerated cardiac cine reconstruction may potentially be improved by the emerging diffusion models, but the clinically unacceptable long processing time poses a challenge. We aim to develop a clinically feasible diffusion-model-based reconstruction pipeline to improve the image quality of cine MRI. A multi-in multi-out diffusion enhancement model together with fa… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  39. Towards Personalized Evaluation of Large Language Models with An Anonymous Crowd-Sourcing Platform

    Authors: Mingyue Cheng, Hao Zhang, Jiqian Yang, Qi Liu, Li Li, Xin Huang, Liwei Song, Zhi Li, Zhenya Huang, Enhong Chen

    Abstract: Large language model evaluation plays a pivotal role in the enhancement of its capacity. Previously, numerous methods for evaluating large language models have been proposed in this area. Despite their effectiveness, these existing works mainly focus on assessing objective questions, overlooking the capability to evaluate subjective questions which is extremely common for large language models. Ad… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  40. arXiv:2403.07623  [pdf, other

    cs.IR

    Empowering Sequential Recommendation from Collaborative Signals and Semantic Relatedness

    Authors: Mingyue Cheng, Hao Zhang, Qi Liu, Fajie Yuan, Zhi Li, Zhenya Huang, Enhong Chen, Jun Zhou, Longfei Li

    Abstract: Sequential recommender systems (SRS) could capture dynamic user preferences by modeling historical behaviors ordered in time. Despite effectiveness, focusing only on the \textit{collaborative signals} from behaviors does not fully grasp user interests. It is also significant to model the \textit{semantic relatedness} reflected in content features, e.g., images and text. Towards that end, in this p… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  41. arXiv:2403.06071  [pdf, other

    cs.CV cs.IR

    Bit-mask Robust Contrastive Knowledge Distillation for Unsupervised Semantic Hashing

    Authors: Liyang He, Zhenya Huang, Jiayu Liu, Enhong Chen, Fei Wang, **g Sha, Shi** Wang

    Abstract: Unsupervised semantic hashing has emerged as an indispensable technique for fast image search, which aims to convert images into binary hash codes without relying on labels. Recent advancements in the field demonstrate that employing large-scale backbones (e.g., ViT) in unsupervised semantic hashing models can yield substantial improvements. However, the inference delay has become increasingly dif… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 12 pages, 19 figures, Proceedings of the ACM Web Conference 2024 (WWW '24)

  42. arXiv:2403.01972  [pdf, other

    cs.CL

    Multi-perspective Improvement of Knowledge Graph Completion with Large Language Models

    Authors: Derong Xu, Ziheng Zhang, Zhenxi Lin, Xian Wu, Zhihong Zhu, Tong Xu, Xiangyu Zhao, Yefeng Zheng, Enhong Chen

    Abstract: Knowledge graph completion (KGC) is a widely used method to tackle incompleteness in knowledge graphs (KGs) by making predictions for missing links. Description-based KGC leverages pre-trained language models to learn entity and relation representations with their names or descriptions, which shows promising results. However, the performance of description-based KGC is still limited by the quality… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  43. arXiv:2403.01744  [pdf, other

    cs.IR

    NoteLLM: A Retrievable Large Language Model for Note Recommendation

    Authors: Chao Zhang, Shiwei Wu, Haoxin Zhang, Tong Xu, Yan Gao, Yao Hu, Di Wu, Enhong Chen

    Abstract: People enjoy sharing "notes" including their experiences within online communities. Therefore, recommending notes aligned with user interests has become a crucial task. Existing online methods only input notes into BERT-based models to generate note embeddings for assessing similarity. However, they may underutilize some important cues, e.g., hashtags or categories, which represent the key concept… ▽ More

    Submitted 25 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Published as a WWW'24 full paper

  44. arXiv:2402.18252  [pdf, other

    cs.CL cs.AI

    Towards Generalist Prompting for Large Language Models by Mental Models

    Authors: Haoxiang Guan, Jiyan He, Shuxin Zheng, En-Hong Chen, Weiming Zhang, Nenghai Yu

    Abstract: Large language models (LLMs) have demonstrated impressive performance on many tasks. However, to achieve optimal performance, specially designed prompting methods are still needed. These methods either rely on task-specific few-shot examples that require a certain level of domain knowledge, or are designed to be simple but only perform well on a few types of tasks. In this work, we attempt to intr… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  45. arXiv:2402.18099  [pdf, other

    cs.CL cs.AI

    Editing Factual Knowledge and Explanatory Ability of Medical Large Language Models

    Authors: Derong Xu, Ziheng Zhang, Zhihong Zhu, Zhenxi Lin, Qidong Liu, Xian Wu, Tong Xu, Wanyu Wang, Yuyang Ye, Xiangyu Zhao, Yefeng Zheng, Enhong Chen

    Abstract: Model editing aims to precisely alter the behaviors of large language models (LLMs) in relation to specific knowledge, while leaving unrelated knowledge intact. This approach has proven effective in addressing issues of hallucination and outdated information in LLMs. However, the potential of using model editing to modify knowledge in the medical field remains largely unexplored, even though resol… ▽ More

    Submitted 4 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  46. arXiv:2402.16312  [pdf, other

    cs.LG cs.AI

    Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users

    Authors: Hantao Yang, Xutong Liu, Zhiyong Wang, Hong Xie, John C. S. Lui, Defu Lian, Enhong Chen

    Abstract: We study the problem of federated contextual combinatorial cascading bandits, where $|\mathcal{U}|$ agents collaborate under the coordination of a central server to provide tailored recommendations to the $|\mathcal{U}|$ corresponding users. Existing works consider either a synchronous framework, necessitating full agent participation and global synchronization, or assume user homogeneity with ide… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI 2024

  47. arXiv:2402.14601  [pdf, other

    cs.CY cs.AI cs.HC cs.LG

    Bringing Generative AI to Adaptive Learning in Education

    Authors: Hang Li, Tianlong Xu, Chaoli Zhang, Eason Chen, **g Liang, Xing Fan, Haoyang Li, Jiliang Tang, Qingsong Wen

    Abstract: The recent surge in generative AI technologies, such as large language models and diffusion models, has boosted the development of AI applications in various domains, including science, finance, and education. Concurrently, adaptive learning, a concept that has gained substantial interest in the educational sphere, has proven its efficacy in enhancing students' learning efficiency. In this positio… ▽ More

    Submitted 28 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 14 pages, 7 figures

  48. arXiv:2402.14594  [pdf, other

    cs.CY cs.AI cs.CL cs.HC cs.IR

    Improving Assessment of Tutoring Practices using Retrieval-Augmented Generation

    Authors: Zifei FeiFei Han, Jionghao Lin, Ashish Gurung, Danielle R. Thomas, Eason Chen, Conrad Borchers, Shivang Gupta, Kenneth R. Koedinger

    Abstract: One-on-one tutoring is an effective instructional method for enhancing learning, yet its efficacy hinges on tutor competencies. Novice math tutors often prioritize content-specific guidance, neglecting aspects such as social-emotional learning. Social-emotional learning promotes equity and inclusion and nurturing relationships with students, which is crucial for holistic student development. Asses… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 11 page Workshop paper, AAAI2024 Workshop on AI for Education - Bridging Innovation and Responsibility, Large Language Model, Personalized Tutor Training, Automatic Assessment

  49. arXiv:2402.11857  [pdf, other

    cs.LG cs.DC

    Communication-Efficient Distributed Learning with Local Immediate Error Compensation

    Authors: Yifei Cheng, Li Shen, Linli Xu, Xun Qian, Shiwei Wu, Yiming Zhou, Tie Zhang, Dacheng Tao, Enhong Chen

    Abstract: Gradient compression with error compensation has attracted significant attention with the target of reducing the heavy communication overhead in distributed learning. However, existing compression methods either perform only unidirectional compression in one iteration with higher communication cost, or bidirectional compression with slower convergence rate. In this work, we propose the Local Immed… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  50. arXiv:2402.09694  [pdf, other

    cs.CV

    Seed Optimization with Frozen Generator for Superior Zero-shot Low-light Enhancement

    Authors: Yuxuan Gu, Yi **, Ben Wang, Zhixiang Wei, Xiaoxiao Ma, Pengyang Ling, Haoxuan Wang, Huaian Chen, Enhong Chen

    Abstract: In this work, we observe that the generators, which are pre-trained on massive natural images, inherently hold the promising potential for superior low-light image enhancement against varying scenarios.Specifically, we embed a pre-trained generator to Retinex model to produce reflectance maps with enhanced detail and vividness, thereby recovering features degraded by low-light conditions.Taking on… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.