Skip to main content

Showing 1–50 of 77 results for author: Gui, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18245  [pdf, other

    cs.CL

    Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems

    Authors: Italo Luis da Silva, Hanqi Yan, Lin Gui, Yulan He

    Abstract: The inherent ambiguity of cause and effect boundaries poses a challenge in evaluating causal event extraction tasks. Traditional metrics like Exact Match and BertScore poorly reflect model performance, so we trained evaluation models to approximate human evaluation, achieving high agreement. We used them to perform Reinforcement Learning with extraction models to align them with human preference,… ▽ More

    Submitted 27 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures, 6 tables

  2. arXiv:2406.17969  [pdf, other

    cs.CL cs.AI

    Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective

    Authors: Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He

    Abstract: To better interpret the intrinsic mechanism of large language models (LLMs), recent studies focus on monosemanticity on its basic units. A monosemantic neuron is dedicated to a single and specific concept, which forms a one-to-one correlation between neurons and concepts. Despite extensive research in monosemanticity probing, it remains unclear whether monosemanticity is beneficial or harmful to m… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.16074  [pdf, other

    eess.IV cs.CV

    CAVM: Conditional Autoregressive Vision Model for Contrast-Enhanced Brain Tumor MRI Synthesis

    Authors: Lujun Gui, Chuyang Ye, Tianyi Yan

    Abstract: Contrast-enhanced magnetic resonance imaging (MRI) is pivotal in the pipeline of brain tumor segmentation and analysis. Gadolinium-based contrast agents, as the most commonly used contrast agents, are expensive and may have potential side effects, and it is desired to obtain contrast-enhanced brain tumor MRI scans without the actual use of contrast agents. Deep learning methods have been applied t… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: The work has been accepted by MICCAI 2024

  4. Multi-Layer Ranking with Large Language Models for News Source Recommendation

    Authors: Wenjia Zhang, Lin Gui, Rob Procter, Yulan He

    Abstract: To seek reliable information sources for news events, we introduce a novel task of expert recommendation, which aims to identify trustworthy sources based on their previously quoted statements. To achieve this, we built a novel dataset, called NewsQuote, consisting of 23,571 quote-speaker pairs sourced from a collection of news articles. We formulate the recommendation task as the retrieval of exp… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted by the SIGIR 2024. arXiv admin note: text overlap with arXiv:2305.04825

  5. arXiv:2406.07544  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Situational Awareness Matters in 3D Vision Language Reasoning

    Authors: Yunze Man, Liang-Yan Gui, Yu-Xiong Wang

    Abstract: Being able to carry out complicated vision language reasoning tasks in 3D space represents a significant milestone in develo** household robots and human-centered embodied AI. In this work, we demonstrate that a critical and distinct challenge in 3D vision language reasoning is situational awareness, which incorporates two key components: (1) The autonomous agent grounds its self-location based… ▽ More

    Submitted 26 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPR 2024. Project Page: https://yunzeman.github.io/situation3d

  6. arXiv:2406.00832  [pdf, other

    cs.CL cs.LG

    BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling

    Authors: Lin Gui, Cristina Gârbacea, Victor Veitch

    Abstract: This paper concerns the problem of aligning samples from large language models to human preferences using best-of-$n$ sampling, where we draw $n$ samples, rank them, and return the best one. We consider two fundamental problems. First: what is the relationship between best-of-$n$ and approaches to alignment that train LLMs to output samples with a high expected reward (e.g., RLHF or DPO)? To answe… ▽ More

    Submitted 5 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  7. arXiv:2404.17662  [pdf, other

    cs.CL

    PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery Games

    Authors: Qinglin Zhu, Runcong Zhao, **hua Du, Lin Gui, Yulan He

    Abstract: We propose PLAYER*, a novel framework that addresses the limitations of existing agent-based approaches built on Large Language Models (LLMs) in handling complex questions and understanding interpersonal relationships in dynamic environments. PLAYER* enhances path planning in Murder Mystery Games (MMGs) using an anytime sampling-based planner and a questioning-driven search framework. By equip**… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

  8. arXiv:2404.12386  [pdf, other

    cs.CV cs.LG

    SOHES: Self-supervised Open-world Hierarchical Entity Segmentation

    Authors: Shengcao Cao, Jiuxiang Gu, Jason Kuen, Hao Tan, Ruiyi Zhang, Handong Zhao, Ani Nenkova, Liang-Yan Gui, Tong Sun, Yu-Xiong Wang

    Abstract: Open-world entity segmentation, as an emerging computer vision task, aims at segmenting entities in images without being restricted by pre-defined classes, offering impressive generalization capabilities on unseen images and concepts. Despite its promise, existing entity segmentation methods like Segment Anything Model (SAM) rely heavily on costly expert annotators. This work presents Self-supervi… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: ICLR 2024

  9. arXiv:2404.01258  [pdf, other

    cs.CV cs.AI

    Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

    Authors: Ruohong Zhang, Liangke Gui, Zhiqing Sun, Yihao Feng, Keyang Xu, Yuanhan Zhang, Di Fu, Chunyuan Li, Alexander Hauptmann, Yonatan Bisk, Yiming Yang

    Abstract: Preference modeling techniques, such as direct preference optimization (DPO), has shown effective in enhancing the generalization abilities of large language model (LLM). However, in tasks involving video instruction-following, providing informative feedback, especially for detecting hallucinations in generated responses, remains a significant challenge. Previous studies have explored using large… ▽ More

    Submitted 2 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  10. arXiv:2403.19652  [pdf, other

    cs.CV cs.AI

    InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction

    Authors: Sirui Xu, Ziyin Wang, Yu-Xiong Wang, Liang-Yan Gui

    Abstract: Text-conditioned human motion generation has experienced significant advancements with diffusion models trained on extensive motion capture data and corresponding textual annotations. However, extending such success to 3D dynamic human-object interaction (HOI) generation faces notable challenges, primarily due to the lack of large-scale interaction data and comprehensive descriptions that align wi… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Project Page: https://sirui-xu.github.io/InterDreamer/

  11. arXiv:2402.18189  [pdf, other

    cs.CR

    VulMCI : Code Splicing-based Pixel-row Oversampling for More Continuous Vulnerability Image Generation

    Authors: Tao Peng, Ling Gui, Yi Sun

    Abstract: In recent years, the rapid development of deep learning technology has brought new prospects to the field of vulnerability detection. Many vulnerability detection methods involve converting source code into images for detection, yet they often overlook the quality of the generated images. Due to the fact that vulnerability images lack clear and continuous contours, unlike images used in object det… ▽ More

    Submitted 16 April, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  12. arXiv:2402.15637  [pdf, other

    cs.CL

    Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models

    Authors: Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He

    Abstract: In-context learning has become a popular paradigm in natural language processing. However, its performance can be significantly influenced by the order of in-context demonstration examples. In this paper, we found that causal language models (CausalLMs) are more sensitive to this order compared to prefix language models (PrefixLMs). We attribute this phenomenon to the auto-regressive attention mas… ▽ More

    Submitted 6 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  13. arXiv:2402.15309  [pdf, other

    cs.LG cs.CL

    Counterfactual Generation with Identifiability Guarantees

    Authors: Hanqi Yan, Ling**g Kong, Lin Gui, Yuejie Chi, Eric Xing, Yulan He, Kun Zhang

    Abstract: Counterfactual generation lies at the core of various machine learning tasks, including image translation and controllable text generation. This generation process usually requires the identification of the disentangled latent representations, such as content and style, that underlie the observed data. However, it becomes more challenging when faced with a scarcity of paired data and labeling info… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Neurips23. Controllable generation in causal perspective with a case study of ChatGPT, sheds light on theory-guaranteed alignment in language models

  14. arXiv:2402.14963  [pdf, other

    cs.CL cs.AI

    Mirror: A Multiple-perspective Self-Reflection Method for Knowledge-rich Reasoning

    Authors: Hanqi Yan, Qinglin Zhu, Xinyu Wang, Lin Gui, Yulan He

    Abstract: While Large language models (LLMs) have the capability to iteratively reflect on their own outputs, recent studies have observed their struggles with knowledge-rich problems without access to external resources. In addition to the inefficiency of LLMs in self-assessment, we also observe that LLMs struggle to revisit their predictions despite receiving explicit negative feedback. Therefore, We prop… ▽ More

    Submitted 24 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: ACL24, Main Conference, long paper. Code is available at https://github.com/hanqi-qi/Mirror.git

  15. arXiv:2402.14522  [pdf, other

    cs.CL cs.LG

    Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond

    Authors: Xinyu Wang, Hainiu Xu, Lin Gui, Yulan He

    Abstract: Task embedding, a meta-learning technique that captures task-specific information, has become prevalent, especially in areas such as multi-task learning, model editing, and interpretability. However, it faces challenges with the emergence of prompt-guided Large Language Models (LLMs) operating in a gradientfree manner. Existing task embedding methods rely on fine-tuned, task-specific language mode… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  16. arXiv:2402.14298  [pdf, other

    cs.CL

    Multi-modal Stance Detection: New Datasets and Model

    Authors: Bin Liang, Ang Li, **gqian Zhao, Lin Gui, Min Yang, Yue Yu, Kam-Fai Wong, Ruifeng Xu

    Abstract: Stance detection is a challenging task that aims to identify public opinion from social media platforms with respect to specific targets. Previous work on stance detection largely focused on pure texts. In this paper, we study multi-modal stance detection for tweets consisting of texts and images, which are prevalent in today's fast-growing social media platforms where people often post multi-moda… ▽ More

    Submitted 6 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: ACL'24 Findings

  17. arXiv:2402.14296  [pdf, other

    cs.CL

    Mitigating Biases of Large Language Models in Stance Detection with Calibration

    Authors: Ang Li, **gqian Zhao, Bin Liang, Lin Gui, Hui Wang, Xi Zeng, Xingwei Liang, Kam-Fai Wong, Ruifeng Xu

    Abstract: Large language models (LLMs) have achieved remarkable progress in many natural language processing tasks. However, our experiment reveals that, in stance detection tasks, LLMs may generate biased stances due to sentiment-stance spurious correlations and preference towards certain individuals and topics, thus harming their performance. Therefore, in this paper, we propose to Mitigate Biases of LLMs… ▽ More

    Submitted 16 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  18. arXiv:2402.14228  [pdf, other

    cs.LG cs.AI

    COPR: Continual Human Preference Learning via Optimal Policy Regularization

    Authors: Han Zhang, Lin Gui, Yu Lei, Yuanzhao Zhai, Yehong Zhang, Yulan He, Hui Wang, Yue Yu, Kam-Fai Wong, Bin Liang, Ruifeng Xu

    Abstract: Reinforcement Learning from Human Feedback (RLHF) is commonly utilized to improve the alignment of Large Language Models (LLMs) with human preferences. Given the evolving nature of human preferences, continual alignment becomes more crucial and practical in comparison to traditional static alignment. Nevertheless, making RLHF compatible with Continual Learning (CL) is challenging due to its comple… ▽ More

    Submitted 27 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  19. arXiv:2402.11051  [pdf, other

    cs.CL cs.AI

    Large Language Models Fall Short: Understanding Complex Relationships in Detective Narratives

    Authors: Runcong Zhao, Qinglin Zhu, Hainiu Xu, Jiazheng Li, Yuxiang Zhou, Yulan He, Lin Gui

    Abstract: Existing datasets for narrative understanding often fail to represent the complexity and uncertainty of relationships in real-life social scenarios. To address this gap, we introduce a new benchmark, Conan, designed for extracting and analysing intricate character relation graphs from detective narratives. Specifically, we designed hierarchical relationship categories and manually extracted and an… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  20. arXiv:2402.03311  [pdf, other

    cs.CV cs.AI cs.LG

    HASSOD: Hierarchical Adaptive Self-Supervised Object Detection

    Authors: Shengcao Cao, Dhiraj Joshi, Liang-Yan Gui, Yu-Xiong Wang

    Abstract: The human visual perception system demonstrates exceptional capabilities in learning without explicit supervision and understanding the part-to-whole composition of objects. Drawing inspiration from these two abilities, we propose Hierarchical Adaptive Self-Supervised Object Detection (HASSOD), a novel approach that learns to detect objects and understand their compositions without human supervisi… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: NeurIPS 2023

  21. arXiv:2312.14154  [pdf, other

    cs.CV

    Virtual Pets: Animatable Animal Generation in 3D Scenes

    Authors: Yen-Chi Cheng, Chieh Hubert Lin, Chaoyang Wang, Yash Kant, Sergey Tulyakov, Alexander Schwing, Liangyan Gui, Hsin-Ying Lee

    Abstract: Toward unlocking the potential of generative models in immersive 4D experiences, we introduce Virtual Pet, a novel pipeline to model realistic and diverse motions for target animal species within a 3D environment. To circumvent the limited availability of 3D motion data aligned with environmental geometry, we leverage monocular internet videos and extract deformable NeRF representations for the fo… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Preprint. Project page: https://yccyenchicheng.github.io/VirtualPets/

  22. arXiv:2311.00237  [pdf, other

    cs.CL

    The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis

    Authors: Yuxiang Zhou, Jiazheng Li, Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He

    Abstract: Understanding in-context learning (ICL) capability that enables large language models (LLMs) to excel in proficiency through demonstration examples is of utmost importance. This importance stems not only from the better utilization of this capability across various tasks, but also from the proactive identification and mitigation of potential risks, including concerns regarding truthfulness, bias,… ▽ More

    Submitted 15 February, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

  23. arXiv:2310.18783  [pdf, other

    cs.CL

    Are NLP Models Good at Tracing Thoughts: An Overview of Narrative Understanding

    Authors: Lixing Zhu, Runcong Zhao, Lin Gui, Yulan He

    Abstract: Narrative understanding involves capturing the author's cognitive processes, providing insights into their knowledge, intentions, beliefs, and desires. Although large language models (LLMs) excel in generating grammatically coherent text, their ability to comprehend the author's thoughts remains uncertain. This limitation hinders the practical applications of narrative understanding. In this paper… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  24. arXiv:2310.18073  [pdf, other

    cs.CL

    A Scalable Framework for Table of Contents Extraction from Complex ESG Annual Reports

    Authors: Xinyu Wang, Lin Gui, Yulan He

    Abstract: Table of contents (ToC) extraction centres on structuring documents in a hierarchical manner. In this paper, we propose a new dataset, ESGDoc, comprising 1,093 ESG annual reports from 563 companies spanning from 2001 to 2022. These reports pose significant challenges due to their diverse structures and extensive length. To address these challenges, we propose a new framework for Toc extraction, co… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  25. arXiv:2310.15694  [pdf, other

    cs.LG cs.CL

    COPR: Continual Learning Human Preference through Optimal Policy Regularization

    Authors: Han Zhang, Lin Gui, Yuanzhao Zhai, Hui Wang, Yu Lei, Ruifeng Xu

    Abstract: The technique of Reinforcement Learning from Human Feedback (RLHF) is a commonly employed method to improve pre-trained Language Models (LM), enhancing their ability to conform to human preferences. Nevertheless, the current RLHF-based LMs necessitate full retraining each time novel queries or feedback are introduced, which becomes a challenging task because human preferences can vary between diff… ▽ More

    Submitted 26 March, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  26. arXiv:2310.01459  [pdf, other

    cs.CL cs.AI cs.HC

    NarrativePlay: Interactive Narrative Understanding

    Authors: Runcong Zhao, Wenjia Zhang, Jiazheng Li, Lixing Zhu, Yanran Li, Yulan He, Lin Gui

    Abstract: In this paper, we introduce NarrativePlay, a novel system that allows users to role-play a fictional character and interact with other characters in narratives such as novels in an immersive environment. We leverage Large Language Models (LLMs) to generate human-like responses, guided by personality traits extracted from narratives. The system incorporates auto-generated visual display of narrativ… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  27. arXiv:2309.14525  [pdf, other

    cs.CV cs.CL

    Aligning Large Multimodal Models with Factually Augmented RLHF

    Authors: Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-Yan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell

    Abstract: Large Multimodal Models (LMM) are built across modalities and the misalignment between two modalities can result in "hallucination", generating textual outputs that are not grounded by the multimodal information in context. To address the multimodal misalignment issue, we adapt the Reinforcement Learning from Human Feedback (RLHF) from the text domain to the task of vision-language alignment, wher… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: Preprint

  28. arXiv:2308.16905  [pdf, other

    cs.CV cs.AI cs.GR

    InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion

    Authors: Sirui Xu, Zhengyuan Li, Yu-Xiong Wang, Liang-Yan Gui

    Abstract: This paper addresses a novel task of anticipating 3D human-object interactions (HOIs). Most existing research on HOI synthesis lacks comprehensive whole-body interactions with dynamic objects, e.g., often limited to manipulating small or static objects. Our task is significantly more challenging, as it requires modeling dynamic objects with various shapes, capturing whole-body motion, and ensuring… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: ICCV 2023; Project Page: https://sirui-xu.github.io/InterDiff/

  29. arXiv:2308.09105  [pdf, other

    cs.CV cs.LG

    Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation

    Authors: Shengcao Cao, Mengtian Li, James Hays, Deva Ramanan, Yi-Xiong Wang, Liang-Yan Gui

    Abstract: Resource-constrained perception systems such as edge computing and vision-for-robotics require vision models to be both accurate and lightweight in computation and memory usage. While knowledge distillation is a proven strategy to enhance the performance of lightweight classification models, its application to structured outputs like object detection and instance segmentation remains a complicated… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: ICML 2023

  30. arXiv:2307.14603  [pdf, other

    eess.IV cs.CV

    A Weakly Supervised Segmentation Network Embedding Cross-scale Attention Guidance and Noise-sensitive Constraint for Detecting Tertiary Lymphoid Structures of Pancreatic Tumors

    Authors: Bingxue Wang, Liwen Zou, Jun Chen, Yingying Cao, Zhenghua Cai, Yudong Qiu, Liang Mao, Zhongqiu Wang, **gya Chen, Luying Gui, ** Yang

    Abstract: The presence of tertiary lymphoid structures (TLSs) on pancreatic pathological images is an important prognostic indicator of pancreatic tumors. Therefore, TLSs detection on pancreatic pathological images plays a crucial role in diagnosis and treatment for patients with pancreatic tumors. However, fully supervised detection algorithms based on deep learning usually require a large number of manual… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  31. arXiv:2306.05421  [pdf, other

    cs.CV

    Stochastic Multi-Person 3D Motion Forecasting

    Authors: Sirui Xu, Yu-Xiong Wang, Liang-Yan Gui

    Abstract: This paper aims to deal with the ignored real-world complexities in prior work on human motion forecasting, emphasizing the social properties of multi-person motion, the diversity of motion and social interactions, and the complexity of articulated motion. To this end, we introduce a novel task of stochastic multi-person 3D motion forecasting. We propose a dual-level generative modeling framework… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: ICLR 2023 (Top 25% Paper); Project Page: https://sirui-xu.github.io/DuMMF

  32. arXiv:2306.03598  [pdf, other

    cs.CL

    CUE: An Uncertainty Interpretation Framework for Text Classifiers Built on Pre-Trained Language Models

    Authors: Jiazheng Li, Zhaoyue Sun, Bin Liang, Lin Gui, Yulan He

    Abstract: Text classifiers built on Pre-trained Language Models (PLMs) have achieved remarkable progress in various tasks including sentiment analysis, natural language inference, and question-answering. However, the occurrence of uncertain predictions by these classifiers poses a challenge to their reliability when deployed in practical applications. Much effort has been devoted to designing various probes… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Accepted to UAI 2023

  33. arXiv:2305.18926  [pdf, other

    cs.CL

    Document-Level Multi-Event Extraction with Event Proxy Nodes and Hausdorff Distance Minimization

    Authors: Xinyu Wang, Lin Gui, Yulan He

    Abstract: Document-level multi-event extraction aims to extract the structural information from a given document automatically. Most recent approaches usually involve two steps: (1) modeling entity interactions; (2) decoding entity interactions into events. However, such approaches ignore a global view of inter-dependency of multiple events. Moreover, an event is decoded by iteratively merging its related e… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  34. arXiv:2305.14973  [pdf, other

    cs.CL

    OverPrompt: Enhancing ChatGPT through Efficient In-Context Learning

    Authors: Jiazheng Li, Runcong Zhao, Yongxin Yang, Yulan He, Lin Gui

    Abstract: The remarkable performance of pre-trained large language models has revolutionised various natural language processing applications. Due to huge parametersizes and extensive running costs, companies or organisations tend to transfer the models to the target task by zero-shot prompting techniques. However, the prohibitive costs of tokens and time have hindered their adoption in applications. We pro… ▽ More

    Submitted 14 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 R0-FoMo Workshop

  35. arXiv:2305.12962  [pdf, other

    cs.CL

    Distilling ChatGPT for Explainable Automated Student Answer Assessment

    Authors: Jiazheng Li, Lin Gui, Yuxiang Zhou, David West, Cesare Aloisi, Yulan He

    Abstract: Providing explainable and faithful feedback is crucial for automated student answer assessment. In this paper, we introduce a novel framework that explores using ChatGPT, a cutting-edge large language model, for the concurrent tasks of student answer scoring and rationale generation. We identify the appropriate instructions by prompting ChatGPT with different templates to collect the rationales, w… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted EMNLP 2023

  36. arXiv:2305.05331  [pdf, other

    cs.IR cs.CL

    Explainable Recommender with Geometric Information Bottleneck

    Authors: Hanqi Yan, Lin Gui, Menghan Wang, Kun Zhang, Yulan He

    Abstract: Explainable recommender systems can explain their recommendation decisions, enhancing user trust in the systems. Most explainable recommender systems either rely on human-annotated rationales to train models for explanation generation or leverage the attention mechanism to extract important text spans from reviews as explanations. The extracted rationales are often confined to an individual review… ▽ More

    Submitted 5 January, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted by TKDE

  37. arXiv:2305.04825  [pdf, other

    cs.IR cs.HC cs.LG

    NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-Checking

    Authors: Wenjia Zhang, Lin Gui, Rob Procter, Yulan He

    Abstract: To enhance the ability to find credible evidence in news articles, we propose a novel task of expert recommendation, which aims to identify trustworthy experts on a specific news topic. To achieve the aim, we describe the construction of a novel NewsQuote dataset consisting of 24,031 quote-speaker pairs that appeared on a COVID-19 news corpus. We demonstrate an automatic pipeline for speaker and q… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 11 pages, 5 figures. 17TH International AAAI Conference on Web and Social Media; Mediate 2023: News Media and Computational Journalism Workshop

    ACM Class: I.2.7; H.3.3

  38. arXiv:2305.04599  [pdf, other

    cs.CL

    Cone: Unsupervised Contrastive Opinion Extraction

    Authors: Runcong Zhao, Lin Gui, Yulan He

    Abstract: Contrastive opinion extraction aims to extract a structured summary or key points organised as positive and negative viewpoints towards a common aspect or topic. Most recent works for unsupervised key point extraction is largely built on sentence clustering or opinion summarisation based on the popularity of opinions expressed in text. However, these methods tend to generate aspect clusters with i… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  39. arXiv:2305.04522  [pdf, other

    cs.CL

    Event Knowledge Incorporation with Posterior Regularization for Event-Centric Question Answering

    Authors: Junru Lu, Gabriele Pergola, Lin Gui, Yulan He

    Abstract: We propose a simple yet effective strategy to incorporate event knowledge extracted from event trigger annotations via posterior regularization to improve the event reasoning capability of mainstream question-answering (QA) models for event-centric QA. In particular, we define event-related knowledge constraints based on the event trigger annotations in the QA datasets, and subsequently use them t… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: work in process

  40. arXiv:2305.03724  [pdf, other

    cs.CV cs.AI cs.RO

    DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception

    Authors: Yunze Man, Liang-Yan Gui, Yu-Xiong Wang

    Abstract: Closing the domain gap between training and deployment and incorporating multiple sensor modalities are two challenging yet critical topics for self-driving. Existing work only focuses on single one of the above topics, overlooking the simultaneous domain and modality shift which pervasively exists in real-world scenarios. A model trained with multi-sensor data collected in Europe may need to run… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: IROS 2023. Project website: https://yunzeman.github.io/DualCross

  41. arXiv:2305.03034  [pdf, other

    cs.CV

    Contrastive Mean Teacher for Domain Adaptive Object Detectors

    Authors: Shengcao Cao, Dhiraj Joshi, Liang-Yan Gui, Yu-Xiong Wang

    Abstract: Object detectors often suffer from the domain gap between training (source domain) and real-world applications (target domain). Mean-teacher self-training is a powerful paradigm in unsupervised domain adaptation for object detection, but it struggles with low-quality pseudo-labels. In this work, we identify the intriguing alignment and synergy between mean-teacher self-training and contrastive lea… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: CVPR 2023

  42. arXiv:2303.04147  [pdf, other

    cs.AI math.OC

    Heuristics for Vehicle Routing Problem: A Survey and Recent Advances

    Authors: Fei Liu, Chengyu Lu, Lin Gui, Qingfu Zhang, Xialiang Tong, Mingxuan Yuan

    Abstract: Vehicle routing is a well-known optimization research topic with significant practical importance. Among different approaches to solving vehicle routing, heuristics can produce a satisfactory solution at a reasonable computational cost. Consequently, much effort has been made in the past decades to develop vehicle routing heuristics. In this article, we systematically survey the existing vehicle r… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  43. arXiv:2303.02944  [pdf, other

    cs.CV

    CTG-Net: An Efficient Cascaded Framework Driven by Terminal Guidance Mechanism for Dilated Pancreatic Duct Segmentation

    Authors: Liwen Zou, Zhenghua Cai, Yudong Qiu, Luying Gui, Liang Mao, ** Yang

    Abstract: Pancreatic duct dilation indicates a high risk of various pancreatic diseases. Segmentation of dilated pancreatic ducts on computed tomography (CT) images shows the potential to assist the early diagnosis, surgical planning and prognosis. Because of the ducts' tiny sizes, slender tubular structures and the surrounding distractions, most current researches on pancreatic duct segmentation achieve lo… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  44. arXiv:2303.01241  [pdf, other

    cs.CL cs.LG

    PANACEA: An Automated Misinformation Detection System on COVID-19

    Authors: Runcong Zhao, Miguel Arana-Catania, Lixing Zhu, Elena Kochkina, Lin Gui, Arkaitz Zubiaga, Rob Procter, Maria Liakata, Yulan He

    Abstract: In this demo, we introduce a web-based misinformation detection system PANACEA on COVID-19 related claims, which has two modules, fact-checking and rumour detection. Our fact-checking module, which is supported by novel natural language inference methods with a self-attention network, outperforms state-of-the-art approaches. It is also able to give automated veracity assessment and ranked supporti… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

  45. arXiv:2302.06198  [pdf, other

    cs.CL

    Distinguishability Calibration to In-Context Learning

    Authors: Hong**g Li, Hanqi Yan, Yanran Li, Li Qian, Yulan He, Lin Gui

    Abstract: Recent years have witnessed increasing interests in prompt-based learning in which models can be trained on only a few annotated instances, making them suitable in low-resource settings. When using prompt-based learning for text classification, the goal is to use a pre-trained language model (PLM) to predict a missing token in a pre-defined template given an input text, which can be mapped to a cl… ▽ More

    Submitted 10 May, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted by EACL23-Findings

  46. Diverse Human Motion Prediction Guided by Multi-Level Spatial-Temporal Anchors

    Authors: Sirui Xu, Yu-Xiong Wang, Liang-Yan Gui

    Abstract: Predicting diverse human motions given a sequence of historical poses has received increasing attention. Despite rapid progress, existing work captures the multi-modal nature of human motions primarily through likelihood-based sampling, where the mode collapse has been widely observed. In this paper, we propose a simple yet effective approach that disentangles randomly sampled codes with a determi… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: ECCV 2022 (Oral); Project Page: https://sirui-xu.github.io/STARS/

  47. arXiv:2302.03693  [pdf, other

    cs.CL cs.LG stat.ML

    Concept Algebra for (Score-Based) Text-Controlled Generative Models

    Authors: Zihao Wang, Lin Gui, Jeffrey Negrea, Victor Veitch

    Abstract: This paper concerns the structure of learned representations in text-guided generative models, focusing on score-based models. A key property of such models is that they can compose disparate concepts in a `disentangled' manner. This suggests these models have internal representations that encode concepts in a `disentangled' manner. Here, we focus on the idea that concepts are encoded as subspaces… ▽ More

    Submitted 7 February, 2024; v1 submitted 7 February, 2023; originally announced February 2023.

  48. arXiv:2301.07183  [pdf, other

    cs.IR cs.LG

    Tracking Brand-Associated Polarity-Bearing Topics in User Reviews

    Authors: Runcong Zhao, Lin Gui, Hanqi Yan, Yulan He

    Abstract: Monitoring online customer reviews is important for business organisations to measure customer satisfaction and better manage their reputations. In this paper, we propose a novel dynamic Brand-Topic Model (dBTM) which is able to automatically detect and track brand-associated sentiment scores and polarity-bearing topics from product reviews organised in temporally-ordered time intervals. dBTM mode… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

  49. arXiv:2212.04493  [pdf, other

    cs.CV cs.LG

    SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation

    Authors: Yen-Chi Cheng, Hsin-Ying Lee, Sergey Tulyakov, Alexander Schwing, Liangyan Gui

    Abstract: In this work, we present a novel framework built to simplify 3D asset generation for amateur users. To enable interactive generation, our method supports a variety of input modalities that can be easily provided by a human, including images, text, partially observed shapes and combinations of these, further allowing to adjust the strength of each input. At the core of our approach is an encoder-de… ▽ More

    Submitted 21 March, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: In CVPR 2023. Project page and code is available at: https://yccyenchicheng.github.io/SDFusion/. Fix some typos

  50. arXiv:2210.12902  [pdf, other

    cs.CL

    Event-Centric Question Answering via Contrastive Learning and Invertible Event Transformation

    Authors: Junru Lu, Xingwei Tan, Gabriele Pergola, Lin Gui, Yulan He

    Abstract: Human reading comprehension often requires reasoning of event semantic relations in narratives, represented by Event-centric Question-Answering (QA). To address event-centric QA, we propose a novel QA model with contrastive learning and invertible event transformation, call TranCLR. Our proposed model utilizes an invertible transformation matrix to project semantic vectors of events into a common… ▽ More

    Submitted 13 December, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Findings of EMNLP 2022