Skip to main content

Showing 1–7 of 7 results for author: Ru, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14486  [pdf, other

    cs.CL

    RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models

    Authors: Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang

    Abstract: Large Language Models (LLMs) have shown impressive capabilities but also a concerning tendency to hallucinate. This paper presents RefChecker, a framework that introduces claim-triplets to represent claims in LLM responses, aiming to detect fine-grained hallucinations. In RefChecker, an extractor generates claim-triplets from a response, which are then evaluated by a checker against a reference. W… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2310.12874  [pdf, other

    cs.CL

    StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding

    Authors: Cheng Jiayang, Lin Qiu, Tsz Ho Chan, Tianqing Fang, Weiqi Wang, Chunkit Chan, Dongyu Ru, Qipeng Guo, Hongming Zhang, Yangqiu Song, Yue Zhang, Zheng Zhang

    Abstract: Analogy-making between narratives is crucial for human reasoning. In this paper, we evaluate the ability to identify and generate analogies by constructing a first-of-its-kind large-scale story-level analogy corpus, \textsc{StoryAnalogy}, which contains 24K story pairs from diverse domains with human annotations on two similarities from the extended Structure-Map** Theory. We design a set of tes… ▽ More

    Submitted 23 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 main conference

  3. arXiv:2306.10658  [pdf, other

    cs.CL

    Distributed Marker Representation for Ambiguous Discourse Markers and Entangled Relations

    Authors: Dongyu Ru, Lin Qiu, Xipeng Qiu, Yue Zhang, Zheng Zhang

    Abstract: Discourse analysis is an important task because it models intrinsic semantic structures between sentences in a document. Discourse markers are natural representations of discourse in our daily language. One challenge is that the markers as well as pre-defined and human-labeled discourse relations can be ambiguous when describing the semantics between sentences. We believe that a better approach is… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  4. arXiv:2111.05407  [pdf, other

    cs.CL

    Learning Logic Rules for Document-level Relation Extraction

    Authors: Dongyu Ru, Changzhi Sun, Jiangtao Feng, Lin Qiu, Hao Zhou, Weinan Zhang, Yong Yu, Lei Li

    Abstract: Document-level relation extraction aims to identify relations between entities in a whole document. Prior efforts to capture long-range dependencies have relied heavily on implicitly powerful representations learned through (graph) neural networks, which makes the model less transparent. To tackle this challenge, in this paper, we propose LogiRE, a novel probabilistic model for document-level rela… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Appear at EMNLP 2021 main conference

  5. arXiv:2004.08046  [pdf, other

    cs.CL cs.LG

    Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete Space

    Authors: Dongyu Ru, Jiangtao Feng, Lin Qiu, Hao Zhou, Mingxuan Wang, Weinan Zhang, Yong Yu, Lei Li

    Abstract: Active learning for sentence understanding aims at discovering informative unlabeled data for annotation and therefore reducing the demand for labeled data. We argue that the typical uncertainty sampling method for active learning is time-consuming and can hardly work in real-time, which may lead to ineffective sample selection. We propose adversarial uncertainty sampling in discrete space (AUSDS)… ▽ More

    Submitted 28 October, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted to EMNLP 2020 Findings

  6. arXiv:1805.08939  [pdf, other

    cs.LG stat.ML

    Approximate Random Dropout

    Authors: Zhuoran Song, Ru Wang, Dongyu Ru, Hongru Huang, Zhenghao Peng, **g Ke, Xiaoyao Liang, Li Jiang

    Abstract: The training phases of Deep neural network~(DNN) consumes enormous processing time and energy. Compression techniques utilizing the sparsity of DNNs can effectively accelerate the inference phase of DNNs. However, it can be hardly used in the training phase because the training phase involves dense matrix-multiplication using General Purpose Computation on Graphics Processors (GPGPU), which endors… ▽ More

    Submitted 14 December, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: 7 pages, 6 figures, conference

  7. QA4IE: A Question Answering based Framework for Information Extraction

    Authors: Lin Qiu, Hao Zhou, Yanru Qu, Weinan Zhang, Suoheng Li, Shu Rong, Dongyu Ru, Lihua Qian, Kewei Tu, Yong Yu

    Abstract: Information Extraction (IE) refers to automatically extracting structured relation tuples from unstructured texts. Common IE solutions, including Relation Extraction (RE) and open IE systems, can hardly handle cross-sentence tuples, and are severely restricted by limited relation types as well as informal relation specifications (e.g., free-text based relation tuples). In order to overcome these w… ▽ More

    Submitted 28 January, 2019; v1 submitted 10 April, 2018; originally announced April 2018.

    Comments: Accepted by 17th International Semantic Web Conference (ISWC'18)