Skip to main content

Showing 1–44 of 44 results for author: Ouyang, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17517  [pdf, other

    cs.LG cs.AI

    Preserving Node Distinctness in Graph Autoencoders via Similarity Distillation

    Authors: Ge Chen, Yulan Hu, Sheng Ouyang, Yong Liu, Cuicui Luo

    Abstract: Graph autoencoders (GAEs), as a kind of generative self-supervised learning approach, have shown great potential in recent years. GAEs typically rely on distance-based criteria, such as mean-square-error (MSE), to reconstruct the input graph. However, relying solely on a single reconstruction criterion may lead to a loss of distinctiveness in the reconstructed graph, causing nodes to collapse into… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.16486  [pdf, other

    cs.AI

    Towards Comprehensive Preference Data Collection for Reward Modeling

    Authors: Yulan Hu, Qingyang Li, Sheng Ouyang, Ge Chen, Kaihui Chen, Lijun Mei, Xucheng Ye, Fuzheng Zhang, Yong Liu

    Abstract: Reinforcement Learning from Human Feedback (RLHF) facilitates the alignment of large language models (LLMs) with human preferences, thereby enhancing the quality of responses generated. A critical component of RLHF is the reward model, which is trained on preference data and outputs a scalar reward during the inference stage. However, the collection of preference data still lacks thorough investig… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.01168  [pdf

    econ.GN cs.AI cs.CY cs.ET cs.HC

    How Ethical Should AI Be? How AI Alignment Shapes the Risk Preferences of LLMs

    Authors: Shumiao Ouyang, Hayong Yun, Xingjian Zheng

    Abstract: This study explores the risk preferences of Large Language Models (LLMs) and how the process of aligning them with human ethical standards influences their economic decision-making. By analyzing 30 LLMs, we uncover a broad range of inherent risk profiles ranging from risk-averse to risk-seeking. We then explore how different types of AI alignment, a process that ensures models act according to hum… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  4. arXiv:2404.19316  [pdf, other

    cs.CL

    QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering

    Authors: Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, **g Xiao

    Abstract: Extractive Question Answering (EQA) in Machine Reading Comprehension (MRC) often faces the challenge of dealing with semantically identical but format-variant inputs. Our work introduces a novel approach, called the ``Query Latent Semantic Calibrator (QLSC)'', designed as an auxiliary module for existing MRC models. We propose a unique scaling strategy to capture latent semantic center features of… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted by the 2024 International Joint Conference on Neural Networks (IJCNN 2024)

  5. arXiv:2403.14340  [pdf, other

    cs.LG cs.AI

    Exploring Task Unification in Graph Representation Learning via Generative Approach

    Authors: Yulan Hu, Sheng Ouyang, Zhirui Yang, Ge Chen, Junchen Wan, Xiao Wang, Yong Liu

    Abstract: Graphs are ubiquitous in real-world scenarios and encompass a diverse range of tasks, from node-, edge-, and graph-level tasks to transfer learning. However, designing specific tasks for each type of graph data is often costly and lacks generalizability. Recent endeavors under the "Pre-training + Fine-tuning" or "Pre-training + Prompt" paradigms aim to design a unified framework capable of general… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  6. arXiv:2402.16843  [pdf, other

    cs.CV cs.AI cs.CL cs.GR cs.LG

    Multi-LoRA Composition for Image Generation

    Authors: Ming Zhong, Yelong Shen, Shuohang Wang, Yadong Lu, Yizhu Jiao, Siru Ouyang, Donghan Yu, Jiawei Han, Weizhu Chen

    Abstract: Low-Rank Adaptation (LoRA) is extensively utilized in text-to-image models for the accurate rendition of specific elements like distinct characters or unique styles in generated images. Nonetheless, existing methods face challenges in effectively composing multiple LoRAs, especially as the number of LoRAs to be integrated grows, thus hindering the creation of complex imagery. In this paper, we stu… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Project Website: https://maszhongming.github.io/Multi-LoRA-Composition/

  7. arXiv:2402.14315  [pdf, other

    q-bio.BM cs.LG

    Structure-Based Drug Design via 3D Molecular Generative Pre-training and Sampling

    Authors: Yuwei Yang, Siqi Ouyang, Xueyu Hu, Mingyue Zheng, Hao Zhou, Lei Li

    Abstract: Structure-based drug design aims at generating high affinity ligands with prior knowledge of 3D target structures. Existing methods either use conditional generative model to learn the distribution of 3D ligands given target binding sites, or iteratively modify molecules to optimize a structure-based activity estimator. The former is highly constrained by data quantity and quality, which leaves op… ▽ More

    Submitted 15 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  8. arXiv:2402.14293  [pdf, other

    cs.CL

    Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education

    Authors: Rui Yang, Boming Yang, Sixun Ouyang, Tianwei She, Aosong Feng, Yuang Jiang, Freddy Lecue, **ghui Lu, Irene Li

    Abstract: In the domain of Natural Language Processing (NLP), Large Language Models (LLMs) have demonstrated promise in text-generation tasks. However, their educational applications, particularly for domain-specific queries, remain underexplored. This study investigates LLMs' capabilities in educational scenarios, focusing on concept graph recovery and question-answering (QA). We assess LLMs' zero-shot per… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  9. arXiv:2402.05035   

    cs.CV

    A Survey on Domain Generalization for Medical Image Analysis

    Authors: Ziwei Niu, Shuyi Ouyang, Shiao Xie, Yen-wei Chen, Lanfen Lin

    Abstract: Medical Image Analysis (MedIA) has emerged as a crucial tool in computer-aided diagnosis systems, particularly with the advancement of deep learning (DL) in recent years. However, well-trained deep models often experience significant performance degradation when deployed in different medical sites, modalities, and sequences, known as a domain shift issue. In light of this, Domain Generalization (D… ▽ More

    Submitted 13 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: This is a withdrawn submission and will be considered invalid. Due to some errors and overlap with published papers, we have chosen to withdraw it

  10. arXiv:2401.10608  [pdf, other

    cs.CV cs.MM

    M2ORT: Many-To-One Regression Transformer for Spatial Transcriptomics Prediction from Histopathology Images

    Authors: Hongyi Wang, Xiuju Du, **g Liu, Shuyi Ouyang, Yen-Wei Chen, Lanfen Lin

    Abstract: The advancement of Spatial Transcriptomics (ST) has facilitated the spatially-aware profiling of gene expressions based on histopathology images. Although ST data offers valuable insights into the micro-environment of tumors, its acquisition cost remains expensive. Therefore, directly predicting the ST expressions from digital pathology images is desired. Current methods usually adopt existing reg… ▽ More

    Submitted 24 January, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

  11. arXiv:2401.06059  [pdf, other

    cs.CL cs.AI cs.LG

    Investigating Data Contamination for Pre-training Language Models

    Authors: Minhao Jiang, Ken Ziyu Liu, Ming Zhong, Rylan Schaeffer, Siru Ouyang, Jiawei Han, Sanmi Koyejo

    Abstract: Language models pre-trained on web-scale corpora demonstrate impressive capabilities on diverse downstream tasks. However, there is increasing concern whether such capabilities might arise from evaluation datasets being included in the pre-training corpus -- a phenomenon known as \textit{data contamination} -- in a manner that artificially increases performance. There has been little understanding… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 16 pages, 5 figures

  12. arXiv:2312.04515  [pdf, other

    cs.CL

    Efficient Monotonic Multihead Attention

    Authors: Xutai Ma, Anna Sun, Siqi Ouyang, Hirofumi Inaguma, Paden Tomasello

    Abstract: We introduce the Efficient Monotonic Multihead Attention (EMMA), a state-of-the-art simultaneous translation model with numerically-stable and unbiased monotonic alignment estimation. In addition, we present improved training and inference strategies, including simultaneous fine-tuning from an offline translation model and reduction of monotonic alignment variance. The experimental results demonst… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  13. arXiv:2311.09656  [pdf, other

    cs.CL cs.AI

    Structured Chemistry Reasoning with Large Language Models

    Authors: Siru Ouyang, Zhuosheng Zhang, Bing Yan, Xuan Liu, Ye** Choi, Jiawei Han, Lianhui Qin

    Abstract: Large Language Models (LLMs) excel in diverse areas, yet struggle with complex scientific reasoning, especially in the field of chemistry. Different from the simple chemistry tasks (e.g., molecule classification) addressed in previous studies, complex chemistry problems require not only vast knowledge and precise calculation, but also compositional reasoning about rich dynamic interactions of diff… ▽ More

    Submitted 9 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Work in progress

  14. arXiv:2311.01191  [pdf, other

    cs.LG cs.AI

    VIGraph: Generative Self-supervised Learning for Class-Imbalanced Node Classification

    Authors: Yulan Hu, Sheng Ouyang, Zhirui Yang, Yong Liu

    Abstract: Class imbalance in graph data presents significant challenges for node classification. While existing methods, such as SMOTE-based approaches, partially mitigate this issue, they still exhibit limitations in constructing imbalanced graphs. Generative self-supervised learning (SSL) methods, exemplified by graph autoencoders (GAEs), offer a promising solution by directly generating minority nodes fr… ▽ More

    Submitted 27 March, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  15. arXiv:2310.16040  [pdf, other

    cs.CL cs.AI

    Instruct and Extract: Instruction Tuning for On-Demand Information Extraction

    Authors: Yizhu Jiao, Ming Zhong, Sha Li, Ruining Zhao, Siru Ouyang, Heng Ji, Jiawei Han

    Abstract: Large language models with instruction-following capabilities open the door to a wider group of users. However, when it comes to information extraction - a classic task in natural language processing - most task-specific systems cannot align well with long-tail ad hoc extraction use cases for non-expert users. To address this, we propose a novel paradigm, termed On-Demand Information Extraction, t… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  16. arXiv:2310.14525  [pdf, other

    cs.LG cs.AI

    Graph Ranking Contrastive Learning: A Extremely Simple yet Efficient Method

    Authors: Yulan Hu, Sheng Ouyang, **gyu Liu, Ge Chen, Zhirui Yang, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang, Yong Liu

    Abstract: Graph contrastive learning (GCL) has emerged as a representative graph self-supervised method, achieving significant success. The currently prevalent optimization objective for GCL is InfoNCE. Typically, it employs augmentation techniques to obtain two views, where a node in one view acts as the anchor, the corresponding node in the other view serves as the positive sample, and all other nodes are… ▽ More

    Submitted 21 March, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

  17. arXiv:2310.12418  [pdf, other

    cs.CL

    The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions

    Authors: Siru Ouyang, Shuohang Wang, Yang Liu, Ming Zhong, Yizhu Jiao, Dan Iter, Reid Pryzant, Chenguang Zhu, Heng Ji, Jiawei Han

    Abstract: Recent progress in Large Language Models (LLMs) has produced models that exhibit remarkable performance across a variety of NLP tasks. However, it remains unclear whether the existing focus of NLP research accurately captures the genuine requirements of human users. This paper provides a comprehensive analysis of the divergence between current NLP research and the needs of real-world NLP applicati… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  18. arXiv:2310.11102  [pdf, other

    cs.LG cs.AI

    Refining Latent Representations: A Generative SSL Approach for Heterogeneous Graph Learning

    Authors: Yulan Hu, Zhirui Yang, Sheng Ouyang, Yong Liu

    Abstract: Self-Supervised Learning (SSL) has shown significant potential and has garnered increasing interest in graph learning. However, particularly for generative SSL methods, its potential in Heterogeneous Graph Learning (HGL) remains relatively underexplored. Generative SSL utilizes an encoder to map the input graph into a latent representation and a decoder to recover the input graph from the latent r… ▽ More

    Submitted 20 April, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  19. arXiv:2310.07795  [pdf, other

    cs.CL

    Ontology Enrichment for Effective Fine-grained Entity Ty**

    Authors: Siru Ouyang, Jiaxin Huang, Pranav Pillai, Yunyi Zhang, Yu Zhang, Jiawei Han

    Abstract: Fine-grained entity ty** (FET) is the task of identifying specific entity types at a fine-grained level for entity mentions based on their contextual information. Conventional methods for FET require extensive human annotation, which is time-consuming and costly. Recent studies have been develo** weakly supervised or zero-shot approaches. We study the setting of zero-shot FET where only an ont… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  20. arXiv:2308.03423  [pdf, other

    cs.CL cs.AI

    Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism

    Authors: Jiaxin Fan, Yong Zhang, Hanzhang Li, Jianzong Wang, Zhitao Li, Sheng Ouyang, Ning Cheng, **g Xiao

    Abstract: Chinese Automatic Speech Recognition (ASR) error correction presents significant challenges due to the Chinese language's unique features, including a large character set and borderless, morpheme-based structure. Current mainstream models often struggle with effectively utilizing word-level features and phonetic information. This paper introduces a novel approach that incorporates a dynamic error… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: Accepted by 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)

  21. arXiv:2308.02828  [pdf, other

    cs.SE

    LLM is Like a Box of Chocolates: the Non-determinism of ChatGPT in Code Generation

    Authors: Shuyin Ouyang, Jie M. Zhang, Mark Harman, Meng Wang

    Abstract: There has been a recent explosion of research on Large Language Models (LLMs) for software engineering tasks, in particular code generation. However, results from LLMs can be highly unstable; nondeterministically returning very different codes for the same prompt. Non-determinism is a potential menace to scientific conclusion validity. When non-determinism is high, scientific conclusions simply ca… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

  22. arXiv:2307.01448  [pdf, other

    cs.CL

    ReactIE: Enhancing Chemical Reaction Extraction with Weak Supervision

    Authors: Ming Zhong, Siru Ouyang, Minhao Jiang, Vivian Hu, Yizhu Jiao, Xuan Wang, Jiawei Han

    Abstract: Structured chemical reaction information plays a vital role for chemists engaged in laboratory work and advanced endeavors such as computer-aided drug design. Despite the importance of extracting structured reactions from scientific literature, data annotation for this purpose is cost-prohibitive due to the significant labor required from domain experts. Consequently, the scarcity of sufficient tr… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: Findings of ACL 2023, Short Paper

  23. arXiv:2305.15064  [pdf, other

    cs.CL

    AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With Large Language Models

    Authors: Siqi Ouyang, Lei Li

    Abstract: Recent large language models (LLMs) are promising for making decisions in grounded environments. However, LLMs frequently fail in complex decision-making tasks due to the misalignment between the pre-trained knowledge in LLMs and the actual rules in the environment. Existing methods require either costly gradient computation or lengthy in-context demonstrations. In this paper, we propose AutoPlan,… ▽ More

    Submitted 26 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Findings

  24. arXiv:2212.09359  [pdf, other

    cs.CL cs.SD eess.AS

    WACO: Word-Aligned Contrastive Learning for Speech Translation

    Authors: Siqi Ouyang, Rong Ye, Lei Li

    Abstract: End-to-end Speech Translation (E2E ST) aims to directly translate source speech into target text. Existing ST methods perform poorly when only extremely small speech-text data are available for training. We observe that an ST model's performance closely correlates with its embedding similarity between speech and source transcript. In this paper, we propose Word-Aligned COntrastive learning (WACO),… ▽ More

    Submitted 7 July, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: ACL 2023 Poster

  25. arXiv:2212.06950  [pdf, other

    cs.CL

    Pre-trained Language Models Can be Fully Zero-Shot Learners

    Authors: Xuandong Zhao, Siqi Ouyang, Zhiguo Yu, Ming Wu, Lei Li

    Abstract: How can we extend a pre-trained model to many language understanding tasks, without labeled or additional unlabeled data? Pre-trained language models (PLMs) have been effective for a wide range of NLP tasks. However, existing approaches either require fine-tuning on downstream labeled datasets or manually constructing proper prompts. In this paper, we propose nonparametric prompting PLM (NPPrompt)… ▽ More

    Submitted 26 May, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  26. arXiv:2210.07113  [pdf, other

    cs.CL cs.AI cs.HC cs.IR cs.LG

    Towards End-to-End Open Conversational Machine Reading

    Authors: Sizhe Zhou, Siru Ouyang, Zhuosheng Zhang, Hai Zhao

    Abstract: In open-retrieval conversational machine reading (OR-CMR) task, machines are required to do multi-turn question answering given dialogue history and a textual knowledge base. Existing works generally utilize two independent modules to approach this problem's two successive sub-tasks: first with a hard-label decision making and second with a question generation aided by various entailment reasoning… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 10 pages, 2 figures, 10 tables

  27. arXiv:2209.08284  [pdf, ps, other

    cs.CL

    Structured Knowledge Grounding for Question Answering

    Authors: Yujie Lu, Siqi Ouyang, Kairui Zhou

    Abstract: Can language models (LM) ground question-answering (QA) tasks in the knowledge base via inherent relational reasoning ability? While previous models that use only LMs have seen some success on many QA tasks, more recent methods include knowledge graphs (KG) to complement LMs with their more logic-driven implicit knowledge. However, effectively extracting information from structured data, like KGs,… ▽ More

    Submitted 5 June, 2023; v1 submitted 17 September, 2022; originally announced September 2022.

  28. On the Impact of Noises in Crowd-Sourced Data for Speech Translation

    Authors: Siqi Ouyang, Rong Ye, Lei Li

    Abstract: Training speech translation (ST) models requires large and high-quality datasets. MuST-C is one of the most widely used ST benchmark datasets. It contains around 400 hours of speech-transcript-translation data for each of the eight translation directions. This dataset passes several quality-control filters during creation. However, we find that MuST-C still suffers from three major quality issues:… ▽ More

    Submitted 30 June, 2022; v1 submitted 28 June, 2022; originally announced June 2022.

    Comments: Accepted to IWSLT 2022 as a scientific paper

  29. arXiv:2205.00281  [pdf, other

    cs.LG stat.ML

    Understanding the Generalization Performance of Spectral Clustering Algorithms

    Authors: Shaojie Li, Sheng Ouyang, Yong Liu

    Abstract: The theoretical analysis of spectral clustering mainly focuses on consistency, while there is relatively little research on its generalization performance. In this paper, we study the excess risk bounds of the popular spectral clustering algorithms: \emph{relaxed} RatioCut and \emph{relaxed} NCut. Firstly, we show that their excess risk bounds between the empirical continuous optimal solution and… ▽ More

    Submitted 17 July, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

  30. arXiv:2204.06142  [pdf

    cs.IR cs.AI

    Retrieval of Scientific and Technological Resources for Experts and Scholars

    Authors: Suyu Ouyang, Yingxia Shao, Ang Li

    Abstract: Institutions of higher learning, research institutes and other scientific research units have abundant scientific and technological resources of experts and scholars, and these talents with great scientific and technological innovation ability are an important force to promote industrial upgrading. The scientific and technological resources of experts and scholars are mainly composed of basic attr… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: 9 pages

  31. arXiv:2203.17079  [pdf

    cs.CL cs.AI

    Scientific and Technological Text Knowledge Extraction Method of based on Word Mixing and GRU

    Authors: Suyu Ouyang, Yingxia Shao, Jun** Du, Ang Li

    Abstract: The knowledge extraction task is to extract triple relations (head entity-relation-tail entity) from unstructured text data. The existing knowledge extraction methods are divided into "pipeline" method and joint extraction method. The "pipeline" method is to separate named entity recognition and entity relationship extraction and use their own modules to extract them. Although this method has bett… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: 8 pages,2 figures

  32. arXiv:2108.12599  [pdf, other

    cs.CL cs.AI cs.HC cs.IR

    Smoothing Dialogue States for Open Conversational Machine Reading

    Authors: Zhuosheng Zhang, Siru Ouyang, Hai Zhao, Masao Utiyama, Eiichiro Sumita

    Abstract: Conversational machine reading (CMR) requires machines to communicate with humans through multi-turn interactions between two salient dialogue states of decision making and question generation processes. In open CMR settings, as the more realistic scenario, the retrieved background knowledge would be noisy, which results in severe challenges in the information transmission. Existing studies common… ▽ More

    Submitted 2 September, 2021; v1 submitted 28 August, 2021; originally announced August 2021.

    Comments: Accepted by EMNLP 2021 Main Conference

  33. arXiv:2106.10796  [pdf, other

    cs.LG cs.DC

    CD-SGD: Distributed Stochastic Gradient Descent with Compression and Delay Compensation

    Authors: Enda Yu, Dezun Dong, Yemao Xu, Shuo Ouyang, Xiangke Liao

    Abstract: Communication overhead is the key challenge for distributed training. Gradient compression is a widely used approach to reduce communication traffic. When combining with parallel communication mechanism method like pipeline, gradient compression technique can greatly alleviate the impact of communication overhead. However, there exists two problems of gradient compression technique to be solved. F… ▽ More

    Submitted 6 September, 2021; v1 submitted 20 June, 2021; originally announced June 2021.

    Comments: 12 pages

  34. arXiv:2105.10334  [pdf, other

    cs.CL cs.AI

    Fact-driven Logical Reasoning for Machine Reading Comprehension

    Authors: Siru Ouyang, Zhuosheng Zhang, Hai Zhao

    Abstract: Recent years have witnessed an increasing interest in training machines with reasoning ability, which deeply relies on accurately and clearly presented clue forms. The clues are usually modeled as entity-aware knowledge in existing studies. However, those entity-aware clues are primarily focused on commonsense, making them insufficient for tasks that require knowledge of temporary facts or events,… ▽ More

    Submitted 26 May, 2023; v1 submitted 21 May, 2021; originally announced May 2021.

  35. arXiv:2105.06635  [pdf, other

    cs.AI cs.RO

    An Extension of BIM Using AI: a Multi Working-Machines Pathfinding Solution

    Authors: Yusheng Xiang, Kailun Liu, Tianqing Su, Jun Li, Shirui Ouyang, Samuel S. Mao, Marcus Geimer

    Abstract: Multi working-machines pathfinding solution enables more mobile machines simultaneously to work inside of a working site so that the productivity can be expected to increase evolutionary. To date, the potential cooperation conflicts among construction machinery limit the amount of construction machinery investment in a concrete working site. To solve the cooperation problem, civil engineers optimi… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: 17 pages, 12 figures

  36. arXiv:2012.14827  [pdf, other

    cs.CL cs.AI

    Dialogue Graph Modeling for Conversational Machine Reading

    Authors: Siru Ouyang, Zhuosheng Zhang, Hai Zhao

    Abstract: Conversational Machine Reading (CMR) aims at answering questions in a complicated manner. Machine needs to answer questions through interactions with users based on given rule document, user scenario and dialogue history, and ask questions to clarify if necessary. In this paper, we propose a dialogue graph modeling framework to improve the understanding and reasoning ability of machine on CMR task… ▽ More

    Submitted 29 May, 2021; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: Findings of ACL: ACL 2021

  37. arXiv:2010.02451  [pdf

    cs.CV cs.AI

    Collaboratively boosting data-driven deep learning and knowledge-guided ontological reasoning for semantic segmentation of remote sensing imagery

    Authors: Yansheng Li, Song Ouyang, Yongjun Zhang

    Abstract: As one kind of architecture from the deep learning family, deep semantic segmentation network (DSSN) achieves a certain degree of success on the semantic segmentation task and obviously outperforms the traditional methods based on hand-crafted features. As a classic data-driven technique, DSSN can be trained by an end-to-end mechanism and competent for employing the low-level and mid-level cues (i… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  38. Communication optimization strategies for distributed deep neural network training: A survey

    Authors: Shuo Ouyang, Dezun Dong, Yemao Xu, Liquan Xiao

    Abstract: Recent trends in high-performance computing and deep learning have led to the proliferation of studies on large-scale deep neural network training. However, the frequent communication requirements among computation nodes drastically slows the overall training speeds, which causes bottlenecks in distributed training, particularly in clusters with limited network bandwidths. To mitigate the drawback… ▽ More

    Submitted 22 November, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Journal ref: Journal of Parallel and Distributed Computing 149 (2021) pp. 52-65

  39. arXiv:1912.01795  [pdf, other

    cs.CL cs.AI

    Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets

    Authors: Fanchao Qi, Liang Chang, Maosong Sun, Sicong Ouyang, Zhiyuan Liu

    Abstract: A sememe is defined as the minimum semantic unit of human languages. Sememe knowledge bases (KBs), which contain words annotated with sememes, have been successfully applied to many NLP tasks. However, existing sememe KBs are built on only a few languages, which hinders their widespread utilization. To address the issue, we propose to build a unified sememe KB for multiple languages based on Babel… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Comments: Accepted by AAAI Conference on Artificial Intelligence 2020 for oral presentation

  40. arXiv:1910.08910  [pdf, other

    cs.CL cs.LG eess.AS

    Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes

    Authors: Yujia Qin, Fanchao Qi, Sicong Ouyang, Zhiyuan Liu, Cheng Yang, Yasheng Wang, Qun Liu, Maosong Sun

    Abstract: Sememes, the minimum semantic units of human languages, have been successfully utilized in various natural language processing applications. However, most existing studies exploit sememes in specific tasks and few efforts are made to utilize sememes more fundamentally. In this paper, we propose to incorporate sememes into recurrent neural networks (RNNs) to improve their sequence modeling ability,… ▽ More

    Submitted 19 August, 2020; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: Published in IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP). 10 pages, 2 figures

  41. arXiv:1807.11141  [pdf, other

    cs.IR

    KB4Rec: A Dataset for Linking Knowledge Bases with Recommender Systems

    Authors: Wayne Xin Zhao, Gaole He, Hongjian Dou, ** Huang, Siqi Ouyang, Ji-Rong Wen

    Abstract: To develop a knowledge-aware recommender system, a key data problem is how we can obtain rich and structured knowledge information for recommender system (RS) items. Existing datasets or methods either use side information from original recommender systems (containing very few kinds of useful information) or utilize private knowledge base (KB). In this paper, we present the first public linked KB… ▽ More

    Submitted 27 December, 2020; v1 submitted 29 July, 2018; originally announced July 2018.

  42. arXiv:1807.06978  [pdf, other

    cs.IR cs.AI cs.CL

    Improving Explainable Recommendations with Synthetic Reviews

    Authors: Sixun Ouyang, Aonghus Lawlor, Felipe Costa, Peter Dolog

    Abstract: An important task for a recommender system to provide interpretable explanations for the user. This is important for the credibility of the system. Current interpretable recommender systems tend to focus on certain features known to be important to the user and offer their explanations in a structured form. It is well known that user generated reviews and feedback from reviewers have strong levera… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

    Comments: Recsys DLRS 2018

  43. arXiv:1707.01561  [pdf, other

    cs.CL cs.LG

    Automatic Generation of Natural Language Explanations

    Authors: Felipe Costa, Sixun Ouyang, Peter Dolog, Aonghus Lawlor

    Abstract: An important task for recommender system is to generate explanations according to a user's preferences. Most of the current methods for explainable recommendations use structured sentences to provide descriptions along with the recommendations they produce. However, those methods have neglected the review-oriented way of writing a text, even though it is known that these reviews have a strong infl… ▽ More

    Submitted 4 July, 2017; originally announced July 2017.

    Comments: 7 pages, 5 figures, 2nd workshop on Deep Learning for Recommender Systems

  44. arXiv:1409.5114  [pdf, other

    cs.CV

    A Survey on Heterogeneous Face Recognition: Sketch, Infra-red, 3D and Low-resolution

    Authors: Shuxin Ouyang, Timothy Hospedales, Yi-Zhe Song, Xueming Li

    Abstract: Heterogeneous face recognition (HFR) refers to matching face imagery across different domains. It has received much interest from the research community as a result of its profound implications in law enforcement. A wide variety of new invariant features, cross-modality matching models and heterogeneous datasets being established in recent years. This survey provides a comprehensive review of esta… ▽ More

    Submitted 10 October, 2014; v1 submitted 17 September, 2014; originally announced September 2014.

    Comments: survey paper(35 pages)

    ACM Class: A.1; I.4.9; I.5.4