Skip to main content

Showing 1–33 of 33 results for author: Ou, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16299  [pdf, other

    cs.CL cs.AI

    Compensate Quantization Errors: Make Weights Hierarchical to Compensate Each Other

    Authors: Yifei Gao, Jie Ou, Lei Wang, Yuting Xiao, Zhiyuan Xiang, Ruiting Dai, Jun Cheng

    Abstract: Emergent Large Language Models (LLMs) use their extraordinary performance and powerful deduction capacity to discern from traditional language models. However, the expenses of computational resources and storage for these LLMs are stunning, quantization then arises as a trending conversation. To address accuracy decay caused by quantization, two streams of works in post-training quantization metho… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Efficient quantization method

    MSC Class: F.2.3

  2. arXiv:2406.03736  [pdf, other

    cs.LG cs.CL

    Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

    Authors: **gyang Ou, Shen Nie, Kaiwen Xue, Fengqi Zhu, Jiacheng Sun, Zhenguo Li, Chongxuan Li

    Abstract: Discrete diffusion models with absorbing processes have shown promise in language modeling. The key quantities to be estimated are the ratios between the marginal probabilities of two transitive states at all timesteps, called the concrete score. In this paper, we reveal that the concrete score in absorbing diffusion can be expressed as conditional probabilities of clean data, multiplied by a time… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  3. arXiv:2405.03952  [pdf, other

    cs.SD cs.CL eess.AS

    HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech

    Authors: Zhongren Dong, Zixing Zhang, Weixiang Xu, **g Han, Jianjun Ou, Björn W. Schuller

    Abstract: Automatically detecting Alzheimer's Disease (AD) from spontaneous speech plays an important role in its early diagnosis. Recent approaches highly rely on the Transformer architectures due to its efficiency in modelling long-range context dependencies. However, the quadratic increase in computational complexity associated with self-attention and the length of audio poses a challenge when deploying… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Journal ref: publised at ICASSP 2024

  4. arXiv:2404.18669  [pdf, other

    cs.GR cs.AI cs.CV

    Bootstrap 3D Reconstructed Scenes from 3D Gaussian Splatting

    Authors: Yifei Gao, Jie Ou, Lei Wang, Jun Cheng

    Abstract: Recent developments in neural rendering techniques have greatly enhanced the rendering of photo-realistic 3D scenes across both academic and commercial fields. The latest method, known as 3D Gaussian Splatting (3D-GS), has set new benchmarks for rendering quality and speed. Nevertheless, the limitations of 3D-GS become pronounced in synthesizing new viewpoints, especially for views that greatly de… ▽ More

    Submitted 12 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    MSC Class: I.4.8

  5. arXiv:2404.11095  [pdf, other

    cs.CL cs.AI

    Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues

    Authors: Jiao Ou, Jiayu Wu, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai

    Abstract: Aligning large language models (LLMs) with human expectations requires high-quality instructional dialogues, which can be achieved by raising diverse, in-depth, and insightful instructions that deepen interactions. Existing methods target instructions from real instruction dialogues as a learning goal and fine-tune a user simulator for posing instructions. However, the user simulator struggles to… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 27 pages, 3 figures, 12 tables

  6. arXiv:2404.08698  [pdf, other

    cs.CL cs.LG

    Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

    Authors: Jie Ou, Yueming Chen, Wenhong Tian

    Abstract: While Large Language Models (LLMs) have shown remarkable abilities, they are hindered by significant resource consumption and considerable latency due to autoregressive processing. In this study, we introduce Adaptive N-gram Parallel Decoding (ANPD), an innovative and lossless approach that accelerates inference by allowing the simultaneous generation of multiple tokens. ANPD incorporates a two-st… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  7. arXiv:2404.06365  [pdf, other

    cs.CV cs.MM

    Dynamic Resolution Guidance for Facial Expression Recognition

    Authors: Jie Ou, Xu Li, Tianxiang Jiang, Yuanlun Xie

    Abstract: Facial expression recognition (FER) is vital for human-computer interaction and emotion analysis, yet recognizing expressions in low-resolution images remains challenging. This paper introduces a practical method called Dynamic Resolution Guidance for Facial Expression Recognition (DRGFER) to effectively recognize facial expressions in images with varying resolutions without compromising FER model… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  8. arXiv:2402.10618   

    cs.CL

    Enhancing Role-playing Systems through Aggressive Queries: Evaluation and Improvement

    Authors: Yihong Tang, Jiao Ou, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai

    Abstract: The advent of Large Language Models (LLMs) has propelled dialogue generation into new realms, particularly in the field of role-playing systems (RPSs). While enhanced with ordinary role-relevant training dialogues, existing LLM-based RPSs still struggle to align with roles when handling intricate and trapped queries in boundary scenarios. In this paper, we design the Modular ORchestrated Trap-sett… ▽ More

    Submitted 15 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: After our paper was submitted to the conference for review, it was found that there were major problems, so it was revised by more than 80%, which can basically be regarded as new work

  9. arXiv:2401.07579  [pdf, other

    cs.CV

    PMFSNet: Polarized Multi-scale Feature Self-attention Network For Lightweight Medical Image Segmentation

    Authors: Jiahui Zhong, Wenhong Tian, Yuanlun Xie, Zhijia Liu, Jie Ou, Taoran Tian, Lei Zhang

    Abstract: Current state-of-the-art medical image segmentation methods prioritize accuracy but often at the expense of increased computational demands and larger model sizes. Applying these large-scale models to the relatively limited scale of medical image datasets tends to induce redundant computation, complicating the process without the necessary benefits. This approach not only adds complexity but also… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  10. arXiv:2311.01677  [pdf, other

    cs.CL cs.AI

    DialogBench: Evaluating LLMs as Human-like Dialogue Systems

    Authors: Jiao Ou, Junda Lu, Che Liu, Yihong Tang, Fuzheng Zhang, Di Zhang, Kun Gai

    Abstract: Large language models (LLMs) have achieved remarkable breakthroughs in new dialogue capabilities by leveraging instruction tuning, which refreshes human impressions of dialogue systems. The long-standing goal of dialogue systems is to be human-like enough to establish long-term connections with users. Therefore, there has been an urgent need to evaluate LLMs as human-like dialogue systems. In this… ▽ More

    Submitted 29 March, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted at NAACL 2024 (main conference)

  11. arXiv:2308.15002  [pdf, other

    cs.AI

    Exploring the Limits of Historical Information for Temporal Knowledge Graph Extrapolation

    Authors: Yi Xu, Junjie Ou, Hui Xu, Luoyi Fu, Lei Zhou, Xinbing Wang, Chenghu Zhou

    Abstract: Temporal knowledge graphs, representing the dynamic relationships and interactions between entities over time, have been identified as a promising approach for event forecasting. However, a limitation of most temporal knowledge graph reasoning methods is their heavy reliance on the recurrence or periodicity of events, which brings challenges to inferring future events related to entities that lack… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: Extended version of AAAI paper arXiv:2211.10904

  12. arXiv:2306.08818  [pdf, other

    cs.CL

    Pragmatic Inference with a CLIP Listener for Contrastive Captioning

    Authors: Jiefu Ou, Benno Krojer, Daniel Fried

    Abstract: We propose a simple yet effective and robust method for contrastive captioning: generating discriminative captions that distinguish target images from very similar alternative distractor images. Our approach is built on a pragmatic inference procedure that formulates captioning as a reference game between a speaker, which produces possible captions describing the target, and a listener, which sele… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: Findings of ACL 2023, fixed some references

  13. arXiv:2303.02618  [pdf, other

    cs.LG cs.AI cs.NE

    Ensemble Reinforcement Learning: A Survey

    Authors: Yanjie Song, P. N. Suganthan, Witold Pedrycz, Junwei Ou, Yongming He, Yingwu Chen, Yutong Wu

    Abstract: Reinforcement Learning (RL) has emerged as a highly effective technique for addressing various scientific and applied problems. Despite its success, certain complex tasks remain challenging to be addressed solely with a single model and algorithm. In response, ensemble reinforcement learning (ERL), a promising approach that combines the benefits of both RL and ensemble learning (EL), has gained wi… ▽ More

    Submitted 13 December, 2023; v1 submitted 5 March, 2023; originally announced March 2023.

    Comments: 34 pages

  14. arXiv:2302.04197  [pdf, ps, other

    cs.CL

    Hierarchical Event Grounding

    Authors: Jiefu Ou, Adithya Pratapa, Rishubh Gupta, Teruko Mitamura

    Abstract: Event grounding aims at linking mention references in text corpora to events from a knowledge base (KB). Previous work on this task focused primarily on linking to a single KB event, thereby overlooking the hierarchical aspects of events. Events in documents are typically described at various levels of spatio-temporal granularity (Glavas et al. 2014). These hierarchical relations are utilized in d… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: Accepted to AAAI 2023

  15. arXiv:2211.10904  [pdf, other

    cs.AI cs.LG

    Temporal Knowledge Graph Reasoning with Historical Contrastive Learning

    Authors: Yi Xu, Junjie Ou, Hui Xu, Luoyi Fu

    Abstract: Temporal knowledge graph, serving as an effective way to store and model dynamic relations, shows promising prospects in event forecasting. However, most temporal knowledge graph reasoning methods are highly dependent on the recurrence or periodicity of events, which brings challenges to inferring future events related to entities that lack historical interaction. In fact, the current moment is of… ▽ More

    Submitted 2 December, 2022; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted by AAAI 2023

  16. arXiv:2210.16838  [pdf, other

    cs.CL cs.AI

    Counterfactual Data Augmentation via Perspective Transition for Open-Domain Dialogues

    Authors: Jiao Ou, **chao Zhang, Yang Feng, Jie Zhou

    Abstract: The construction of open-domain dialogue systems requires high-quality dialogue datasets. The dialogue data admits a wide variety of responses for a given dialogue history, especially responses with different semantics. However, collecting high-quality such a dataset in most scenarios is labor-intensive and time-consuming. In this paper, we propose a data augmentation method to automatically augme… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022 (main conference)

  17. arXiv:2202.03893  [pdf, other

    physics.soc-ph cs.SI stat.ME

    Influence maximization under limited network information: Seeding high-degree neighbors

    Authors: Jiamin Ou, Vincent Buskens, Arnout Van De Rijt, Debabrata Panja

    Abstract: The diffusion of information, norms, and practices across a social network can be initiated by compelling a small number of seed individuals to adopt first. Strategies proposed in previous work either assume full network information or large degree of control over what information is collected. However, privacy settings on the Internet and high non-response in surveys often severely limit availabl… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

    Comments: 28 pages, 9 figures

  18. arXiv:2112.11444  [pdf, other

    cs.AI cs.CL

    ESAN: Efficient Sentiment Analysis Network of A-Shares Research Reports for Stock Price Prediction

    Authors: Tuo Sun, Wanrong Zheng, Shufan Yu, Mengxun Li, Jiarui Ou

    Abstract: In this paper, we are going to develop a natural language processing model to help us to predict stocks in the long term. The whole network includes two modules. The first module is a natural language processing model which seeks out reliable factors from input reports. While the other is a time-series forecasting model which takes the factors as input and aims to predict stocks earnings yield. To… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

  19. arXiv:2110.10217  [pdf, other

    cs.NE cs.CV

    An Adaptive Sampling and Edge Detection Approach for Encoding Static Images for Spiking Neural Networks

    Authors: Peyton Chandarana, Junlin Ou, Ramtin Zand

    Abstract: Current state-of-the-art methods of image classification using convolutional neural networks are often constrained by both latency and power consumption. This places a limit on the devices, particularly low-power edge devices, that can employ these methods. Spiking neural networks (SNNs) are considered to be the third generation of artificial neural networks which aim to address these latency and… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  20. arXiv:2110.00959  [pdf, other

    cs.LG cs.AI

    Boost Neural Networks by Checkpoints

    Authors: Feng Wang, Guoyizhe Wei, Qiao Liu, **xiang Ou, Xian Wei, Hairong Lv

    Abstract: Training multiple deep neural networks (DNNs) and averaging their outputs is a simple way to improve the predictive performance. Nevertheless, the multiplied training cost prevents this ensemble method to be practical and efficient. Several recent works attempt to save and ensemble the checkpoints of DNNs, which only requires the same computational cost as training a single network. However, these… ▽ More

    Submitted 25 October, 2021; v1 submitted 3 October, 2021; originally announced October 2021.

  21. arXiv:2109.07779  [pdf, other

    cs.CL

    Constructing Emotion Consensus and Utilizing Unpaired Data for Empathetic Dialogue Generation

    Authors: Lei Shen, **chao Zhang, Jiao Ou, Xiaofang Zhao, Jie Zhou

    Abstract: Researches on dialogue empathy aim to endow an agent with the capacity of accurate understanding and proper responding for emotions. Existing models for empathetic dialogue generation focus on the emotion flow in one direction, that is, from the context to response. We argue that conducting an empathetic conversation is a bidirectional process, where empathy occurs when the emotions of two interlo… ▽ More

    Submitted 18 September, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: Accepted by EMNLP 2021 Findings

  22. arXiv:2107.01651  [pdf, other

    physics.soc-ph cs.SI

    Hidden dependence of spreading vulnerability on topological complexity

    Authors: Mark M. Dekker, Raoul D. Schram, Jiamin Ou, Debabrata Panja

    Abstract: Many dynamical phenomena in complex systems concern spreading that plays out on top of networks with changing architecture over time -- commonly known as temporal networks. A complex system's proneness to facilitate spreading phenomena, which we abbreviate as its `spreading vulnerability', is often surmised to be related to the topology of the temporal network featured by the system. Yet, cleanly… ▽ More

    Submitted 14 April, 2022; v1 submitted 4 July, 2021; originally announced July 2021.

    Comments: 15 pages, 9 figures, to appear in Phys. Rev. E

    Journal ref: Phys. Rev E 105, 054301 (2022)

  23. arXiv:2107.01443  [pdf, other

    physics.soc-ph cs.SI

    Quantifying agent impacts on contact sequences in social interactions

    Authors: Mark M. Dekker, Tessa F. Blanken, Fabian Dablander, Jiamin Ou, Denny Borsboom, Debabrata Panja

    Abstract: Human social behavior plays a crucial role in how pathogens like SARS-CoV-2 or fake news spread in a population. Social interactions determine the contact network among individuals, while spreading, requiring individual-to-individual transmission, takes place on top of the network. Studying the topological aspects of a contact network, therefore, not only has the potential of leading to valuable i… ▽ More

    Submitted 14 April, 2022; v1 submitted 3 July, 2021; originally announced July 2021.

    Comments: 14 pages, 4 fugures, separate supplementary information in pdf

    Journal ref: Scientific Reports 12, 3483 (2022)

  24. arXiv:2106.00976  [pdf, other

    cs.CL

    Exploring Discourse Structures for Argument Impact Classification

    Authors: Xin Liu, Jiefu Ou, Yangqiu Song, Xin Jiang

    Abstract: Discourse relations among arguments reveal logical structures of a debate conversation. However, no prior work has explicitly studied how the sequence of discourse relations influence a claim's impact. This paper empirically shows that the discourse relations between two arguments along the context path are essential factors for identifying the persuasive power of an argument. We further propose D… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted by ACL 2021

  25. arXiv:2106.00566  [pdf, other

    cs.CV

    Full-Resolution Encoder-Decoder Networks with Multi-Scale Feature Fusion for Human Pose Estimation

    Authors: Jie Ou, Mingjian Chen, Hong Wu

    Abstract: To achieve more accurate 2D human pose estimation, we extend the successful encoder-decoder network, simple baseline network (SBN), in three ways. To reduce the quantization errors caused by the large output stride size, two more decoder modules are appended to the end of the simple baseline network to get full output resolution. Then, the global context blocks (GCBs) are added to the encoder and… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  26. arXiv:2104.12977  [pdf, other

    cs.CL

    SE-DAE: Style-Enhanced Denoising Auto-Encoder for Unsupervised Text Style Transfer

    Authors: Jicheng Li, Yang Feng, Jiao Ou

    Abstract: Text style transfer aims to change the style of sentences while preserving the semantic meanings. Due to the lack of parallel data, the Denoising Auto-Encoder (DAE) is widely used in this task to model distributions of different sentence styles. However, because of the conflict between the target of the conventional denoising procedure and the target of style transfer task, the vanilla DAE can not… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted by the 2021 International Joint Conference on Neural Networks (IJCNN 2021)

  27. arXiv:2104.02137  [pdf, other

    cs.AI cs.CL

    ASER: Towards Large-scale Commonsense Knowledge Acquisition via Higher-order Selectional Preference over Eventualities

    Authors: Hongming Zhang, Xin Liu, Haojie Pan, Haowen Ke, Jiefu Ou, Tianqing Fang, Yangqiu Song

    Abstract: Commonsense knowledge acquisition and reasoning have long been a core artificial intelligence problem. However, in the past, there has been a lack of scalable methods to collect commonsense knowledge. In this paper, we propose to develop principles for collecting commonsense knowledge based on selectional preference. We generalize the definition of selectional preference from one-hop linguistic sy… ▽ More

    Submitted 16 January, 2022; v1 submitted 5 April, 2021; originally announced April 2021.

  28. arXiv:2103.04941  [pdf, other

    cs.CL

    InFillmore: Frame-Guided Language Generation with Bidirectional Context

    Authors: Jiefu Ou, Nathaniel Weir, Anton Belyy, Felix Yu, Benjamin Van Durme

    Abstract: We propose a structured extension to bidirectional-context conditional language generation, or "infilling," inspired by Frame Semantic theory (Fillmore, 1976). Guidance is provided through two approaches: (1) model fine-tuning, conditioning directly on observed symbolic frames, and (2) a novel extension to disjunctive lexically constrained decoding that leverages frame semantic lexical units. Auto… ▽ More

    Submitted 22 March, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Appearing in *SEM 2021

  29. arXiv:2103.01782  [pdf, other

    cs.SE

    MicroHECL: High-Efficient Root Cause Localization in Large-Scale Microservice Systems

    Authors: Dewei Liu, Chuan He, Xin Peng, Fan Lin, Chenxi Zhang, Shengfang Gong, Ziang Li, Jiayu Ou, Zheshun Wu

    Abstract: Availability issues of industrial microservice systems (e.g., drop of successfully placed orders and processed transactions) directly affect the running of the business. These issues are usually caused by various types of service anomalies which propagate along service dependencies. Accurate and high-efficient root cause localization is thus a critical challenge for large-scale industrial microser… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

  30. arXiv:2012.03316  [pdf, other

    cs.CV

    Efficient Human Pose Estimation with Depthwise Separable Convolution and Person Centroid Guided Joint Grou**

    Authors: Jie Ou, Hong Wu

    Abstract: In this paper, we propose efficient and effective methods for 2D human pose estimation. A new ResBlock is proposed based on depthwise separable convolution and is utilized instead of the original one in Hourglass network. It can be further enhanced by replacing the vanilla depthwise convolution with a mixed depthwise convolution. Based on it, we propose a bottom-up multi-person pose estimation met… ▽ More

    Submitted 6 December, 2020; originally announced December 2020.

  31. arXiv:2004.12617  [pdf, other

    cs.CL

    On the Importance of Word and Sentence Representation Learning in Implicit Discourse Relation Classification

    Authors: Xin Liu, Jiefu Ou, Yangqiu Song, Xin Jiang

    Abstract: Implicit discourse relation classification is one of the most difficult parts in shallow discourse parsing as the relation prediction without explicit connectives requires the language understanding at both the text span level and the sentence level. Previous studies mainly focus on the interactions between two arguments. We argue that a powerful contextualized representation module, a bilateral m… ▽ More

    Submitted 28 April, 2020; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: Accepted by IJCAI 2020

  32. arXiv:2003.13954  [pdf, other

    cs.CV

    FGN: Fully Guided Network for Few-Shot Instance Segmentation

    Authors: Zhibo Fan, **-Gang Yu, Zhihao Liang, Jiarong Ou, Changxin Gao, Gui-Song Xia, Yuanqing Li

    Abstract: Few-shot instance segmentation (FSIS) conjoins the few-shot learning paradigm with general instance segmentation, which provides a possible way of tackling instance segmentation in the lack of abundant labeled data for training. This paper presents a Fully Guided Network (FGN) for few-shot instance segmentation. FGN perceives FSIS as a guided model where a so-called support set is encoded and util… ▽ More

    Submitted 31 March, 2020; originally announced March 2020.

    Comments: Accepted by CVPR 2020, 10 pages, 6 figures,

  33. arXiv:2002.03510  [pdf, other

    eess.SY cs.RO

    Autonomous quadrotor obstacle avoidance based on dueling double deep recurrent Q-learning with monocular vision

    Authors: Jiajun Ou, Xiao Guo, Ming Zhu, Wenjie Lou

    Abstract: The rapid development of unmanned aerial vehicles (UAV) puts forward a higher requirement for autonomous obstacle avoidance. Due to the limited payload and power supply, small UAVs such as quadrotors usually carry simple sensors and computation units, which makes traditional methods more challenging to implement. In this paper, a novel framework is demonstrated to control a quadrotor flying throug… ▽ More

    Submitted 2 March, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: 23 pages, 10 figures