Skip to main content

Showing 1–50 of 150 results for author: Su, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00934  [pdf, other

    cs.CL

    CLEME2.0: Towards More Interpretable Evaluation by Disentangling Edits for Grammatical Error Correction

    Authors: **gheng Ye, Zishan Xu, Yinghui Li, Xuxin Cheng, Linlin Song, Qingyu Zhou, Hai-Tao Zheng, Ying Shen, Xin Su

    Abstract: The paper focuses on improving the interpretability of Grammatical Error Correction (GEC) metrics, which receives little attention in previous studies. To bridge the gap, we propose CLEME2.0, a reference-based evaluation strategy that can describe four elementary dimensions of GEC systems, namely hit-correction, error-correction, under-correction, and over-correction. They collectively contribute… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 16 pages, 8 tables, 2 figures. Under review

  2. arXiv:2406.19593  [pdf, other

    cs.CL cs.CV

    SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

    Authors: Xin Su, Man Luo, Kris W Pan, Tien Pei Chou, Vasudev Lal, Phillip Howard

    Abstract: Synthetic data generation has gained significant attention recently for its utility in training large vision and language models. However, the application of synthetic data to the training of multimodal context-augmented generation systems has been relatively unexplored. This gap in existing work is important because existing vision and language models (VLMs) are not trained specifically for conte… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2406.16054  [pdf, ps, other

    cs.LO

    On the Relative Completeness of Satisfaction-based Probabilistic Hoare Logic With While Loop

    Authors: Xin Sun, Xingchi Su, Xiaoning Bian, Anran Cui

    Abstract: Probabilistic Hoare logic (PHL) is an extension of Hoare logic and is specifically useful in verifying randomized programs. It allows researchers to formally reason about the behavior of programs with stochastic elements, ensuring the desired probabilistic properties are upheld. The relative completeness of satisfaction-based PHL has been an open problem ever since the birth of the first PHL in 19… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 13 pages. arXiv admin note: text overlap with arXiv:2405.01940

    MSC Class: 03B70 Logic in computer science ACM Class: F.3

  4. arXiv:2406.05723  [pdf, other

    cs.CV

    Binarized Diffusion Model for Image Super-Resolution

    Authors: Zheng Chen, Haotong Qin, Yong Guo, Xiongfei Su, Xin Yuan, Linghe Kong, Yulun Zhang

    Abstract: Advanced diffusion models (DMs) perform impressively in image super-resolution (SR), but the high memory and computational costs hinder their deployment. Binarization, an ultra-compression algorithm, offers the potential for effectively accelerating DMs. Nonetheless, due to the model structure and the multi-step iterative attribute of DMs, existing binarization methods result in significant perfor… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Code is available at https://github.com/zhengchen1999/BI-DiffSR

  5. arXiv:2406.04025  [pdf

    cs.CL

    The syntax-semantics interface in a child's path: A study of 3- to 11-year-olds' elicited production of Mandarin recursive relative clauses

    Authors: Caimei Yang, Qihang Yang, Xingzhi Su, Chenxi Fu, Xiaoyi Wang, Ying Yan, Zaijiang Man

    Abstract: There have been apparently conflicting claims over the syntax-semantics relationship in child acquisition. However, few of them have assessed the child's path toward the acquisition of recursive relative clauses (RRCs). The authors of the current paper did experiments to investigate 3- to 11-year-olds' most-structured elicited production of eight Mandarin RRCs in a 4 (syntactic types)*2 (semantic… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  6. arXiv:2406.03459  [pdf, other

    cs.CV

    LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection

    Authors: Qiang Chen, Xiangbo Su, Xinyu Zhang, Jian Wang, Jiahui Chen, Yunpeng Shen, Chuchu Han, Ziliang Chen, Weixiang Xu, Fanrong Li, Shan Zhang, Kun Yao, Errui Ding, Gang Zhang, **gdong Wang

    Abstract: In this paper, we present a light-weight detection transformer, LW-DETR, which outperforms YOLOs for real-time object detection. The architecture is a simple stack of a ViT encoder, a projector, and a shallow DETR decoder. Our approach leverages recent advanced techniques, such as training-effective techniques, e.g., improved loss and pretraining, and interleaved window and global attentions for r… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  7. arXiv:2405.19207  [pdf

    cs.IR cs.AI

    A Multi-Source Retrieval Question Answering Framework Based on RAG

    Authors: Ridong Wu, Shuhong Chen, Xiangbiao Su, Yuankai Zhu, Yifei Liao, Jianming Wu

    Abstract: With the rapid development of large-scale language models, Retrieval-Augmented Generation (RAG) has been widely adopted. However, existing RAG paradigms are inevitably influenced by erroneous retrieval information, thereby reducing the reliability and correctness of generated results. Therefore, to improve the relevance of retrieval information, this study proposes a method that replaces tradition… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 4 pages,3 figures

  8. arXiv:2405.07089  [pdf, other

    cs.HC

    SonifyAR: Context-Aware Sound Generation in Augmented Reality

    Authors: Xia Su, Jon E. Froehlich, Eunyee Koh, Chang Xiao

    Abstract: Sound plays a crucial role in enhancing user experience and immersiveness in Augmented Reality (AR). However, current platforms lack support for AR sound authoring due to limited interaction types, challenges in collecting and specifying context information, and difficulty in acquiring matching sound assets. We present SonifyAR, an LLM-based AR sound authoring system that generates context-aware s… ▽ More

    Submitted 15 May, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

    Comments: 12 pages, 12 figures

  9. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  10. arXiv:2405.03712  [pdf, other

    cs.LG cs.AI cs.CR cs.NE

    Your Network May Need to Be Rewritten: Network Adversarial Based on High-Dimensional Function Graph Decomposition

    Authors: Xiaoyan Su, Yinghao Zhu, Run Li

    Abstract: In the past, research on a single low dimensional activation function in networks has led to internal covariate shift and gradient deviation problems. A relatively small research area is how to use function combinations to provide property completion for a single activation function application. We propose a network adversarial method to address the aforementioned challenges. This is the first met… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  11. arXiv:2405.01940  [pdf, other

    cs.LO

    On the Relative Completeness of Satisfaction-based Quantum Hoare Logic

    Authors: Xin Sun, Xingchi Su, Xiaoning Bian, Huiwen Wu

    Abstract: Quantum Hoare logic (QHL) is a formal verification tool specifically designed to ensure the correctness of quantum programs. There has been an ongoing challenge to achieve a relatively complete satisfaction-based QHL with while-loop since its inception in 2006. This paper presents a solution by proposing the first relatively complete satisfaction-based QHL with while-loop. The completeness is prov… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 35 pages

    MSC Class: 03B70 Logic in computer science ACM Class: F.3

  12. arXiv:2404.09155  [pdf, other

    cs.LG cs.AI cs.CL

    Mitigating Heterogeneity among Factor Tensors via Lie Group Manifolds for Tensor Decomposition Based Temporal Knowledge Graph Embedding

    Authors: Jiang Li, Xiangdong Su, Yeyun Gong, Guanglai Gao

    Abstract: Recent studies have highlighted the effectiveness of tensor decomposition methods in the Temporal Knowledge Graphs Embedding (TKGE) task. However, we found that inherent heterogeneity among factor tensors in tensor decomposition significantly hinders the tensor fusion process and further limits the performance of link prediction. To overcome this limitation, we introduce a novel method that maps f… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  13. arXiv:2404.07479  [pdf, other

    cs.HC

    RASSAR: Room Accessibility and Safety Scanning in Augmented Reality

    Authors: Xia Su, Han Zhang, Kaiming Cheng, Jaewook Lee, Qiaochu Liu, Wyatt Olson, Jon Froehlich

    Abstract: The safety and accessibility of our homes is critical to quality of life and evolves as we age, become ill, host guests, or experience life events such as having children. Researchers and health professionals have created assessment instruments such as checklists that enable homeowners and trained experts to identify and mitigate safety and access issues. With advances in computer vision, augmente… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: To Appear in CHI 2024

  14. arXiv:2404.04886  [pdf, other

    cs.CR cs.AI

    PagPassGPT: Pattern Guided Password Guessing via Generative Pretrained Transformer

    Authors: Xingyu Su, Xiaojie Zhu, Yang Li, Yong Li, Chi Chen, Paulo Esteves-Veríssimo

    Abstract: Amidst the surge in deep learning-based password guessing models, challenges of generating high-quality passwords and reducing duplicate passwords persist. To address these challenges, we present PagPassGPT, a password guessing model constructed on Generative Pretrained Transformer (GPT). It can perform pattern guided guessing by incorporating pattern structure information as background knowledge,… ▽ More

    Submitted 17 June, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: Be accepted by DSN 2024

  15. arXiv:2404.04875  [pdf, other

    cs.CV

    NeRF2Points: Large-Scale Point Cloud Generation From Street Views' Radiance Field Optimization

    Authors: Peng Tu, Xun Zhou, Mingming Wang, Xiaojun Yang, Bo Peng, ** Chen, Xiu Su, Yawen Huang, Yefeng Zheng, Chang Xu

    Abstract: Neural Radiance Fields (NeRF) have emerged as a paradigm-shifting methodology for the photorealistic rendering of objects and environments, enabling the synthesis of novel viewpoints with remarkable fidelity. This is accomplished through the strategic utilization of object-centric camera poses characterized by significant inter-frame overlap. This paper explores a compelling, alternative utility o… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: 18 pages

  16. arXiv:2404.04856  [pdf, other

    cs.CV cs.AI

    Msmsfnet: a multi-stream and multi-scale fusion net for edge detection

    Authors: Chenguang Liu, Chisheng Wang, Feifei Dong, Xin Su, Chuanhua Zhu, De** Zhang, Qingquan Li

    Abstract: Edge detection is a long standing problem in computer vision. Recent deep learning based algorithms achieve state of-the-art performance in publicly available datasets. Despite the efficiency of these algorithms, their performance, however, relies heavily on the pretrained weights of the backbone network on the ImageNet dataset. This limits heavily the design space of deep learning based edge dete… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  17. arXiv:2403.11838  [pdf, other

    cs.CL cs.AI

    Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models

    Authors: Yi Luo, Zhenghao Lin, Yuhao Zhang, Jiashuo Sun, Chen Lin, Cheng** Xu, Xiangdong Su, Yelong Shen, Jian Guo, Yeyun Gong

    Abstract: Large Language Models (LLMs) exhibit impressive capabilities but also present risks such as biased content generation and privacy issues. One of the current alignment techniques includes principle-driven integration, but it faces challenges arising from the imprecision of manually crafted rules and inadequate risk perception in models without safety training. To address these, we introduce Guide-A… ▽ More

    Submitted 23 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024 main conference

  18. arXiv:2403.03015  [pdf, other

    cs.IT eess.SP

    Low Complexity Channel Estimation for RIS-Assisted THz Systems with Beam Split

    Authors: Xin Su, Ruisi He, Peng Zhang, Bo Ai

    Abstract: To support extremely high data rates, reconfigurable intelligent surface (RIS)-assisted terahertz (THz) communication is considered to be a promising technology for future sixth-generation networks. However, due to the typical employment of hybrid beamforming architecture in THz systems, as well as the passive nature of RIS which lacks the capability to process pilot signals, obtaining channel sta… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  19. arXiv:2402.17363  [pdf, other

    cs.RO cs.LG

    CGGM: A conditional graph generation model with adaptive sparsity for node anomaly detection in IoT networks

    Authors: Xianshi Su, Munan Li, Tongbang Jiang, Hao Long

    Abstract: Dynamic graphs are extensively employed for detecting anomalous behavior in nodes within the Internet of Things (IoT). Generative models are often used to address the issue of imbalanced node categories in dynamic graphs. Nevertheless, the constraints it faces include the monotonicity of adjacency relationships, the difficulty in constructing multi-dimensional features for nodes, and the lack of a… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 13 pages, 19 figures

  20. arXiv:2402.03815  [pdf, other

    cs.LG

    Expediting In-Network Federated Learning by Voting-Based Consensus Model Compression

    Authors: Xiaoxin Su, Yipeng Zhou, Laizhong Cui, Song Guo

    Abstract: Recently, federated learning (FL) has gained momentum because of its capability in preserving data privacy. To conduct model training by FL, multiple clients exchange model updates with a parameter server via Internet. To accelerate the communication speed, it has been explored to deploy a programmable switch (PS) in lieu of the parameter server to coordinate clients. The challenge to deploy the P… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)

  21. arXiv:2402.03770  [pdf, other

    cs.LG

    Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes

    Authors: Xiaoxin Su, Yipeng Zhou, Laizhong Cui, John C. S. Lui, Jiangchuan Liu

    Abstract: In Federated Learning (FL) paradigm, a parameter server (PS) concurrently communicates with distributed participating clients for model collection, update aggregation, and model distribution over multiple rounds, without touching private data owned by individual clients. FL is appealing in preserving data privacy; yet the communication between the PS and scattered clients can be a severe bottlenec… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in 2024 IEEE International Conference on Computer Communications(INFOCOM 2024)

  22. arXiv:2401.17916  [pdf, other

    cs.CV

    Source-free Domain Adaptive Object Detection in Remote Sensing Images

    Authors: Weixing Liu, Jun Liu, Xin Su, Han Nie, Bin Luo

    Abstract: Recent studies have used unsupervised domain adaptive object detection (UDAOD) methods to bridge the domain gap in remote sensing (RS) images. However, UDAOD methods typically assume that the source domain data can be accessed during the domain adaptation process. This setting is often impractical in the real world due to RS data privacy and transmission difficulty. To address this challenge, we p… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 14 pages, 11 figures

  23. Understanding users negative emotions and continuous usage intention in short video platforms

    Authors: Xusen Cheng, Xiaowei Su, Bo Yang, Alex Zarifis, Jian Mou

    Abstract: While short videos bring a lot of information and happiness to users, they also occupy users time and short videos gradually change peoples living habits. This paper studies the negative effects and negative emotions of users caused by using short video platforms, as well as the users intention to continue using the short video platform when they have negative emotions. Therefore, this study uses… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: Electronic Commerce Research and Applications (2023)

    ACM Class: H.0; A.0; K.6; K.4

  24. arXiv:2401.10518  [pdf, other

    cs.LG

    Spatial-temporal Forecasting for Regions without Observations

    Authors: Xinyu Su, Jianzhong Qi, Egemen Tanin, Yanchuan Chang, Majid Sarvi

    Abstract: Spatial-temporal forecasting plays an important role in many real-world applications, such as traffic forecasting, air pollutant forecasting, crowd-flow forecasting, and so on. State-of-the-art spatial-temporal forecasting models take data-driven approaches and rely heavily on data availability. Such models suffer from accuracy issues when data is incomplete, which is common in reality due to the… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted by EDBT2024

  25. arXiv:2401.09083  [pdf

    cs.CV

    Remote Sensing ChatGPT: Solving Remote Sensing Tasks with ChatGPT and Visual Models

    Authors: Haonan Guo, Xin Su, Chen Wu, Bo Du, Liangpei Zhang, Deren Li

    Abstract: Recently, the flourishing large language models(LLM), especially ChatGPT, have shown exceptional performance in language understanding, reasoning, and interaction, attracting users and researchers from multiple fields and domains. Although LLMs have shown great capacity to perform human-like task accomplishment in natural language and natural image, their potential in handling remote sensing inter… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: The manuscript is submitted to IEEE International Geoscience and Remote Sensing Symposium(IGARSS2024). Looking forward to seeing you in July!

  26. arXiv:2401.02954  [pdf, other

    cs.CL cs.AI cs.LG

    DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

    Authors: DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li , et al. (63 additional authors not shown)

    Abstract: The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  27. arXiv:2312.13307  [pdf, other

    cs.LG cs.AI cs.CV

    Not All Steps are Equal: Efficient Generation with Progressive Diffusion Models

    Authors: Wenhao Li, Xiu Su, Shan You, Tao Huang, Fei Wang, Chen Qian, Chang Xu

    Abstract: Diffusion models have demonstrated remarkable efficacy in various generative tasks with the predictive prowess of denoising model. Currently, these models employ a uniform denoising approach across all timesteps. However, the inherent variations in noisy latents at each timestep lead to conflicts during training, constraining the potential of diffusion models. To address this challenge, we propose… ▽ More

    Submitted 1 January, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  28. arXiv:2312.12091  [pdf, other

    cs.DC

    Model-Heterogeneous Federated Learning for Internet of Things: Enabling Technologies and Future Directions

    Authors: Boyu Fan, Siyang Jiang, Xiang Su, Pan Hui

    Abstract: Internet of Things (IoT) interconnects a massive amount of devices, generating heterogeneous data with diverse characteristics. IoT data emerges as a vital asset for data-intensive IoT applications, such as healthcare, smart city and predictive maintenance, harnessing the vast volume of heterogeneous data to its maximum advantage. These applications leverage different Artificial Intelligence (AI)… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  29. arXiv:2311.08505  [pdf, other

    cs.CL

    Semi-Structured Chain-of-Thought: Integrating Multiple Sources of Knowledge for Improved Language Model Reasoning

    Authors: Xin Su, Tiep Le, Steven Bethard, Phillip Howard

    Abstract: An important open question in the use of large language models for knowledge-intensive tasks is how to effectively integrate knowledge from three sources: the model's parametric memory, external structured knowledge, and external unstructured knowledge. Most existing prompting methods either rely on one or two of these sources, or require repeatedly invoking large language models to generate simil… ▽ More

    Submitted 1 April, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: NAACL 2024 main conference

  30. arXiv:2311.03799  [pdf, other

    cs.CV

    Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models

    Authors: Yichao Cao, Qingfei Tang, Xiu Su, Chen Song, Shan You, Xiaobo Lu, Chang Xu

    Abstract: Human-object interaction (HOI) detection aims to comprehend the intricate relationships between humans and objects, predicting $<human, action, object>$ triplets, and serving as the foundation for numerous computer vision tasks. The complexity and diversity of human-object interactions in the real world, however, pose significant challenges for both annotation and recognition, particularly in reco… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  31. arXiv:2311.03311  [pdf, other

    cs.CL cs.CY

    Unraveling Downstream Gender Bias from Large Language Models: A Study on AI Educational Writing Assistance

    Authors: Thiemo Wambsganss, Xiaotian Su, Vinitra Swamy, Seyed Parsa Neshaei, Roman Rietsche, Tanja Käser

    Abstract: Large Language Models (LLMs) are increasingly utilized in educational tasks such as providing writing suggestions to students. Despite their potential, LLMs are known to harbor inherent biases which may negatively impact learners. Previous studies have investigated bias in models and data representations separately, neglecting the potential impact of LLM bias on human writing. In this paper, we in… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted as a full paper at EMNLP Findings 2023

  32. arXiv:2310.19536  [pdf, other

    cs.LG cs.IR

    Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from Imperfect Demonstration for Interactive Recommendation

    Authors: Jialin Liu, Xinyan Su, Zeyu He, Xiangyu Zhao, Jun Li

    Abstract: Rewards serve as a measure of user satisfaction and act as a limiting factor in interactive recommender systems. In this research, we focus on the problem of learning to reward (LTR), which is fundamental to reinforcement learning. Previous approaches either introduce additional procedures for learning to reward, thereby increasing the complexity of optimization, or assume that user-agent interact… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  33. arXiv:2310.19519  [pdf, other

    cs.LG cs.AI cs.IR stat.ME

    A General Neural Causal Model for Interactive Recommendation

    Authors: Jialin Liu, Xinyan Su, Peng Zhou, Xiangyu Zhao, Jun Li

    Abstract: Survivor bias in observational data leads the optimization of recommender systems towards local optima. Currently most solutions re-mines existing human-system collaboration patterns to maximize longer-term satisfaction by reinforcement learning. However, from the causal perspective, mitigating survivor effects requires answering a counterfactual problem, which is generally unidentifiable and ines… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  34. arXiv:2310.19292  [pdf, other

    cs.CL

    Fusing Temporal Graphs into Transformers for Time-Sensitive Question Answering

    Authors: Xin Su, Phillip Howard, Nagib Hakim, Steven Bethard

    Abstract: Answering time-sensitive questions from long documents requires temporal reasoning over the times in questions and documents. An important open question is whether large language models can perform such reasoning solely using a provided text document, or whether they can benefit from additional temporal information extracted using other systems. We address this research question by applying existi… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

  35. arXiv:2310.16123  [pdf, other

    cs.LG

    Anchor Space Optimal Transport: Accelerating Batch Processing of Multiple OT Problems

    Authors: Jianming Huang, Xun Su, Zhongxi Fang, Hiroyuki Kasai

    Abstract: The optimal transport (OT) theory provides an effective way to compare probability distributions on a defined metric space, but it suffers from cubic computational complexity. Although the Sinkhorn's algorithm greatly reduces the computational complexity of OT solutions, the solutions of multiple OT problems are still time-consuming and memory-comsuming in practice. However, many works on the comp… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 26 pages, 4 figures, 6 tables

  36. arXiv:2310.15486  [pdf, other

    cs.IT

    RIS-based IMT-2030 Testbed for MmWave Multi-stream Ultra-massive MIMO Communications

    Authors: Shuhao Zeng, Boya Di, Hongliang Zhang, Jiahao Gao, Shaohua Yue, Xinyuan Hu, Rui Fu, Jiaqi Zhou, Xu Liu, Haobo Zhang, Yuhan Wang, Shaohui Sun, Haichao Qin, Xin Su, Mengjun Wang, Lingyang Song

    Abstract: As one enabling technique of the future sixth generation (6G) network, ultra-massive multiple-input-multiple-output (MIMO) can support high-speed data transmissions and cell coverage extension. However, it is hard to realize the ultra-massive MIMO via traditional phased arrays due to unacceptable power consumption. To address this issue, reconfigurable intelligent surface-based (RIS-based) antenna… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 8 pages, 5 figures, to be published in IEEE Wireless Communications

  37. arXiv:2310.14532  [pdf, other

    cs.CV

    Practical Deep Dispersed Watermarking with Synchronization and Fusion

    Authors: Hengchang Guo, Qilong Zhang, Junwei Luo, Feng Guo, Wenbin Zhang, Xiaodong Su, Minglei Li

    Abstract: Deep learning based blind watermarking works have gradually emerged and achieved impressive performance. However, previous deep watermarking studies mainly focus on fixed low-resolution images while paying less attention to arbitrary resolution images, especially widespread high-resolution images nowadays. Moreover, most works usually demonstrate robustness against typical non-geometric attacks (\… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Accpeted by ACM MM 2023

  38. arXiv:2310.11088  [pdf, other

    cs.IR cs.AI cs.LG

    MeKB-Rec: Personal Knowledge Graph Learning for Cross-Domain Recommendation

    Authors: Xin Su, Yao Zhou, Zifei Shan, Qian Chen

    Abstract: It is a long-standing challenge in modern recommender systems to effectively make recommendations for new users, namely the cold-start problem. Cross-Domain Recommendation (CDR) has been proposed to address this challenge, but current ways to represent users' interests across systems are still severely limited. We introduce Personal Knowledge Graph (PKG) as a domain-invariant interest representati… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 13 pages, 4 figures, conference

  39. arXiv:2310.09025  [pdf, other

    cs.NI eess.SP

    Survey on Near-Space Information Networks: Channel Modeling, Networking, and Transmission Perspectives

    Authors: Xianbin Cao, Peng Yang, Xiaoning Su

    Abstract: Near-space information networks (NSINs) composed of high-altitude platforms (HAPs) and high- and low-altitude unmanned aerial vehicles (UAVs) are a new regime for providing quick, robust, and cost-efficient sensing and communication services. Precipitated by innovations and breakthroughs in manufacturing, materials, communications, electronics, and control techniques, NSINs have been envisioned as… ▽ More

    Submitted 13 May, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

  40. arXiv:2310.04750  [pdf, other

    cs.AI cs.CV cs.LG

    DiffNAS: Bootstrap** Diffusion Models by Prompting for Better Architectures

    Authors: Wenhao Li, Xiu Su, Shan You, Fei Wang, Chen Qian, Chang Xu

    Abstract: Diffusion models have recently exhibited remarkable performance on synthetic data. After a diffusion path is selected, a base model, such as UNet, operates as a denoising autoencoder, primarily predicting noises that need to be eliminated step by step. Consequently, it is crucial to employ a model that aligns with the expected budgets to facilitate superior synthetic performance. In this paper, we… ▽ More

    Submitted 9 October, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

  41. arXiv:2309.12626  [pdf

    cs.AI cs.CL

    Construction contract risk identification based on knowledge-augmented language model

    Authors: Saika Wong, Chunmo Zheng, Xing Su, Yinqiu Tang

    Abstract: Contract review is an essential step in construction projects to prevent potential losses. However, the current methods for reviewing construction contracts lack effectiveness and reliability, leading to time-consuming and error-prone processes. While large language models (LLMs) have shown promise in revolutionizing natural language processing (NLP) tasks, they struggle with domain-specific knowl… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  42. arXiv:2309.12132  [pdf

    cs.AI

    A knowledge representation approach for construction contract knowledge modeling

    Authors: Chunmo Zheng, Saika Wong, Xing Su, Yinqiu Tang

    Abstract: The emergence of large language models (LLMs) presents an unprecedented opportunity to automate construction contract management, reducing human errors and saving significant time and costs. However, LLMs may produce convincing yet inaccurate and misleading content due to a lack of domain expertise. To address this issue, expert-driven contract knowledge can be represented in a structured manner t… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  43. arXiv:2309.09179  [pdf, other

    cs.CV cs.AI

    Syntax Tree Constrained Graph Network for Visual Question Answering

    Authors: Xiangrui Su, Qi Zhang, Chongyang Shi, Jiachang Liu, Liang Hu

    Abstract: Visual Question Answering (VQA) aims to automatically answer natural language questions related to given image content. Existing VQA methods integrate vision modeling and language understanding to explore the deep semantics of the question. However, these methods ignore the significant syntax information of the question, which plays a vital role in understanding the essential semantics of the ques… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  44. arXiv:2309.02692  [pdf, other

    cs.SI

    Hy-DeFake: Hypergraph Neural Networks for Detecting Fake News in Online Social Networks

    Authors: Xing Su, Jian Yang, Jia Wu, Zitai Qiu

    Abstract: Nowadays social media is the primary platform for people to obtain news and share information. Combating online fake news has become an urgent task to reduce the damage it causes to society. Existing methods typically improve their fake news detection performances by utilizing textual auxiliary information (such as relevant retweets and comments) or simple structural information (i.e., graph const… ▽ More

    Submitted 22 December, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

  45. SAAN: Similarity-aware attention flow network for change detection with VHR remote sensing images

    Authors: Haonan Guo, Xin Su, Chen Wu, Bo Du, Liangpei Zhang

    Abstract: Change detection (CD) is a fundamental and important task for monitoring the land surface dynamics in the earth observation field. Existing deep learning-based CD methods typically extract bi-temporal image features using a weight-sharing Siamese encoder network and identify change regions using a decoder network. These CD methods, however, still perform far from satisfactorily as we observe that… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 15 pages,13 figures

    Journal ref: IEEE Transactions on Image Processing, vol. 33, pp. 2599-2613, 2024

  46. arXiv:2308.10761  [pdf, other

    cs.CV

    CoNe: Contrast Your Neighbours for Supervised Image Classification

    Authors: Mingkai Zheng, Shan You, Lang Huang, Xiu Su, Fei Wang, Chen Qian, Xiaogang Wang, Chang Xu

    Abstract: Image classification is a longstanding problem in computer vision and machine learning research. Most recent works (e.g. SupCon , Triplet, and max-margin) mainly focus on grou** the intra-class samples aggressively and compactly, with the assumption that all intra-class samples should be pulled tightly towards their class centers. However, such an objective will be very hard to achieve since it… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  47. arXiv:2308.08449  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Improving CTC-AED model with integrated-CTC and auxiliary loss regularization

    Authors: Daobin Zhu, Xiangdong Su, Hongbin Zhang

    Abstract: Connectionist temporal classification (CTC) and attention-based encoder decoder (AED) joint training has been widely applied in automatic speech recognition (ASR). Unlike most hybrid models that separately calculate the CTC and AED losses, our proposed integrated-CTC utilizes the attention mechanism of AED to guide the output of CTC. In this paper, we employ two fusion methods, namely direct addit… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  48. arXiv:2308.07313  [pdf, other

    cs.CV

    Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation

    Authors: Huan Liu, Qiang Chen, Zichang Tan, Jiang-Jiang Liu, Jian Wang, Xiangbo Su, Xiaolong Li, Kun Yao, Junyu Han, Errui Ding, Yao Zhao, **gdong Wang

    Abstract: In this paper, we study the problem of end-to-end multi-person pose estimation. State-of-the-art solutions adopt the DETR-like framework, and mainly develop the complex decoder, e.g., regarding pose estimation as keypoint box detection and combining with human detection in ED-Pose, hierarchically predicting with pose decoder and joint (keypoint) decoder in PETR. We present a simple yet effective t… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  49. arXiv:2308.04813  [pdf, other

    cs.CL

    CLEVA: Chinese Language Models EVAluation Platform

    Authors: Yanyang Li, Jianqiao Zhao, Duo Zheng, Zi-Yuan Hu, Zhi Chen, Xiaohui Su, Yongfeng Huang, Shijia Huang, Dahua Lin, Michael R. Lyu, Liwei Wang

    Abstract: With the continuous emergence of Chinese Large Language Models (LLMs), how to evaluate a model's capabilities has become an increasingly significant issue. The absence of a comprehensive Chinese benchmark that thoroughly assesses a model's performance, the unstandardized and incomparable prompting procedure, and the prevalent risk of contamination pose major challenges in the current evaluation of… ▽ More

    Submitted 16 October, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: EMNLP 2023 System Demonstrations camera-ready

  50. arXiv:2308.03313  [pdf

    cs.SI cs.CY

    Quantifying the Impact of Large Language Models on Collective Opinion Dynamics

    Authors: Chao Li, Xing Su, Haoying Han, Cong Xue, Chunmo Zheng, Chao Fan

    Abstract: The process of opinion expression and exchange is a critical component of democratic societies. As people interact with large language models (LLMs) in the opinion sha** process different from traditional media, the impacts of LLMs are increasingly recognized and being concerned. However, the knowledge about how LLMs affect the process of opinion expression and exchange of social opinion network… ▽ More

    Submitted 25 August, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: 21 pages, 4figures,2tables