Skip to main content

Showing 1–49 of 49 results for author: Cui, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11721  [pdf, other

    cs.CL cs.AI cs.LG

    Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity

    Authors: Bingxiang He, Ning Ding, Cheng Qian, Jia Deng, Ganqu Cui, Lifan Yuan, Huan-ang Gao, Huimin Chen, Zhiyuan Liu, Maosong Sun

    Abstract: Understanding alignment techniques begins with comprehending zero-shot generalization brought by instruction tuning, but little of the mechanism has been understood. Existing work has largely been confined to the task level, without considering that tasks are artificially defined and, to LLMs, merely consist of tokens and representations. This line of research has been limited to examining transfe… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 33 pages, 14 figures

  2. arXiv:2406.03949  [pdf, other

    cs.CL

    UltraMedical: Building Specialized Generalists in Biomedicine

    Authors: Kaiyan Zhang, Sihang Zeng, Ermo Hua, Ning Ding, Zhang-Ren Chen, Zhiyuan Ma, Haoxin Li, Ganqu Cui, Biqing Qi, Xuekai Zhu, Xingtai Lv, Hu **fang, Zhiyuan Liu, Bowen Zhou

    Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities across various domains and are moving towards more specialized areas. Recent advanced proprietary models such as GPT-4 and Gemini have achieved significant advancements in biomedicine, which have also raised privacy and security challenges. The construction of specialized generalists hinges largely on high-quality datasets, enh… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Datasets and models are available at https://github.com/TsinghuaC3I/UltraMedical

  3. arXiv:2405.17220  [pdf, other

    cs.CL

    RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

    Authors: Tianyu Yu, Haoye Zhang, Yuan Yao, Yunkai Dang, Da Chen, Xiaoman Lu, Ganqu Cui, Taiwen He, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun

    Abstract: Learning from feedback reduces the hallucination of multimodal large language models (MLLMs) by aligning them with human preferences. While traditional methods rely on labor-intensive and time-consuming manual labeling, recent approaches employing models as automatic labelers have shown promising results without human intervention. However, these methods heavily rely on costly proprietary models l… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Project Website: https://github.com/RLHF-V/RLAIF-V

  4. arXiv:2404.06395  [pdf, other

    cs.CL cs.LG

    MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

    Authors: Shengding Hu, Yuge Tu, Xu Han, Chaoqun He, Ganqu Cui, Xiang Long, Zhi Zheng, Yewei Fang, Yuxiang Huang, Weilin Zhao, Xinrong Zhang, Zheng Leng Thai, Kaihuo Zhang, Chongyi Wang, Yuan Yao, Chenyang Zhao, Jie Zhou, Jie Cai, Zhongwu Zhai, Ning Ding, Chao Jia, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun

    Abstract: The burgeoning interest in develo** Large Language Models (LLMs) with up to trillion parameters has been met with concerns regarding resource efficiency and practical expense, particularly given the immense cost of experimentation. This scenario underscores the importance of exploring the potential of Small Language Models (SLMs) as a resource-efficient alternative. In this context, we introduce… ▽ More

    Submitted 3 June, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: revise according to peer review

  5. arXiv:2404.02078  [pdf, other

    cs.AI cs.CL cs.LG

    Advancing LLM Reasoning Generalists with Preference Trees

    Authors: Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, Maosong Sun

    Abstract: We introduce Eurus, a suite of large language models (LLMs) optimized for reasoning. Finetuned from Mistral-7B and CodeLlama-70B, Eurus models achieve state-of-the-art results among open-source models on a diverse set of benchmarks covering mathematics, code generation, and logical reasoning problems. Notably, Eurus-70B beats GPT-3.5 Turbo in reasoning through a comprehensive benchmarking across 1… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Models and data are available at https://github.com/OpenBMB/Eurus

  6. arXiv:2403.11038  [pdf, other

    cs.CV math.NA

    Texture Edge detection by Patch consensus (TEP)

    Authors: Guangyu Cui, Sung Ha Kang

    Abstract: We propose Texture Edge detection using Patch consensus (TEP) which is a training-free method to detect the boundary of texture. We propose a new simple way to identify the texture edge location, using the consensus of segmented local patch information. While on the boundary, even using local patch information, the distinction between textures are typically not clear, but using neighbor consensus… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  7. arXiv:2403.08281  [pdf, other

    cs.CL cs.AI

    Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models

    Authors: Ning Ding, Yulin Chen, Ganqu Cui, Xingtai Lv, Weilin Zhao, Ruobing Xie, Bowen Zhou, Zhiyuan Liu, Maosong Sun

    Abstract: Underlying data distributions of natural language, programming code, and mathematical symbols vary vastly, presenting a complex challenge for large language models (LLMs) that strive to achieve high performance across all three domains simultaneously. Achieving a very high level of proficiency for an LLM within a specific domain often requires extensive training with relevant corpora, which is typ… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  8. arXiv:2403.04180  [pdf, ps, other

    cs.LG cs.IR

    RATSF: Empowering Customer Service Volume Management through Retrieval-Augmented Time-Series Forecasting

    Authors: Tianfeng Wang, Gaojie Cui

    Abstract: An efficient customer service management system hinges on precise forecasting of service volume. In this scenario, where data non-stationarity is pronounced, successful forecasting heavily relies on identifying and leveraging similar historical data rather than merely summarizing periodic patterns. Existing models based on RNN or Transformer architectures may struggle with this flexible and effect… ▽ More

    Submitted 16 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  9. arXiv:2402.19085  [pdf, other

    cs.CL cs.AI eess.SY

    Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment

    Authors: Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Jiexin Wang, Huimin Chen, Bowen Sun, Ruobing Xie, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun

    Abstract: Alignment in artificial intelligence pursues the consistency between model responses and human preferences as well as values. In practice, the multifaceted nature of human preferences inadvertently introduces what is known as the "alignment tax" -a compromise where enhancements in alignment within one objective (e.g.,harmlessness) can diminish performance in others (e.g.,helpfulness). However, exi… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  10. arXiv:2402.05369  [pdf, other

    cs.LG cs.CL

    Noise Contrastive Alignment of Language Models with Explicit Rewards

    Authors: Huayu Chen, Guande He, Lifan Yuan, Ganqu Cui, Hang Su, Jun Zhu

    Abstract: User intentions are typically formalized as evaluation rewards to be maximized when fine-tuning language models (LMs). Existing alignment methods, such as Direct Preference Optimization (DPO), are mainly tailored for pairwise preference data where rewards are implicitly defined rather than explicitly given. In this paper, we introduce a general framework for LM alignment, leveraging Noise Contrast… ▽ More

    Submitted 3 July, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  11. arXiv:2402.05079  [pdf, other

    eess.IV cs.CV

    Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation

    Authors: Ziyang Wang, Jian-Qing Zheng, Yichi Zhang, Ge Cui, Lei Li

    Abstract: In recent advancements in medical image analysis, Convolutional Neural Networks (CNN) and Vision Transformers (ViT) have set significant benchmarks. While the former excels in capturing local features through its convolution operations, the latter achieves remarkable global context understanding by leveraging self-attention mechanisms. However, both architectures exhibit limitations in efficiently… ▽ More

    Submitted 30 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  12. arXiv:2402.03456  [pdf, other

    cs.CV

    Constrained Multiview Representation for Self-supervised Contrastive Learning

    Authors: Siyuan Dai, Kai Ye, Kun Zhao, Ge Cui, Haoteng Tang, Liang Zhan

    Abstract: Representation learning constitutes a pivotal cornerstone in contemporary deep learning paradigms, offering a conduit to elucidate distinctive features within the latent space and interpret the deep models. Nevertheless, the inherent complexity of anatomical patterns and the random nature of lesion distribution in medical image segmentation pose significant challenges to the disentanglement of rep… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 11 pages, 9 figures, 2 algorithms

  13. arXiv:2312.07075  [pdf, other

    cs.RO

    Motion Planning and Control of A Morphing Quadrotor in Restricted Scenarios

    Authors: Guiyang Cui, Ruihao Xia, Xin **, Yang Tang

    Abstract: Morphing quadrotors with four external actuators can adapt to different restricted scenarios by changing their geometric structure. However, previous works mainly focus on the improvements in structures and controllers, and existing planning algorithms don't consider the morphological modifications, which leads to safety and dynamic feasibility issues. In this paper, we propose a unified planning… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 8 pages, 9 figures

  14. arXiv:2312.00849  [pdf, other

    cs.CL cs.CV

    RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

    Authors: Tianyu Yu, Yuan Yao, Haoye Zhang, Taiwen He, Yifeng Han, Ganqu Cui, **yi Hu, Zhiyuan Liu, Hai-Tao Zheng, Maosong Sun, Tat-Seng Chua

    Abstract: Multimodal Large Language Models (MLLMs) have recently demonstrated impressive capabilities in multimodal understanding, reasoning, and interaction. However, existing MLLMs prevalently suffer from serious hallucination problems, generating text that is not factually grounded in associated images. The problem makes existing MLLMs untrustworthy and thus impractical in real-world (especially high-sta… ▽ More

    Submitted 8 March, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Accepted by CVPR 2024

  15. arXiv:2311.11340  [pdf, other

    cs.RO

    RflyMAD: A Dataset for Multicopter Fault Detection and Health Assessment

    Authors: Xiangli Le, Bo **, Gen Cui, Xunhua Dai, Quan Quan

    Abstract: This paper presents an open-source dataset RflyMAD, a Multicopter Abnomal Dataset developed by Reliable Flight Control (Rfly) Group aiming to promote the development of research fields like fault detection and isolation (FDI) or health assessment (HA). The entire 114 GB dataset includes 11 types of faults under 6 flight statuses which are adapted from ADS-33 file to cover more occasions in which t… ▽ More

    Submitted 11 January, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

  16. arXiv:2311.09868  [pdf, other

    cs.SE cs.AI

    INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair

    Authors: Hanbin Wang, Zhenghao Liu, Shuo Wang, Ganqu Cui, Ning Ding, Zhiyuan Liu, Ge Yu

    Abstract: This paper introduces INTERVENOR (INTERactiVE chaiN Of Repair), a system designed to emulate the interactive code repair processes observed in humans, encompassing both code diagnosis and code repair. INTERVENOR prompts Large Language Models (LLMs) to play distinct roles during the code repair process, functioning as both a Code Learner and a Code Teacher. Specifically, the Code Learner is tasked… ▽ More

    Submitted 12 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: 27 pages, 19 figures, 10 tables

  17. arXiv:2310.01377  [pdf, other

    cs.CL cs.AI cs.LG

    UltraFeedback: Boosting Language Models with High-quality Feedback

    Authors: Ganqu Cui, Lifan Yuan, Ning Ding, Guanming Yao, Wei Zhu, Yuan Ni, Guotong Xie, Zhiyuan Liu, Maosong Sun

    Abstract: Reinforcement learning from human feedback (RLHF) has become a pivot technique in aligning large language models (LLMs) with human preferences. In RLHF practice, preference data plays a crucial role in bridging human proclivity and LLMs. However, the scarcity of diverse, naturalistic datasets of human preferences on LLM outputs at scale poses a great challenge to RLHF as well as feedback learning… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  18. arXiv:2307.11973  [pdf, other

    cs.CV

    Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition

    Authors: Yao Liu, Gangfeng Cui, Jiahui Luo, Xiaojun Chang, Lina Yao

    Abstract: As a fundamental aspect of human life, two-person interactions contain meaningful information about people's activities, relationships, and social settings. Human action recognition serves as the foundation for many smart applications, with a strong focus on personal privacy. However, recognizing two-person interactions poses more challenges due to increased body occlusion and overlap compared to… ▽ More

    Submitted 14 May, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

  19. arXiv:2306.04618  [pdf, other

    cs.CL cs.CR cs.LG

    Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations

    Authors: Lifan Yuan, Yangyi Chen, Ganqu Cui, Hongcheng Gao, Fangyuan Zou, Xingyi Cheng, Heng Ji, Zhiyuan Liu, Maosong Sun

    Abstract: This paper reexamines the research on out-of-distribution (OOD) robustness in the field of NLP. We find that the distribution shift settings in previous studies commonly lack adequate challenges, hindering the accurate evaluation of OOD robustness. To address these issues, we propose a benchmark construction protocol that ensures clear differentiation and challenging distribution shifts. Then we i… ▽ More

    Submitted 26 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023 Dataset and Benchmark Track. Code is available at \url{https://github.com/lifan-yuan/OOD_NLP}

  20. arXiv:2305.18503  [pdf, other

    cs.CL cs.CR cs.LG

    From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework

    Authors: Yangyi Chen, Hongcheng Gao, Ganqu Cui, Lifan Yuan, Dehan Kong, Hanlu Wu, Ning Shi, Bo Yuan, Longtao Huang, Hui Xue, Zhiyuan Liu, Maosong Sun, Heng Ji

    Abstract: Textual adversarial attacks can discover models' weaknesses by adding semantic-preserved but misleading perturbations to the inputs. The long-lasting adversarial attack-and-defense arms race in Natural Language Processing (NLP) is algorithm-centric, providing valuable techniques for automatic robustness evaluation. However, the existing practice of robustness evaluation may exhibit issues of incom… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL 2023

  21. arXiv:2304.08354  [pdf, other

    cs.CL cs.AI cs.LG

    Tool Learning with Foundation Models

    Authors: Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, **g Yi, Yuzhang Zhu , et al. (16 additional authors not shown)

    Abstract: Humans possess an extraordinary ability to create and utilize tools, allowing them to overcome physical limitations and explore new frontiers. With the advent of foundation models, AI systems have the potential to be equally adept in tool use as humans. This paradigm, i.e., tool learning with foundation models, combines the strengths of specialized tools and foundation models to achieve enhanced a… ▽ More

    Submitted 15 June, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  22. arXiv:2212.12418  [pdf

    cs.RO

    Dynamic Speed Guidance for CAV Ramp Merging in Non-Cooperative Environment: An On-Site Experiment

    Authors: Wei Ji, Yechi Ma, Guangzhang Cui, Xiaotian Qin, Wei Hua

    Abstract: Ramp merging is a typical application of cooperative intelligent transportation system (C-ITS). Vehicle trajectories perceived by roadside sensors are importation complement to the limited visual field of on-board perception. Vehicle tracking and trajectory denoising algorithm is proposed in this paper to take full advantage of roadside cameras for vehicle trajectory and speed profile estimation.… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: This work has been submitted to IFAC for possible publication

  23. arXiv:2212.08408  [pdf, other

    cs.CL cs.LG

    Decoder Tuning: Efficient Language Understanding as Decoding

    Authors: Ganqu Cui, Wentao Li, Ning Ding, Longtao Huang, Zhiyuan Liu, Maosong Sun

    Abstract: With the evergrowing sizes of pre-trained models (PTMs), it has been an emerging practice to only provide the inference APIs for users, namely model-as-a-service (MaaS) setting. To adapt PTMs with model parameters frozen, most current approaches focus on the input side, seeking for powerful prompts to stimulate models for correct answers. However, we argue that input-side adaptation could be arduo… ▽ More

    Submitted 24 May, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

    Comments: ACL 2023 main conference. Code: https://github.com/thunlp/DecT

  24. arXiv:2211.05319  [pdf, other

    cs.LG cs.CL cs.CV

    Few-shot Classification with Hypersphere Modeling of Prototypes

    Authors: Ning Ding, Yulin Chen, Ganqu Cui, Xiaobin Wang, Hai-Tao Zheng, Zhiyuan Liu, Pengjun Xie

    Abstract: Metric-based meta-learning is one of the de facto standards in few-shot learning. It composes of representation learning and metrics calculation designs. Previous works construct class representations in different ways, varying from mean output embedding to covariance and distributions. However, using embeddings in space lacks expressivity and cannot capture class information robustly, while stati… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: preprint

  25. arXiv:2211.00151  [pdf, other

    cs.CL cs.LG

    A Close Look into the Calibration of Pre-trained Language Models

    Authors: Yangyi Chen, Lifan Yuan, Ganqu Cui, Zhiyuan Liu, Heng Ji

    Abstract: Pre-trained language models (PLMs) may fail in giving reliable estimates of their predictive uncertainty. We take a close look into this problem, aiming to answer two questions: (1) Do PLMs learn to become calibrated in the training process? (2) How effective are existing calibration methods? For the first question, we conduct fine-grained control experiments to study the dynamic change in PLMs' c… ▽ More

    Submitted 8 May, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

    Comments: Accepted to ACL 2023 main conference. Code is available at: https://github.com/lifan-yuan/PLMCalibration

  26. arXiv:2210.10683  [pdf, other

    cs.CL cs.CR cs.LG

    Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP

    Authors: Yangyi Chen, Hongcheng Gao, Ganqu Cui, Fanchao Qi, Longtao Huang, Zhiyuan Liu, Maosong Sun

    Abstract: Textual adversarial samples play important roles in multiple subfields of NLP research, including security, evaluation, explainability, and data augmentation. However, most work mixes all these roles, obscuring the problem definitions and research goals of the security role that aims to reveal the practical concerns of NLP models. In this paper, we rethink the research paradigm of textual adversar… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022, main conference

  27. arXiv:2208.08149  [pdf, ps, other

    cs.AI

    A Concept and Argumentation based Interpretable Model in High Risk Domains

    Authors: Haixiao Chi, Dawei Wang, Gaojie Cui, Feng Mao, Beishui Liao

    Abstract: Interpretability has become an essential topic for artificial intelligence in some high-risk domains such as healthcare, bank and security. For commonly-used tabular data, traditional methods trained end-to-end machine learning models with numerical and categorical data only, and did not leverage human understandable knowledge such as data descriptions. Yet mining human-level knowledge from tabula… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

  28. arXiv:2208.05263  [pdf, ps, other

    cs.IT

    Enhanced Low-Redundancy Restricted Array for Direction of Arrival Estimation

    Authors: Shidong Zhang, Zhengchun Zhou, Guolong Cui, Xiaohu Tang, **zhi Fan

    Abstract: Sensor arrays play a significant role in direction of arrival (DOA) estimation. Specifically, arrays with low redundancy and reduced mutual coupling are desirable. In this paper, we investigate a sensor array configuration that has a restricted sensor spacing and propose a closed-form expression. We also propose several classes of low redundancy (LR) arrays. Interestingly, compared with super nest… ▽ More

    Submitted 13 November, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

  29. arXiv:2206.08514  [pdf, other

    cs.LG cs.CL cs.CR

    A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks

    Authors: Ganqu Cui, Lifan Yuan, Bingxiang He, Yangyi Chen, Zhiyuan Liu, Maosong Sun

    Abstract: Textual backdoor attacks are a kind of practical threat to NLP systems. By injecting a backdoor in the training phase, the adversary could control model predictions via predefined triggers. As various attack and defense models have been proposed, it is of great significance to perform rigorous evaluations. However, we highlight two issues in previous backdoor learning evaluations: (1) The differen… ▽ More

    Submitted 1 November, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 Datasets & Benchmarks; Toolkits avaliable at https://github.com/thunlp/OpenBackdoor

  30. arXiv:2204.05239  [pdf, other

    cs.CL

    Exploring the Universal Vulnerability of Prompt-based Learning Paradigm

    Authors: Lei Xu, Yangyi Chen, Ganqu Cui, Hongcheng Gao, Zhiyuan Liu

    Abstract: Prompt-based learning paradigm bridges the gap between pre-training and fine-tuning, and works effectively under the few-shot setting. However, we find that this learning paradigm inherits the vulnerability from the pre-training stage, where model predictions can be misled by inserting certain triggers into the text. In this paper, we explore this universal vulnerability by either injecting backdo… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted to Findings of NAACL 2022

  31. arXiv:2203.14101   

    cs.LG cs.AI cs.CL

    A Roadmap for Big Model

    Authors: Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, **g Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui , et al. (75 additional authors not shown)

    Abstract: With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm. Researchers have achieved various outcomes in the construction of BMs and the BM application in many fields. At present, there is a lack of research work that sorts out the overall progress of BMs and guides the follow-up research. In this paper, we cover not only the BM… ▽ More

    Submitted 20 April, 2022; v1 submitted 26 March, 2022; originally announced March 2022.

    Comments: This report has been withdrawn by the authors due to critical issues in Section 2.3.1 of Article 2

  32. arXiv:2203.09770  [pdf, other

    cs.CL cs.LG

    Prototypical Verbalizer for Prompt-based Few-shot Tuning

    Authors: Ganqu Cui, Shengding Hu, Ning Ding, Longtao Huang, Zhiyuan Liu

    Abstract: Prompt-based tuning for pre-trained language models (PLMs) has shown its effectiveness in few-shot learning. Typically, prompt-based tuning wraps the input text into a cloze question. To make predictions, the model maps the output words to labels via a verbalizer, which is either manually designed or automatically built. However, manual verbalizers heavily depend on domain-specific prior knowledge… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: 11 pages. ACL 2022 main conference

  33. arXiv:2202.10017  [pdf, other

    eess.AS cs.SD

    The PCG-AIID System for L3DAS22 Challenge: MIMO and MISO convolutional recurrent Network for Multi Channel Speech Enhancement and Speech Recognition

    Authors: **gdong Li, Yuanyuan Zhu, Dawei Luo, Yun Liu, Guohui Cui, Zhaoxia Li

    Abstract: This paper described the PCG-AIID system for L3DAS22 challenge in Task 1: 3D speech enhancement in office reverberant environment. We proposed a two-stage framework to address multi-channel speech denoising and dereverberation. In the first stage, a multiple input and multiple output (MIMO) network is applied to remove background noise while maintaining the spatial characteristics of multi-channel… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: To appear at ICASSP 2022 (Accepted)

  34. arXiv:2202.04312  [pdf, other

    cs.NI eess.SP

    Using 5G in Smart Cities: A Systematic Map** Study

    Authors: Chen Yang, Peng Liang, Liming Fu, Guorui Cui, Fei Huang, Feng Teng, Yawar Abbas Bangash

    Abstract: 5G is the fifth generation wireless network, with a set of characteristics, e.g., high bandwidth and data rates. The scenarios of using 5G include enhanced Mobile Broadband (eMBB), massive Machine Type Communications (mMTC), and ultra-Reliable and Low-Latency Communications (uRLLC). 5G is expected to support a wide variety of applications. We conducted a systematic map** study that covers the li… ▽ More

    Submitted 15 February, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: Preprint accepted for publication in Intelligent Systems with Applications, 2022

  35. arXiv:2201.12994  [pdf, other

    cs.LG cs.SI

    MGNN: Graph Neural Networks Inspired by Distance Geometry Problem

    Authors: Guanyu Cui, Zhewei Wei

    Abstract: Graph Neural Networks (GNNs) have emerged as a prominent research topic in the field of machine learning. Existing GNN models are commonly categorized into two types: spectral GNNs, which are designed based on polynomial graph filters, and spatial GNNs, which utilize a message-passing scheme as the foundation of the model. For the expressive power and universality of spectral GNNs, a natural appro… ▽ More

    Submitted 30 August, 2023; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: Accepted by KDD 2023

  36. arXiv:2201.03199  [pdf, other

    cs.RO cs.AI

    Task planning and explanation with virtual actions

    Authors: Guowei Cui, ** Chen

    Abstract: One of the challenges of task planning is to find out what causes the planning failure and how to handle the failure intelligently. This paper shows how to achieve this. The idea is inspired by the connected graph: each verticle represents a set of compatible \textit{states}, and each edge represents an \textit{action}. For any given initial states and goals, we construct virtual actions to ensure… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

  37. arXiv:2106.08171  [pdf, other

    cs.LG stat.ML

    Evaluating Modules in Graph Contrastive Learning

    Authors: Ganqu Cui, Yufeng Du, Cheng Yang, Jie Zhou, Liang Xu, Xing Zhou, Xingyi Cheng, Zhiyuan Liu

    Abstract: The recent emergence of contrastive learning approaches facilitates the application on graph representation learning (GRL), introducing graph contrastive learning (GCL) into the literature. These methods contrast semantically similar and dissimilar sample pairs to encode the semantics into node or graph embeddings. However, most existing works only performed \textbf{model-level} evaluation, and di… ▽ More

    Submitted 2 June, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

  38. arXiv:2011.00621  [pdf, other

    cs.RO

    Semantic Task Planning for Service Robots in Open World

    Authors: Guowei Cui, Wei Shuai, ** Chen

    Abstract: In this paper, we present a planning system based on semantic reasoning for a general-purpose service robot, which is aimed at behaving more intelligently in domains that contain incomplete information, under-specified goals, and dynamic changes. First, Two kinds of data are generated by Natural Language Processing module from the speech: (i) action frames and their relationships; (ii) the modifie… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

  39. arXiv:2010.13297  [pdf, other

    cs.LG

    Discriminatively Constrained Semi-supervised Multi-view Nonnegative Matrix Factorization with Graph Regularization

    Authors: Guosheng Cui, Ruxin Wang, Dan Wu, Ye Li

    Abstract: In recent years, semi-supervised multi-view nonnegative matrix factorization (MVNMF) algorithms have achieved promising performances for multi-view clustering. While most of semi-supervised MVNMFs have failed to effectively consider discriminative information among clusters and feature alignment from multiple views simultaneously. In this paper, a novel Discriminatively Constrained Semi-Supervised… ▽ More

    Submitted 25 October, 2020; originally announced October 2020.

  40. arXiv:2007.08547  [pdf, other

    cs.CV cs.GR

    Talking-head Generation with Rhythmic Head Motion

    Authors: Lele Chen, Guofeng Cui, Celong Liu, Zhong Li, Ziyi Kou, Yi Xu, Chenliang Xu

    Abstract: When people deliver a speech, they naturally move heads, and this rhythmic head motion conveys prosodic information. However, generating a lip-synced video while moving head naturally is challenging. While remarkably successful, existing works either generate still talkingface videos or rely on landmark/video frames as sparse/dense map** guidance to generate head movements, which leads to unreal… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

  41. Adaptive Graph Encoder for Attributed Graph Embedding

    Authors: Ganqu Cui, Jie Zhou, Cheng Yang, Zhiyuan Liu

    Abstract: Attributed graph embedding, which learns vector representations from graph topology and node features, is a challenging task for graph analysis. Recently, methods based on graph convolutional networks (GCNs) have made great progress on this task. However,existing GCN-based methods have three major drawbacks. Firstly,our experiments indicate that the entanglement of graph convolutional filters and… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: To appear in KDD 2020

  42. arXiv:2005.03201  [pdf, other

    cs.CV eess.IV

    What comprises a good talking-head video generation?: A Survey and Benchmark

    Authors: Lele Chen, Guofeng Cui, Ziyi Kou, Haitian Zheng, Chenliang Xu

    Abstract: Over the years, performance evaluation has become essential in computer vision, enabling tangible progress in many sub-fields. While talking-head video generation has become an emerging research topic, existing evaluations on this topic present many limitations. For example, most approaches use human subjects (e.g., via Amazon MTurk) to evaluate their research claims directly. This subjective eval… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

  43. arXiv:1911.07160  [pdf, other

    cs.CV

    Improve CAM with Auto-adapted Segmentation and Co-supervised Augmentation

    Authors: Ziyi Kou, Guofeng Cui, Shaojie Wang, Wentian Zhao, Chenliang Xu

    Abstract: Weakly Supervised Object Localization (WSOL) methods generate both classification and localization results by learning from only image category labels. Previous methods usually utilize class activation map (CAM) to obtain target object regions. However, most of them only focus on improving foreground object parts in CAM, but ignore the important effect of its background contents. In this paper, we… ▽ More

    Submitted 13 January, 2021; v1 submitted 17 November, 2019; originally announced November 2019.

    Comments: Accepted by WACV2021. Equal contribution for the first two authors

  44. arXiv:1909.03619  [pdf, other

    cs.CV

    Weakly Supervised Localization Using Background Images

    Authors: Ziyi Kou, Wentian Zhao, Guofeng Cui, Shaojie Wang

    Abstract: Weakly Supervised Object Localization (WSOL) methodsusually rely on fully convolutional networks in order to ob-tain class activation maps(CAMs) of targeted labels. How-ever, these networks always highlight the most discriminativeparts to perform the task, the located areas are much smallerthan entire targeted objects. In this work, we propose a novelend-to-end model to enlarge CAMs generated from… ▽ More

    Submitted 10 September, 2019; v1 submitted 8 September, 2019; originally announced September 2019.

    Comments: Course project of CSC577, University of Rochester

  45. arXiv:1907.11580  [pdf, other

    cs.DC

    Edge User Allocation with Dynamic Quality of Service

    Authors: Phu Lai, Qiang He, Guangming Cui, Xiaoyu Xia, Mohamed Abdelrazek, Feifei Chen, John Hosking, John Grundy, Yun Yang

    Abstract: In edge computing, edge servers are placed in close proximity to end-users. App vendors can deploy their services on edge servers to reduce network latency experienced by their app users. The edge user allocation (EUA) problem challenges service providers with the objective to maximize the number of allocated app users with hired computing resources on edge servers while ensuring their fixed quali… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

    Comments: This manuscript has been accepted for publication at the 17th International Conference on Service-Oriented Computing and may be published in the book series Lecture Notes in Computer Science. All copyrights reserved to Springer Nature Switzerland AG, Gewerbestrasse 11, 6330 Cham, Switzerland

  46. arXiv:1812.08434  [pdf

    cs.LG cs.AI stat.ML

    Graph Neural Networks: A Review of Methods and Applications

    Authors: Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, Maosong Sun

    Abstract: Lots of learning tasks require dealing with graph data which contains rich relation information among elements. Modeling physics systems, learning molecular fingerprints, predicting protein interface, and classifying diseases demand a model to learn from graph inputs. In other domains such as learning from non-structural data like texts and images, reasoning on extracted structures (like the depen… ▽ More

    Submitted 6 October, 2021; v1 submitted 20 December, 2018; originally announced December 2018.

    Comments: Published at AI Open 2021

  47. arXiv:1804.08066  [pdf, other

    cs.LG stat.ML

    MQGrad: Reinforcement Learning of Gradient Quantization in Parameter Server

    Authors: Guoxin Cui, Jun Xu, Wei Zeng, Yanyan Lan, Jiafeng Guo, Xueqi Cheng

    Abstract: One of the most significant bottleneck in training large scale machine learning models on parameter server (PS) is the communication overhead, because it needs to frequently exchange the model gradients between the workers and servers during the training iterations. Gradient quantization has been proposed as an effective approach to reducing the communication volume. One key issue in gradient quan… ▽ More

    Submitted 22 April, 2018; originally announced April 2018.

    Comments: 7 pages, 5 figures

  48. arXiv:1803.02256  [pdf, other

    cs.CV

    Depth Information Guided Crowd Counting for Complex Crowd Scenes

    Authors: Mingliang Xu, Zhaoyang Ge, Xiaoheng Jiang, Gaoge Cui, Pei Lv, Bing Zhou, Changsheng Xu

    Abstract: It is important to monitor and analyze crowd events for the sake of city safety. In an EDOF (extended depth of field) image with a crowded scene, the distribution of people is highly imbalanced. People far away from the camera look much smaller and often occlude each other heavily, while people close to the camera look larger. In such a case, it is difficult to accurately estimate the number of pe… ▽ More

    Submitted 23 April, 2018; v1 submitted 3 March, 2018; originally announced March 2018.

    Comments: 9 pages, 8 figures. The paper is under consideration at Pattern Recognition Letters

  49. arXiv:1701.05700  [pdf, ps, other

    cs.IT

    Antenna Deployment Method for MIMO Radar under the Situation of Multiple Interference Regions

    Authors: Tianxian Zhang, Jiadong Liang, Yichuan Yang, Guolong Cui, Lingjiang Kong, Xiaobo Yang

    Abstract: In this paper, considering multiple interference regions simultaneously, an optimal antenna deployment problem for distributed Multi-Input Multi-Output (MIMO) radar is investigated. The optimal antenna deployment problem is solved by proposing an antenna deployment method based on Multi-Objective Particle Swarm Optimization (MOPSO). Firstly, we construct a multi-objective optimization problem for… ▽ More

    Submitted 20 January, 2017; originally announced January 2017.

    Comments: 12 pages