Skip to main content

Showing 1–50 of 161 results for author: Qin, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00924  [pdf, other

    cs.CL

    EXCGEC: A Benchmark of Edit-wise Explainable Chinese Grammatical Error Correction

    Authors: **gheng Ye, Shang Qin, Yinghui Li, Xuxin Cheng, Libo Qin, Hai-Tao Zheng, Peng Xing, Zishan Xu, Guo Cheng, Zhao Wei

    Abstract: Existing studies explore the explainability of Grammatical Error Correction (GEC) in a limited scenario, where they ignore the interaction between corrections and explanations. To bridge the gap, this paper introduces the task of EXplainable GEC (EXGEC), which focuses on the integral role of both correction and explanation tasks. To facilitate the task, we propose EXCGEC, a tailored benchmark for… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 22 pages, 10 tables, 9 figures. Under review

  2. arXiv:2406.17233  [pdf, other

    cs.SE cs.CL

    Self-Constructed Context Decompilation with Fined-grained Alignment Enhancement

    Authors: Yunlong Feng, Yang Xu, Dechuan Teng, Honglin Mu, Xiao Xu, Libo Qin, Wanxiang Che, Qingfu Zhu

    Abstract: Decompilation transforms compiled code back into a high-level programming language for analysis when source code is unavailable. Previous work has primarily focused on enhancing decompilation performance by increasing the scale of model parameters or training data for pre-training. Based on the characteristics of the decompilation task, we propose two methods: (1) Without fine-tuning, the Self-Con… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Under Review

  3. arXiv:2406.16562  [pdf, other

    cs.CV cs.CL

    EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models

    Authors: Zhiyu Tan, Xiaomeng Yang, Luozheng Qin, Meng** Yang, Cheng Zhang, Hao Li

    Abstract: The recent advancements in text-to-image generative models have been remarkable. Yet, the field suffers from a lack of evaluation metrics that accurately reflect the performance of these models, particularly lacking fine-grained metrics that can guide the optimization of the models. In this paper, we propose EvalAlign, a metric characterized by its accuracy, stability, and fine granularity. Our ap… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: Github Repository: https://github.com/SAIS-FUXI/EvalAlign

  4. arXiv:2406.16268  [pdf, other

    cs.DB

    Efficient Antagonistic k-plex Enumeration in Signed Graphs

    Authors: Lantian Xu, Rong-Hua Li, Dong Wen, Qiangqiang Dai, Guoren Wang, Lu Qin

    Abstract: A signed graph is a graph where each edge receives a sign, positive or negative. The signed graph model has been used in many real applications, such as protein complex discovery and social network analysis. Finding cohesive subgraphs in signed graphs is a fundamental problem. A k-plex is a common model for cohesive subgraphs in which every vertex is adjacent to all but at most k vertices within t… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  5. arXiv:2406.13940  [pdf, other

    cs.CL

    AutoCAP: Towards Automatic Cross-lingual Alignment Planning for Zero-shot Chain-of-Thought

    Authors: Yongheng Zhang, Qiguang Chen, Min Li, Wanxiang Che, Libo Qin

    Abstract: Cross-lingual chain-of-thought can effectively complete reasoning tasks across languages, which gains increasing attention. Recently, dominant approaches in the literature improve cross-lingual alignment capabilities by integrating reasoning knowledge from different languages. Despite achieving excellent performance, current methods still have two main challenges: (1) Manual language specification… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL2024 Findings

  6. arXiv:2406.11217  [pdf, other

    cs.AI cs.CL cs.CV physics.ao-ph

    WeatherQA: Can Multimodal Language Models Reason about Severe Weather?

    Authors: Chengqian Ma, Zhanxiang Hua, Alexandra Anderson-Frey, Vikram Iyer, Xin Liu, Lianhui Qin

    Abstract: Severe convective weather events, such as hail, tornadoes, and thunderstorms, often occur quickly yet cause significant damage, costing billions of dollars every year. This highlights the importance of forecasting severe weather threats hours in advance to better prepare meteorologists and residents in at-risk areas. Can modern large foundation models perform such forecasting? Existing weather ben… ▽ More

    Submitted 23 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 26 pages, 9 figures

  7. arXiv:2406.10861  [pdf, other

    cs.LG cs.DC

    Knowledge Distillation in Federated Learning: a Survey on Long Lasting Challenges and New Solutions

    Authors: Laiqiao Qin, Tianqing Zhu, Wanlei Zhou, Philip S. Yu

    Abstract: Federated Learning (FL) is a distributed and privacy-preserving machine learning paradigm that coordinates multiple clients to train a model while kee** the raw data localized. However, this traditional FL poses some challenges, including privacy risks, data heterogeneity, communication bottlenecks, and system heterogeneity issues. To tackle these challenges, knowledge distillation (KD) has been… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  8. arXiv:2406.10505  [pdf, other

    cs.CL

    CroPrompt: Cross-task Interactive Prompting for Zero-shot Spoken Language Understanding

    Authors: Libo Qin, Fuxuan Wei, Qiguang Chen, **gxuan Zhou, Shijue Huang, Jiasheng Si, Wenpeng Lu, Wanxiang Che

    Abstract: Slot filling and intent detection are two highly correlated tasks in spoken language understanding (SLU). Recent SLU research attempts to explore zero-shot prompting techniques in large language models to alleviate the data scarcity problem. Nevertheless, the existing prompting work ignores the cross-task interaction information for SLU, which leads to sub-optimal performance. To solve this proble… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  9. arXiv:2406.05673  [pdf, other

    cs.AI cs.CL

    Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking

    Authors: Fangxu Yu, Lai Jiang, Haoqiang Kang, Shibo Hao, Lianhui Qin

    Abstract: Divergent thinking, the cognitive process of generating diverse solutions, is a hallmark of human creativity and problem-solving. For machines, sampling diverse solution trajectories in complex reasoning problems is crucial for robust outcomes, data augmentation, and enhanced model generalization. Large language models (LLMs) often struggle with generating high-quality, diverse reasoning. While su… ▽ More

    Submitted 24 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  10. arXiv:2406.01276  [pdf, other

    cs.CL

    EduNLP: Towards a Unified and Modularized Library for Educational Resources

    Authors: Zhenya Huang, Yuting Ning, Longhu Qin, Shiwei Tong, Shangzi Xue, Tong Xiao, Xin Lin, Jiayu Liu, Qi Liu, Enhong Chen, Shi**g Wang

    Abstract: Educational resource understanding is vital to online learning platforms, which have demonstrated growing applications recently. However, researchers and developers always struggle with using existing general natural language toolkits or domain-specific models. The issue raises a need to develop an effective and easy-to-use one that benefits AI education-related research and applications. To bridg… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  11. arXiv:2406.01070  [pdf, other

    cs.CL

    Guiding ChatGPT to Generate Salient Domain Summaries

    Authors: Jun Gao, Ziqiang Cao, Shaoyao Huang, Luozheng Qin, Chunhui Ai

    Abstract: ChatGPT is instruct-tuned to generate general and human-expected content to align with human preference through Reinforcement Learning from Human Feedback (RLHF), meanwhile resulting in generated responses not salient enough. Therefore, in this case, ChatGPT may fail to satisfy domain requirements in zero-shot settings, leading to poor ROUGE scores. Inspired by the In-Context Learning (ICL) and re… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  12. arXiv:2405.16473  [pdf, other

    cs.CV cs.AI cs.CL

    M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought

    Authors: Qiguang Chen, Libo Qin, ** Zhang, Zhi Chen, Xiao Xu, Wanxiang Che

    Abstract: Multi-modal Chain-of-Thought (MCoT) requires models to leverage knowledge from both textual and visual modalities for step-by-step reasoning, which gains increasing attention. Nevertheless, the current MCoT benchmark still faces some challenges: (1) absence of visual modal reasoning, (2) single-step visual modal reasoning, and (3) Domain missing, thereby hindering the development of MCoT. Motivate… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted at ACL2024 Main Conference

  13. arXiv:2405.16471  [pdf, other

    cs.NI

    Performance Optimization in RSMA-assisted Uplink xURLLC IIoT Networks with Statistical QoS Provisioning

    Authors: Yuang Chen, Hancheng Lu, Chang Wu, Langtian Qin, Xiaobo Guo

    Abstract: Industry 5.0 and beyond networks have driven the emergence of numerous mission-critical applications, prompting contemplation of the neXt-generation ultra-reliable low-latency communication (xURLLC). To guarantee low-latency requirements, xURLLC heavily relies on short-blocklength packets with sporadic arrival traffic. As a disruptive multi-access technique, rate-splitting multiple access (RSMA) h… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 13 pages, 8 figures, submitted to IEEE Transactions for potential publication

  14. arXiv:2405.14399  [pdf, other

    cs.LG cs.CY

    Endowing Interpretability for Neural Cognitive Diagnosis by Efficient Kolmogorov-Arnold Networks

    Authors: Shangshang Yang, Linrui Qin, Xiaoshan Yu

    Abstract: In the realm of intelligent education, cognitive diagnosis plays a crucial role in subsequent recommendation tasks attributed to the revealed students' proficiency in knowledge concepts. Although neural network-based neural cognitive diagnosis models (CDMs) have exhibited significantly better performance than traditional models, neural cognitive diagnosis is criticized for the poor model interpret… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Leverage Kolmogorov-Arnold Networks (KANs) for cognitive diagnosis, enhancing the model interpretability. The diagnosis performance is also improved

    MSC Class: 68T30 ACM Class: I.2.4

  15. arXiv:2405.12914  [pdf, other

    cs.CV

    An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation

    Authors: Zhiyu Tan, Meng** Yang, Luozheng Qin, Hao Yang, Ye Qian, Qiang Zhou, Cheng Zhang, Hao Li

    Abstract: One critical prerequisite for faithful text-to-image generation is the accurate understanding of text inputs. Existing methods leverage the text encoder of the CLIP model to represent input prompts. However, the pre-trained CLIP model can merely encode English with a maximum token length of 77. Moreover, the model capacity of the text encoder from CLIP is relatively limited compared to Large Langu… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Technical report. Project page: https://github.com/llm-conditioned-diffusion/llm-conditioned-diffusion

  16. arXiv:2405.12871  [pdf, other

    cs.DB

    Efficient Influence Minimization via Node Blocking

    Authors: **ghao Wang, Yan** Wu, Xiaoyang Wang, Ying Zhang, Lu Qin, Wenjie Zhang, Xuemin Lin

    Abstract: Given a graph G, a budget k and a misinformation seed set S, Influence Minimization (IMIN) via node blocking aims to find a set of k nodes to be blocked such that the expected spread of S is minimized. This problem finds important applications in suppressing the spread of misinformation and has been extensively studied in the literature. However, existing solutions for IMIN still incur significant… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  17. arXiv:2405.12819  [pdf, other

    cs.CL cs.AI

    Large Language Models Meet NLP: A Survey

    Authors: Libo Qin, Qiguang Chen, Xiachong Feng, Yang Wu, Yongheng Zhang, Yinghui Li, Min Li, Wanxiang Che, Philip S. Yu

    Abstract: While large language models (LLMs) like ChatGPT have shown impressive capabilities in Natural Language Processing (NLP) tasks, a systematic investigation of their potential in this field remains largely unexplored. This study aims to address this gap by exploring the following questions: (1) How are LLMs currently applied to NLP tasks in the literature? (2) Have traditional NLP tasks already been… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  18. arXiv:2405.06217  [pdf, other

    cs.CV cs.MM

    DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding

    Authors: Ting Liu, Xuyang Liu, Siteng Huang, Honggang Chen, Quanjun Yin, Long Qin, Donglin Wang, Yue Hu

    Abstract: Visual grounding (VG) is a challenging task to localize an object in an image based on a textual description. Recent surge in the scale of VG models has substantially improved performance, but also introduced a significant burden on computational costs during fine-tuning. In this paper, we explore applying parameter-efficient transfer learning (PETL) to efficiently transfer the pre-trained vision-… ▽ More

    Submitted 8 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: Accepted by ICME 2024 (Oral)

  19. arXiv:2404.15597  [pdf, other

    cs.NE cs.AI cs.LG cs.MA

    GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL

    Authors: Lang Qin, Ziming Wang, Runhao Jiang, Rui Yan, Hua** Tang

    Abstract: Spiking neural networks (SNNs) are widely applied in various fields due to their energy-efficient and fast-inference capabilities. Applying SNNs to reinforcement learning (RL) can significantly reduce the computational resource requirements for agents and improve the algorithm's performance under resource-constrained conditions. However, in current spiking reinforcement learning (SRL) algorithms,… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  20. arXiv:2404.07963  [pdf, other

    cs.CY cs.AI cs.CL cs.HC cs.LG

    EduAgent: Generative Student Agents in Learning

    Authors: Songlin Xu, Xinyu Zhang, Lianhui Qin

    Abstract: Student simulation in online education is important to address dynamic learning behaviors of students with diverse backgrounds. Existing simulation models based on deep learning usually need massive training data, lacking prior knowledge in educational contexts. Large language models (LLMs) may contain such prior knowledge since they are pre-trained from a large corpus. However, because student be… ▽ More

    Submitted 23 March, 2024; originally announced April 2024.

  21. arXiv:2404.07017  [pdf, other

    cs.CL cs.AI

    Improving Language Model Reasoning with Self-motivated Learning

    Authors: Yunlong Feng, Yang Xu, Libo Qin, Yasheng Wang, Wanxiang Che

    Abstract: Large-scale high-quality training data is important for improving the performance of models. After trained with data that has rationales (reasoning steps), models gain reasoning capability. However, the dataset with high-quality rationales is relatively scarce due to the high annotation cost. To address this issue, we propose \textit{Self-motivated Learning} framework. The framework motivates the… ▽ More

    Submitted 30 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted at LREC-COLING 2024

  22. arXiv:2404.05019  [pdf, other

    cs.LG cs.CL cs.DC

    Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts

    Authors: Weilin Cai, Juyong Jiang, Le Qin, Junwei Cui, Sunghun Kim, Jiayi Huang

    Abstract: Expert parallelism has been introduced as a strategy to distribute the computational workload of sparsely-gated mixture-of-experts (MoE) models across multiple computing devices, facilitating the execution of these increasingly large-scale models. However, the All-to-All communication intrinsic to expert parallelism constitutes a significant overhead, diminishing the MoE models' efficiency. Curren… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  23. arXiv:2404.04925  [pdf, other

    cs.CL

    Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers

    Authors: Libo Qin, Qiguang Chen, Yuhang Zhou, Zhi Chen, Yinghui Li, Lizi Liao, Min Li, Wanxiang Che, Philip S. Yu

    Abstract: Multilingual Large Language Models are capable of using powerful Large Language Models to handle and respond to queries in multiple languages, which achieves remarkable success in multilingual natural language processing tasks. Despite these breakthroughs, there still remains a lack of a comprehensive survey to summarize existing approaches and recent developments in this field. To this end, in th… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  24. arXiv:2404.03647  [pdf, other

    math.OC cs.AI cs.LG

    Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra

    Authors: Darioush Kevian, Usman Syed, Xingang Guo, Aaron Havens, Geir Dullerud, Peter Seiler, Lianhui Qin, Bin Hu

    Abstract: In this paper, we explore the capabilities of state-of-the-art large language models (LLMs) such as GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra in solving undergraduate-level control problems. Controls provides an interesting case study for LLM reasoning due to its combination of mathematical theory and engineering design. We introduce ControlBench, a benchmark dataset tailored to reflect the bread… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  25. arXiv:2403.20005  [pdf, other

    cs.CL

    Large Language Model based Situational Dialogues for Second Language Learning

    Authors: Shuyao Xu, Long Qin, Tianyang Chen, Zhenzhou Zha, Bingxue Qiu, Weizhi Wang

    Abstract: In second language learning, scenario-based conversation practice is important for language learners to achieve fluency in speaking, but students often lack sufficient opportunities to practice their conversational skills with qualified instructors or native speakers. To bridge this gap, we propose situational dialogue models for students to engage in conversational practice. Our situational dialo… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 14 pages, 6 figures

  26. arXiv:2403.15464  [pdf, other

    cs.CL cs.AI cs.LG cs.MA

    LLMs-based Few-Shot Disease Predictions using EHR: A Novel Approach Combining Predictive Agent Reasoning and Critical Agent Instruction

    Authors: Hejie Cui, Zhuocheng Shen, Jieyu Zhang, Hui Shao, Lianhui Qin, Joyce C. Ho, Carl Yang

    Abstract: Electronic health records (EHRs) contain valuable patient data for health-related prediction tasks, such as disease prediction. Traditional approaches rely on supervised learning methods that require large labeled datasets, which can be expensive and challenging to obtain. In this study, we investigate the feasibility of applying Large Language Models (LLMs) to convert structured patient visit dat… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    ACM Class: J.3; I.2.7

  27. arXiv:2403.15059  [pdf, other

    cs.CV cs.AI

    MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration

    Authors: Zhichao Wei, Qingkun Su, Long Qin, Weizhi Wang

    Abstract: Recent advances in tuning-free personalized image generation based on diffusion models are impressive. However, to improve subject fidelity, existing methods either retrain the diffusion model or infuse it with dense visual embeddings, both of which suffer from poor generalization and efficiency. Also, these methods falter in multi-subject image generation due to the unconstrained cross-attention… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  28. arXiv:2403.12574  [pdf, other

    cs.CV cs.AI cs.NE

    EAS-SNN: End-to-End Adaptive Sampling and Representation for Event-based Detection with Recurrent Spiking Neural Networks

    Authors: Ziming Wang, Ziling Wang, Huaning Li, Lang Qin, Runhao Jiang, De Ma, Hua** Tang

    Abstract: Event cameras, with their high dynamic range and temporal resolution, are ideally suited for object detection, especially under scenarios with motion blur and challenging lighting conditions. However, while most existing approaches prioritize optimizing spatiotemporal representations with advanced detection backbones and early aggregation functions, the crucial issue of adaptive event sampling rem… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  29. arXiv:2403.11568  [pdf, other

    cs.CV

    EffiVED:Efficient Video Editing via Text-instruction Diffusion Models

    Authors: Zhenghao Zhang, Zuozhuo Dai, Long Qin, Weizhi Wang

    Abstract: Large-scale text-to-video models have shown remarkable abilities, but their direct application in video editing remains challenging due to limited available datasets. Current video editing methods commonly require per-video fine-tuning of diffusion models or specific inversion optimization to ensure high-fidelity edits. In this paper, we introduce EffiVED, an efficient diffusion-based model that d… ▽ More

    Submitted 5 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  30. arXiv:2403.09500  [pdf, other

    cs.CV

    Faceptor: A Generalist Model for Face Perception

    Authors: Lixiong Qin, Mei Wang, Xuannan Liu, Yuhang Zhang, Wei Deng, Xiaoshuai Song, Weiran Xu, Weihong Deng

    Abstract: With the comprehensive research conducted on various face analysis tasks, there is a growing interest among researchers to develop a unified approach to face perception. Existing methods mainly discuss unified representation and training, which lack task extensibility and application efficiency. To tackle this issue, we focus on the unified model structure, exploring a face generalist model. As an… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  31. arXiv:2403.04659  [pdf, other

    cs.CR cs.HC

    "Did They F***ing Consent to That?": Safer Digital Intimacy via Proactive Protection Against Image-Based Sexual Abuse

    Authors: Lucy Qin, Vaughn Hamilton, Sharon Wang, Yigit Aydinalp, Marin Scarlett, Elissa M. Redmiles

    Abstract: As many as 8 in 10 adults share intimate content such as nude or lewd images. Sharing such content has significant benefits for relationship intimacy and body image, and can offer employment. However, stigmatizing attitudes and a lack of technological mitigations put those sharing such content at risk of sexual violence. An estimated 1 in 3 people have been subjected to image-based sexual abuse (I… ▽ More

    Submitted 13 June, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  32. arXiv:2403.01988  [pdf, other

    cs.CL

    FakeNewsGPT4: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs

    Authors: Xuannan Liu, Peipei Li, Huaibo Huang, Zekun Li, Xing Cui, Jiahao Liang, Lixiong Qin, Weihong Deng, Zhaofeng He

    Abstract: The massive generation of multimodal fake news exhibits substantial distribution discrepancies, prompting the need for generalized detectors. However, the insulated nature of training within specific domains restricts the capability of classical detectors to obtain open-world facts. In this paper, we propose FakeNewsGPT4, a novel framework that augments Large Vision-Language Models (LVLMs) with fo… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  33. arXiv:2402.11420  [pdf, other

    cs.CL

    Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction

    Authors: Yinghui Li, Shang Qin, **gheng Ye, Shirong Ma, Yangning Li, Libo Qin, Xuming Hu, Wenhao Jiang, Hai-Tao Zheng, Philip S. Yu

    Abstract: Recently, Large Language Models (LLMs) have been widely studied by researchers for their roles in various downstream NLP tasks. As a fundamental task in the NLP field, Chinese Grammatical Error Correction (CGEC) aims to correct all potential grammatical errors in the input sentences. Previous studies have shown that LLMs' performance as correctors on CGEC remains unsatisfactory due to its challeng… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  34. arXiv:2402.10691  [pdf, other

    cs.CL

    Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts

    Authors: Xianzhen Luo, Qingfu Zhu, Zhiming Zhang, Libo Qin, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che

    Abstract: Program of Thoughts (PoT) is an approach characterized by its executable intermediate steps, which ensure the accuracy of the logical calculations in the reasoning process. Currently, PoT primarily uses Python. However, relying solely on a single language may result in suboptimal solutions and overlook the potential benefits of other programming languages. In this paper, we conduct comprehensive e… ▽ More

    Submitted 16 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: under review

  35. arXiv:2402.08679  [pdf, other

    cs.LG cs.AI cs.CL

    COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

    Authors: Xingang Guo, Fangxu Yu, Huan Zhang, Lianhui Qin, Bin Hu

    Abstract: Jailbreaks on large language models (LLMs) have recently received increasing attention. For a comprehensive assessment of LLM safety, it is essential to consider jailbreaks with diverse attributes, such as contextual coherence and sentiment/stylistic variations, and hence it is beneficial to study controllable jailbreaking, i.e. how to enforce control on LLM attacks. In this paper, we formally for… ▽ More

    Submitted 6 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024

  36. arXiv:2402.06609  [pdf, other

    cs.CY cs.CR

    You Still See Me: How Data Protection Supports the Architecture of ML Surveillance

    Authors: Rui-Jie Yew, Lucy Qin, Suresh Venkatasubramanian

    Abstract: Human data forms the backbone of machine learning. Data protection laws thus have strong bearing on how ML systems are governed. Given that most requirements in data protection laws accompany the processing of personal data, organizations have an incentive to keep their data out of legal scope. This makes the development and application of certain privacy-preserving techniques--data protection tec… ▽ More

    Submitted 18 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: A version of this work was accepted at the 2023 NeurIPS Workshop on Regulatable ML

  37. arXiv:2402.03900  [pdf, other

    cs.CL

    Pro-HAN: A Heterogeneous Graph Attention Network for Profile-Based Spoken Language Understanding

    Authors: Dechuan Teng, Chunlin Lu, Xiao Xu, Wanxiang Che, Libo Qin

    Abstract: Recently, Profile-based Spoken Language Understanding (SLU) has gained increasing attention, which aims to incorporate various types of supplementary profile information (i.e., Knowledge Graph, User Profile, Context Awareness) to eliminate the prevalent ambiguities in user utterances. However, existing approaches can only separately model different profile information, without considering their in… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted at ICASSP 2024

  38. arXiv:2401.13692  [pdf, ps, other

    cs.CR

    Local Privacy-preserving Mechanisms and Applications in Machine Learning

    Authors: Likun Qin, Tianshuo Qiu

    Abstract: The emergence and evolution of Local Differential Privacy (LDP) and its various adaptations play a pivotal role in tackling privacy issues related to the vast amounts of data generated by intelligent devices, which are crucial for data-informed decision-making in the realm of crowdsensing. Utilizing these extensive datasets can provide critical insights but also introduces substantial privacy conc… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2309.00861

  39. arXiv:2401.12507  [pdf, other

    cs.CV

    Open-Set Facial Expression Recognition

    Authors: Yuhang Zhang, Yue Yao, Xuannan Liu, Lixiong Qin, Wen**g Wang, Weihong Deng

    Abstract: Facial expression recognition (FER) models are typically trained on datasets with a fixed number of seven basic classes. However, recent research works point out that there are far more expressions than the basic ones. Thus, when these models are deployed in the real world, they may encounter unknown classes, such as compound expressions that cannot be classified into existing basic classes. To ad… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI2024

  40. arXiv:2401.00424  [pdf, other

    cs.CL

    SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection

    Authors: Shijue Huang, Libo Qin, Bingbing Wang, Geng Tu, Ruifeng Xu

    Abstract: Multi-modal intent detection aims to utilize various modalities to understand the user's intentions, which is essential for the deployment of dialogue systems in real-world scenarios. The two core challenges for multi-modal intent detection are (1) how to effectively align and fuse different features of modalities and (2) the limited labeled multi-modal intent training data. In this work, we intro… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  41. arXiv:2312.16499  [pdf, other

    cs.CV

    Camera calibration for the surround-view system: a benchmark and dataset

    Authors: L Qin, C Lin, S Huang, S Yang, Y Zhao

    Abstract: Surround-view system (SVS) is widely used in the Advanced Driver Assistance System (ADAS). SVS uses four fisheye lenses to monitor real-time scenes around the vehicle. However, accurate intrinsic and extrinsic parameter estimation is required for the proper functioning of the system. At present, the intrinsic calibration can be pipeline by utilizing checkerboard algorithm, while extrinsic calibrat… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  42. arXiv:2312.15252  [pdf, other

    q-bio.BM cs.LG

    DTIAM: A unified framework for predicting drug-target interactions, binding affinities and activation/inhibition mechanisms

    Authors: Zhangli Lu, Chuqi Lei, Kaili Wang, Libo Qin, **g Tang, Min Li

    Abstract: Accurate and robust prediction of drug-target interactions (DTIs) plays a vital role in drug discovery. Despite extensive efforts have been invested in predicting novel DTIs, existing approaches still suffer from insufficient labeled data and cold start problems. More importantly, there is currently a lack of studies focusing on elucidating the mechanism of action (MoA) between drugs and targets.… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  43. arXiv:2312.01499  [pdf, other

    eess.SY cs.DC eess.SP

    Towards Decentralized Task Offloading and Resource Allocation in User-Centric Mobile Edge Computing

    Authors: Langtian Qin, Hancheng Lu, Yuang Chen, Baolin Chong, Feng Wu

    Abstract: In the traditional cellular-based mobile edge computing (MEC), users at the edge of the cell are prone to suffer severe inter-cell interference and signal attenuation, leading to low throughput even transmission interruptions. Such edge effect severely obstructs offloading of tasks to MEC servers. To address this issue, we propose user-centric mobile edge computing (UCMEC), a novel MEC architectur… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 16 pages, 13 figures

  44. From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language Models

    Authors: Zachary Englhardt, Chengqian Ma, Margaret E. Morris, Xuhai "Orson" Xu, Chun-Cheng Chang, Lianhui Qin, Daniel McDuff, Xin Liu, Shwetak Patel, Vikram Iyer

    Abstract: Passively collected behavioral health data from ubiquitous sensors holds significant promise to provide mental health professionals insights from patient's daily lives; however, develo** analysis tools to use this data in clinical practice requires addressing challenges of generalization across devices and weak or ambiguous correlations between the measured signals and an individual's mental hea… ▽ More

    Submitted 25 November, 2023; v1 submitted 21 November, 2023; originally announced November 2023.

  45. arXiv:2311.12886  [pdf, other

    cs.CV

    AnimateAnything: Fine-Grained Open Domain Image Animation with Motion Guidance

    Authors: Zuozhuo Dai, Zhenghao Zhang, Yao Yao, Bingxue Qiu, Siyu Zhu, Long Qin, Weizhi Wang

    Abstract: Image animation is a key task in computer vision which aims to generate dynamic visual content from static image. Recent image animation methods employ neural based rendering technique to generate realistic animations. Despite these advancements, achieving fine-grained and controllable image animation guided by text remains challenging, particularly for open-domain images captured in diverse real… ▽ More

    Submitted 4 December, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

  46. arXiv:2311.11429  [pdf, other

    cs.LG

    Fast Heavy Inner Product Identification Between Weights and Inputs in Neural Network Training

    Authors: Lianke Qin, Saayan Mitra, Zhao Song, Yuanyuan Yang, Tianyi Zhou

    Abstract: In this paper, we consider a heavy inner product identification problem, which generalizes the Light Bulb problem~(\cite{prr89}): Given two sets $A \subset \{-1,+1\}^d$ and $B \subset \{-1,+1\}^d$ with $|A|=|B| = n$, if there are exact $k$ pairs whose inner product passes a certain threshold, i.e., $\{(a_1, b_1), \cdots, (a_k, b_k)\} \subset A \times B$ such that… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: IEEE BigData 2023

  47. arXiv:2311.09879  [pdf, ps, other

    cs.IT cs.NI

    Cross-Layer Optimization for Statistical QoS Provision in C-RAN with Finite-Length Coding

    Authors: Chang Wu, Hancheng Lu, Yuang Chen, Langtian Qin

    Abstract: The cloud radio access network (C-RAN) has become the foundational structure for various emerging communication paradigms, leveraging the flexible deployment of distributed access points (APs) and centralized task processing. In this paper, we propose a cross-layer optimization framework based on a practical finite-length coding communication system in C-RAN, aiming at maximizing bandwidth efficie… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 13 pages, 17 figures

  48. arXiv:2311.09682  [pdf, other

    cs.CL cs.AI

    MacGyver: Are Large Language Models Creative Problem Solvers?

    Authors: Yufei Tian, Abhilasha Ravichander, Lianhui Qin, Ronan Le Bras, Raja Marjieh, Nanyun Peng, Ye** Choi, Thomas L. Griffiths, Faeze Brahman

    Abstract: We explore the creative problem-solving capabilities of modern LLMs in a novel constrained setting. To this end, we create MACGYVER, an automatically generated dataset consisting of over 1,600 real-world problems deliberately designed to trigger innovative usage of objects and necessitate out-of-the-box thinking. We then present our collection to both LLMs and humans to compare and contrast their… ▽ More

    Submitted 27 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: NAACL 2024

  49. arXiv:2311.09656  [pdf, other

    cs.CL cs.AI

    Structured Chemistry Reasoning with Large Language Models

    Authors: Siru Ouyang, Zhuosheng Zhang, Bing Yan, Xuan Liu, Ye** Choi, Jiawei Han, Lianhui Qin

    Abstract: Large Language Models (LLMs) excel in diverse areas, yet struggle with complex scientific reasoning, especially in the field of chemistry. Different from the simple chemistry tasks (e.g., molecule classification) addressed in previous studies, complex chemistry problems require not only vast knowledge and precise calculation, but also compositional reasoning about rich dynamic interactions of diff… ▽ More

    Submitted 9 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Work in progress

  50. arXiv:2311.09008  [pdf, other

    cs.CL

    End-to-end Task-oriented Dialogue: A Survey of Tasks, Methods, and Future Directions

    Authors: Libo Qin, Wenbo Pan, Qiguang Chen, Lizi Liao, Zhou Yu, Yue Zhang, Wanxiang Che, Min Li

    Abstract: End-to-end task-oriented dialogue (EToD) can directly generate responses in an end-to-end fashion without modular training, which attracts escalating popularity. The advancement of deep neural networks, especially the successful use of large pre-trained models, has further led to significant progress in EToD research in recent years. In this paper, we present a thorough review and provide a unifie… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP2023