Skip to main content

Showing 1–50 of 103 results for author: Qi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02073  [pdf, other

    cs.LG

    Contribution Evaluation of Heterogeneous Participants in Federated Learning via Prototypical Representations

    Authors: Qi Guo, Minghao Yao, Zhen Tian, Saiyu Qi, Yong Qi, Yun Lin, ** Song Dong

    Abstract: Contribution evaluation in federated learning (FL) has become a pivotal research area due to its applicability across various domains, such as detecting low-quality datasets, enhancing model robustness, and designing incentive mechanisms. Existing contribution evaluation methods, which primarily rely on data volume, model similarity, and auxiliary test datasets, have shown success in diverse scena… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.00567  [pdf, other

    cs.AI cs.LG

    A Contextual Combinatorial Bandit Approach to Negotiation

    Authors: Yexin Li, Zhancun Mu, Siyuan Qi

    Abstract: Learning effective negotiation strategies poses two key challenges: the exploration-exploitation dilemma and dealing with large action spaces. However, there is an absence of learning-based approaches that effectively address these challenges in negotiation. This paper introduces a comprehensive formulation to tackle various negotiation problems. Our approach leverages contextual combinatorial mul… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  3. arXiv:2407.00132  [pdf, other

    cs.SE cs.AI

    ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents

    Authors: Haiyang Shen, Yue Li, Desong Meng, Dongqi Cai, Sheng Qi, Li Zhang, Mengwei Xu, Yun Ma

    Abstract: Recent advancements in integrating large language models (LLMs) with application programming interfaces (APIs) have gained significant interest in both academia and industry. These API-based agents, leveraging the strong autonomy and planning capabilities of LLMs, can efficiently solve problems requiring multi-step actions. However, their ability to handle multi-dimensional difficulty levels, dive… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  4. arXiv:2406.15073  [pdf, other

    cs.AI cs.DB

    KnobTree: Intelligent Database Parameter Configuration via Explainable Reinforcement Learning

    Authors: Jiahan Chen, Shuhan Qi, Yifan Li, Zeyu Dong, Mingfeng Ding, Yulin Wu, Xuan Wang

    Abstract: Databases are fundamental to contemporary information systems, yet traditional rule-based configuration methods struggle to manage the complexity of real-world applications with hundreds of tunable parameters. Deep reinforcement learning (DRL), which combines perception and decision-making, presents a potential solution for intelligent database configuration tuning. However, due to black-box prope… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  5. arXiv:2406.11194  [pdf, other

    cs.CL

    In-Context Editing: Learning Knowledge from Self-Induced Distributions

    Authors: Siyuan Qi, Bangcheng Yang, Kailin Jiang, Xiaobo Wang, Jiaqi Li, Yifan Zhong, Yaodong Yang, Zilong Zheng

    Abstract: The existing fine-tuning paradigm for language models is brittle in knowledge editing scenarios, where the model must incorporate new information without extensive retraining. This brittleness often results in overfitting, reduced performance, and unnatural language generation. To address this, we propose Consistent In-Context Editing (ICE), a novel approach that leverages the model's in-context l… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  6. arXiv:2405.10968  [pdf, other

    cs.DC cs.LG

    LIFL: A Lightweight, Event-driven Serverless Platform for Federated Learning

    Authors: Shixiong Qi, K. K. Ramakrishnan, Myung** Lee

    Abstract: Federated Learning (FL) typically involves a large-scale, distributed system with individual user devices/servers training models locally and then aggregating their model updates on a trusted central server. Existing systems for FL often use an always-on server for model aggregation, which can be inefficient in terms of resource utilization. They may also be inelastic in their resource management.… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  7. arXiv:2405.10098  [pdf, other

    cs.NE

    When Large Language Model Meets Optimization

    Authors: Sen Huang, Kaixiang Yang, Sheng Qi, Rui Wang

    Abstract: Optimization algorithms and large language models (LLMs) enhance decision-making in dynamic environments by integrating artificial intelligence with traditional techniques. LLMs, with extensive domain knowledge, facilitate intelligent modeling and strategic decision-making in optimization, while optimization algorithms refine LLM architectures and output quality. This synergy offers novel approach… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  8. arXiv:2405.07374  [pdf, other

    cs.LG cs.AI stat.ML

    Conformalized Survival Distributions: A Generic Post-Process to Increase Calibration

    Authors: Shi-ang Qi, Yakun Yu, Russell Greiner

    Abstract: Discrimination and calibration represent two important properties of survival analysis, with the former assessing the model's ability to accurately rank subjects and the latter evaluating the alignment of predicted outcomes with actual events. With their distinct nature, it is hard for survival models to simultaneously optimize both of them especially as many previous results found improving calib… ▽ More

    Submitted 2 June, 2024; v1 submitted 12 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024; 37 pages, 19 figures

  9. arXiv:2404.12041  [pdf, other

    cs.CL cs.AI

    Can We Catch the Elephant? A Survey of the Evolvement of Hallucination Evaluation on Natural Language Generation

    Authors: Siya Qi, Yulan He, Zheng Yuan

    Abstract: Hallucination in Natural Language Generation (NLG) is like the elephant in the room, obvious but often overlooked until recent achievements significantly improved the fluency and grammaticality of generated text. As the capabilities of text generation models have improved, researchers have begun to pay more attention to the phenomenon of hallucination. Despite significant progress in this field in… ▽ More

    Submitted 15 June, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: 16 pages, 2 figures

  10. arXiv:2404.02517  [pdf, other

    cs.CV

    HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras

    Authors: Zhongyu Xia, ZhiWei Lin, Xinhao Wang, Yongtao Wang, Yun Xing, Shengxiang Qi, Nan Dong, Ming-Hsuan Yang

    Abstract: Three-dimensional perception from multi-view cameras is a crucial component in autonomous driving systems, which involves multiple tasks like 3D object detection and bird's-eye-view (BEV) semantic segmentation. To improve perception precision, large image encoders, high-resolution images, and long-term temporal inputs have been adopted in recent 3D perception models, bringing remarkable performanc… ▽ More

    Submitted 20 May, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  11. arXiv:2404.00622  [pdf, other

    cs.MA eess.SY

    OpenMines: A Light and Comprehensive Mining Simulation Environment for Truck Dispatching

    Authors: Shi Meng, Bin Tian, Xiaotong Zhang, Shuangying Qi, Caiji Zhang, Qiang Zhang

    Abstract: Mine fleet management algorithms can significantly reduce operational costs and enhance productivity in mining systems. Most current fleet management algorithms are evaluated based on self-implemented or proprietary simulation environments, posing challenges for replication and comparison. This paper models the simulation environment for mine fleet management from a complex systems perspective. Bu… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: accepted in: 2024 35th IEEE Intelligent Vehicles Symposium (IV) 4 figures, 1 table

  12. arXiv:2403.19531  [pdf, other

    cs.CR cs.DB cs.SI

    SecGraph: Towards SGX-based Efficient and Confidentiality-Preserving Graph Search

    Authors: Qiuhao Wang, Xu Yang, Saiyu Qi, Yong Qi

    Abstract: Graphs have more expressive power and are widely researched in various search demand scenarios, compared with traditional relational and XML models. Today, many graph search services have been deployed on a third-party server, which can alleviate users from the burdens of maintaining large-scale graphs and huge computation costs. Nevertheless, outsourcing graph search services to the third-party s… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: This paper has been accepted by DASFAA 2024

  13. arXiv:2403.18926  [pdf, other

    cs.LG cs.CL

    XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection

    Authors: Yuanhang Yang, Shiyi Qi, Wenchao Gu, Chaozheng Wang, Cuiyun Gao, Zenglin Xu

    Abstract: Sparse models, including sparse Mixture-of-Experts (MoE) models, have emerged as an effective approach for scaling Transformer models. However, they often suffer from computational inefficiency since a significant number of parameters are unnecessarily involved in computations via multiplying values by zero or low activation values. To address this issue, we present \tool, a novel MoE designed to… ▽ More

    Submitted 24 May, 2024; v1 submitted 27 February, 2024; originally announced March 2024.

    Comments: ACL2024 Findings

  14. arXiv:2403.16687  [pdf

    cs.CY cs.AI physics.ed-ph

    Investigation of the effectiveness of applying ChatGPT in Dialogic Teaching Using Electroencephalography

    Authors: Jiayue Zhang, Yiheng Liu, Wenqi Cai, Lanlan Wu, Yali Peng, **g**g Yu, Senqing Qi, Taotao Long, Bao Ge

    Abstract: In recent years, the rapid development of artificial intelligence technology, especially the emergence of large language models (LLMs) such as ChatGPT, has presented significant prospects for application in the field of education. LLMs possess the capability to interpret knowledge, answer questions, and consider context, thus providing support for dialogic teaching to students. Therefore, an exami… ▽ More

    Submitted 10 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  15. arXiv:2403.16440  [pdf, other

    cs.CV

    RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection

    Authors: Zhiwei Lin, Zhe Liu, Zhongyu Xia, Xinhao Wang, Yongtao Wang, Shengxiang Qi, Yang Dong, Nan Dong, Le Zhang, Ce Zhu

    Abstract: Three-dimensional object detection is one of the key tasks in autonomous driving. To reduce costs in practice, low-cost multi-view cameras for 3D object detection are proposed to replace the expansive LiDAR sensors. However, relying solely on cameras is difficult to achieve highly accurate and robust 3D object detection. An effective solution to this issue is combining multi-view cameras with the… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  16. arXiv:2403.00869  [pdf, other

    cs.LG stat.ML

    Enhancing Multivariate Time Series Forecasting with Mutual Information-driven Cross-Variable and Temporal Modeling

    Authors: Shiyi Qi, Liangjian Wen, Yiduo Li, Yuanhang Yang, Zhe Li, Zhongwen Rao, Lujia Pan, Zenglin Xu

    Abstract: Recent advancements have underscored the impact of deep learning techniques on multivariate time series forecasting (MTSF). Generally, these techniques are bifurcated into two categories: Channel-independence and Channel-mixing approaches. Although Channel-independence methods typically yield better results, Channel-mixing could theoretically offer improvements by leveraging inter-variable correla… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  17. arXiv:2402.16913  [pdf, other

    cs.LG

    PDETime: Rethinking Long-Term Multivariate Time Series Forecasting from the perspective of partial differential equations

    Authors: Shiyi Qi, Zenglin Xu, Yiduo Li, Liangjian Wen, Qingsong Wen, Qifan Wang, Yuan Qi

    Abstract: Recent advancements in deep learning have led to the development of various models for long-term multivariate time-series forecasting (LMTF), many of which have shown promising results. Generally, the focus has been on historical-value-based models, which rely on past observations to predict future series. Notably, a new trend has emerged with time-index-based models, offering a more nuanced under… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  18. arXiv:2402.15430  [pdf, other

    cs.CV cs.LG

    Hierarchical Invariance for Robust and Interpretable Vision Tasks at Larger Scales

    Authors: Shuren Qi, Yushu Zhang, Chao Wang, Zhihua Xia, Xiaochun Cao, Jian Weng

    Abstract: Develo** robust and interpretable vision systems is a crucial step towards trustworthy artificial intelligence. In this regard, a promising paradigm considers embedding task-required invariant structures, e.g., geometric invariance, in the fundamental image representation. However, such invariant representations typically exhibit limited discriminability, limiting their applications in larger-sc… ▽ More

    Submitted 11 April, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  19. arXiv:2402.02030  [pdf, other

    cs.CL

    Panacea: Pareto Alignment via Preference Adaptation for LLMs

    Authors: Yifan Zhong, Chengdong Ma, Xiaoyuan Zhang, Ziran Yang, Haojun Chen, Qingfu Zhang, Siyuan Qi, Yaodong Yang

    Abstract: Current methods for large language model alignment typically use scalar human preference labels. However, this convention tends to oversimplify the multi-dimensional and heterogeneous nature of human preferences, leading to reduced expressivity and even misalignment. This paper presents Panacea, an innovative approach that reframes alignment as a multi-dimensional preference optimization problem.… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

  20. arXiv:2401.12902  [pdf, other

    cs.CV

    Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning?

    Authors: Cheng Han, Qifan Wang, Yiming Cui, Wenguan Wang, Lifu Huang, Siyuan Qi, Dongfang Liu

    Abstract: As the scale of vision models continues to grow, the emergence of Visual Prompt Tuning (VPT) as a parameter-efficient transfer learning technique has gained attention due to its superior performance compared to traditional full-finetuning. However, the conditions favoring VPT (the ``when") and the underlying rationale (the ``why") remain unclear. In this paper, we conduct a comprehensive analysis… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 29 pages, 19 figures

  21. arXiv:2401.10568  [pdf, other

    cs.AI

    CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

    Authors: Siyuan Qi, Shuo Chen, Yexin Li, Xiangyu Kong, Junqi Wang, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Wei Wang, Yaodong Yang, Song-Chun Zhu

    Abstract: The generalization of decision-making agents encompasses two fundamental elements: learning from past experiences and reasoning in novel contexts. However, the predominant emphasis in most interactive environments is on learning, often at the expense of complexity in reasoning. In this paper, we introduce CivRealm, an environment inspired by the Civilization game. Civilization's profound alignment… ▽ More

    Submitted 12 March, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

  22. arXiv:2312.17076  [pdf, other

    cs.RO

    Minimally-intrusive Navigation in Dense Crowds with Integrated Macro and Micro-level Dynamics

    Authors: Tong Zhou, Senmao Qi, Guangdu Cen, Ziqi Zha, Erli Lyu, Jiaole Wang, Max Q. -H. Meng

    Abstract: In mobile robot navigation, despite advancements, the generation of optimal paths often disrupts pedestrian areas. To tackle this, we propose three key contributions to improve human-robot coexistence in shared spaces. Firstly, we have established a comprehensive framework to understand disturbances at individual and flow levels. Our framework provides specialized computational strategies for in-d… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 23 pages, 13 figures

    MSC Class: 68T40 ACM Class: I.2.9

  23. arXiv:2311.08583  [pdf

    cs.DC cs.NE

    MOSAIC: A Multi-Objective Optimization Framework for Sustainable Datacenter Management

    Authors: Sirui Qi, Dejan Milojicic, Cullen Bash, Sudeep Pasricha

    Abstract: In recent years, cloud service providers have been building and hosting datacenters across multiple geographical locations to provide robust services. However, the geographical distribution of datacenters introduces growing pressure to both local and global environments, particularly when it comes to water usage and carbon emissions. Unfortunately, efforts to reduce the environmental impact of suc… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  24. arXiv:2311.04769  [pdf

    eess.IV cs.CV

    An attention-based deep learning network for predicting Platinum resistance in ovarian cancer

    Authors: Haoming Zhuang, Beibei Li, **gtong Ma, Patrice Monkam, Shouliang Qi, Wei Qian, Dianning He

    Abstract: Background: Ovarian cancer is among the three most frequent gynecologic cancers globally. High-grade serous ovarian cancer (HGSOC) is the most common and aggressive histological type. Guided treatment for HGSOC typically involves platinum-based combination chemotherapy, necessitating an assessment of whether the patient is platinum-resistant. The purpose of this study is to propose a deep learning… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  25. arXiv:2310.19675  [pdf, other

    eess.IV cs.CV

    A Principled Hierarchical Deep Learning Approach to Joint Image Compression and Classification

    Authors: Siyu Qi, Achintha Wijesinghe, Lahiru D. Chamain, Zhi Ding

    Abstract: Among applications of deep learning (DL) involving low cost sensors, remote image classification involves a physical channel that separates edge sensors and cloud classifiers. Traditional DL models must be divided between an encoder for the sensor and the decoder + classifier at the edge server. An important challenge is to effectively train such distributed models when the connecting channels hav… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  26. arXiv:2310.01320  [pdf, other

    cs.AI cs.CL cs.CY cs.LG cs.MA

    Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation

    Authors: Shenzhi Wang, Chang Liu, Zilong Zheng, Siyuan Qi, Shuo Chen, Qisen Yang, Andrew Zhao, Chaofei Wang, Shiji Song, Gao Huang

    Abstract: Recent breakthroughs in large language models (LLMs) have brought remarkable success in the field of LLM-as-Agent. Nevertheless, a prevalent assumption is that the information processed by LLMs is consistently honest, neglecting the pervasive deceptive or misleading information in human society and AI-generated content. This oversight makes LLMs susceptible to malicious manipulations, potentially… ▽ More

    Submitted 24 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 40 pages

  27. arXiv:2309.15079  [pdf, other

    cs.RO

    Towards High Efficient Long-horizon Planning with Expert-guided Motion-encoding Tree Search

    Authors: Tong Zhou, Erli Lyu, Jiaole Wang, Guangdu Cen, Ziqi Zha, Senmao Qi, Max Q. -H. Meng

    Abstract: Autonomous driving holds promise for increased safety, optimized traffic management, and a new level of convenience in transportation. While model-based reinforcement learning approaches such as MuZero enables long-term planning, the exponentially increase of the number of search nodes as the tree goes deeper significantly effect the searching efficiency. To deal with this problem, in this paper w… ▽ More

    Submitted 30 September, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: 7 pages, 5 figures

    MSC Class: 68T40 ACM Class: I.2.9

  28. arXiv:2309.09435  [pdf, other

    cs.CR

    Security and Privacy on Generative Data in AIGC: A Survey

    Authors: Tao Wang, Yushu Zhang, Shuren Qi, Ruoyu Zhao, Zhihua Xia, Jian Weng

    Abstract: The advent of artificial intelligence-generated content (AIGC) represents a pivotal moment in the evolution of information technology. With AIGC, it can be effortless to generate high-quality data that is challenging for the public to distinguish. Nevertheless, the proliferation of generative data across cyberspace brings security and privacy issues, including privacy leakages of individuals and m… ▽ More

    Submitted 17 December, 2023; v1 submitted 17 September, 2023; originally announced September 2023.

  29. arXiv:2309.07967  [pdf, other

    cs.IR

    iHAS: Instance-wise Hierarchical Architecture Search for Deep Learning Recommendation Models

    Authors: Yakun Yu, Shi-ang Qi, Jiuding Yang, Liyao Jiang, Di Niu

    Abstract: Current recommender systems employ large-sized embedding tables with uniform dimensions for all features, leading to overfitting, high computational cost, and suboptimal generalizing performance. Many techniques aim to solve this issue by feature selection or embedding dimension search. However, these techniques typically select a fixed subset of features or embedding dimensions for all instances… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Accepted as CIKM23 Long paper

  30. arXiv:2309.01940  [pdf, other

    cs.CL cs.AI

    CodeApex: A Bilingual Programming Evaluation Benchmark for Large Language Models

    Authors: Lingyue Fu, Huacan Chai, Shuang Luo, Kounianhua Du, Weiming Zhang, Longteng Fan, Jiayi Lei, Renting Rui, Jianghao Lin, Yuchen Fang, Yifan Liu, **gkuan Wang, Siyuan Qi, Kangning Zhang, Weinan Zhang, Yong Yu

    Abstract: With the emergence of Large Language Models (LLMs), there has been a significant improvement in the programming capabilities of models, attracting growing attention from researchers. Evaluating the programming capabilities of LLMs is crucial as it reflects the multifaceted abilities of LLMs, and it has numerous downstream applications. In this paper, we propose CodeApex, a bilingual benchmark data… ▽ More

    Submitted 11 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 33pages

  31. arXiv:2308.13086  [pdf

    cs.DC cs.LG cs.NE

    SHIELD: Sustainable Hybrid Evolutionary Learning Framework for Carbon, Wastewater, and Energy-Aware Data Center Management

    Authors: Sirui Qi, Dejan Milojicic, Cullen Bash, Sudeep Pasricha

    Abstract: Today's cloud data centers are often distributed geographically to provide robust data services. But these geo-distributed data centers (GDDCs) have a significant associated environmental impact due to their increasing carbon emissions and water usage, which needs to be curtailed. Moreover, the energy costs of operating these data centers continue to rise. This paper proposes a novel framework to… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  32. arXiv:2308.05870  [pdf, other

    cs.LG eess.SP

    UFed-GAN: A Secure Federated Learning Framework with Constrained Computation and Unlabeled Data

    Authors: Achintha Wijesinghe, Songyang Zhang, Siyu Qi, Zhi Ding

    Abstract: To satisfy the broad applications and insatiable hunger for deploying low latency multimedia data classification and data privacy in a cloud-based setting, federated learning (FL) has emerged as an important learning paradigm. For the practical cases involving limited computational power and only unlabeled data in many wireless communications applications, this work investigates FL paradigm in a r… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  33. arXiv:2308.05692  [pdf, other

    cs.DC

    Isolated Scheduling for Distributed Training Tasks in GPU Clusters

    Authors: Xinchi Han, Weihao Jiang, Peirui Cao, Qinwei Yang, Yunzhuo Liu, Shuyao Qi, Shengkai Lin, Shizhen Zhao

    Abstract: Distributed machine learning (DML) technology makes it possible to train large neural networks in a reasonable amount of time. Meanwhile, as the computing power grows much faster than network capacity, network communication has gradually become the bottleneck of DML. Current multi-tenant GPU clusters face network contention caused by hash-collision problem which not only further increases the over… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  34. arXiv:2307.13770  [pdf, other

    cs.CV cs.AI

    E^2VPT: An Effective and Efficient Approach for Visual Prompt Tuning

    Authors: Cheng Han, Qifan Wang, Yiming Cui, Zhiwen Cao, Wenguan Wang, Siyuan Qi, Dongfang Liu

    Abstract: As the size of transformer-based models continues to grow, fine-tuning these large-scale pretrained vision models for new tasks has become increasingly parameter-intensive. Parameter-efficient learning has been developed to reduce the number of tunable parameters during fine-tuning. Although these methods show promising results, there is still a significant performance gap compared to full fine-tu… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: 12 pages, 4 figures

  35. arXiv:2307.04350  [pdf, other

    cs.DB

    The Linked Data Benchmark Council (LDBC): Driving competition and collaboration in the graph data management space

    Authors: Gábor Szárnyas, Brad Bebee, Altan Birler, Alin Deutsch, George Fletcher, Henry A. Gabb, Denise Gosnell, Alastair Green, Zhihui Guo, Keith W. Hare, Jan Hidders, Alexandru Iosup, Atanas Kiryakov, Tomas Kovatchev, Xinsheng Li, Leonid Libkin, Heng Lin, Xiaojian Luo, Arnau Prat-Pérez, David Püroja, Shipeng Qi, Oskar van Rest, Benjamin A. Steer, Dávid Szakállas, Bing Tong , et al. (8 additional authors not shown)

    Abstract: Graph data management is instrumental for several use cases such as recommendation, root cause analysis, financial fraud detection, and enterprise knowledge representation. Efficiently supporting these use cases yields a number of unique requirements, including the need for a concise query language and graph-aware query optimization techniques. The goal of the Linked Data Benchmark Council (LDBC)… ▽ More

    Submitted 17 August, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

    ACM Class: H.2.4

  36. arXiv:2306.15975  [pdf, other

    cs.DB cs.PF

    The LDBC Financial Benchmark

    Authors: Shipeng Qi, Heng Lin, Zhihui Guo, Gábor Szárnyas, Bing Tong, Yan Zhou, Bin Yang, Jiansong Zhang, Zheng Wang, Youren Shen, Changyuan Wang, Parviz Peiravi, Henry Gabb, Ben Steer

    Abstract: The Linked Data Benchmark Council's Financial Benchmark (LDBC FinBench) is a new effort that defines a graph database benchmark targeting financial scenarios such as anti-fraud and risk control. The benchmark has one workload, the Transaction Workload, currently. It captures OLTP scenario with complex, simple read queries and write queries that continuously insert or delete data in the graph. Comp… ▽ More

    Submitted 30 June, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: For the source code of this specification, see the ldbc_finbench_docs repository on Github. arXiv admin note: substantial text overlap with arXiv:2001.02299

    ACM Class: H.2.4

  37. arXiv:2306.15796  [pdf, other

    cs.AI

    ConKI: Contrastive Knowledge Injection for Multimodal Sentiment Analysis

    Authors: Yakun Yu, Mingjun Zhao, Shi-ang Qi, Feiran Sun, Baoxun Wang, Weidong Guo, Xiaoli Wang, Lei Yang, Di Niu

    Abstract: Multimodal Sentiment Analysis leverages multimodal signals to detect the sentiment of a speaker. Previous approaches concentrate on performing multimodal fusion and representation learning based on general knowledge obtained from pretrained models, which neglects the effect of domain-specific knowledge. In this paper, we propose Contrastive Knowledge Injection (ConKI) for multimodal sentiment anal… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted by ACL Findings 2023

  38. arXiv:2306.01196  [pdf, other

    cs.LG cs.AI stat.ML

    An Effective Meaningful Way to Evaluate Survival Models

    Authors: Shi-ang Qi, Neeraj Kumar, Mahtab Farrokh, Weijie Sun, Li-Hao Kuan, Rajesh Ranganath, Ricardo Henao, Russell Greiner

    Abstract: One straightforward metric to evaluate a survival prediction model is based on the Mean Absolute Error (MAE) -- the average of the absolute difference between the time predicted by the model and the true event time, over all subjects. Unfortunately, this is challenging because, in practice, the test set includes (right) censored individuals, meaning we do not know when a censored individual actual… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2023

  39. arXiv:2305.17147  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Heterogeneous Value Alignment Evaluation for Large Language Models

    Authors: Zhaowei Zhang, Ceyao Zhang, Nian Liu, Siyuan Qi, Ziqi Rong, Song-Chun Zhu, Shuguang Cui, Yaodong Yang

    Abstract: The emergent capabilities of Large Language Models (LLMs) have made it crucial to align their values with those of humans. However, current methodologies typically attempt to assign value as an attribute to LLMs, yet lack attention to the ability to pursue value and the importance of transferring heterogeneous values in specific practical applications. In this paper, we propose a Heterogeneous Val… ▽ More

    Submitted 11 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: v3 is the camera-ready version for AAAI-24 Workshop on Public Sector LLMs: Algorithmic and Sociotechnical Design. 7 pages of main content, 1 page of references, 3 pages of appendices, and 7 figures. Our full prompts are released in the repo: https://github.com/zowiezhang/HVAE

  40. arXiv:2305.10856  [pdf, other

    cs.CV

    Towards an Accurate and Secure Detector against Adversarial Perturbations

    Authors: Chao Wang, Shuren Qi, Zhiqiu Huang, Yushu Zhang, Rushi Lan, Xiaochun Cao

    Abstract: The vulnerability of deep neural networks to adversarial perturbations has been widely perceived in the computer vision community. From a security perspective, it poses a critical risk for modern vision systems, e.g., the popular Deep Learning as a Service (DLaaS) frameworks. For protecting off-the-shelf deep models while not modifying them, current algorithms typically detect adversarial patterns… ▽ More

    Submitted 24 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  41. arXiv:2305.10721  [pdf, other

    cs.LG cs.AI

    Revisiting Long-term Time Series Forecasting: An Investigation on Linear Map**

    Authors: Zhe Li, Shiyi Qi, Yiduo Li, Zenglin Xu

    Abstract: Long-term time series forecasting has gained significant attention in recent years. While there are various specialized designs for capturing temporal dependency, previous studies have demonstrated that a single linear layer can achieve competitive forecasting performance compared to other complex architectures. In this paper, we thoroughly investigate the intrinsic effectiveness of recent approac… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 12 pages, 11 figures

  42. arXiv:2305.10169  [pdf, other

    cs.MM

    Few-shot Joint Multimodal Aspect-Sentiment Analysis Based on Generative Multimodal Prompt

    Authors: Xiaocui Yang, Shi Feng, Daling Wang, Sun Qi, Wenfang Wu, Yifei Zhang, Pengfei Hong, Soujanya Poria

    Abstract: We have witnessed the rapid proliferation of multimodal data on numerous social media platforms. Conventional studies typically require massive labeled data to train models for Multimodal Aspect-Based Sentiment Analysis (MABSA). However, collecting and annotating fine-grained multimodal data for MABSA is tough. To alleviate the above issue, we perform three MABSA-related tasks with quite a small n… ▽ More

    Submitted 18 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: 13 pages, 7 figures, 6 tables, ACL 2023 Long Paper (Findings)

  43. arXiv:2305.05503  [pdf, other

    cs.SE

    BadCS: A Backdoor Attack Framework for Code search

    Authors: Shiyi Qi, Yuanhang Yang, Shuzhzeng Gao, Cuiyun Gao, Zenglin Xu

    Abstract: With the development of deep learning (DL), DL-based code search models have achieved state-of-the-art performance and have been widely used by developers during software development. However, the security issue, e.g., recommending vulnerable code, has not received sufficient attention, which will bring potential harm to software development. Poisoning-based backdoor attack has proven effective in… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  44. arXiv:2305.03899  [pdf, other

    cs.CV cs.LG eess.IV

    NL-CS Net: Deep Learning with Non-Local Prior for Image Compressive Sensing

    Authors: Shuai Bian, Shouliang Qi, Chen Li, Yudong Yao, Yueyang Teng

    Abstract: Deep learning has been applied to compressive sensing (CS) of images successfully in recent years. However, existing network-based methods are often trained as the black box, in which the lack of prior knowledge is often the bottleneck for further performance improvement. To overcome this drawback, this paper proposes a novel CS method using non-local prior which combines the interpretability of t… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 21pages,6figures

    ACM Class: I.4.7

  45. arXiv:2304.00275  [pdf, ps, other

    eess.SY cs.RO

    Automated Formation Control Synthesis from Temporal Logic Specifications

    Authors: Shuhao Qi, Zengjie Zhang, Sofie Haesaert, Zhiyong Sun

    Abstract: In many practical scenarios, multi-robot systems are envisioned to support humans in executing complicated tasks within structured environments, such as search-and-rescue tasks. We propose a framework for a multi-robot swarm to fulfill complex tasks represented by temporal logic specifications. Given temporal logic specifications on the swarm formation and navigation, we develop a controller with… ▽ More

    Submitted 15 September, 2023; v1 submitted 1 April, 2023; originally announced April 2023.

  46. arXiv:2303.09058  [pdf, other

    cs.AI

    SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning

    Authors: Shuhan Qi, Shuhao Zhang, Qiang Wang, Jiajia Zhang, **g Xiao, Xuan Wang

    Abstract: Value-decomposition methods, which reduce the difficulty of a multi-agent system by decomposing the joint state-action space into local observation-action spaces, have become popular in cooperative multi-agent reinforcement learning (MARL). However, value-decomposition methods still have the problems of tremendous sample consumption for training and lack of active exploration. In this paper, we pr… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 13 pages, 9 figures

  47. arXiv:2303.06169  [pdf

    cs.LG cs.AR

    MOELA: A Multi-Objective Evolutionary/Learning Design Space Exploration Framework for 3D Heterogeneous Manycore Platforms

    Authors: Sirui Qi, Yingheng Li, Sudeep Pasricha, Ryan Gary Kim

    Abstract: To enable emerging applications such as deep machine learning and graph processing, 3D network-on-chip (NoC) enabled heterogeneous manycore platforms that can integrate many processing elements (PEs) are needed. However, designing such complex systems with multiple objectives can be challenging due to the huge associated design space and long evaluation times. To optimize such systems, we propose… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  48. arXiv:2303.04404  [pdf, other

    cs.NI

    MiddleNet: A Unified, High-Performance NFV and Middlebox Framework with eBPF and DPDK

    Authors: Shixiong Qi, Ziteng Zeng, Leslie Monis, K. K. Ramakrishnan

    Abstract: Traditional network resident functions (e.g., firewalls, network address translation) and middleboxes (caches, load balancers) have moved from purpose-built appliances to software-based components. However, L2/L3 network functions (NFs) are being implemented on Network Function Virtualization (NFV) platforms that extensively exploit kernel-bypass technology. They often use DPDK for zero-copy deliv… ▽ More

    Submitted 30 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

  49. arXiv:2303.00369  [pdf, other

    cs.CV eess.IV

    Indescribable Multi-modal Spatial Evaluator

    Authors: Lingke Kong, X. Sharon Qi, Qi** Shen, Jiacheng Wang, **gyi Zhang, Yanle Hu, Qichao Zhou

    Abstract: Multi-modal image registration spatially aligns two images with different distributions. One of its major challenges is that images acquired from different imaging machines have different imaging distributions, making it difficult to focus only on the spatial aspect of the images and ignore differences in distributions. In this study, we developed a self-supervised approach, Indescribable Multi-mo… ▽ More

    Submitted 1 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR2023

  50. Representing Noisy Image Without Denoising

    Authors: Shuren Qi, Yushu Zhang, Chao Wang, Tao Xiang, Xiaochun Cao, Yong Xiang

    Abstract: A long-standing topic in artificial intelligence is the effective recognition of patterns from noisy images. In this regard, the recent data-driven paradigm considers 1) improving the representation robustness by adding noisy samples in training phase (i.e., data augmentation) or 2) pre-processing the noisy image by learning to solve the inverse problem (i.e., image denoising). However, such metho… ▽ More

    Submitted 19 June, 2024; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024