Skip to main content

Showing 1–50 of 149 results for author: Du, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00128  [pdf, other

    cs.IR cs.AI cs.LG

    When Search Engine Services meet Large Language Models: Visions and Challenges

    Authors: Haoyi Xiong, Jiang Bian, Yuchen Li, Xuhong Li, Mengnan Du, Shuaiqiang Wang, Dawei Yin, Sumi Helal

    Abstract: Combining Large Language Models (LLMs) with search engine services marks a significant shift in the field of services computing, opening up new possibilities to enhance how we search for and retrieve information, understand content, and interact with internet services. This paper conducts an in-depth examination of how integrating LLMs with search engines can mutually benefit both technologies. We… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: Under Review

  2. arXiv:2406.15917  [pdf, other

    cs.RO

    To Err is Robotic: Rapid Value-Based Trial-and-Error during Deployment

    Authors: Maximilian Du, Alexander Khazatsky, Tobias Gerstenberg, Chelsea Finn

    Abstract: When faced with a novel scenario, it can be hard to succeed on the first attempt. In these challenging situations, it is important to know how to retry quickly and meaningfully. Retrying behavior can emerge naturally in robots trained on diverse data, but such robot policies will typically only exhibit undirected retrying behavior and may not terminate a suboptimal approach before an unrecoverable… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  3. Practical, Automated Scenario-based Mobile App Testing

    Authors: Shengcheng Yu, Chunrong Fang, Mingzhe Du, Zimin Ding, Zhenyu Chen, Zhendong Su

    Abstract: The importance of mobile application (app) quality insurance is increasing with the rapid development of the mobile Internet. Automated test generation approaches, as a dominant direction of app quality insurance, follow specific models or strategies, targeting at optimizing the code coverage. Such approaches lead to a huge gap between testing execution and app business logic. Test scripts develop… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE Transaction on Software Engineering in 2024

  4. arXiv:2406.05756  [pdf, other

    cs.AI cs.CL cs.CV cs.MM

    EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language Models

    Authors: Mengfei Du, Binhao Wu, Zejun Li, Xuan**g Huang, Zhongyu Wei

    Abstract: The recent rapid development of Large Vision-Language Models (LVLMs) has indicated their potential for embodied tasks.However, the critical skill of spatial understanding in embodied environments has not been thoroughly evaluated, leaving the gap between current LVLMs and qualified embodied intelligence unknown. Therefore, we construct EmbSpatial-Bench, a benchmark for evaluating embodied spatial… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024 Main

  5. arXiv:2405.20910  [pdf, other

    physics.app-ph cs.AI cs.CV physics.data-an

    Predicting ptychography probe positions using single-shot phase retrieval neural network

    Authors: Ming Du, Tao Zhou, Jun**g Deng, Daniel J. Ching, Steven Henke, Mathew J. Cherukara

    Abstract: Ptychography is a powerful imaging technique that is used in a variety of fields, including materials science, biology, and nanotechnology. However, the accuracy of the reconstructed ptychography image is highly dependent on the accuracy of the recorded probe positions which often contain errors. These errors are typically corrected jointly with phase retrieval through numerical optimization appro… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    MSC Class: 94A08 ACM Class: I.4.0

  6. arXiv:2405.14672  [pdf, other

    cs.CV

    Towards Imperceptible Backdoor Attack in Self-supervised Learning

    Authors: Hanrong Zhang, Zhenting Wang, Tingxu Han, Mingyu **, Chenlu Zhan, Mengnan Du, Hongwei Wang, Shiqing Ma

    Abstract: Self-supervised learning models are vulnerable to backdoor attacks. Existing backdoor attacks that are effective in self-supervised learning often involve noticeable triggers, like colored patches, which are vulnerable to human inspection. In this paper, we propose an imperceptible and effective backdoor attack against self-supervised models. We first find that existing imperceptible triggers desi… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2405.12754  [pdf, other

    astro-ph.SR cs.AI cs.LG physics.space-ph

    Neural Operator for Accelerating Coronal Magnetic Field Model

    Authors: Yutao Du, Qin Li, Raghav Gnanasambandam, Mengnan Du, Haimin Wang, Bo Shen

    Abstract: Studying the sun's outer atmosphere is challenging due to its complex magnetic fields impacting solar activities. Magnetohydrodynamics (MHD) simulations help model these interactions but are extremely time-consuming (usually on a scale of days). Our research applies the Fourier Neural Operator (FNO) to accelerate the coronal magnetic field modeling, specifically, the Bifrost MHD model. We apply Te… ▽ More

    Submitted 26 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  8. arXiv:2405.12523  [pdf, other

    cs.CV cs.AI

    Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large Language Models

    Authors: Jiaqi Li, Qianshan Wei, Chuanyi Zhang, Guilin Qi, Miaozeng Du, Yongrui Chen, Sheng Bi

    Abstract: Machine unlearning empowers individuals with the `right to be forgotten' by removing their private or sensitive information encoded in machine learning models. However, it remains uncertain whether MU can be effectively applied to Multimodal Large Language Models (MLLMs), particularly in scenarios of forgetting the leaked visual data of concepts. To overcome the challenge, we propose an efficient… ▽ More

    Submitted 29 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  9. arXiv:2405.07761  [pdf, other

    cs.LG cs.AI cs.SC math-ph stat.AP

    LLM4ED: Large Language Models for Automatic Equation Discovery

    Authors: Mengge Du, Yuntian Chen, Zhongzheng Wang, Longfeng Nie, Dongxiao Zhang

    Abstract: Equation discovery is aimed at directly extracting physical laws from data and has emerged as a pivotal research domain. Previous methods based on symbolic mathematics have achieved substantial advancements, but often require the design of implementation of complex algorithms. In this paper, we introduce a new framework that utilizes natural language-based prompts to guide large language models (L… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  10. arXiv:2405.06649  [pdf, other

    q-bio.BM cs.LG q-bio.MN

    ProLLM: Protein Chain-of-Thoughts Enhanced LLM for Protein-Protein Interaction Prediction

    Authors: Mingyu **, Haochen Xue, Zhenting Wang, Boming Kang, Ruosong Ye, Kaixiong Zhou, Mengnan Du, Yongfeng Zhang

    Abstract: The prediction of protein-protein interactions (PPIs) is crucial for understanding biological functions and diseases. Previous machine learning approaches to PPI prediction mainly focus on direct physical interactions, ignoring the broader context of nonphysical connections through intermediate proteins, thus limiting their effectiveness. The emergence of Large Language Models (LLMs) provides a ne… ▽ More

    Submitted 30 March, 2024; originally announced May 2024.

  11. arXiv:2404.11553  [pdf, other

    cs.CL cs.AI cs.LG

    Quantifying Multilingual Performance of Large Language Models Across Languages

    Authors: Zihao Li, Yucheng Shi, Zirui Liu, Fan Yang, Ali Payani, Ninghao Liu, Mengnan Du

    Abstract: The development of Large Language Models (LLMs) relies on extensive text corpora, which are often unevenly distributed across languages. This imbalance results in LLMs performing significantly better on high-resource languages like English, German, and French, while their capabilities in low-resource languages remain inadequate. Currently, there is a lack of quantitative methods to evaluate the pe… ▽ More

    Submitted 16 June, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  12. arXiv:2404.07066  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?

    Authors: Mingyu **, Qinkai Yu, **gyuan Huang, Qingcheng Zeng, Zhenting Wang, Wenyue Hua, Haiyan Zhao, Kai Mei, Yanda Meng, Kaize Ding, Fan Yang, Mengnan Du, Yongfeng Zhang

    Abstract: Large language models (LLMs) have shown remarkable performances across a wide range of tasks. However, the mechanisms by which these models encode tasks of varying complexities remain poorly understood. In this paper, we explore the hypothesis that LLMs process concepts of varying complexities in different layers, introducing the idea of "Concept Depth" to suggest that more complex concepts are ty… ▽ More

    Submitted 30 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: 12 pages

  13. arXiv:2404.01994  [pdf, other

    cs.CV cs.CL cs.LG

    DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning

    Authors: Mengfei Du, Binhao Wu, Jiwen Zhang, Zhihao Fan, Zejun Li, Ruipu Luo, Xuan**g Huang, Zhongyu Wei

    Abstract: Vision-and-Language navigation (VLN) requires an agent to navigate in unseen environment by following natural language instruction. For task completion, the agent needs to align and integrate various navigation modalities, including instruction, observation and navigation history. Existing works primarily concentrate on cross-modal attention at the fusion stage to achieve this objective. Neverthel… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted by LREC-COLING 2024

  14. arXiv:2403.13181  [pdf, other

    cs.DB

    Efficient k-step Weighted Reachability Query Processing Algorithms

    Authors: Lian Chen, Junfeng Zhou, Ming Du, Sheng Yu, Xian Tang, Ziyang Chen

    Abstract: Given a data graph G, a source vertex u and a target vertex v of a reachability query, the reachability query is used to answer whether there exists a path from u to v in G. Reachability query processing is one of the fundamental operations in graph data management, which is widely used in biological networks, communication networks, and social networks to assist data analysis. The data graphs in… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  15. arXiv:2403.08946  [pdf, other

    cs.LG cs.CL cs.CY

    Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era

    Authors: Xuansheng Wu, Haiyan Zhao, Yaochen Zhu, Yucheng Shi, Fan Yang, Tianming Liu, Xiaoming Zhai, Wenlin Yao, Jundong Li, Mengnan Du, Ninghao Liu

    Abstract: Explainable AI (XAI) refers to techniques that provide human-understandable insights into the workings of AI models. Recently, the focus of XAI is being extended towards Large Language Models (LLMs) which are often criticized for their lack of transparency. This extension calls for a significant transformation in XAI methodologies because of two reasons. First, many existing XAI methods cannot be… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 38 pages, 4 figures

  16. arXiv:2403.07311  [pdf, other

    cs.CL cs.LG

    Knowledge Graph Large Language Model (KG-LLM) for Link Prediction

    Authors: Dong Shu, Tianle Chen, Mingyu **, Chong Zhang, Mengnan Du, Yongfeng Zhang

    Abstract: The task of multi-hop link prediction within knowledge graphs (KGs) stands as a challenge in the field of knowledge graph analysis, as it requires the model to reason through and understand all intermediate connections before making a prediction. In this paper, we introduce the Knowledge Graph Large Language Model (KG-LLM), a novel framework that leverages large language models (LLMs) for knowledg… ▽ More

    Submitted 28 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: 13 pages, 5 figures

  17. arXiv:2402.15159  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Machine Unlearning of Pre-trained Large Language Models

    Authors: ** Yao, Eli Chien, Minxin Du, Xinyao Niu, Tianhao Wang, Zezhou Cheng, Xiang Yue

    Abstract: This study investigates the concept of the `right to be forgotten' within the context of large language models (LLMs). We explore machine unlearning as a pivotal solution, with a focus on pre-trained models--a notably under-researched area. Our research delineates a comprehensive framework for machine unlearning in pre-trained LLMs, encompassing a critical analysis of seven diverse unlearning meth… ▽ More

    Submitted 30 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: ACL 2024 main. Code and data at https://github.com/yao**17/Unlearning_LLM

  18. arXiv:2402.14835  [pdf, other

    cs.CL cs.AI cs.LG

    MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing

    Authors: Jiaqi Li, Miaozeng Du, Chuanyi Zhang, Yongrui Chen, Nan Hu, Guilin Qi, Haiyun Jiang, Siyuan Cheng, Bozhong Tian

    Abstract: Multimodal knowledge editing represents a critical advancement in enhancing the capabilities of Multimodal Large Language Models (MLLMs). Despite its potential, current benchmarks predominantly focus on coarse-grained knowledge, leaving the intricacies of fine-grained (FG) multimodal entity knowledge largely unexplored. This gap presents a notable challenge, as FG entity recognition is pivotal for… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 8 pages

  19. arXiv:2402.13184  [pdf, other

    cs.CL

    What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents

    Authors: Mingyu **, Beichen Wang, Zhaoqian Xue, Suiyuan Zhu, Wenyue Hua, Hua Tang, Kai Mei, Mengnan Du, Yongfeng Zhang

    Abstract: In this study, we introduce "CosmoAgent," an innovative artificial intelligence framework utilizing Large Language Models (LLMs) to simulate complex interactions between human and extraterrestrial civilizations, with a special emphasis on Stephen Hawking's cautionary advice about not sending radio signals haphazardly into the universe. The goal is to assess the feasibility of peaceful coexistence… ▽ More

    Submitted 20 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  20. arXiv:2402.10835  [pdf, other

    cs.CL

    Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities

    Authors: Mingyu **, Hua Tang, Chong Zhang, Qinkai Yu, Chengzhi Liu, Suiyuan Zhu, Yongfeng Zhang, Mengnan Du

    Abstract: Large language models (LLMs) have been applied in many fields with rapid development in recent years. As a classic machine learning task, time series forecasting has recently received a boost from LLMs. However, there is a research gap in the LLMs' preferences in this field. In this paper, by comparing LLMs with traditional models, many properties of LLMs in time series prediction are found. For e… ▽ More

    Submitted 18 February, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  21. arXiv:2402.10688  [pdf, other

    cs.CL

    Towards Uncovering How Large Language Model Works: An Explainability Perspective

    Authors: Haiyan Zhao, Fan Yang, Bo Shen, Himabindu Lakkaraju, Mengnan Du

    Abstract: Large language models (LLMs) have led to breakthroughs in language tasks, yet the internal mechanisms that enable their remarkable generalization and reasoning abilities remain opaque. This lack of transparency presents challenges such as hallucinations, toxicity, and misalignment with human values, hindering the safe and beneficial deployment of LLMs. This paper aims to uncover the mechanisms und… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 8 pages, 2 figures

  22. arXiv:2402.07844  [pdf, other

    cs.SE cs.CL

    Mercury: A Code Efficiency Benchmark for Code Large Language Models

    Authors: Mingzhe Du, Anh Tuan Luu, Bin Ji, Qian Liu, See-Kiong Ng

    Abstract: Amidst the recent strides in evaluating Large Language Models for Code (Code LLMs), existing benchmarks have mainly focused on the functional correctness of generated code, neglecting the importance of their computational efficiency. To fill the gap, we present Mercury, the first code efficiency benchmark for Code LLMs. It comprises 1,889 Python tasks, each accompanied by adequate solutions that s… ▽ More

    Submitted 11 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  23. arXiv:2402.04678  [pdf, other

    cs.CL cs.AI cs.LG

    FaithLM: Towards Faithful Explanations for Large Language Models

    Authors: Yu-Neng Chuang, Guanchu Wang, Chia-Yuan Chang, Ruixiang Tang, Shaochen Zhong, Fan Yang, Mengnan Du, Xuanting Cai, Xia Hu

    Abstract: Large Language Models (LLMs) have become proficient in addressing complex tasks by leveraging their extensive internal knowledge and reasoning capabilities. However, the black-box nature of these models complicates the task of explaining their decision-making processes. While recent advancements demonstrate the potential of leveraging LLMs to self-explain their predictions through natural language… ▽ More

    Submitted 26 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  24. arXiv:2402.00746  [pdf, other

    cs.CL

    Health-LLM: Personalized Retrieval-Augmented Disease Prediction System

    Authors: Mingyu **, Qinkai Yu, Dong Shu, Chong Zhang, Lizhou Fan, Wenyue Hua, Suiyuan Zhu, Yanda Meng, Zhenting Wang, Mengnan Du, Yongfeng Zhang

    Abstract: Recent advancements in artificial intelligence (AI), especially large language models (LLMs), have significantly advanced healthcare applications and demonstrated potentials in intelligent medical treatment. However, there are conspicuous challenges such as vast data volumes and inconsistent symptom characterization standards, preventing full integration of healthcare AI systems with individual pa… ▽ More

    Submitted 19 March, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  25. arXiv:2401.15463  [pdf, other

    cs.CL cs.AI

    DataFrame QA: A Universal LLM Framework on DataFrame Question Answering Without Data Exposure

    Authors: Junyi Ye, Mengnan Du, Guiling Wang

    Abstract: This paper introduces DataFrame question answering (QA), a novel task that utilizes large language models (LLMs) to generate Pandas queries for information retrieval and data analysis on dataframes, emphasizing safe and non-revealing data handling. Our method, which solely relies on dataframe column names, not only ensures data privacy but also significantly reduces the context window in the promp… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  26. arXiv:2401.14027  [pdf, other

    cs.LG

    The Risk of Federated Learning to Skew Fine-Tuning Features and Underperform Out-of-Distribution Robustness

    Authors: Mengyao Du, Miao Zhang, Yuwen Pu, Kai Xu, Shouling Ji, Quanjun Yin

    Abstract: To tackle the scarcity and privacy issues associated with domain-specific datasets, the integration of federated learning in conjunction with fine-tuning has emerged as a practical solution. However, our findings reveal that federated learning has the risk of skewing fine-tuning features and compromising the out-of-distribution robustness of the model. By introducing three robustness indicators an… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 12 pages, 10 figures

  27. arXiv:2401.08552  [pdf, other

    cs.LG cs.AI

    Explaining Time Series via Contrastive and Locally Sparse Perturbations

    Authors: Zichuan Liu, Yingying Zhang, Tianchun Wang, Zefan Wang, Dongsheng Luo, Mengnan Du, Min Wu, Yi Wang, Chunlin Chen, Lunting Fan, Qingsong Wen

    Abstract: Explaining multivariate time series is a compound challenge, as it requires identifying important locations in the time series and matching complex temporal patterns. Although previous saliency-based methods addressed the challenges, their perturbation may not alleviate the distribution shift issue, which is inevitable especially in heterogeneous samples. We present ContraLSP, a locally sparse mod… ▽ More

    Submitted 28 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted by International Conference on Learning Representations (ICLR 2024)

  28. arXiv:2401.08469  [pdf, other

    eess.IV cs.CV cs.LG

    Explanations of Classifiers Enhance Medical Image Segmentation via End-to-end Pre-training

    Authors: Jiamin Chen, Xuhong Li, Yanwu Xu, Mengnan Du, Haoyi Xiong

    Abstract: Medical image segmentation aims to identify and locate abnormal structures in medical images, such as chest radiographs, using deep neural networks. These networks require a large number of annotated images with fine-grained masks for the regions of interest, making pre-training strategies based on classification datasets essential for sample efficiency. Based on a large-scale medical image classi… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  29. arXiv:2401.04925  [pdf, other

    cs.CL cs.AI

    The Impact of Reasoning Step Length on Large Language Models

    Authors: Mingyu **, Qinkai Yu, Dong Shu, Haiyan Zhao, Wenyue Hua, Yanda Meng, Yongfeng Zhang, Mengnan Du

    Abstract: Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correlation between the effectiveness of CoT and the length of reasoning steps in prompts remains largely unknown. To shed light on this, we have conducted several empirical experiments to explore the relations. Specifically, we design experiments that expand and compress the ra… ▽ More

    Submitted 22 June, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: Findings of ACL 2024

  30. arXiv:2401.04374  [pdf, other

    cs.AI cs.LG

    Towards Explainable Artificial Intelligence (XAI): A Data Mining Perspective

    Authors: Haoyi Xiong, Xuhong Li, Xiaofei Zhang, Jiamin Chen, Xinhao Sun, Yuchen Li, Zeyi Sun, Mengnan Du

    Abstract: Given the complexity and lack of transparency in deep neural networks (DNNs), extensive efforts have been made to make these systems more interpretable or explain their behaviors in accessible terms. Unlike most reviews, which focus on algorithmic and model-centric perspectives, this work takes a "data-centric" view, examining how data collection, processing, and analysis contribute to explainable… ▽ More

    Submitted 13 January, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  31. arXiv:2401.01755  [pdf, other

    cs.SD cs.AI eess.AS

    Incremental FastPitch: Chunk-based High Quality Text to Speech

    Authors: Muyang Du, Chuan Liu, Junjie Lai

    Abstract: Parallel text-to-speech models have been widely applied for real-time speech synthesis, and they offer more controllability and a much faster synthesis process compared with conventional auto-regressive models. Although parallel models have benefits in many aspects, they become naturally unfit for incremental synthesis due to their fully parallel architecture such as transformer. In this work, we… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 5 pages, 4 figures, 1 table

  32. arXiv:2312.15359  [pdf, other

    cs.LG cs.AI cs.CV

    LETA: Learning Transferable Attribution for Generic Vision Explainer

    Authors: Guanchu Wang, Yu-Neng Chuang, Fan Yang, Mengnan Du, Chia-Yuan Chang, Shaochen Zhong, Zirui Liu, Zhaozhuo Xu, Kaixiong Zhou, Xuanting Cai, Xia Hu

    Abstract: Explainable machine learning significantly improves the transparency of deep neural networks~(DNN). However, existing work is constrained to explaining the behavior of individual model predictions, and lacks the ability to transfer the explanation across various models and tasks. This limitation results in explaining various tasks being time- and resource-consuming. To address this problem, we dev… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  33. arXiv:2312.10655  [pdf, other

    cs.SE cs.RO

    Practical Non-Intrusive GUI Exploration Testing with Visual-based Robotic Arms

    Authors: Shengcheng Yu, Chunrong Fang, Mingzhe Du, Yuchen Ling, Zhenyu Chen, Zhendong Su

    Abstract: GUI testing is significant in the SE community. Most existing frameworks are intrusive and only support some specific platforms. With the development of distinct scenarios, diverse embedded systems or customized operating systems on different devices do not support existing intrusive GUI testing frameworks. Some approaches adopt robotic arms to replace the interface invoking of mobile apps under t… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted by the 46th International Conference on Software Engineering (ICSE 2024)

  34. arXiv:2312.08670  [pdf, other

    stat.ME cs.AI cs.LG

    Temporal-Spatial Entropy Balancing for Causal Continuous Treatment-Effect Estimation

    Authors: Tao Hu, Honglong Zhang, Fan Zeng, Min Du, XiangKun Du, Yue Zheng, Quanqi Li, Mengran Zhang, Dan Yang, Jihao Wu

    Abstract: In the field of intracity freight transportation, changes in order volume are significantly influenced by temporal and spatial factors. When building subsidy and pricing strategies, predicting the causal effects of these strategies on order volume is crucial. In the process of calculating causal effects, confounding variables can have an impact. Traditional methods to control confounding variables… ▽ More

    Submitted 18 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 10 pages;

  35. arXiv:2312.00308  [pdf, other

    cs.CV eess.IV stat.AP

    A knowledge-based data-driven (KBDD) framework for all-day identification of cloud types using satellite remote sensing

    Authors: Longfeng Nie, Yuntian Chen, Mengge Du, Changqi Sun, Dongxiao Zhang

    Abstract: Cloud types, as a type of meteorological data, are of particular significance for evaluating changes in rainfall, heatwaves, water resources, floods and droughts, food security and vegetation cover, as well as land use. In order to effectively utilize high-resolution geostationary observations, a knowledge-based data-driven (KBDD) framework for all-day identification of cloud types based on spectr… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  36. arXiv:2311.10349  [pdf, other

    eess.IV cs.CV cs.LG

    Pseudo Label-Guided Data Fusion and Output Consistency for Semi-Supervised Medical Image Segmentation

    Authors: Tao Wang, Yuanbin Chen, Xinlin Zhang, Yuanbo Zhou, Junlin Lan, Bizhe Bai, Tao Tan, Min Du, Qinquan Gao, Tong Tong

    Abstract: Supervised learning algorithms based on Convolutional Neural Networks have become the benchmark for medical image segmentation tasks, but their effectiveness heavily relies on a large amount of labeled data. However, annotating medical image datasets is a laborious and time-consuming process. Inspired by semi-supervised algorithms that use both labeled and unlabeled data for training, we propose t… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

  37. arXiv:2310.14248  [pdf, other

    cs.CL

    From Static to Dynamic: A Continual Learning Framework for Large Language Models

    Authors: Mingzhe Du, Anh Tuan Luu, Bin Ji, See-kiong Ng

    Abstract: The vast number of parameters in large language models (LLMs) endows them with remarkable capabilities, allowing them to excel in a variety of natural language processing tasks. However, this complexity also presents challenges, making LLMs difficult to train and inhibiting their ability to continuously assimilate new knowledge, which may lead to inaccuracies in their outputs. To mitigate these is… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  38. arXiv:2310.12785  [pdf, other

    cs.LG cs.CY

    A Theoretical Approach to Characterize the Accuracy-Fairness Trade-off Pareto Frontier

    Authors: Hua Tang, Lu Cheng, Ninghao Liu, Mengnan Du

    Abstract: While the accuracy-fairness trade-off has been frequently observed in the literature of fair machine learning, rigorous theoretical analyses have been scarce. To demystify this long-standing challenge, this work seeks to develop a theoretical framework by characterizing the shape of the accuracy-fairness trade-off Pareto frontier (FairFrontier), determined by a set of all optimal Pareto classifier… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  39. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  40. arXiv:2310.02569  [pdf, other

    cs.CV

    ReForm-Eval: Evaluating Large Vision Language Models via Unified Re-Formulation of Task-Oriented Benchmarks

    Authors: Zejun Li, Ye Wang, Mengfei Du, Qingwen Liu, Binhao Wu, Jiwen Zhang, Chengxing Zhou, Zhihao Fan, Jie Fu, **g**g Chen, Xuan**g Huang, Zhongyu Wei

    Abstract: Recent years have witnessed remarkable progress in the development of large vision-language models (LVLMs). Benefiting from the strong language backbones and efficient cross-modal alignment strategies, LVLMs exhibit surprising capabilities to perceive visual signals and perform visually grounded reasoning. However, the capabilities of LVLMs have not been comprehensively and quantitatively evaluate… ▽ More

    Submitted 17 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 38 pages, 11 figures, 24 tables

  41. arXiv:2309.15479  [pdf, other

    cs.LG cs.DS

    Fast Locality Sensitive Hashing with Theoretical Guarantee

    Authors: Zongyuan Tan, Hongya Wang, Bo Xu, Minjie Luo, Ming Du

    Abstract: Locality-sensitive hashing (LSH) is an effective randomized technique widely used in many machine learning tasks. The cost of hashing is proportional to data dimensions, and thus often the performance bottleneck when dimensionality is high and the number of hash functions involved is large. Surprisingly, however, little work has been done to improve the efficiency of LSH computation. In this paper… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  42. arXiv:2309.09380  [pdf, other

    cs.CL cs.LG

    Mitigating Shortcuts in Language Models with Soft Label Encoding

    Authors: Zirui He, Huiqi Deng, Haiyan Zhao, Ninghao Liu, Mengnan Du

    Abstract: Recent research has shown that large language models rely on spurious correlations in the data for natural language understanding (NLU) tasks. In this work, we aim to answer the following research question: Can we reduce spurious correlations by modifying the ground truth labels of the training data? Specifically, we propose a simple yet effective debiasing framework, named Soft Label Encoding (So… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  43. arXiv:2309.08375  [pdf, other

    cs.LG cs.CY

    Boosting Fair Classifier Generalization through Adaptive Priority Reweighing

    Authors: Zhihao Hu, Yiran Xu, Mengnan Du, **dong Gu, Xinmei Tian, Fengxiang He

    Abstract: With the increasing penetration of machine learning applications in critical decision-making areas, calls for algorithmic fairness are more prominent. Although there have been various modalities to improve algorithmic fairness through learning with fairness constraints, their performance does not generalize well in the test set. A performance-promising fair algorithm with better generalizability i… ▽ More

    Submitted 20 May, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  44. arXiv:2309.07672  [pdf

    cs.LG math.NA stat.AP

    Physics-constrained robust learning of open-form partial differential equations from limited and noisy data

    Authors: Mengge Du, Yuntian Chen, Longfeng Nie, Siyu Lou, Dongxiao Zhang

    Abstract: Unveiling the underlying governing equations of nonlinear dynamic systems remains a significant challenge. Insufficient prior knowledge hinders the determination of an accurate candidate library, while noisy observations lead to imprecise evaluations, which in turn result in redundant function terms or erroneous equations. This study proposes a framework to robustly uncover open-form partial diffe… ▽ More

    Submitted 29 April, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  45. DP-Forward: Fine-tuning and Inference on Language Models with Differential Privacy in Forward Pass

    Authors: Minxin Du, Xiang Yue, Sherman S. M. Chow, Tianhao Wang, Chenyu Huang, Huan Sun

    Abstract: Differentially private stochastic gradient descent (DP-SGD) adds noise to gradients in back-propagation, safeguarding training data from privacy leakage, particularly membership inference. It fails to cover (inference-time) threats like embedding inversion and sensitive attribute inference. It is also costly in storage and computation when used to fine-tune large pre-trained language models (LMs).… ▽ More

    Submitted 19 September, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: To appear at ACM CCS '23. This is the full version. The first two authors contribute equally

  46. arXiv:2309.01029  [pdf, other

    cs.CL cs.AI cs.LG

    Explainability for Large Language Models: A Survey

    Authors: Haiyan Zhao, Hanjie Chen, Fan Yang, Ninghao Liu, Huiqi Deng, Hengyi Cai, Shuaiqiang Wang, Dawei Yin, Mengnan Du

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in natural language processing. However, their internal mechanisms are still unclear and this lack of transparency poses unwanted risks for downstream applications. Therefore, understanding and explaining these models is crucial for elucidating their behaviors, limitations, and social impacts. In this paper, we introduce a taxo… ▽ More

    Submitted 28 November, 2023; v1 submitted 2 September, 2023; originally announced September 2023.

  47. UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for Temporal Forgery Localization

    Authors: Rui Zhang, Hongxia Wang, Mingshan Du, Hanqing Liu, Yang Zhou, Qiang Zeng

    Abstract: The emergence of artificial intelligence-generated content (AIGC) has raised concerns about the authenticity of multimedia content in various fields. However, existing research for forgery content detection has focused mainly on binary classification tasks of complete videos, which has limited applicability in industrial settings. To address this gap, we propose UMMAFormer, a novel universal trans… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

    Comments: 11 pages, 8 figures, 66 references. This paper has been accepted for ACM MM 2023

    MSC Class: 68T45 ACM Class: I.4

    Journal ref: Proceedings of the 31st ACM International Conference on Multimedia (MM '23), October 29-November 3, 2023

  48. arXiv:2308.12952  [pdf, other

    cs.RO cs.LG

    BridgeData V2: A Dataset for Robot Learning at Scale

    Authors: Homer Walke, Kevin Black, Abraham Lee, Moo ** Kim, Max Du, Chongyi Zheng, Tony Zhao, Philippe Hansen-Estruch, Quan Vuong, Andre He, Vivek Myers, Kuan Fang, Chelsea Finn, Sergey Levine

    Abstract: We introduce BridgeData V2, a large and diverse dataset of robotic manipulation behaviors designed to facilitate research on scalable robot learning. BridgeData V2 contains 60,096 trajectories collected across 24 environments on a publicly available low-cost robot. BridgeData V2 provides extensive task and environment variability, leading to skills that can generalize across environments, domains,… ▽ More

    Submitted 17 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 9 pages

  49. arXiv:2308.10149  [pdf, other

    cs.CL cs.AI

    A Survey on Fairness in Large Language Models

    Authors: Yingji Li, Mengnan Du, Rui Song, Xin Wang, Ying Wang

    Abstract: Large Language Models (LLMs) have shown powerful performance and development prospects and are widely deployed in the real world. However, LLMs can capture social biases from unprocessed training data and propagate the biases to downstream tasks. Unfair LLM systems have undesirable social impacts and potential harms. In this paper, we provide a comprehensive review of related research on fairness… ▽ More

    Submitted 21 February, 2024; v1 submitted 19 August, 2023; originally announced August 2023.

    Comments: 28 pages, 5 figures, 2 tables, 175 references

  50. arXiv:2308.08181  [pdf, ps, other

    cs.SD cs.CL eess.AS

    ChinaTelecom System Description to VoxCeleb Speaker Recognition Challenge 2023

    Authors: Mengjie Du, Xiang Fang, Jie Li

    Abstract: This technical report describes ChinaTelecom system for Track 1 (closed) of the VoxCeleb2023 Speaker Recognition Challenge (VoxSRC 2023). Our system consists of several ResNet variants trained only on VoxCeleb2, which were fused for better performance later. Score calibration was also applied for each variant and the fused system. The final submission achieved minDCF of 0.1066 and EER of 1.980%.

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: System description of VoxSRC 2023