Skip to main content

Showing 1–50 of 332 results for author: Jiang, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19657  [pdf, other

    cs.LG

    LLMEasyQuant -- An Easy to Use Toolkit for LLM Quantization

    Authors: Dong Liu, Meng Jiang, Kaiser Pister

    Abstract: Currently, there are many quantization methods appeared for LLM quantization, yet few are user-friendly and easy to be deployed locally. Packages like TensorRT and Quantohave many underlying structures and self-invoking internal functions, which are not conducive to developers' personalized development and learning for deployment. Therefore, we develop LLMEasyQuant, it is a package aiming to for e… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.18345  [pdf, other

    cs.LG eess.SP

    EmT: A Novel Transformer for Generalized Cross-subject EEG Emotion Recognition

    Authors: Yi Ding, Chengxuan Tong, Shuailei Zhang, Muyun Jiang, Yong Li, Kevin Lim Jun Liang, Cuntai Guan

    Abstract: Integrating prior knowledge of neurophysiology into neural network architecture enhances the performance of emotion decoding. While numerous techniques emphasize learning spatial and short-term temporal patterns, there has been limited emphasis on capturing the vital long-term contextual information associated with emotional cognitive processes. In order to address this discrepancy, we introduce a… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 11 pages, 5 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2406.18038  [pdf, other

    cs.LG

    MT2ST: Adaptive Multi-Task to Single-Task Learning

    Authors: Dong Liu, Meng Jiang

    Abstract: The conventional training approaches often face challenges in balancing the breadth of multi-task learning (MTL) with the depth of single-task learning (STL). To address this issue, we introduce the Multi-Task to Single-Task (MT2ST) framework, a groundbreaking approach that can combine the generalizability of MTL with the precision of STL. Our work include two strategies: 'Diminish' and 'Switch'.… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.17918  [pdf, other

    cs.LG cs.DC cs.SI

    GraphSnapShot: Graph Machine Learning Acceleration with Fast Storage and Retrieval

    Authors: Dong Liu, Roger Waleffe, Meng Jiang, Shivaram Venkataraman

    Abstract: In our recent research, we have developed a framework called GraphSnapShot, which has been proven an useful tool for graph learning acceleration. GraphSnapShot is a framework for fast cache, storage, retrieval and computation for graph learning. It can quickly store and update the local topology of graph structure and allows us to track patterns in the structure of graph networks, just like take s… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  5. arXiv:2406.17281  [pdf, other

    cs.LG

    Distance Recomputator and Topology Reconstructor for Graph Neural Networks

    Authors: Dong Liu, Meng Jiang

    Abstract: This paper introduces novel methodologies, the Distance Recomputator and Topology Reconstructor, aimed at enhancing Graph Neural Networks (GNNs). The Distance Recomputator dynamically recalibrates node distances within k-hop neighborhoods using a dynamic encoding scheme, thereby improving the accuracy and adaptability of node representations. Concurrently, the Topology Reconstructor adjusts local… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  6. arXiv:2406.16173  [pdf, other

    cs.HC

    Crepe: A Mobile Screen Data Collector Using Graph Query

    Authors: Yuwen Lu, Meng Chen, Qi Zhao, Victor Cox, Yang Yang, Meng Jiang, Jay Brockman, Tamara Kay, Toby Jia-Jun Li

    Abstract: Collecting mobile datasets remains challenging for academic researchers due to limited data access and technical barriers. Commercial organizations often possess exclusive access to mobile data, leading to a "data monopoly" that restricts the independence of academic research. Existing open-source mobile data collection frameworks primarily focus on mobile sensing data rather than screen content,… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  7. arXiv:2406.12921  [pdf, other

    cs.LG

    WindowMixer: Intra-Window and Inter-Window Modeling for Time Series Forecasting

    Authors: Quangao Liu, Ruiqi Li, Maowei Jiang, Wei Yang, Chen Liang, LongLong Pang, Zhuozhang Zou

    Abstract: Time series forecasting (TSF) is crucial in fields like economic forecasting, weather prediction, traffic flow analysis, and public health surveillance. Real-world time series data often include noise, outliers, and missing values, making accurate forecasting challenging. Traditional methods model point-to-point relationships, which limits their ability to capture complex temporal patterns and inc… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  8. arXiv:2406.12056  [pdf, other

    cs.LG q-bio.QM

    Learning Molecular Representation in a Cell

    Authors: Gang Liu, Srijit Seal, John Arevalo, Zhenwen Liang, Anne E. Carpenter, Meng Jiang, Shantanu Singh

    Abstract: Predicting drug efficacy and safety in vivo requires information on biological responses (e.g., cell morphology and gene expression) to small molecule perturbations. However, current molecular representation learning methods do not provide a comprehensive view of cell states under these perturbations and struggle to remove noise, hindering model generalization. We introduce the Information Alignme… ▽ More

    Submitted 22 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 21 pages, 8 tables, 7 figures

  9. arXiv:2406.12050  [pdf, other

    cs.CL

    Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning

    Authors: Zhihan Zhang, Zhenwen Liang, Wenhao Yu, Dian Yu, Mengzhao Jia, Dong Yu, Meng Jiang

    Abstract: Supervised fine-tuning enhances the problem-solving abilities of language models across various mathematical reasoning tasks. To maximize such benefits, existing research focuses on broadening the training set with various data augmentation techniques, which is effective for standard single-round question-answering settings. Our work introduces a novel technique aimed at cultivating a deeper under… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  10. arXiv:2406.10934  [pdf

    physics.ed-ph cs.HC

    Beyond Answers: Large Language Model-Powered Tutoring System in Physics Education for Deep Learning and Precise Understanding

    Authors: Zhoumingju Jiang, Mengjun Jiang

    Abstract: The integration of artificial intelligence (AI) in education has shown significant promise, yet the effective personalization of learning, particularly in physics education, remains a challenge. This paper proposes Physics-STAR, a framework for large language model (LLM)- powered tutoring system designed to address this gap by providing personalized and adaptive learning experiences for high schoo… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 13 pages, 3 figures, CSCW 2O24

  11. arXiv:2406.10471  [pdf, other

    cs.CL

    Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts

    Authors: Zhaoxuan Tan, Zheyuan Liu, Meng Jiang

    Abstract: Personalized large language models (LLMs) aim to tailor interactions, content, and recommendations to individual user preferences. While parameter-efficient fine-tuning (PEFT) methods excel in performance and generalization, they are costly and limit communal benefits when used individually. To this end, we introduce Personalized Pieces (Per-Pcs), a framework that allows users to safely share and… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  12. arXiv:2406.03777  [pdf, other

    cs.LG cs.AI

    Empirical Guidelines for Deploying LLMs onto Resource-constrained Edge Devices

    Authors: Ruiyang Qin, Dancheng Liu, Zheyu Yan, Zhaoxuan Tan, Zixuan Pan, Zhenge Jia, Meng Jiang, Ahmed Abbasi, **jun Xiong, Yiyu Shi

    Abstract: The scaling laws have become the de facto guidelines for designing large language models (LLMs), but they were studied under the assumption of unlimited computing resources for both training and inference. As LLMs are increasingly used as personalized intelligent assistants, their customization (i.e., learning through fine-tuning) and deployment onto resource-constrained edge devices will become m… ▽ More

    Submitted 13 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Benckmarking paper

  13. arXiv:2406.00631  [pdf, other

    cs.CV

    MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging

    Authors: Jiaying Zhou, Mingzhou Jiang, Junde Wu, Jiayuan Zhu, Ziyue Wang, Yueming **

    Abstract: Medicine is inherently a multimodal discipline. Medical images can reflect the pathological changes of cancer and tumors, while the expression of specific genes can influence their morphological characteristics. However, most deep learning models employed for these medical tasks are unimodal, making predictions using either image data or genomic data exclusively. In this paper, we propose a multim… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  14. arXiv:2405.20579  [pdf, other

    cs.RO cs.LG

    HOPE: A Reinforcement Learning-based Hybrid Policy Path Planner for Diverse Parking Scenarios

    Authors: Mingyang Jiang, Yueyuan Li, Songan Zhang, Chunxiang Wang, Ming Yang

    Abstract: Path planning plays a pivotal role in automated parking, yet current methods struggle to efficiently handle the intricate and diverse parking scenarios. One potential solution is the reinforcement learning-based method, leveraging its exploration in unrecorded situations. However, a key challenge lies in training reinforcement learning methods is the inherent randomness in converging to a feasible… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 tables, 5 figures, 1 page appendix

  15. arXiv:2405.14092  [pdf, other

    cs.CL

    Large Language Models Can Self-Correct with Minimal Effort

    Authors: Zhenyu Wu, Qingkai Zeng, Zhihan Zhang, Zhaoxuan Tan, Chao Shen, Meng Jiang

    Abstract: Intrinsic self-correct was a method that instructed large language models (LLMs) to verify and correct their responses without external feedback. Unfortunately, the study concluded that the LLMs could not self-correct reasoning yet. We find that a simple yet effective verification method can unleash inherent capabilities of the LLMs. That is to mask a key condition in the question, add the current… ▽ More

    Submitted 23 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Work in Progress

  16. arXiv:2405.13300   

    cs.LG cs.AI

    FAITH: Frequency-domain Attention In Two Horizons for Time Series Forecasting

    Authors: Ruiqi Li, Maowei Jiang, Kai Wang, Kaiduo Feng, Quangao Liu, Yue Sun, Xiufang Zhou

    Abstract: Time Series Forecasting plays a crucial role in various fields such as industrial equipment maintenance, meteorology, energy consumption, traffic flow and financial investment. However, despite their considerable advantages over traditional statistical approaches, current deep learning-based predictive models often exhibit a significant deviation between their forecasting outcomes and the ground t… ▽ More

    Submitted 1 July, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: We think there are some errors in the experiment result, it may lead to a wrong conclusion. So we think it will be responsible to withdraw it

  17. arXiv:2405.09148  [pdf, ps, other

    cs.CV

    A Hierarchically Feature Reconstructed Autoencoder for Unsupervised Anomaly Detection

    Authors: Honghui Chen, **** Chen, Huan Mao, Mengxi Jiang

    Abstract: Anomaly detection and localization without any manual annotations and prior knowledge is a challenging task under the setting of unsupervised learning. The existing works achieve excellent performance in the anomaly detection, but with complex networks or cumbersome pipelines. To address this issue, this paper explores a simple but effective architecture in the anomaly detection. It consists of a… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 12 pages, 4 figures

    MSC Class: 68T01 ACM Class: I.2.10

  18. arXiv:2405.02644  [pdf, other

    cs.LG

    Interpretable Multi-View Clustering

    Authors: Mudi Jiang, Lianyu Hu, Zengyou He, Zhikui Chen

    Abstract: Multi-view clustering has become a significant area of research, with numerous methods proposed over the past decades to enhance clustering accuracy. However, in many real-world applications, it is crucial to demonstrate a clear decision-making process-specifically, explaining why samples are assigned to particular clusters. Consequently, there remains a notable gap in develo** interpretable met… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 12 pages,6 figures

    ACM Class: I.2.6

  19. arXiv:2404.14604  [pdf, other

    cs.CL

    Describe-then-Reason: Improving Multimodal Mathematical Reasoning through Visual Comprehension Training

    Authors: Mengzhao Jia, Zhihan Zhang, Wenhao Yu, Fangkai Jiao, Meng Jiang

    Abstract: Open-source multimodal large language models (MLLMs) excel in various tasks involving textual and visual inputs but still struggle with complex multimodal mathematical reasoning, lagging behind proprietary models like GPT-4V(ision) and Gemini-Pro. Although fine-tuning with intermediate steps (i.e., rationales) elicits some mathematical reasoning skills, the resulting models still fall short in vis… ▽ More

    Submitted 25 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  20. arXiv:2404.12569  [pdf, other

    cs.LG cs.AI

    Multi-View Subgraph Neural Networks: Self-Supervised Learning with Scarce Labeled Data

    Authors: Zhenzhong Wang, Qingyuan Zeng, Wanyu Lin, Min Jiang, Kay Chen Tan

    Abstract: While graph neural networks (GNNs) have become the de-facto standard for graph-based node classification, they impose a strong assumption on the availability of sufficient labeled samples. This assumption restricts the classification performance of prevailing GNNs on many real-world applications suffering from low-data regimes. Specifically, features extracted from scarce labeled nodes could not p… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  21. arXiv:2404.12235  [pdf, other

    cs.CV

    Beyond Average: Individualized Visual Scanpath Prediction

    Authors: Xianyu Chen, Ming Jiang, Qi Zhao

    Abstract: Understanding how attention varies across individuals has significant scientific and societal impacts. However, existing visual scanpath models treat attention uniformly, neglecting individual differences. To bridge this gap, this paper focuses on individualized scanpath prediction (ISP), a new attention modeling task that aims to accurately predict how different individuals shift their attention… ▽ More

    Submitted 18 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: To appear in CVPR2024

  22. arXiv:2404.11449  [pdf, other

    cs.CL cs.LG

    AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts

    Authors: Meng Jiang, Yi **g Yu, Qing Zhao, Jianqiang Li, Changwei Song, Hongzhi Qi, Wei Zhai, Dan Luo, Xiaoqin Wang, Guanghui Fu, Bing Xiang Yang

    Abstract: Cognitive Behavioral Therapy (CBT) is an effective technique for addressing the irrational thoughts stemming from mental illnesses, but it necessitates precise identification of cognitive pathways to be successfully implemented in patient care. In current society, individuals frequently express negative emotions on social media on specific topics, often exhibiting cognitive distortions, including… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  23. arXiv:2404.01943  [pdf, other

    cs.CV cs.RO

    Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation

    Authors: Zihan Wang, Xiangyang Li, Jiahao Yang, Yeqi Liu, Junjie Hu, Ming Jiang, Shuqiang Jiang

    Abstract: Vision-and-language navigation (VLN) enables the agent to navigate to a remote location following the natural language instruction in 3D environments. At each navigation step, the agent selects from possible candidate locations and then makes the move. For better navigation planning, the lookahead exploration strategy aims to effectively evaluate the agent's next action by accurately anticipating… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024. The code is available at https://github.com/MrZihan/HNR-VLN

  24. arXiv:2403.12744  [pdf, other

    cs.CL

    Instructing Large Language Models to Identify and Ignore Irrelevant Conditions

    Authors: Zhenyu Wu, Chao Shen, Meng Jiang

    Abstract: Math word problem (MWP) solving requires generating a reasoning path based on a given problem description that often contains irrelevant conditions. Existing chain-of-thought (CoT) prompting methods elicited multi-step reasoning abilities of large language models (LLMs) to solve MWPs. However, they were seriously confused by the irrelevant conditions, resulting in low accuracy. In this paper, we p… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: NAACL 2024 - Camera Ready

  25. arXiv:2403.12242  [pdf, other

    cs.CL cs.AI cs.LG

    Reference-based Metrics Disprove Themselves in Question Generation

    Authors: Bang Nguyen, Mengxia Yu, Yun Huang, Meng Jiang

    Abstract: Reference-based metrics such as BLEU and BERTScore are widely used to evaluate question generation (QG). In this study, on QG benchmarks such as SQuAD and HotpotQA, we find that using human-written references cannot guarantee the effectiveness of the reference-based metrics. Most QG benchmarks have only one reference; we replicated the annotation process and collect another reference. A good metri… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Revised Jun 14 2024; Under Review

  26. arXiv:2403.11758  [pdf, other

    cs.SE

    Demystifying the DAO Governance Process

    Authors: Junjie Ma, Muhui Jiang, **an Jiang, Xiapu Luo, Yufeng Hu, Ya** Zhou, Qi Wang, Fengwei Zhang

    Abstract: Decentralized Autonomous Organization (DAO) becomes a popular governance solution for decentralized applications (dApps) to achieve decentralized governance. In the DAO, no single entity can arbitrarily control the dApps without approval from the majority of members. However, despite its advantages, DAO has also been targeted by several attacks, leading to the loss of millions of dollars. In this… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  27. arXiv:2403.10931  [pdf, other

    eess.IV cs.CV

    Uncertainty-Aware Adapter: Adapting Segment Anything Model (SAM) for Ambiguous Medical Image Segmentation

    Authors: Mingzhou Jiang, Jiaying Zhou, Junde Wu, Tianyang Wang, Yueming **, Min Xu

    Abstract: The Segment Anything Model (SAM) gained significant success in natural image segmentation, and many methods have tried to fine-tune it to medical image segmentation. An efficient way to do so is by using Adapters, specialized modules that learn just a few parameters to tailor SAM specifically for medical images. However, unlike natural images, many tissues and lesions in medical images have blurry… ▽ More

    Submitted 18 March, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

  28. arXiv:2403.03465  [pdf, other

    cs.LG cs.SI

    Self-Attention Empowered Graph Convolutional Network for Structure Learning and Node Embedding

    Authors: Mengying Jiang, Guizhong Liu, Yuanchao Su, Xinliang Wu

    Abstract: In representation learning on graph-structured data, many popular graph neural networks (GNNs) fail to capture long-range dependencies, leading to performance degradation. Furthermore, this weakness is magnified when the concerned graph is characterized by heterophily (low homophily). To solve this issue, this paper proposes a novel graph learning framework called the graph convolutional network w… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 33 pages,6 figures,9 tables

  29. arXiv:2402.18127  [pdf, other

    cs.LG

    Hierarchical Multi-Relational Graph Representation Learning for Large-Scale Prediction of Drug-Drug Interactions

    Authors: Mengying Jiang, Guizhong Liu, Yuanchao Su, Weiqiang **, Biao Zhao

    Abstract: Most existing methods for predicting drug-drug interactions (DDI) predominantly concentrate on capturing the explicit relationships among drugs, overlooking the valuable implicit correlations present between drug pairs (DPs), which leads to weak predictions. To address this issue, this paper introduces a hierarchical multi-relational graph representation learning (HMGRL) approach. Within the frame… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 14 pages,10 figures

  30. arXiv:2402.17065  [pdf, other

    cs.CV cs.AI cs.LG

    Taming the Tail in Class-Conditional GANs: Knowledge Sharing via Unconditional Training at Lower Resolutions

    Authors: Saeed Khorram, Mingqi Jiang, Mohamad Shahbazi, Mohamad H. Danesh, Li Fuxin

    Abstract: Despite extensive research on training generative adversarial networks (GANs) with limited training data, learning to generate images from long-tailed training distributions remains fairly unexplored. In the presence of imbalanced multi-class training data, GANs tend to favor classes with more samples, leading to the generation of low-quality and less diverse samples in tail classes. In this study… ▽ More

    Submitted 16 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  31. arXiv:2402.16822  [pdf, other

    cs.CL cs.AI cs.LG

    Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

    Authors: Mikayel Samvelyan, Sharath Chandra Raparthy, Andrei Lupu, Eric Hambro, Aram H. Markosyan, Manish Bhatt, Yuning Mao, Minqi Jiang, Jack Parker-Holder, Jakob Foerster, Tim Rocktäschel, Roberta Raileanu

    Abstract: As large language models (LLMs) become increasingly prevalent across many real-world applications, understanding and enhancing their robustness to user inputs is of paramount importance. Existing methods for identifying adversarial prompts tend to focus on specific domains, lack diversity, or require extensive human annotations. To address these limitations, we present Rainbow Teaming, a novel app… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  32. arXiv:2402.15943  [pdf

    cs.SE cs.AI

    Rethinking Software Engineering in the Foundation Model Era: A Curated Catalogue of Challenges in the Development of Trustworthy FMware

    Authors: Ahmed E. Hassan, Dayi Lin, Gopi Krishnan Rajbahadur, Keheliya Gallaba, Filipe R. Cogo, Boyuan Chen, Haoxiang Zhang, Kishanthan Thangarajah, Gustavo Ansaldi Oliva, Jiahuei Lin, Wali Mohammad Abdullah, Zhen Ming Jiang

    Abstract: Foundation models (FMs), such as Large Language Models (LLMs), have revolutionized software development by enabling new use cases and business models. We refer to software built using FMs as FMware. The unique properties of FMware (e.g., prompts, agents, and the need for orchestration), coupled with the intrinsic limitations of FMs (e.g., hallucination) lead to a completely new set of software eng… ▽ More

    Submitted 3 March, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  33. arXiv:2402.15215  [pdf, other

    cs.IR

    Item-side Fairness of Large Language Model-based Recommendation System

    Authors: Meng Jiang, Keqin Bao, Jizhi Zhang, Wenjie Wang, Zhengyi Yang, Fuli Feng, Xiangnan He

    Abstract: Recommendation systems for Web content distribution intricately connect to the information access and exposure opportunities for vulnerable populations. The emergence of Large Language Models-based Recommendation System (LRS) may introduce additional societal challenges to recommendation systems due to the inherent biases in Large Language Models (LLMs). From the perspective of item-side fairness,… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted by the Proceedings of the ACM Web Conference 2024

  34. arXiv:2402.12284  [pdf, other

    cs.LG cs.AI

    Refining Minimax Regret for Unsupervised Environment Design

    Authors: Michael Beukman, Samuel Coward, Michael Matthews, Mattie Fellows, Minqi Jiang, Michael Dennis, Jakob Foerster

    Abstract: In unsupervised environment design, reinforcement learning agents are trained on environment configurations (levels) generated by an adversary that maximises some objective. Regret is a commonly used objective that theoretically results in a minimax regret (MMR) policy with desirable robustness guarantees; in particular, the agent's maximum regret is bounded. However, once the agent reaches this r… ▽ More

    Submitted 8 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: ICML 2024. The first two authors contributed equally

  35. arXiv:2402.10670  [pdf, other

    cs.CL cs.RO

    OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models

    Authors: Yuxuan Kuang, Hai Lin, Meng Jiang

    Abstract: Object navigation (ObjectNav) requires an agent to navigate through unseen environments to find queried objects. Many previous methods attempted to solve this task by relying on supervised or reinforcement learning, where they are trained on limited household datasets with close-set objects. However, two key challenges are unsolved: understanding free-form natural language instructions that demand… ▽ More

    Submitted 24 March, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: NAACL 2024 Findings

  36. arXiv:2402.10058  [pdf, other

    cs.CL

    Towards Safer Large Language Models through Machine Unlearning

    Authors: Zheyuan Liu, Guangyao Dou, Zhaoxuan Tan, Yijun Tian, Meng Jiang

    Abstract: The rapid advancement of Large Language Models (LLMs) has demonstrated their vast potential across various domains, attributed to their extensive pretraining knowledge and exceptional generalizability. However, LLMs often encounter challenges in generating harmful content when faced with problematic prompts. To address this problem, existing work attempted to implement a gradient ascent based appr… ▽ More

    Submitted 5 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted by ACL 2024 Findings

  37. arXiv:2402.07386  [pdf, other

    cs.CL

    Chain-of-Layer: Iteratively Prompting Large Language Models for Taxonomy Induction from Limited Examples

    Authors: Qingkai Zeng, Yuyang Bai, Zhaoxuan Tan, Shangbin Feng, Zhenwen Liang, Zhihan Zhang, Meng Jiang

    Abstract: Automatic taxonomy induction is crucial for web search, recommendation systems, and question answering. Manual curation of taxonomies is expensive in terms of human effort, making automatic taxonomy construction highly desirable. In this work, we introduce Chain-of-Layer which is an in-context learning framework designed to induct taxonomies from a given set of entities. Chain-of-Layer breaks down… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  38. arXiv:2402.04401  [pdf, other

    cs.CL

    Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning

    Authors: Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, Meng Jiang

    Abstract: Personalization in large language models (LLMs) is increasingly important, aiming to align LLM's interactions, content, and recommendations with individual user preferences. Recent advances in LLM personalization have spotlighted effective prompt design, by enriching user queries with non-parametric knowledge through behavior history retrieval and textual profiles. However, these approaches were l… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  39. arXiv:2402.02973  [pdf, other

    cs.SE

    Are We There Yet? Unraveling the State-of-the-Art Smart Contract Fuzzers

    Authors: Shuohan Wu, Zihao Li, Luyi Yan, Weimin Chen, Muhui Jiang, Chenxu Wang, Xiapu Luo, Hao Zhou

    Abstract: Given the growing importance of smart contracts in various applications, ensuring their security and reliability is critical. Fuzzing, an effective vulnerability detection technique, has recently been widely applied to smart contracts. Despite numerous studies, a systematic investigation of smart contract fuzzing techniques remains lacking. In this paper, we fill this gap by: 1) providing a compre… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: ICSE 2024

  40. arXiv:2401.13858  [pdf, other

    cs.LG q-bio.BM

    Graph Diffusion Transformer for Multi-Conditional Molecular Generation

    Authors: Gang Liu, Jiaxin Xu, Tengfei Luo, Meng Jiang

    Abstract: Inverse molecular design with diffusion models holds great potential for advancements in material and drug discovery. Despite success in unconditional molecule generation, integrating multiple properties such as synthetic score and gas permeability as condition constraints into diffusion models remains unexplored. We present the Graph Diffusion Transformer (Graph DiT) for multi-conditional molecul… ▽ More

    Submitted 6 May, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: 21 pages, 9 figures, 7 tables

  41. arXiv:2401.13460  [pdf, other

    cs.LG cs.AI cs.MA

    Multi-Agent Diagnostics for Robustness via Illuminated Diversity

    Authors: Mikayel Samvelyan, Davide Paglieri, Minqi Jiang, Jack Parker-Holder, Tim Rocktäschel

    Abstract: In the rapidly advancing field of multi-agent systems, ensuring robustness in unfamiliar and adversarial settings is crucial. Notwithstanding their outstanding performance in familiar environments, these systems often falter in new situations due to overfitting during the training phase. This is especially pronounced in settings where both cooperative and competitive behaviours are present, encaps… ▽ More

    Submitted 28 March, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  42. arXiv:2401.13444  [pdf, other

    cs.CL cs.AI

    Clue-Guided Path Exploration: An Efficient Knowledge Base Question-Answering Framework with Low Computational Resource Consumption

    Authors: Dehao Tao, Feng Huang, Yongfeng Huang, Minghu Jiang

    Abstract: In recent times, large language models (LLMs) have showcased remarkable capabilities. However, updating their knowledge poses challenges, potentially leading to inaccuracies when confronted with unfamiliar queries. While integrating knowledge graphs with LLMs has been explored, existing approaches treat LLMs as primary decision-makers, imposing high demands on their capabilities. This is particula… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  43. arXiv:2401.11337  [pdf, other

    cs.CV cs.AI

    Prompting Large Vision-Language Models for Compositional Reasoning

    Authors: Timothy Ossowski, Ming Jiang, Junjie Hu

    Abstract: Vision-language models such as CLIP have shown impressive capabilities in encoding texts and images into aligned embeddings, enabling the retrieval of multimodal data in a shared embedding space. However, these embedding-based models still face challenges in effectively matching images and texts with similar visio-linguistic compositionality, as evidenced by their performance on the recent Winogro… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  44. arXiv:2401.10512  [pdf, other

    cs.CV

    Exploring Color Invariance through Image-Level Ensemble Learning

    Authors: Yunpeng Gong, Jiaquan Li, Lifei Chen, Min Jiang

    Abstract: In the field of computer vision, the persistent presence of color bias, resulting from fluctuations in real-world lighting and camera conditions, presents a substantial challenge to the robustness of models. This issue is particularly pronounced in complex wide-area surveillance scenarios, such as person re-identification and industrial dust segmentation, where models often experience a decline in… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  45. arXiv:2401.10090  [pdf, other

    cs.CV

    Cross-Modality Perturbation Synergy Attack for Person Re-identification

    Authors: Yunpeng Gong, Zhun Zhong, Zhiming Luo, Yansong Qu, Rongrong Ji, Min Jiang

    Abstract: In recent years, there has been significant research focusing on addressing security concerns in single-modal person re-identification (ReID) systems that are based on RGB images. However, the safety of cross-modality scenarios, which are more commonly encountered in practical applications involving images captured by infrared cameras, has not received adequate attention. The main challenge in cro… ▽ More

    Submitted 18 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  46. arXiv:2401.07487  [pdf, other

    cs.RO cs.CV

    Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation

    Authors: Yuanchen Ju, Kaizhe Hu, Guowei Zhang, Gu Zhang, Mingrun Jiang, Huazhe Xu

    Abstract: Enabling robotic manipulation that generalizes to out-of-distribution scenes is a crucial step toward open-world embodied intelligence. For human beings, this ability is rooted in the understanding of semantic correspondence among objects, which naturally transfers the interaction experience of familiar objects to novel ones. Although robots lack such a reservoir of interaction experience, the vas… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  47. arXiv:2401.06059  [pdf, other

    cs.CL cs.AI cs.LG

    Investigating Data Contamination for Pre-training Language Models

    Authors: Minhao Jiang, Ken Ziyu Liu, Ming Zhong, Rylan Schaeffer, Siru Ouyang, Jiawei Han, Sanmi Koyejo

    Abstract: Language models pre-trained on web-scale corpora demonstrate impressive capabilities on diverse downstream tasks. However, there is increasing concern whether such capabilities might arise from evaluation datasets being included in the pre-training corpus -- a phenomenon known as \textit{data contamination} -- in a manner that artificially increases performance. There has been little understanding… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 16 pages, 5 figures

  48. arXiv:2401.05561  [pdf, other

    cs.CL

    TrustLLM: Trustworthiness in Large Language Models

    Authors: Lichao Sun, Yue Huang, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang , et al. (45 additional authors not shown)

    Abstract: Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their excellent natural language processing capabilities. Nonetheless, these LLMs present many challenges, particularly in the realm of trustworthiness. Therefore, ensuring the trustworthiness of LLMs emerges as an important topic. This paper introduces TrustLLM, a comprehensive study of trustworthiness in… ▽ More

    Submitted 17 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: This work is still under work and we welcome your contribution

  49. arXiv:2401.02402  [pdf, other

    cs.CV

    3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

    Authors: Zihao Xiao, Longlong **g, Shangxuan Wu, Alex Zihao Zhu, **gwei Ji, Chiyu Max Jiang, Wei-Chih Hung, Thomas Funkhouser, Weicheng Kuo, Anelia Angelova, Yin Zhou, Shiwei Sheng

    Abstract: 3D panoptic segmentation is a challenging perception task, especially in autonomous driving. It aims to predict both semantic and instance annotations for 3D points in a scene. Although prior 3D panoptic segmentation approaches have achieved great performance on closed-set benchmarks, generalizing these approaches to unseen things and unseen stuff categories remains an open problem. For unseen obj… ▽ More

    Submitted 2 April, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

  50. arXiv:2312.17538  [pdf, other

    cs.CV cs.LG eess.IV

    Distance Guided Generative Adversarial Network for Explainable Binary Classifications

    Authors: Xiangyu Xiong, Yue Sun, Xiaohong Liu, Wei Ke, Chan-Tong Lam, Jiangang Chen, Mingfeng Jiang, Mingwei Wang, Hui Xie, Tong Tong, Qinquan Gao, Hao Chen, Tao Tan

    Abstract: Despite the potential benefits of data augmentation for mitigating the data insufficiency, traditional augmentation methods primarily rely on the prior intra-domain knowledge. On the other hand, advanced generative adversarial networks (GANs) generate inter-domain samples with limited variety. These previous methods make limited contributions to describing the decision boundaries for binary classi… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: 12 pages, 8 figures. This work has been submitted to the IEEE TNNLS for possible publication. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media