Skip to main content

Showing 1–50 of 265 results for author: Feng, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16942  [pdf, other

    eess.IV cs.AI cs.CV

    Enhancing Diagnostic Reliability of Foundation Model with Uncertainty Estimation in OCT Images

    Authors: Yuanyuan Peng, Aidi Lin, Meng Wang, Tian Lin, Ke Zou, Yinglin Cheng, Tingkun Shi, Xulong Liao, Lixia Feng, Zhen Liang, Xinjian Chen, Huazhu Fu, Haoyu Chen

    Abstract: Inability to express the confidence level and detect unseen classes has limited the clinical implementation of artificial intelligence in the real-world. We developed a foundation model with uncertainty estimation (FMUE) to detect 11 retinal conditions on optical coherence tomography (OCT). In the internal test set, FMUE achieved a higher F1 score of 96.76% than two state-of-the-art algorithms, RE… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: All codes are available at https://github.com/yuanyuanpeng0129/FMUE

  2. arXiv:2406.15031  [pdf, other

    cs.IT

    New Upper Bounds for Noisy Permutation Channels

    Authors: Lugaoze Feng, Baoji Wang, Guocheng Lv, Xvnan Li, Luhua Wang, Ye **

    Abstract: The noisy permutation channel is a useful abstraction introduced by Makur for point-to-point communication networks and biological storage. While the asymptotic capacity results exist for this model, the characterization of the second-order asymptotics is not available. Therefore, we analyze the converse bounds for the noisy permutation channel in the finite blocklength regime. To do this, we pres… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 24 Pages, Submitted to IEEE Transactions on Communications

  3. arXiv:2406.14537  [pdf, other

    cs.LG q-fin.TR

    MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading

    Authors: Chuqiao Zong, Chaojie Wang, Molei Qin, Lei Feng, Xinrun Wang, Bo An

    Abstract: High-frequency trading (HFT) that executes algorithmic trading in short time scales, has recently occupied the majority of cryptocurrency market. Besides traditional quantitative trading methods, reinforcement learning (RL) has become another appealing approach for HFT due to its terrific ability of handling high-dimensional financial data and solving sophisticated sequential decision-making probl… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted to KDD 2024

  4. arXiv:2406.14359  [pdf, other

    cs.NE

    Learning to Transfer for Evolutionary Multitasking

    Authors: Sheng-Hao Wu, Yuxiao Huang, Xingyu Wu, Liang Feng, Zhi-Hui Zhan, Kay Chen Tan

    Abstract: Evolutionary multitasking (EMT) is an emerging approach for solving multitask optimization problems (MTOPs) and has garnered considerable research interest. The implicit EMT is a significant research branch that utilizes evolution operators to enable knowledge transfer (KT) between tasks. However, current approaches in implicit EMT face challenges in adaptability, due to the use of a limited numbe… ▽ More

    Submitted 22 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review

  5. arXiv:2406.11168  [pdf, other

    math.OC cs.LG

    Two-Timescale Optimization Framework for Decentralized Linear-Quadratic Optimal Control

    Authors: Lechen Feng, Yuan-Hua Ni, Xuebo Zhang

    Abstract: This study investigates a decentralized linear-quadratic optimal control problem, and several approximate separable constrained optimization problems are formulated for the first time based on the selection of sparsity promoting functions. First, for the optimization problem with weighted $\ell_1$ sparsity promoting function, a two-timescale algorithm is adopted that is based on the BSUM (Block Su… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  6. arXiv:2406.10502  [pdf, other

    cs.LG cs.AI cs.CV

    Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data

    Authors: Jiahan Zhang, Qi Wei, Feng Liu, Lei Feng

    Abstract: Fine-tuning vision-language models (VLMs) with abundant unlabeled data recently has attracted increasing attention. Existing methods that resort to the pseudolabeling strategy would suffer from heavily incorrect hard pseudolabels when VLMs exhibit low zero-shot performance in downstream tasks. To alleviate this issue, we propose a Candidate Pseudolabel Learning method, termed CPL, to fine-tune VLM… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML2024

  7. arXiv:2406.09385  [pdf, other

    cs.CV

    Towards Vision-Language Geo-Foundation Model: A Survey

    Authors: Yue Zhou, Litong Feng, Yi** Ke, Xue Jiang, Junchi Yan, Xue Yang, Wayne Zhang

    Abstract: Vision-Language Foundation Models (VLFMs) have made remarkable progress on various multimodal tasks, such as image captioning, image-text retrieval, visual question answering, and visual grounding. However, most methods rely on training with general image datasets, and the lack of geospatial data leads to poor performance on earth observation. Numerous geospatial image-text pair datasets and VLFMs… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 18 pages, 4 figures

  8. arXiv:2406.08987  [pdf, other

    cs.NE

    Towards Next Era of Multi-objective Optimization: Large Language Models as Architects of Evolutionary Operators

    Authors: Yuxiao Huang, Shenghao Wu, Wenjie Zhang, Jibin Wu, Liang Feng, Kay Chen Tan

    Abstract: Multi-objective optimization problems (MOPs) are prevalent in various real-world applications, necessitating sophisticated solutions that balance conflicting objectives. Traditional evolutionary algorithms (EAs), while effective, often rely on domain-specific expert knowledge and iterative tuning, which can impede innovation when encountering novel MOPs. Very recently, the emergence of Large Langu… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 14 pages, 5 figures, 5 tables

  9. arXiv:2406.08754  [pdf, other

    cs.CL cs.CR

    StructuralSleight: Automated Jailbreak Attacks on Large Language Models Utilizing Uncommon Text-Encoded Structure

    Authors: Bangxin Li, Hengrui Xing, Chao Huang, ** Qian, Huangqing Xiao, Linfeng Feng, Cong Tian

    Abstract: Large Language Models (LLMs) are widely used in natural language processing but face the risk of jailbreak attacks that maliciously induce them to generate harmful content. Existing jailbreak attacks, including character-level and context-level attacks, mainly focus on the prompt of the plain text without specifically exploring the significant influence of its structure. In this paper, we focus on… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 12 pages, 4 figures

  10. arXiv:2406.08079  [pdf, other

    cs.CV

    A$^{2}$-MAE: A spatial-temporal-spectral unified remote sensing pre-training method based on anchor-aware masked autoencoder

    Authors: Lixian Zhang, Yi Zhao, Runmin Dong, **xiao Zhang, Shuai Yuan, Shilei Cao, Mengxuan Chen, Juepeng Zheng, Weijia Li, Wei Liu, Wayne Zhang, Litong Feng, Haohuan Fu

    Abstract: Vast amounts of remote sensing (RS) data provide Earth observations across multiple dimensions, encompassing critical spatial, temporal, and spectral information which is essential for addressing global-scale challenges such as land use monitoring, disaster prevention, and environmental change mitigation. Despite various pre-training methods tailored to the characteristics of RS data, a key limita… ▽ More

    Submitted 16 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  11. arXiv:2406.07069  [pdf, other

    cs.RO eess.SY

    Optimal Gait Control for a Tendon-driven Soft Quadruped Robot by Model-based Reinforcement Learning

    Authors: Xuezhi Niu, Kaige Tan, Lei Feng

    Abstract: This study presents an innovative approach to optimal gait control for a soft quadruped robot enabled by four Compressible Tendon-driven Soft Actuators (CTSAs). Improving our previous studies of using model-free reinforcement learning for gait control, we employ model-based reinforcement learning (MBRL) to further enhance the performance of the gait controller. Compared to rigid robots, the propos… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  12. arXiv:2406.07065  [pdf, other

    cs.RO eess.SY

    Optimal Gait Design for a Soft Quadruped Robot via Multi-fidelity Bayesian Optimization

    Authors: Kaige Tan, Xuezhi Niu, Qinglei Ji, Lei Feng, Martin Törngren

    Abstract: This study focuses on the locomotion capability improvement in a tendon-driven soft quadruped robot through an online adaptive learning approach. Leveraging the inverse kinematics model of the soft quadruped robot, we employ a central pattern generator to design a parametric gait pattern, and use Bayesian optimization (BO) to find the optimal parameters. Further, to address the challenges of model… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  13. arXiv:2406.04609  [pdf, other

    cs.LG cs.AI

    Diverse Intra- and Inter-Domain Activity Style Fusion for Cross-Person Generalization in Activity Recognition

    Authors: Junru Zhang, Lang Feng, Zhidan Liu, Yuhan Wu, Yang He, Yabo Dong, Duanqing Xu

    Abstract: Existing domain generalization (DG) methods for cross-person generalization tasks often face challenges in capturing intra- and inter-domain style diversity, resulting in domain gaps with the target domain. In this study, we explore a novel perspective to tackle this problem, a process conceptualized as domain padding. This proposal aims to enrich the domain diversity by synthesizing intra- and in… ▽ More

    Submitted 28 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2024)

  14. arXiv:2406.03150  [pdf, other

    cs.LG cs.CV

    Sample-specific Masks for Visual Reprogramming-based Prompting

    Authors: Chengyi Cai, Zesheng Ye, Lei Feng, Jianzhong Qi, Feng Liu

    Abstract: Visual reprogramming (VR) is a prompting technique that aims to re-purpose a pre-trained model (e.g., a classifier on ImageNet) to target tasks (e.g., medical data prediction) by learning a small-scale pattern added into input images instead of tuning considerable parameters within the model. The location of the pattern within input samples is usually determined by a pre-defined mask shared across… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  15. arXiv:2406.02915  [pdf, other

    cs.CV cs.LG

    Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models

    Authors: **hao Li, Haopeng Li, Sarah Erfani, Lei Feng, James Bailey, Feng Liu

    Abstract: It has recently been discovered that using a pre-trained vision-language model (VLM), e.g., CLIP, to align a whole query image with several finer text descriptions generated by a large language model can significantly enhance zero-shot performance. However, in this paper, we empirically find that the finer descriptions tend to align more effectively with local areas of the query image rather than… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 22 pages, 16 figures, published to ICML 2024

    MSC Class: 68T45; 68T10 ACM Class: I.2.10; I.4.10

  16. arXiv:2405.20705  [pdf, other

    cs.AI

    ADESSE: Advice Explanations in Complex Repeated Decision-Making Environments

    Authors: Sören Schleibaum, Lu Feng, Sarit Kraus, Jörg P. Müller

    Abstract: In the evolving landscape of human-centered AI, fostering a synergistic relationship between humans and AI agents in decision-making processes stands as a paramount challenge. This work considers a problem setup where an intelligent agent comprising a neural network-based prediction component and a deep reinforcement learning component provides advice to a human decision-maker in complex repeated… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  17. arXiv:2405.17879  [pdf, other

    cs.LG cs.AI

    Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree

    Authors: Lang Feng, Pengjie Gu, Bo An, Gang Pan

    Abstract: Diffusion planners have shown promise in handling long-horizon and sparse-reward tasks due to the non-autoregressive plan generation. However, their inherent stochastic risk of generating infeasible trajectories presents significant challenges to their reliability and stability. We introduce a novel approach, the Trajectory Aggregation Tree (TAT), to address this issue in diffusion planners. Compa… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: ICML 2024 (Spotlight)

  18. arXiv:2405.15269  [pdf, other

    cs.CV cs.LG

    BDetCLIP: Multimodal Prompting Contrastive Test-Time Backdoor Detection

    Authors: Yuwei Niu, Shuo He, Qi Wei, Feng Liu, Lei Feng

    Abstract: Multimodal contrastive learning methods (e.g., CLIP) have shown impressive zero-shot classification performance due to their strong ability to joint representation learning for visual and textual modalities. However, recent research revealed that multimodal contrastive learning on poisoned pre-training data with a small proportion of maliciously backdoored data can induce backdoored CLIP that coul… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  19. arXiv:2405.14474  [pdf, other

    cs.NE

    Time Cell Inspired Temporal Codebook in Spiking Neural Networks for Enhanced Image Generation

    Authors: Linghao Feng, Dongcheng Zhao, Sicheng Shen, Yiting Dong, Guobin Shen, Yi Zeng

    Abstract: This paper presents a novel approach leveraging Spiking Neural Networks (SNNs) to construct a Variational Quantized Autoencoder (VQ-VAE) with a temporal codebook inspired by hippocampal time cells. This design captures and utilizes temporal dependencies, significantly enhancing the generative capabilities of SNNs. Neuroscientific research has identified hippocampal "time cells" that fire sequentia… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  20. arXiv:2405.14111  [pdf, other

    cs.LG

    Improving Generalization of Deep Neural Networks by Optimum Shifting

    Authors: Yuyan Zhou, Ye Li, Lei Feng, Sheng-Jun Huang

    Abstract: Recent studies showed that the generalization of neural networks is correlated with the sharpness of the loss landscape, and flat minima suggests a better generalization ability than sharp minima. In this paper, we propose a novel method called \emph{optimum shifting}, which changes the parameters of a neural network from a sharp minimum to a flatter one while maintaining the same training loss va… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  21. arXiv:2405.13956  [pdf, other

    cs.LG

    Attention as an RNN

    Authors: Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Mohamed Osama Ahmed, Yoshua Bengio, Greg Mori

    Abstract: The advent of Transformers marked a significant breakthrough in sequence modelling, providing a highly performant architecture capable of leveraging GPU parallelism. However, Transformers are computationally expensive at inference time, limiting their applications, particularly in low-resource settings (e.g., mobile and embedded devices). Addressing this, we (1) begin by showing that attention can… ▽ More

    Submitted 28 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  22. arXiv:2405.09721  [pdf, other

    cs.CR

    DP-RuL: Differentially-Private Rule Learning for Clinical Decision Support Systems

    Authors: Josephine Lamp, Lu Feng, David Evans

    Abstract: Serious privacy concerns arise with the use of patient data in rule-based clinical decision support systems (CDSS). The goal of a privacy-preserving CDSS is to learn a population ruleset from individual clients' local rulesets, while protecting the potentially sensitive information contained in the rulesets. We present the first work focused on this problem and develop a framework for learning pop… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  23. arXiv:2405.07777  [pdf, other

    cs.CV eess.IV

    GMSR:Gradient-Guided Mamba for Spectral Reconstruction from RGB Images

    Authors: Xinying Wang, Zhixiong Huang, Sifan Zhang, Jiawen Zhu, Lin Feng

    Abstract: Mainstream approaches to spectral reconstruction (SR) primarily focus on designing Convolution- and Transformer-based architectures. However, CNN methods often face challenges in handling long-range dependencies, whereas Transformers are constrained by computational efficiency limitations. Recent breakthroughs in state-space model (e.g., Mamba) has attracted significant attention due to its near-l… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  24. arXiv:2405.01616  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative Active Learning for the Search of Small-molecule Protein Binders

    Authors: Maksym Korablyov, Cheng-Hao Liu, Moksh Jain, Almer M. van der Sloot, Eric Jolicoeur, Edward Ruediger, Andrei Cristian Nica, Emmanuel Bengio, Kostiantyn Lapchevskyi, Daniel St-Cyr, Doris Alexandra Schuetz, Victor Ion Butoi, Jarrid Rector-Brooks, Simon Blackburn, Leo Feng, Hadi Nekoei, SaiKrishna Gottipati, Priyesh Vijayan, Prateek Gupta, Ladislav Rampášek, Sasikanth Avancha, Pierre-Luc Bacon, William L. Hamilton, Brooks Paige, Sanchit Misra , et al. (9 additional authors not shown)

    Abstract: Despite substantial progress in machine learning for scientific discovery in recent years, truly de novo design of small molecules which exhibit a property of interest remains a significant challenge. We introduce LambdaZero, a generative active learning approach to search for synthesizable molecules. Powered by deep reinforcement learning, LambdaZero learns to search over the vast space of molecu… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  25. arXiv:2404.19438  [pdf, other

    cs.NE

    Neuro-Vision to Language: Enhancing Visual Reconstruction and Language Interaction through Brain Recordings

    Authors: Guobin Shen, Dongcheng Zhao, Xiang He, Linghao Feng, Yiting Dong, Jihang Wang, Qian Zhang, Yi Zeng

    Abstract: Decoding non-invasive brain recordings is pivotal for advancing our understanding of human cognition but faces challenges due to individual differences and complex neural signal representations. Traditional methods often require customized models and extensive trials, lacking interpretability in visual reconstruction tasks. Our framework integrates 3D brain structures with visual semantics using a… ▽ More

    Submitted 22 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

  26. arXiv:2404.17883  [pdf, other

    cs.CV

    Underwater Variable Zoom: Depth-Guided Perception Network for Underwater Image Enhancement

    Authors: Zhixiong Huang, Xinying Wang, Chengpei Xu, **jiang Li, Lin Feng

    Abstract: Underwater scenes intrinsically involve degradation problems owing to heterogeneous ocean elements. Prevailing underwater image enhancement (UIE) methods stick to straightforward feature modeling to learn the map** function, which leads to limited vision gain as it lacks more explicit physical cues (e.g., depth). In this work, we investigate injecting the depth prior into the deep UIE model for… ▽ More

    Submitted 10 June, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

  27. arXiv:2404.15557  [pdf, other

    cs.RO

    Safe POMDP Online Planning among Dynamic Agents via Adaptive Conformal Prediction

    Authors: Shili Sheng, Pian Yu, David Parker, Marta Kwiatkowska, Lu Feng

    Abstract: Online planning for partially observable Markov decision processes (POMDPs) provides efficient techniques for robot decision-making under uncertainty. However, existing methods fall short of preventing safety violations in dynamic environments. This work presents a novel safe POMDP online planning approach that offers probabilistic safety guarantees amidst environments populated by multiple dynami… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  28. arXiv:2404.12674  [pdf, other

    cs.DC cs.LG cs.PF

    Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms

    Authors: Zhongyi Lin, Ning Sun, Pallab Bhattacharya, Xizhou Feng, Louis Feng, John D. Owens

    Abstract: Characterizing and predicting the training performance of modern machine learning (ML) workloads on compute systems with compute and communication spread between CPUs, GPUs, and network devices is not only the key to optimization and planning but also a complex goal to achieve. The primary challenges include the complexity of synchronization and load balancing between CPUs and GPUs, the variance i… ▽ More

    Submitted 27 April, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: 12 pages, 11 figures, 4 tables

  29. arXiv:2404.06349  [pdf, other

    cs.LG

    CausalBench: A Comprehensive Benchmark for Causal Learning Capability of Large Language Models

    Authors: Yu Zhou, Xingyu Wu, Beicheng Huang, Jibin Wu, Liang Feng, Kay Chen Tan

    Abstract: Causality reveals fundamental principles behind data distributions in real-world scenarios, and the capability of large language models (LLMs) to understand causality directly impacts their efficacy across explaining outputs, adapting to new evidence, and generating counterfactuals. With the proliferation of LLMs, the evaluation of this capacity is increasingly garnering attention. However, the ab… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  30. arXiv:2404.06290  [pdf, other

    cs.NE

    Exploring the True Potential: Evaluating the Black-box Optimization Capability of Large Language Models

    Authors: Beichen Huang, Xingyu Wu, Yu Zhou, Jibin Wu, Liang Feng, Ran Cheng, Kay Chen Tan

    Abstract: Large language models (LLMs) have gained widespread popularity and demonstrated exceptional performance not only in natural language processing (NLP) tasks but also in non-linguistic domains. Their potential as artificial general intelligence extends beyond NLP, showcasing promising capabilities in diverse optimization scenarios. Despite this rising trend, whether the integration of LLMs into thes… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  31. arXiv:2403.20213  [pdf, other

    cs.CV

    H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model

    Authors: Chao Pang, Jiang Wu, Jiayu Li, Yi Liu, Jiaxing Sun, Weijia Li, Xingxing Weng, Shuai Wang, Litong Feng, Gui-Song Xia, Conghui He

    Abstract: The generic large Vision-Language Models (VLMs) is rapidly develo**, but still perform poorly in Remote Sensing (RS) domain, which is due to the unique and specialized nature of RS imagery and the comparatively limited spatial perception of current VLMs. Existing Remote Sensing specific Vision Language Models (RSVLMs) still have considerable potential for improvement, primarily owing to the lack… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Equal contribution: Chao Pang, Jiang Wu; Corresponding author: Gui-Song Xia, Conghui He

  32. arXiv:2403.18379  [pdf, other

    cs.LG cs.AI

    IIP-Mixer:Intra-Inter Patch Mixing Architecture for Battery Remaining Useful Life Prediction

    Authors: Guangzai Ye, Li Feng, Jianlan Guo, Yuqiang Chen

    Abstract: Accurately estimating the Remaining Useful Life (RUL) of lithium-ion batteries is crucial for maintaining the safe and stable operation of rechargeable battery management systems. However, this task is often challenging due to the complex temporal dynamics involved. Recently, attention-based networks, such as Transformers and Informer, have been the popular architecture in time series forecasting.… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  33. arXiv:2403.15098  [pdf, other

    cs.CV

    UniTraj: A Unified Framework for Scalable Vehicle Trajectory Prediction

    Authors: Lan Feng, Mohammadhossein Bahari, Kaouther Messaoud Ben Amor, Éloi Zablocki, Matthieu Cord, Alexandre Alahi

    Abstract: Vehicle trajectory prediction has increasingly relied on data-driven solutions, but their ability to scale to different data domains and the impact of larger dataset sizes on their generalization remain under-explored. While these questions can be studied by employing multiple datasets, it is challenging due to several discrepancies, e.g., in data formats, map resolution, and semantic annotation t… ▽ More

    Submitted 27 March, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  34. arXiv:2403.14983  [pdf, other

    physics.soc-ph cs.SI

    Reconstructing the evolution history of networked complex systems

    Authors: Junya Wang, Yi-Jiao Zhang, Cong Xu, Jiaze Li, Jiachen Sun, Jiarong Xie, Ling Feng, Tianshou Zhou, Yanqing Hu

    Abstract: The evolution processes of complex systems carry key information in the systems' functional properties. Applying machine learning algorithms, we demonstrate that the historical formation process of various networked complex systems can be extracted, including protein-protein interaction, ecology, and social network systems. The recovered evolution process has demonstrations of immense scientific v… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  35. Typist Experiment: an Investigation of Human-to-Human Dictation via Role-play to Inform Voice-based Text Authoring

    Authors: Can Liu, Siying Hu, Li Feng, Mingming Fan

    Abstract: Voice dictation is increasingly used for text entry, especially in mobile scenarios. However, the speech-based experience gets disrupted when users must go back to a screen and keyboard to review and edit the text. While existing dictation systems focus on improving transcription and error correction, little is known about how to support speech input for the entire text creation process, including… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Journal ref: Proc. ACM Hum.-Comput. Interact. 6, CSCW2, Article 338 (November 2022), 33 pages

  36. arXiv:2403.05102  [pdf, other

    cs.CV

    Enhancing Texture Generation with High-Fidelity Using Advanced Texture Priors

    Authors: Kuo Xu, Maoyu Wang, Muyu Wang, Lincong Feng, Tianhui Zhang, Xiaoli Liu

    Abstract: The recent advancements in 2D generation technology have sparked a widespread discussion on using 2D priors for 3D shape and texture content generation. However, these methods often overlook the subsequent user operations, such as texture aliasing and blurring that occur when the user acquires the 3D model and simplifies its structure. Traditional graphics methods partially alleviate this issue, b… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  37. arXiv:2403.02871  [pdf, other

    quant-ph cs.LG

    Quantum Mixed-State Self-Attention Network

    Authors: Fu Chen, Qinglin Zhao, Li Feng, Chuangtao Chen, Yangbin Lin, Jianhong Lin

    Abstract: The rapid advancement of quantum computing has increasingly highlighted its potential in the realm of machine learning, particularly in the context of natural language processing (NLP) tasks. Quantum machine learning (QML) leverages the unique capabilities of quantum computing to offer novel perspectives and methodologies for complex data processing and pattern recognition challenges. This paper i… ▽ More

    Submitted 8 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  38. arXiv:2403.01757  [pdf, other

    cs.AI cs.CL cs.LG cs.NE math.OC

    How Multimodal Integration Boost the Performance of LLM for Optimization: Case Study on Capacitated Vehicle Routing Problems

    Authors: Yuxiao Huang, Wenjie Zhang, Liang Feng, Xingyu Wu, Kay Chen Tan

    Abstract: Recently, large language models (LLMs) have notably positioned them as capable tools for addressing complex optimization challenges. Despite this recognition, a predominant limitation of existing LLM-based optimization methods is their struggle to capture the relationships among decision variables when relying exclusively on numerical text prompts, especially in high-dimensional problems. Kee**… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 8pages,3 figures, 2 tables

  39. arXiv:2402.05453  [pdf, other

    cs.LG cs.CR

    Mitigating Privacy Risk in Membership Inference by Convex-Concave Loss

    Authors: Zhenlong Liu, Lei Feng, Hui** Zhuang, Xiaofeng Cao, Hongxin Wei

    Abstract: Machine learning models are susceptible to membership inference attacks (MIAs), which aim to infer whether a sample is in the training set. Existing work utilizes gradient ascent to enlarge the loss variance of training data, alleviating the privacy risk. However, optimizing toward a reverse direction may cause the model parameters to oscillate near local minima, leading to instability and subopti… ▽ More

    Submitted 18 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024

  40. arXiv:2402.04344  [pdf, other

    cs.LG

    Does Confidence Calibration Help Conformal Prediction?

    Authors: Huajun Xi, Jianguo Huang, Lei Feng, Hongxin Wei

    Abstract: Conformal prediction, as an emerging uncertainty qualification technique, constructs prediction sets that are guaranteed to contain the true label with high probability. Previous works usually employ temperature scaling to calibrate the classifier, assuming that confidence calibration can benefit conformal prediction. In this work, we first show that post-hoc calibration methods surprisingly lead… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  41. arXiv:2402.01922  [pdf, other

    cs.LG cs.AI

    A General Framework for Learning from Weak Supervision

    Authors: Hao Chen, **dong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj

    Abstract: Weakly supervised learning generally faces challenges in applicability to various scenarios with diverse weak supervision and in scalability due to the complexity of existing algorithms, thereby hindering the practical deployment. This paper introduces a general framework for learning from weak supervision (GLWS) with a novel algorithm. Central to GLWS is an Expectation-Maximization (EM) formulati… ▽ More

    Submitted 5 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 24 pages, 20 tables, 9 figures

  42. arXiv:2402.01619  [pdf, other

    cs.CL

    KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases

    Authors: Jiajie Zhang, Shulin Cao, Linmei Hu, Ling Feng, Lei Hou, Juanzi Li

    Abstract: Program induction (PI) has become a promising paradigm for using knowledge bases (KBs) to help large language models (LLMs) answer complex knowledge-intensive questions. Nonetheless, PI typically relies on a large number of parallel question-program pairs to make the LLM aware of the schema of the given KB, and is thus challenging for many low-resourced KBs that lack annotated data. To this end, w… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  43. arXiv:2401.13360  [pdf, other

    cs.LG

    Debiased Sample Selection for Combating Noisy Labels

    Authors: Qi Wei, Lei Feng, Haobo Wang, Bo An

    Abstract: Learning with noisy labels aims to ensure model generalization given a label-corrupted training set. The sample selection strategy achieves promising performance by selecting a label-reliable subset for model training. In this paper, we empirically reveal that existing sample selection methods suffer from both data and training bias that are represented as imbalanced selected sets and accumulation… ▽ More

    Submitted 24 January, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  44. arXiv:2401.10034  [pdf, other

    cs.NE cs.AI cs.CL

    Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap

    Authors: Xingyu Wu, Sheng-hao Wu, Jibin Wu, Liang Feng, Kay Chen Tan

    Abstract: Large language models (LLMs) have not only revolutionized natural language processing but also extended their prowess to various domains, marking a significant stride towards artificial general intelligence. The interplay between LLMs and evolutionary algorithms (EAs), despite differing in objectives and methodologies, share a common pursuit of applicability in complex problems. Meanwhile, EA can… ▽ More

    Submitted 29 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: evolutionary algorithm (EA), large language model (LLM), optimization problem, prompt engineering, algorithm generation, neural architecture search

  45. arXiv:2401.08013  [pdf, other

    cs.GT cs.MA econ.GN

    A Day-to-Day Dynamical Approach to the Most Likely User Equilibrium Problem

    Authors: Jiayang Li, Qianni Wang, Liyang Feng, Jun Xie, Yu Marco Nie

    Abstract: The lack of a unique user equilibrium (UE) route flow in traffic assignment has posed a significant challenge to many transportation applications. The maximum-entropy principle, which advocates for the consistent selection of the most likely solution as a representative, is often used to address the challenge. Built on a recently proposed day-to-day (DTD) discrete-time dynamical model called cumul… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  46. arXiv:2401.03426  [pdf, other

    cs.CL cs.AI

    On Leveraging Large Language Models for Enhancing Entity Resolution

    Authors: Huahang Li, Longyu Feng, Shuangyin Li, Fei Hao, Chen Jason Zhang, Yuanfeng Song, Lei Chen

    Abstract: Entity resolution, the task of identifying and consolidating records that pertain to the same real-world entity, plays a pivotal role in various sectors such as e-commerce, healthcare, and law enforcement. The emergence of Large Language Models (LLMs) like GPT-4 has introduced a new dimension to this task, leveraging their advanced linguistic capabilities. This paper explores the potential of LLMs… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 12 pages,6 figures, ICDE 2024

  47. arXiv:2401.01563  [pdf, other

    cs.NE

    Towards Multi-Objective High-Dimensional Feature Selection via Evolutionary Multitasking

    Authors: Yinglan Feng, Liang Feng, Songbai Liu, Sam Kwong, Kay Chen Tan

    Abstract: Evolutionary Multitasking (EMT) paradigm, an emerging research topic in evolutionary computation, has been successfully applied in solving high-dimensional feature selection (FS) problems recently. However, existing EMT-based FS methods suffer from several limitations, such as a single mode of multitask generation, conducting the same generic evolutionary search for all tasks, relying on implicit… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  48. arXiv:2401.00426  [pdf, other

    cs.CL cs.AI

    keqing: knowledge-based question answering is a nature chain-of-thought mentor of LLM

    Authors: Chaojie Wang, Yishi Xu, Zhong Peng, Chenxi Zhang, Bo Chen, Xinrun Wang, Lei Feng, Bo An

    Abstract: Large language models (LLMs) have exhibited remarkable performance on various natural language processing (NLP) tasks, especially for question answering. However, in the face of problems beyond the scope of knowledge, these LLMs tend to talk nonsense with a straight face, where the potential solution could be incorporating an Information Retrieval (IR) module and generating response based on these… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: 12 pages, 6 figures

  49. arXiv:2311.13811  [pdf

    cs.AI

    Education distillation:getting student models to learn in shcools

    Authors: Ling Feng, Danyang Li, Tianhao Wu, Xuliang Duan

    Abstract: Knowledge distillation is one of the methods for model compression, and existing knowledge distillation techniques focus on how to improve the distillation algorithm so as to enhance the distillation efficiency. This paper introduces dynamic incremental learning into knowledge distillation and proposes a distillation strategy for education distillation. Specifically, it is proposed to take fragmen… ▽ More

    Submitted 26 November, 2023; v1 submitted 23 November, 2023; originally announced November 2023.

  50. arXiv:2311.10123  [pdf, other

    cs.CV

    MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture

    Authors: Lincong Feng, Muyu Wang, Maoyu Wang, Kuo Xu, Xiaoli Liu

    Abstract: Generative models for 3D object synthesis have seen significant advancements with the incorporation of prior knowledge distilled from 2D diffusion models. Nevertheless, challenges persist in the form of multi-view geometric inconsistencies and slow generation speeds within the existing 3D synthesis frameworks. This can be attributed to two factors: firstly, the deficiency of abundant geometric a p… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2306.17843, arXiv:2209.14988 by other authors